Introducing effGen — The Future of SLM Agents

BuildPowerfulAIAgents
with Small Language Models

Optimized for SLMs. 5-10x faster with complexity routing, automatic task decomposition, multi-agent orchestration, and vLLM.

$
0x
Faster (vLLM)
0
Built-in Tools
0
Presets
0
Backends
0%
Open Source
effgen-terminal● RUNNING
agent_demo.py
$
AGENT LOOP
ThoughtActionObservationAnswer
SCROLL
Features

Everything You Need to Build
Production-Ready AI Agents

Optimized for Small Language Models with production-grade features

Intelligent Task Decomposition

Automatically breaks down complex tasks with multi-dimensional complexity analysis and spawns specialized sub-agents.

  • Automatic complexity scoring
  • Sub-agent routing
  • Parallel execution
Click for details

Multi-Agent Orchestration

Coordinate multiple specialized agents with lifecycle management and agent-to-agent communication.

  • Task routing
  • Shared memory
  • A2A protocol
Click for details

Ultra-Fast vLLM Integration

Native vLLM support delivers 5-10x faster inference. Auto multi-GPU tensor parallelism and PagedAttention.

  • 5-10x faster inference
  • PagedAttention memory efficiency
  • Auto multi-GPU support
Click for details

Universal Tool Integration

31 built-in tools across finance, data science, DevOps, knowledge, and communication. Native function-calling and MCP/A2A/ACP.

  • 31 tools (finance, DevOps, RAG, comms...)
  • Native + ReAct + Hybrid tool calling
  • MCP/A2A/ACP protocols
Click for details

Guardrails & Safety

Offline, ML-free guardrails for toxicity, PII, prompt injection, topics, length, and tool safety. Composable chains with four presets.

  • PII (SSN/email/phone/CC-Luhn), Toxicity, Topic, Length
  • Prompt-injection detection (low/med/high)
  • Tool input/output/permission guardrails
Click for details

Advanced RAG Pipeline

Hybrid search (dense + BM25 + keyword) with RRF fusion, semantic/code/table/hierarchical chunkers, rerankers, and inline citations.

  • DocumentIngester (txt/md/pdf/docx/html/…)
  • HybridSearchEngine with RRF
  • CrossEncoder / LLM / rule-based rerankers
Click for details

DAG Workflow Engine

Define multi-agent pipelines as a DAG with cycle detection, conditional branching, and auto-parallel execution.

  • WorkflowDAG with Kahn's topo sort
  • Auto-parallelize independent nodes
  • YAML workflow definitions + CLI
Click for details

Evaluation & Regression

Built-in test suites, LLM-judge scoring, and baseline regression tracking. Compare multiple models side-by-side.

  • 5 suites: Math, ToolUse, Reasoning, Safety, Conversation
  • EXACT / CONTAINS / REGEX / SEMANTIC / LLM_JUDGE
  • Nightly CI opens GitHub issues on drift
Click for details
Get Started
Architecture

How effGen Works

The ReAct agent loop — reasoning and acting in perfect harmony

01

User Input

Natural language task or query is received by the agent

agent.run("Calculate 24344 * 334")
Click for details
02

Reasoning

Agent analyzes the task using ReAct-style thinking

Thought: I need to multiply these numbers...
Click for details
03

Tool Selection

Best tool is selected from 31 built-in options

Action: Calculator(24344 * 334)
Click for details
04

Execution

Tool runs in a sandboxed environment with safety controls

Executing Calculator...
Click for details
05

Observation

Agent observes and validates the tool output

Observation: 8130896
Click for details
06

Final Answer

Synthesized response returned to the user

Answer: 8,130,896
Click for details
Steps 2-5 repeat until the task is complete (max_iterations configurable)
Built-in Tools

31 Tools Ready to Use

From finance and data science to DevOps, knowledge, and communication — everything your agent needs, built in

🧮
COMPUTATION

Calculator

Perform mathematical calculations, evaluate expressions, and convert units

expressionoperationfrom_unitto_unitprecision
CODE

CodeExecutor

Execute code in a secure sandboxed environment (Python, JS, Bash)

codelanguagetimeoutmemory_limitnetwork_enabledfilesenv_vars
🐍
CODE

PythonREPL

Execute Python code in a persistent REPL session

codesession_idreset_sessionreturn_variablesrestricted_mode
🔍
INFO

WebSearch

Search the web using DuckDuckGo, SerpAPI, or Google

querynum_resultsbackendtime_rangelanguageregion
🌐
INFO

URLFetchTool

Fetch webpage content and extract readable text

urlextract_links
📋
DATA

JSONTool

Parse, query (JSONPath), validate, and format JSON data

dataoperationquery
📁
FILES

FileOperations

Safe file system operations: read, write, search, convert

operationpathcontentformatencodingpatternrecursivetarget_format
💻
SYSTEM

BashTool

Execute shell commands with security controls

command
Agent Presets

One-Line Agent Creation with 6 Presets

Ready-to-use configurations optimized for common use cases

🧮

math

Mathematical computations

temp
0.3
itr: 8
create_agent("math", model)
CLICK FOR DETAILS
🔬

research

Web research & information

temp
0.5
itr: 10
create_agent("research", model)
CLICK FOR DETAILS
💻

coding

Code execution & development

temp
0.4
itr: 12
create_agent("coding", model)
CLICK FOR DETAILS
🚀

general

All 11 tools for any task

temp
0.7
itr: 10
create_agent("general", model)
CLICK FOR DETAILS
📖

rag

Retrieval-augmented Q&A over your docs

temp
0.3
itr: 8
create_agent("rag", model, knowledge_base="./docs/")
CLICK FOR DETAILS

minimal

Direct inference, no tools

temp
0.7
itr: 1
create_agent("minimal", model)
CLICK FOR DETAILS
Quick Start

Up and Running in 60 Seconds

Three simple steps to your first AI agent

01

Install

Get started with pip. Includes vLLM for blazing-fast inference.

bash
pip install effgen[vllm]
02

Create Agent

One-line agent creation. 6 presets, 31 tools, offline guardrails built in.

python
from effgen import load_model
from effgen.presets import create_agent

model = load_model("Qwen/Qwen2.5-3B-Instruct", quantization="4bit")
agent = create_agent("general", model)          # all general-purpose tools
# or: create_agent("rag", model, knowledge_base="./docs/")
03

Execute Tasks

Run tasks with real-time token streaming and tool execution.

python
result = agent.run(
    "Analyze the latest tech trends and "
    "create a comprehensive report"
)
Start Building Today — It's Free

ReadytoBuildthe
FutureofAI?

Join thousands of developers building next-gen agents with effGen. Open source, production-ready, and blazing fast.

...
GitHub Stars
...
Forks
...
Contributors