Choose LangGraph if: You need precise control over agent state and execution flow You're building production agents that must handle edge cases and interrupts Your team is comfortable with Python and graph-based programming Choose CrewAI if: You want to build multi-agent workflows in hours, not days Role-based team metaphors map naturally to your use case You want the fastest time-to-working-agent Choose AutoGen if: Your agents need to debate, critique, and improve outputs through conversation Y

Best Self-Hosted AI Agent Frameworks in 2026

TL;DR

AI agent frameworks let you build systems where LLMs take actions, use tools, and coordinate with other agents to complete multi-step tasks. In 2026, LangGraph leads for production-grade stateful agents requiring complex orchestration. CrewAI wins for role-based multi-agent workflows you can set up in hours. n8n is the best option if you want a visual, no-code/low-code approach. AutoGen (Microsoft) is uniquely strong for conversational multi-agent patterns and includes a no-code Studio. All four are open-source and self-hostable.

Key Takeaways

LangGraph: 14K+ stars — graph-based state machine model, best for production agents, requires Python expertise
CrewAI: 25K+ stars — role-based crew abstraction, fastest to build with, growing enterprise adoption
AutoGen: 38K+ stars — Microsoft-backed, best for conversational multi-agent systems, AutoGen Studio for no-code
n8n: 50K+ stars — visual workflow automation with AI nodes, easiest self-host, no Python required
MCP/A2A support: CrewAI has A2A; OpenAgents has both MCP and A2A natively; LangGraph and AutoGen adding support
Self-hosting: All run via Docker; n8n has the most polished self-hosted dashboard

The 2026 AI Agent Landscape

AI agents moved from demos to production systems in 2025. The defining shift: agents that can browse the web, write and execute code, call APIs, and coordinate with other agents to complete tasks that would take a human hours now run automatically.

The frameworks abstract the hard parts: managing LLM context windows across multiple steps, retrying failed tool calls, handling agent-to-agent communication, and persisting state between runs.

Agent workflow anatomy:
  1. User gives a goal
  2. Planner agent breaks it into subtasks
  3. Worker agents execute subtasks (web search, code exec, API calls)
  4. Aggregator synthesizes results
  5. Output delivered to user

Without a framework:
  - Manual context management
  - Custom retry logic
  - Ad-hoc tool call parsing
  - No state persistence between steps

With a framework:
  - Structured execution graphs
  - Built-in tool integration
  - State persistence
  - Observability hooks

Framework Comparison

	LangGraph	CrewAI	AutoGen	n8n
GitHub Stars	14K+	25K+	38K+	50K+
Language	Python	Python	Python + .NET	JS/TS (Node)
Programming model	State graph	Role-based crew	Conversational	Visual nodes
Self-hostable	✅	✅	✅	✅ Best
Docker Compose	✅	✅	✅	✅ Official
No-code option	❌	❌	✅ Studio	✅ Visual
MCP support	❌	⚠️ Partial	❌	✅
A2A support	❌	✅	❌	❌
LangChain required	✅ Yes	❌ Optional	❌	❌
State persistence	✅ Checkpointer	⚠️ Limited	✅	✅
Human-in-the-loop	✅	✅	✅	✅
License	MIT	MIT	MIT	Sustainable (AGPLv3)
Backing	LangChain Inc	VC-funded	Microsoft	VC-funded

1. LangGraph — Best for Production Agents

LangGraph models agent workflows as directed graphs. Each node is a function (an LLM call, a tool use, a conditional check), and edges define the flow between nodes. State flows through the graph and persists between steps.

from langgraph.graph import StateGraph, END
from langgraph.checkpoint.memory import MemorySaver
from typing import TypedDict, Annotated
import operator

class AgentState(TypedDict):
    messages: Annotated[list, operator.add]
    research_results: list[str]
    final_answer: str

def research_node(state: AgentState):
    """Search the web for relevant information."""
    query = state["messages"][-1].content
    results = web_search(query)  # Your search tool
    return {"research_results": results}

def synthesize_node(state: AgentState):
    """Generate final answer from research."""
    context = "\n".join(state["research_results"])
    response = llm.invoke([
        SystemMessage(content=f"Use this research: {context}"),
        *state["messages"]
    ])
    return {"messages": [response], "final_answer": response.content}

def should_research(state: AgentState):
    """Conditional edge: research first, then synthesize."""
    if not state.get("research_results"):
        return "research"
    return "synthesize"

# Build the graph
graph = StateGraph(AgentState)
graph.add_node("research", research_node)
graph.add_node("synthesize", synthesize_node)

graph.set_entry_point("research")
graph.add_edge("research", "synthesize")
graph.add_edge("synthesize", END)

# Compile with checkpointing for state persistence
checkpointer = MemorySaver()
agent = graph.compile(checkpointer=checkpointer)

# Run with thread ID (enables resuming interrupted runs)
config = {"configurable": {"thread_id": "user-123-session-1"}}
result = agent.invoke(
    {"messages": [HumanMessage(content="What is the latest on LLM agents?")]},
    config=config
)

Self-Host LangGraph Platform

LangGraph Platform is the commercial hosted version. For self-hosting:

# docker-compose.yml — LangGraph self-hosted
version: "3"
services:
  langgraph-api:
    image: langchain/langgraph-api:latest
    ports:
      - "8123:8000"
    environment:
      OPENAI_API_KEY: ${OPENAI_API_KEY}
      LANGCHAIN_API_KEY: ${LANGCHAIN_API_KEY}  # For LangSmith tracing (optional)
    volumes:
      - ./langgraph.json:/deps/langgraph.json
      - ./src:/deps/src
    restart: unless-stopped

2. CrewAI — Best for Multi-Agent Workflows

CrewAI's abstraction is a "crew" of agents with different roles working toward a shared goal. You define agents (a researcher, a writer, a reviewer), give them tools, and assign tasks.

from crewai import Agent, Task, Crew, Process
from crewai_tools import SerperDevTool, WebsiteSearchTool

# Define agents with roles
researcher = Agent(
    role="Senior Research Analyst",
    goal="Find accurate, up-to-date information on {topic}",
    backstory="You're a meticulous researcher who only uses verified sources.",
    tools=[SerperDevTool(), WebsiteSearchTool()],
    llm="gpt-4o",
    verbose=True
)

writer = Agent(
    role="Technical Writer",
    goal="Write clear, engaging content based on research",
    backstory="You transform complex research into accessible articles.",
    llm="gpt-4o"
)

# Define tasks
research_task = Task(
    description="Research the latest developments in {topic}. "
                "Find at least 5 credible sources.",
    expected_output="A detailed research brief with sources and key findings.",
    agent=researcher
)

writing_task = Task(
    description="Write a 500-word article based on the research provided.",
    expected_output="A polished, factual article ready for publication.",
    agent=writer,
    context=[research_task]  # Writer receives researcher's output
)

# Assemble the crew
crew = Crew(
    agents=[researcher, writer],
    tasks=[research_task, writing_task],
    process=Process.sequential,  # or hierarchical
    verbose=True
)

result = crew.kickoff(inputs={"topic": "AI agent frameworks in 2026"})
print(result.raw)

Self-Host CrewAI

# Install CrewAI
pip install crewai crewai-tools

# For the CrewAI+ managed platform (optional):
# crewai login
# crewai deploy

# For fully self-hosted: just run the Python scripts on your server
# No additional infrastructure required beyond your Python environment

# Docker approach:
docker run -d \
  -e OPENAI_API_KEY=your-key \
  -v $(pwd)/crews:/app/crews \
  python:3.11 \
  bash -c "pip install crewai && python /app/crews/my_crew.py"

3. AutoGen — Best for Conversational Agents

Microsoft's AutoGen framework excels at multi-agent conversations — agents that debate, critique each other's output, and reach consensus through dialogue.

from autogen import AssistantAgent, UserProxyAgent, GroupChat, GroupChatManager

# AutoGen agents are designed for conversation patterns
assistant = AssistantAgent(
    name="assistant",
    llm_config={"model": "gpt-4o"},
    system_message="You are a helpful AI assistant."
)

critic = AssistantAgent(
    name="critic",
    llm_config={"model": "gpt-4o"},
    system_message="You critically review responses and suggest improvements."
)

user_proxy = UserProxyAgent(
    name="user_proxy",
    human_input_mode="NEVER",  # Fully automated
    code_execution_config={"work_dir": "workspace", "use_docker": True},
    max_consecutive_auto_reply=10
)

# Group chat: agents discuss and improve responses together
groupchat = GroupChat(
    agents=[user_proxy, assistant, critic],
    messages=[],
    max_round=6
)

manager = GroupChatManager(
    groupchat=groupchat,
    llm_config={"model": "gpt-4o"}
)

user_proxy.initiate_chat(
    manager,
    message="Write and test a Python function to sort a list of dicts by a key."
)

AutoGen Studio provides a no-code interface for building and testing agents — a significant advantage for teams with non-technical stakeholders.

4. n8n — Best for Visual, No-Code Agents

n8n is a workflow automation tool (like Zapier) that added powerful AI agent capabilities. If your team doesn't write Python but needs AI automation, n8n is the answer.

Self-Host n8n

# docker-compose.yml — n8n self-hosted
version: "3.8"
services:
  n8n:
    image: docker.n8n.io/n8nio/n8n:latest
    ports:
      - "5678:5678"
    environment:
      N8N_HOST: your-domain.com
      N8N_PORT: 5678
      N8N_PROTOCOL: https
      WEBHOOK_URL: https://your-domain.com/
      DB_TYPE: postgresdb
      DB_POSTGRESDB_HOST: postgres
      DB_POSTGRESDB_USER: n8n
      DB_POSTGRESDB_PASSWORD: your-secure-password
      DB_POSTGRESDB_DATABASE: n8n
    volumes:
      - n8n_data:/home/node/.n8n
    depends_on:
      - postgres

  postgres:
    image: postgres:15
    environment:
      POSTGRES_USER: n8n
      POSTGRES_PASSWORD: your-secure-password
      POSTGRES_DB: n8n
    volumes:
      - postgres_data:/var/lib/postgresql/data

volumes:
  n8n_data:
  postgres_data:

n8n's AI Agent node connects to OpenAI, Anthropic, Google, or local Ollama and gives you tool-calling, memory, and web search — all configured visually without writing code.

How to Choose

Choose LangGraph if:

You need precise control over agent state and execution flow
You're building production agents that must handle edge cases and interrupts
Your team is comfortable with Python and graph-based programming

Choose CrewAI if:

You want to build multi-agent workflows in hours, not days
Role-based team metaphors map naturally to your use case
You want the fastest time-to-working-agent

Choose AutoGen if:

Your agents need to debate, critique, and improve outputs through conversation
You want a no-code UI for non-technical stakeholders (AutoGen Studio)
You're in the Microsoft ecosystem

Choose n8n if:

Your team doesn't write Python
You need visual workflow design with drag-and-drop
You're automating workflows that combine AI with third-party integrations (Slack, email, databases)

Full list of open-source AI tools at OSSAlt.

See open source alternatives to n8n on OSSAlt.

Running AI Models Locally

Self-hosted AI deployments differ from typical web service deployments in two ways: resource requirements are significantly higher, and model performance depends heavily on hardware configuration.

Hardware requirements: LLM inference is memory-bound. A 7B parameter model requires approximately 4-8GB of VRAM or RAM depending on quantization. A 13B model needs 8-16GB. Running models on CPU is feasible for small models but 5-10x slower than GPU inference. For production AI workloads, a used NVIDIA RTX 3090 (24GB VRAM) handles most 7-13B models at acceptable speed.

Model management with Ollama: Ollama provides a clean API for pulling and running models from Ollama's model library. Models are stored in Docker volumes and persist across container restarts. Pull models with ollama pull llama3.2 and access them via OpenAI-compatible API at http://localhost:11434.

Monitoring AI services: GPU-hosted AI services need resource monitoring at the GPU level, not just CPU and memory. Add NVIDIA's DCGM Prometheus exporter to your monitoring stack alongside Prometheus + Grafana to track GPU memory utilization, temperature, and compute usage.

API authentication: Self-hosted AI services often run without authentication by default. If you're exposing the API beyond localhost, add Authelia or configure API key authentication at the reverse proxy layer. Models with unrestricted API access can be abused for resource exhaustion or prompt injection attacks.

Cost analysis: Self-hosting AI inference is cost-effective at scale. A $0.30/hour GPU VPS runs GPT-4 class models for approximately $0.0002 per 1K tokens — orders of magnitude cheaper than OpenAI's API for high-volume workloads. The break-even point versus managed APIs is typically 2-5M tokens per month.

Network Security and Hardening

Self-hosted services exposed to the internet require baseline hardening. The default Docker networking model exposes container ports directly — without additional configuration, any open port is accessible from anywhere.

Firewall configuration: Use ufw (Uncomplicated Firewall) on Ubuntu/Debian or firewalld on RHEL-based systems. Allow only ports 22 (SSH), 80 (HTTP redirect), and 443 (HTTPS). Block all other inbound ports. Docker bypasses ufw's OUTPUT rules by default — install the ufw-docker package or configure Docker's iptables integration to prevent containers from opening ports that bypass your firewall rules.

SSH hardening: Disable password authentication and root login in /etc/ssh/sshd_config. Use key-based authentication only. Consider changing the default SSH port (22) to a non-standard port to reduce brute-force noise in your logs.

Fail2ban: Install fail2ban to automatically ban IPs that make repeated failed authentication attempts. Configure jails for SSH, Nginx, and any application-level authentication endpoints.

TLS/SSL: Use Let's Encrypt certificates via Certbot or Traefik's automatic ACME integration. Never expose services over HTTP in production. Configure HSTS headers to prevent protocol downgrade attacks. Check your SSL configuration with SSL Labs' server test — aim for an A or A+ rating.

Container isolation: Avoid running containers as root. Add user: "1000:1000" to your docker-compose.yml service definitions where the application supports non-root execution. Use read-only volumes (volumes: - /host/path:/container/path:ro) for configuration files the container only needs to read.

Secrets management: Never put passwords and API keys directly in docker-compose.yml files committed to version control. Use Docker secrets, environment files (.env), or a secrets manager like Vault for sensitive configuration. Add .env to your .gitignore before your first commit.

Production Deployment Checklist

Before treating any self-hosted service as production-ready, work through this checklist. Each item represents a class of failure that will eventually affect your service if left unaddressed.

Infrastructure

Server OS is running latest security patches (apt upgrade / dnf upgrade)
Firewall configured: only ports 22, 80, 443 open
SSH key-only authentication (password auth disabled)
Docker and Docker Compose are current stable versions
Swap space configured (at minimum equal to RAM for <4GB servers)

Application

Docker image version pinned (not latest) in docker-compose.yml
Data directories backed by named volumes (not bind mounts to ephemeral paths)
Environment variables stored in .env file (not hardcoded in compose)
Container restart policy set to unless-stopped or always
Health check configured in Compose or Dockerfile

Networking

SSL certificate issued and auto-renewal configured
HTTP requests redirect to HTTPS
Domain points to server IP (verify with dig +short your.domain)
Reverse proxy (Nginx/Traefik) handles SSL termination

Monitoring and Backup

Uptime monitoring configured with alerting
Automated daily backup of Docker volumes to remote storage
Backup tested with a successful restore drill
Log retention configured (no unbounded log accumulation)

Access Control

Default admin credentials changed
Email confirmation configured if the app supports it
User registration disabled if the service is private
Authentication middleware added if the service lacks native login

Getting Started

The best time to set up monitoring and backups is before you need them. Deploy your service, configure it, and immediately add it to your monitoring stack and backup schedule. These three steps — deploy, monitor, backup — are the complete foundation of a reliable self-hosted service. Everything else is incremental improvement.

Self-hosted AI agent frameworks are at an early stage relative to the broader self-hosting ecosystem. The frameworks change rapidly, and a configuration that worked in February may require updates by April. Build your deployment with this in mind: use pinned Docker image versions, maintain your own fork of any configuration you modify, and allocate time for periodic updates. The payoff — control over model selection, data privacy, and API costs — is substantial for teams running high-volume AI workflows. For occasional AI use, managed APIs remain simpler and cost-effective.

The evaluation criteria for self-hosted AI agent frameworks should include model compatibility, API surface stability, and community maintenance activity. A framework that supports only one model family or whose API breaks between releases creates migration overhead that offsets the operational control advantages of self-hosting. Check the GitHub commit history and issue response time before committing to a framework for production use — the self-hosted AI ecosystem is still consolidating, and some early-stage frameworks have been abandoned mid-development.

AI agent implementation stack: where this page fits

This guide is the open-source and self-hosted layer of the portfolio's AI agent implementation stack. It is the right checkpoint when vendor lock-in, data control, local models, private workflows, or self-hosted orchestration matter more than the fastest managed setup.

For the full implementation path, pair it with:

Top MCP APIs for AI Agents 2026 for managed MCP-ready API and official server selection.
AI Development Stack for JavaScript 2026 for package-level implementation choices.
MCP server frameworks for TypeScript 2026 when the self-hosted layer needs a custom MCP server implementation.
AI Agent SaaS Boilerplate Checklist 2026 for production SaaS scaffolding.
Best AI Sales Agent Stack for 2026 for business-agent tool selection.
Best AI Agent Development Courses and Certifications 2026 for training teams before migration.

Self-hosting CTA: decide which layer must be self-hosted—model runtime, orchestration, vector memory, workflow automation, or observability—before replacing managed APIs wholesale.

The SaaS-to-Self-Hosted Migration Guide (Free PDF)