Overview

Why did we create AG2?#

The original AutoGen project released with its first public preview in September 2023, and AG2 later diverged from that codebase in November 2024 to continue building on its core ideas.

AutoGen was one of the earliest frameworks for building AI agents and orchestrating agent-to-agent collaboration. That early vision proved valuable: it enabled real-world systems, informed the design of many tools, and helped shape the agent ecosystem.

Since then, the agent landscape has changed significantly. Over time, the community has established better practices, common protocols, and new interoperability standards. Capabilities that were once experimental are now becoming part of the expected foundation for agent platforms.

Examples include:

Model Context Protocol (MCP), introduced in November 2024
Agent2Agent (A2A), introduced in April 2025
AG-UI, introduced in May 2025

We have increasingly found that the original architecture inherited from AutoGen challenged the adoption of new ideas. Shipping modern capabilities inside the original design often requires introducing complexity, unnecessary migration effort, or compatibility compromises.

Not every part of the ecosystem is standardized yet, but the direction is clear. AI agents are no longer an experiment; they are standard application infrastructure.

Today's AG2 is our way to move forward with a future-focused foundation, applying the lessons we learned from building and operating hundreds of agent systems on the original framework.

What is AG2?#

AG2 is a protocol-driven, async agent framework built around a small, predictable core and a set of opt-in primitives. It rebuilds the framework around capabilities that were difficult or impractical to introduce on the original architecture.

It is the foundation for AG2 agent development and production-ready multi-agent systems going forward, and forms the basis of AG2 v1.0.

Why use AG2?#

AG2 is built around a small, predictable, core and a set of opt-in primitives you compose to fit your application. Here is what you get out of the box.

1. A clean, async-first agent API#

Two methods cover the conversational surface — agent.ask(...) to start a turn and reply.ask(...) to continue one. The agent loop, tool execution, and LLM calls are async throughout, with streaming enabled by default on supported providers.

2. A composable harness for capable, long-running, agents#

AG2's harness layers on powerful primitives to the base Agent — assembly policies for context shaping, a knowledge store for persistent memory, sub-task delegation with isolated streams, and middleware for retries, logging, token limits, and history management. Build up the agent you need with the harness doing the heavy lifting for you.

The Agent's ability to fan out work, in parallel, to a team of specialist agents (seen as tools), or as subtasks, provides natural orchestration within each agent.

3. Production and Scalability#

Human-in-the-loop hooks, structured output (static, callable, prompted, and transformable), OpenTelemetry tracing, persistent backends for history and streams (e.g. Redis), and a testing utility that mocks LLMs and tool calls without hitting the network — the primitives you need to take an Agent from prototype to production.

The runtime is async end-to-end, so a single process can drive many concurrent agents, tool calls, and provider streams without blocking, and sub-tasks fan out in parallel via asyncio.gather. State is externalised behind protocols — History, Storage, and Stream can be backed by Redis, a database, or anything you build — so agents stay effectively stateless and horizontal scaling is straightforward. Cross-cutting concerns like retries, rate limits, token budgets, and history compaction are middleware you compose onto an Agent.

4. UI and external integration#

Every event in the agent loop — model requests and responses, tool calls and results, human-input requests, observer alerts — flows through an event stream. Streams, with natural filtering capabilities, can power UIs, logging, metrics, or approvals without touching the agent itself.

The stream is bidirectional: AG-UI renders model output and tool calls in real time while user responses come back as HumanMessage events, and persistent backends like Redis let separate processes — a web frontend and a background worker — share the same live conversation.

5. Tools, toolkits, and built-in tools#

Define tools with a @tool decorator on plain functions. Use type hints, dependency injection (Context, Inject, Variable), and toolkits to organize related capabilities. Wire in built-in tools (web search, code execution, shell, memory) or expose any agent as a tool with Agent.as_tool(...).

6. One configuration model across providers#

A single, type-safe, interface spans OpenAI, OpenAI Responses, Anthropic, Gemini, Vertex AI, Ollama, and DashScope. Switching providers is a config change, not a rewrite, and structured output, multimodality (images, audio, video), and built-in tools work consistently across them.

Migrating from earlier AG2#

We value all the users and contributors who have made AG2 what it is today and want to bring you along on this journey.

If you are coming from the earlier ConversableAgent / GroupChat API, the network migration guide maps the classic orchestration patterns onto the new ag2.network module.

How do I try it out?#

Install with pip install ag2. The framework is importable as the ag2 module.

For the latest in-development version, use the main branch of the GitHub repository.

Using an AI coding assistant?

If you build with Claude Code, Cursor, Copilot, or another AI coding assistant, see Coding with AI Assistants to set it up with AG2 skills and project rules so it writes against the current ag2 API.

See the following pages for walkthroughs of the AG2 API.

Current Focus Areas#

AG2 is actively focused on:

improving the single-agent developer experience
providing stronger context and memory management primitives
simplifying integration with real applications, including Text UI, web, ambient, and background runtimes
enabling new multi-agent coordination patterns that are not feasible in the current AG2 architecture
supporting emerging standards and protocols across the AI agent ecosystem

We are building AG2 to make agent development simpler, more modern, and easier to integrate into production-grade applications. We would love your feedback as the API evolves (Discord).