2026 AI Agent Framework Showdown: LangGraph vs CrewAI vs OpenAI Agents SDK vs AutoGen vs Google ADK

Someone in my dev group chat asked last week: "Which AI Agent framework should I use in 2026?" The thread immediately turned into a war zone — someone swore by LangGraph, another person said CrewAI is the only sane choice, and a third person insisted OpenAI's official SDK is the future. No consensus whatsoever.

Here's the thing: there's no one-size-fits-all answer. Different frameworks shine in different scenarios, and the "best" one depends entirely on what you're building. I've spent the last six months tinkering with all the major frameworks — some in real production projects, others just to get a feel for them. Let me share what I actually found, no fluff, no sponsorship, just real experience.

The five frameworks I'll cover today:

LangGraph — From the LangChain team, graph-based orchestration, the production workhorse
CrewAI — Role-playing multi-agent framework, fastest to prototype with
OpenAI Agents SDK — Official OpenAI toolkit, lightweight but capable
AutoGen 2.0 — Microsoft Research's rebuild, async-first architecture
Google ADK — Google's Agent Development Kit, Gemini ecosystem play

There's also Anthropic's Claude Agent SDK, but it's still early days and overlaps a lot with OpenAI's offering. I'll cover that in a separate post. Today we're focusing on these five.

Do You Actually Need an Agent Framework?

Before diving into specifics, let's address something a lot of people skip: do you even need a framework?

If all you want is "give the LLM a search tool" or "let it read and write files," you don't need any framework. Plain old function calling works fine:

python

import openai
 
tools = [{
    "type": "function",
    "function": {
        "name": "web_search",
        "description": "Search the internet for information",
        "parameters": {
            "type": "object",
            "properties": {
                "query": {"type": "string", "description": "Search query"}
            },
            "required": ["query"]
        }
    }
}]
 
response = openai.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": "What's the weather in Beijing today?"}],
    tools=tools
)

Simple, direct, gets the job done.

But when you start needing these things, frameworks earn their keep:

Multi-agent collaboration: One agent searches, another analyzes, a third writes reports
State management: Agents that remember previous conversations and actions
Human-in-the-loop: Critical steps that need human approval before proceeding
Error recovery: Agent crashes mid-execution and needs to resume from where it left off
Observability: Understanding what the agent did at each step, especially when debugging

If your use case involves any of these, it's worth investing time in a framework.

LangGraph: The Production Workhorse

TL;DR

If you're building something serious — something that'll run in production and you'll be on the hook for — LangGraph is currently the most reliable choice.

How It Works

LangGraph's core abstraction is the graph. Each processing step is a node, and the flow between steps is an edge. You can use conditional edges for branching logic, loops for retries, and the entire execution path is under your control.

It's basically a state machine with LLM capabilities bolted on.

python

from langgraph.graph import StateGraph, END
from typing import TypedDict, Annotated
 
class AgentState(TypedDict):
    query: str
    search_results: list[str]
    analysis: str
    needs_review: bool
 
def search_node(state: AgentState) -> AgentState:
    """Search node: fetch information"""
    results = do_search(state["query"])
    return {"search_results": results}
 
def analyze_node(state: AgentState) -> AgentState:
    """Analysis node: LLM analyzes search results"""
    analysis = llm_analyze(state["search_results"])
    needs_review = "uncertain" in analysis
    return {"analysis": analysis, "needs_review": needs_review}
 
def route_after_analysis(state: AgentState) -> str:
    """Conditional routing: decide if human review is needed"""
    if state["needs_review"]:
        return "human_review"
    return "generate_output"
 
# Build the graph
workflow = StateGraph(AgentState)
workflow.add_node("search", search_node)
workflow.add_node("analyze", analyze_node)
workflow.add_node("human_review", human_review_node)
workflow.add_node("generate_output", output_node)
 
# Define the flow
workflow.set_entry_point("search")
workflow.add_edge("search", "analyze")
workflow.add_conditional_edges("analyze", route_after_analysis)
workflow.add_edge("human_review", "generate_output")
workflow.add_edge("generate_output", END)
 
app = workflow.compile()

Where I Screwed Up

First time using LangGraph, the graph model completely threw me off. I'd been writing linear code my whole career, and suddenly I needed to think in nodes and edges. Took about two days before it really clicked.

The biggest gotcha was state management. LangGraph defines state through TypedDict, and each node function receives and returns state updates. The problem? If your node returns a misspelled state field name, it silently ignores it — no error, no warning. I once wrote search_result instead of search_results (missing the 's') and spent half a day debugging before I caught it.

Another gotcha: conditional edges must return node name strings. I tried returning node objects at first. Instant error. The docs mention this, but I didn't read carefully enough. Classic.

The Good

Full control: Every execution path is explicitly defined — no "the agent decided to do something weird" black boxes
Native human-in-the-loop: Graphs can pause at any node, wait for human input, then resume exactly where they stopped
LangSmith observability: The companion tool traces every graph execution, making debugging actually feasible
State persistence with checkpoints: Agent crashes mid-execution? Resume from the last checkpoint
Biggest community: 126,000+ GitHub stars, most tutorials and examples available

The Bad

Steep learning curve: The graph mental model is significantly harder than linear code
Boilerplate heavy: Even simple single-agent tasks require a bunch of glue code
LangSmith dependency: The best observability features require LangSmith, which isn't free

Best For

Regulated industries (finance, healthcare) that need compliance auditing
Complex multi-step workflows with branching logic
Production systems where reliability is non-negotiable
Scenarios requiring human approval at specific checkpoints

CrewAI: Fastest Time to Working Demo

TL;DR

If you need to prototype fast and want non-technical stakeholders to understand your agent logic, CrewAI is the way to go.

How It Works

CrewAI's core abstraction is the role. You give each agent a role name, goal, and backstory, then assemble them into a "crew" with tasks. The agents collaborate to get things done.

The mental model is dead simple — you're building a team where everyone has their specialty.

python

from crewai import Agent, Task, Crew, Process
 
researcher = Agent(
    role="Market Research Analyst",
    goal="Gather competitor intel and market data",
    backstory="Senior analyst with 10 years experience tracking this market",
    tools=[web_search, database_query],
    verbose=True
)
 
writer = Agent(
    role="Content Strategist",
    goal="Write compelling analysis reports based on research",
    backstory="Veteran tech writer who excels at making complex topics accessible",
    tools=[file_writer]
)
 
research_task = Task(
    description="Research 2026 AI agent framework market performance and user reviews",
    expected_output="A research report covering pros and cons of each framework",
    agent=researcher
)
 
writing_task = Task(
    description="Write an in-depth comparison article based on the research",
    expected_output="A 5000+ word analysis article",
    agent=writer,
    context=[research_task]
)
 
crew = Crew(
    agents=[researcher, writer],
    tasks=[research_task, writing_task],
    process=Process.sequential,
    verbose=True
)
 
result = crew.kickoff()

See how the code reads like a team workflow description? Product managers can understand it. That's CrewAI's killer feature.

Where Things Went Wrong

CrewAI was genuinely fast to get started — I had a working demo in about 30 minutes. But then I hit three issues.

First: token burn. CrewAI agents "talk" to each other, and that conversation eats way more tokens than you'd expect. I ran a three-agent task expecting maybe 100K tokens. It consumed nearly 300K. The agents carry massive context in their inter-agent conversations, and even the verbose console output counts toward processing.

Second: debugging is painful. When an agent does something unexpected (like the researcher delegating back to the writer who then delegates back to the researcher), figuring out what happened is really hard. LangGraph's graph model is complex, but at least every node's execution path is deterministic. CrewAI's agent collaboration feels more like a "free-form discussion" — results can be unpredictable.

Third: Flows documentation. CrewAI later introduced Flows as their "enterprise production architecture." But the docs mix Flows and Crews documentation together, and I kept getting confused about which API belonged to which. Some Flows APIs also differ from Crews APIs, making migration non-trivial.

The Good

Blazing fast prototyping: Working demo in 30 minutes, solid prototype in a day
Highly readable role definitions: Non-technical people can understand what each agent does
Built-in delegation: Agents can automatically assign subtasks to other agents
Sequential and hierarchical process modes: Sequential for pipelines, hierarchical for "project manager" patterns
Independent of LangChain: Lightweight, no heavy dependencies

The Bad

Execution flow can be unpredictable: Agent collaboration behavior sometimes surprises you
High token consumption: Multi-agent conversation overhead is significant
Debugging experience is poor: Hard to trace what went wrong
No major tech backing: Unlike LangGraph (LangChain) or AutoGen (Microsoft), CrewAI is an independent team

Best For

Rapid prototyping and idea validation
Content generation and research/analysis tasks
Scenarios where non-technical stakeholders need to understand agent logic
Exploratory projects where execution determinism isn't critical

OpenAI Agents SDK: The Balanced Choice

TL;DR

If you primarily use OpenAI models and want a clean, well-designed framework without excessive complexity, the Agents SDK hits the sweet spot.

How It Works

OpenAI Agents SDK's design philosophy is simplicity. No graph models, no role-playing theater — just straightforward agent and tool definitions. The core concepts are: Agent, Tool, Handoff, and Guardrail.

python

from agents import Agent, Runner, function_tool
 
@function_tool
def get_weather(city: str) -> str:
    """Get weather for a city"""
    return f"{city}: Sunny, 25°C today"
 
@function_tool
def search_web(query: str) -> str:
    """Search the internet"""
    return do_web_search(query)
 
weather_agent = Agent(
    name="Weather Assistant",
    instructions="Help users check weather. If they ask something non-weather-related, hand off to the general assistant.",
    tools=[get_weather]
)
 
general_agent = Agent(
    name="General Assistant",
    instructions="You're a helpful AI assistant that can answer all kinds of questions.",
    tools=[search_web]
)
 
# Handoff: agent-to-agent transfer
weather_agent.handoffs = [general_agent]
 
result = Runner.run_sync(weather_agent, "What's the weather in Beijing?")
print(result.final_output)

Handoff is the most interesting design choice. It lets one agent transfer a conversation to another when it can't handle the request — like customer service escalation, but smarter.

My Experience

The overall vibe of OpenAI Agents SDK is "just right." It's not as heavy as LangGraph, not as flashy as CrewAI — just a clean, well-thought-out framework.

What impressed me most was the Guardrail mechanism. You can add validation logic on agent inputs and outputs, blocking invalid requests before they reach the LLM:

python

from agents import Agent, InputGuardrail, GuardrailFunctionOutput
from pydantic import BaseModel
 
class SafetyCheck(BaseModel):
    is_safe: bool
    reason: str
 
safety_agent = Agent(
    name="Safety Checker",
    instructions="Check if user input contains inappropriate content",
    output_type=SafetyCheck
)
 
async def check_safety(ctx, agent, input):
    result = await Runner.run(safety_agent, input)
    return GuardrailFunctionOutput(
        output_info=result.final_output,
        tripwire_triggered=not result.final_output.is_safe
    )
 
agent = Agent(
    name="Assistant",
    instructions="Be helpful",
    input_guardrails=[InputGuardrail(guardrail_function=check_safety)]
)

This is super practical for production. You can catch problematic inputs before the agent even starts processing, rather than reviewing outputs after the fact.

The Sandbox Agent (new in v0.14.0) also deserves a mention. It lets agents run in sandboxed environments — executing code, manipulating files — similar to Claude Code's worktree model. Great for long-running tasks that need filesystem access.

The Good

Clean design: Minimal API surface, few concepts to learn
Handoff mechanism: Agent-to-agent transfer feels natural
Native guardrails: Input/output validation out of the box
100+ model support: Despite being OpenAI-made, it's not locked to OpenAI models
Built-in tracing: Debug without extra setup
Sandbox Agent: Container-based execution for long-running tasks

The Bad

Relatively young: Open-sourced in 2025, ecosystem still building
Limited complex scenario support: Can't match LangGraph's graph orchestration for intricate workflows
Documentation skews simple: Advanced use cases are under-documented

Best For

Medium-complexity agent tasks
Scenarios requiring flexible agent-to-agent handoffs
Applications with input/output safety requirements
Developers who want to get productive fast without learning heavy concepts

AutoGen 2.0: Microsoft's Async Powerhouse

TL;DR

If you're in the Azure ecosystem or need high-concurrency multi-agent scenarios, AutoGen 2.0 is worth serious consideration.

How It Works

AutoGen's core idea is conversation. Agents collaborate by exchanging messages, like a group of people discussing a problem in a Slack channel. Version 2.0 was a complete rewrite — async-first architecture, modular runtime, production-grade.

python

from autogen_agentchat.agents import AssistantAgent
from autogen_agentchat.teams import RoundRobinGroupChat
from autogen_ext.models.openai import OpenAIChatCompletionClient
 
model = OpenAIChatCompletionClient(model="gpt-4o")
 
coder = AssistantAgent(
    name="coder",
    model_client=model,
    system_message="You're a Python developer. Write clean, tested code."
)
 
reviewer = AssistantAgent(
    name="reviewer",
    model_client=model,
    system_message="You're a code review expert. Find bugs and suggest improvements."
)
 
team = RoundRobinGroupChat(
    participants=[coder, reviewer],
    max_turns=4
)
 
result = await team.run(task="Write a Python quicksort function with unit tests")

AutoGen 2.0's conversation modes are flexible: one-on-one chats, group discussions, even nested conversations (where one agent's response triggers a sub-conversation) all work.

My Experience

Honestly, AutoGen 1.0 was rough. Confusing APIs, outdated docs, constant version compatibility issues. But 2.0 is a completely different animal — much better experience overall.

The biggest pleasant surprise was async performance. I tested running 200 concurrent agent sessions, and AutoGen 2.0 handled them without breaking a sweat. Same test with CrewAI showed noticeable lag.

But AutoGen 2.0 has one issue that drove me crazy: unpredictable token consumption. Because agents have "free-form conversations," estimating token costs upfront is nearly impossible. I ran a code review task once where the two agents "debated" for 20+ rounds before concluding. Token costs went through the roof.

I eventually solved this with max_turns limits. But it highlights a real concern: AutoGen's conversation mode needs hard constraints in production, or costs can spiral.

The Good

Async architecture: Handles high concurrency beautifully
Deep Azure integration: Seamless with Azure OpenAI services
Flexible conversation patterns: One-on-one, group chat, nested conversations
Strong code execution: HumanProxyAgent pattern is excellent for code gen + review
Microsoft Research backing: Research papers land in the framework fast

The Bad

Token consumption can spiral: Conversation mode leads to verbose agents
1.0 → 2.0 migration is painful: Completely different APIs, no backward compatibility
Inconsistent documentation: Some modules well-documented, others nearly bare
Smaller community than LangGraph: Fewer GitHub stars, less discussion activity

Best For

High-concurrency multi-agent systems
Projects deployed on Azure
Code generation and review workflows
Scenarios where agents need to "discuss" to reach consensus

Google ADK: The Gemini Ecosystem Latecomer

TL;DR

If you're deep in the Gemini model ecosystem and Google Cloud, ADK is the natural fit. But brace yourself — it's still evolving fast and APIs change frequently.

How It Works

Google ADK (Agent Development Kit) launched in late 2025, tightly integrated with Gemini models. Its design philosophy is code-first — all agent logic defined in Python code, no declarative configs.

python

from google.adk.agents import Agent
from google.adk.runners import Runner
from google.adk.sessions import InMemorySessionService
 
root_agent = Agent(
    name="assistant",
    model="gemini-2.0-flash",
    description="A helpful AI assistant",
    instruction="You're a professional tech assistant",
    tools=[search_tool, calculator_tool]
)
 
session_service = InMemorySessionService()
runner = Runner(
    agent=root_agent,
    app_name="my_app",
    session_service=session_service
)

ADK supports multi-agent orchestration with hierarchical structures: a main agent understands user intent and delegates subtasks to specialized sub-agents.

My Experience

ADK gives me "high potential, not quite there yet" vibes. Three issues stood out during my testing:

First, dependency conflicts. ADK's dependencies clash with other Google libraries (like google-cloud-aiplatform). Took me about an hour just to get the environment clean.

Second, docs lag behind code. ADK iterates fast — new versions basically every week. But documentation doesn't always keep up. You follow the docs, write your code, run it, and discover the API already changed.

Third, scarce community resources. Compared to LangGraph and CrewAI, Chinese-language ADK resources are basically non-existent. When you hit issues, it's GitHub Issues and official docs or nothing.

That said, the underlying capabilities are genuinely strong. Performance and quality with Gemini 2.0 models are excellent. And Google Cloud's Vertex AI has native ADK support, making deployment straightforward.

The Good

Deep Gemini integration: Best-in-class with Google's own models
Code-first approach: Logic in code, not config files
Easy Vertex AI deployment: One-click for Google Cloud users
MCP protocol support: Native Model Context Protocol integration
Fast iteration: Google is investing heavily

The Bad

Not mature yet: Frequent API changes make production use risky
Dependency conflicts: Compatibility with other Google libraries needs work
Sparse community resources: Hard to find help, especially in non-English communities
Model lock-in tendency: Technically supports other models, but works best with Gemini

Best For

Teams heavily using Gemini models
Projects deployed on Google Cloud
Agents integrating with Google services (Search, Maps, YouTube)
Developers with patience for cutting-edge tech

Head-to-Head Comparison

Here's my subjective scoring across key dimensions (out of 5):

Ease of Getting Started (higher = easier):

CrewAI: ⭐⭐⭐⭐⭐ — Demo in 30 minutes
OpenAI Agents SDK: ⭐⭐⭐⭐ — Clean API, few concepts
Google ADK: ⭐⭐⭐ — Setup can be finicky
AutoGen 2.0: ⭐⭐⭐ — Much better than 1.0
LangGraph: ⭐⭐ — Graph model takes time to internalize

Production Readiness:

LangGraph: ⭐⭐⭐⭐⭐ — Most mature, LangSmith backing
OpenAI Agents SDK: ⭐⭐⭐⭐ — Young but solidly designed
AutoGen 2.0: ⭐⭐⭐⭐ — Microsoft backing, Azure integration
CrewAI: ⭐⭐⭐ — Flows is improving but not there yet
Google ADK: ⭐⭐ — Too much API churn

Token Cost Control:

LangGraph: ⭐⭐⭐⭐⭐ — Graph model is naturally controllable
OpenAI Agents SDK: ⭐⭐⭐⭐ — Guardrails help constrain
Google ADK: ⭐⭐⭐ — Decent
CrewAI: ⭐⭐ — Multi-agent chatter burns tokens
AutoGen 2.0: ⭐⭐ — Conversation mode gets expensive fast

Multi-Agent Collaboration:

AutoGen 2.0: ⭐⭐⭐⭐⭐ — Most flexible conversation patterns
CrewAI: ⭐⭐⭐⭐⭐ — Most intuitive role-playing model
LangGraph: ⭐⭐⭐⭐ — Subgraph support
OpenAI Agents SDK: ⭐⭐⭐⭐ — Clean handoff mechanism
Google ADK: ⭐⭐⭐ — Hierarchical orchestration

Community & Ecosystem:

LangGraph: ⭐⭐⭐⭐⭐ — 126K+ stars, richest ecosystem
CrewAI: ⭐⭐⭐⭐ — 44K+ stars, active community
OpenAI Agents SDK: ⭐⭐⭐⭐ — OpenAI halo effect
AutoGen 2.0: ⭐⭐⭐ — Microsoft community
Google ADK: ⭐⭐ — Still early days

My Recommendation

Building production-grade agent systems? Go with LangGraph. The learning curve is steeper, but the graph model's controllability and LangSmith's observability are lifesavers in production.

Need to validate an idea fast? Go with CrewAI. Demo in half a day, prototype in a day. Perfect for MVP stage.

Want a balanced, no-drama choice? Go with OpenAI Agents SDK. Clean, sufficient, not flashy. Works for most medium-complexity scenarios.

Living in the Azure ecosystem? Go with AutoGen 2.0. Azure OpenAI integration is seamless, and async performance is excellent.

All-in on Google Cloud? Go with Google ADK. But be ready to deal with rough edges — it's still iterating fast.

Not sure? Start with OpenAI Agents SDK. Lowest learning curve, most transferable concepts, and migrating to other frameworks later is relatively painless.

A Real Project's Framework Selection Process

Let me share a recent project's selection journey for context.

The requirement: an automated content analysis system. Input a URL, the system scrapes content, analyzes key information, generates a structured report. Some critical judgments need human confirmation mid-process.

I started with CrewAI because of its fast prototyping. Defined three agents: scraper, analyzer, reporter. Ran it for a few days, hit two issues. Token costs were too high ($15-20/day for 100 URLs), and the analyzer sometimes skipped uncertain judgments instead of flagging them for human review.

Switched to LangGraph. Redesigned the flow as a graph: scrape → initial analysis → determine if human review needed → confirm/auto-process → generate report. The native human-in-the-loop support solved the confirmation problem perfectly, and the graph model brought token costs under control ($8-10/day for the same 100 URLs).

But development speed dropped noticeably. What took half a day with CrewAI took two days with LangGraph.

The takeaway: if your project needs production reliability, cost control, and human oversight, LangGraph is worth the extra development time. If it's an internal tool with less stringent requirements, CrewAI does the job.

What's Coming in Late 2026

A few trends I'm watching:

MCP is becoming standard. Nearly every agent framework is adopting the Model Context Protocol for standardized tool integration. If you're starting with agent frameworks today, understand MCP first.

Agent-as-a-Service is taking off. More platforms offer hosted agent runtimes — LangSmith Deployment, CrewAI Enterprise, Google Vertex AI Agent Builder. You don't need to manage servers anymore.

Multimodal agents are next. It's not just text anymore. Agents need to process images, audio, video. GPT-4o, Gemini 2.0 are pushing this direction, and frameworks need to keep up.

Cost control matters more than ever. As agent applications scale, token consumption costs become a core concern. Frameworks that help control costs at the architecture level will have a real competitive advantage.

That's it for this one. Planning to build a complete small project with each framework next — detailed tutorials coming. Questions? Drop them in the comments.

1	`import openai`
2
3	`tools = [{`
4	`"type": "function",`
5	`"function": {`
6	`"name": "web_search",`
7	`"description": "Search the internet for information",`
8	`"parameters": {`
9	`"type": "object",`
10	`"properties": {`
11	`"query": {"type": "string", "description": "Search query"}`
12	`},`
13	`"required": ["query"]`
14	`}`
15	`}`
16	`}]`
17
18	`response = openai.chat.completions.create(`
19	`model="gpt-4o",`
20	`messages=[{"role": "user", "content": "What's the weather in Beijing today?"}],`
21	`tools=tools`
22	`)`

1	`from langgraph.graph import StateGraph, END`
2	`from typing import TypedDict, Annotated`
3
4	`class AgentState(TypedDict):`
5	`query: str`
6	`search_results: list[str]`
7	`analysis: str`
8	`needs_review: bool`
9
10	`def search_node(state: AgentState) -> AgentState:`
11	`"""Search node: fetch information"""`
12	`results = do_search(state["query"])`
13	`return {"search_results": results}`
14
15	`def analyze_node(state: AgentState) -> AgentState:`
16	`"""Analysis node: LLM analyzes search results"""`
17	`analysis = llm_analyze(state["search_results"])`
18	`needs_review = "uncertain" in analysis`
19	`return {"analysis": analysis, "needs_review": needs_review}`
20
21	`def route_after_analysis(state: AgentState) -> str:`
22	`"""Conditional routing: decide if human review is needed"""`
23	`if state["needs_review"]:`
24	`return "human_review"`
25	`return "generate_output"`
26
27	`# Build the graph`
28	`workflow = StateGraph(AgentState)`
29	`workflow.add_node("search", search_node)`
30	`workflow.add_node("analyze", analyze_node)`
31	`workflow.add_node("human_review", human_review_node)`
32	`workflow.add_node("generate_output", output_node)`
33
34	`# Define the flow`
35	`workflow.set_entry_point("search")`
36	`workflow.add_edge("search", "analyze")`
37	`workflow.add_conditional_edges("analyze", route_after_analysis)`
38	`workflow.add_edge("human_review", "generate_output")`
39	`workflow.add_edge("generate_output", END)`
40
41	`app = workflow.compile()`

1	`from crewai import Agent, Task, Crew, Process`
2
3	`researcher = Agent(`
4	`role="Market Research Analyst",`
5	`goal="Gather competitor intel and market data",`
6	`backstory="Senior analyst with 10 years experience tracking this market",`
7	`tools=[web_search, database_query],`
8	`verbose=True`
9	`)`
10
11	`writer = Agent(`
12	`role="Content Strategist",`
13	`goal="Write compelling analysis reports based on research",`
14	`backstory="Veteran tech writer who excels at making complex topics accessible",`
15	`tools=[file_writer]`
16	`)`
17
18	`research_task = Task(`
19	`description="Research 2026 AI agent framework market performance and user reviews",`
20	`expected_output="A research report covering pros and cons of each framework",`
21	`agent=researcher`
22	`)`
23
24	`writing_task = Task(`
25	`description="Write an in-depth comparison article based on the research",`
26	`expected_output="A 5000+ word analysis article",`
27	`agent=writer,`
28	`context=[research_task]`
29	`)`
30
31	`crew = Crew(`
32	`agents=[researcher, writer],`
33	`tasks=[research_task, writing_task],`
34	`process=Process.sequential,`
35	`verbose=True`
36	`)`
37
38	`result = crew.kickoff()`

1	`from agents import Agent, Runner, function_tool`
2
3	`@function_tool`
4	`def get_weather(city: str) -> str:`
5	`"""Get weather for a city"""`
6	`return f"{city}: Sunny, 25°C today"`
7
8	`@function_tool`
9	`def search_web(query: str) -> str:`
10	`"""Search the internet"""`
11	`return do_web_search(query)`
12
13	`weather_agent = Agent(`
14	`name="Weather Assistant",`
15	`instructions="Help users check weather. If they ask something non-weather-related, hand off to the general assistant.",`
16	`tools=[get_weather]`
17	`)`
18
19	`general_agent = Agent(`
20	`name="General Assistant",`
21	`instructions="You're a helpful AI assistant that can answer all kinds of questions.",`
22	`tools=[search_web]`
23	`)`
24
25	`# Handoff: agent-to-agent transfer`
26	`weather_agent.handoffs = [general_agent]`
27
28	`result = Runner.run_sync(weather_agent, "What's the weather in Beijing?")`
29	`print(result.final_output)`

1	`from agents import Agent, InputGuardrail, GuardrailFunctionOutput`
2	`from pydantic import BaseModel`
3
4	`class SafetyCheck(BaseModel):`
5	`is_safe: bool`
6	`reason: str`
7
8	`safety_agent = Agent(`
9	`name="Safety Checker",`
10	`instructions="Check if user input contains inappropriate content",`
11	`output_type=SafetyCheck`
12	`)`
13
14	`async def check_safety(ctx, agent, input):`
15	`result = await Runner.run(safety_agent, input)`
16	`return GuardrailFunctionOutput(`
17	`output_info=result.final_output,`
18	`tripwire_triggered=not result.final_output.is_safe`
19	`)`
20
21	`agent = Agent(`
22	`name="Assistant",`
23	`instructions="Be helpful",`
24	`input_guardrails=[InputGuardrail(guardrail_function=check_safety)]`
25	`)`

1	`from autogen_agentchat.agents import AssistantAgent`
2	`from autogen_agentchat.teams import RoundRobinGroupChat`
3	`from autogen_ext.models.openai import OpenAIChatCompletionClient`
4
5	`model = OpenAIChatCompletionClient(model="gpt-4o")`
6
7	`coder = AssistantAgent(`
8	`name="coder",`
9	`model_client=model,`
10	`system_message="You're a Python developer. Write clean, tested code."`
11	`)`
12
13	`reviewer = AssistantAgent(`
14	`name="reviewer",`
15	`model_client=model,`
16	`system_message="You're a code review expert. Find bugs and suggest improvements."`
17	`)`
18
19	`team = RoundRobinGroupChat(`
20	`participants=[coder, reviewer],`
21	`max_turns=4`
22	`)`
23
24	`result = await team.run(task="Write a Python quicksort function with unit tests")`

1	`from google.adk.agents import Agent`
2	`from google.adk.runners import Runner`
3	`from google.adk.sessions import InMemorySessionService`
4
5	`root_agent = Agent(`
6	`name="assistant",`
7	`model="gemini-2.0-flash",`
8	`description="A helpful AI assistant",`
9	`instruction="You're a professional tech assistant",`
10	`tools=[search_tool, calculator_tool]`
11	`)`
12
13	`session_service = InMemorySessionService()`
14	`runner = Runner(`
15	`agent=root_agent,`
16	`app_name="my_app",`
17	`session_service=session_service`
18	`)`