Home / Guides / The 4-Agent Pattern: Boss, Builder, Publisher, Researcher

By Aderson Rocha · 2026-05-24

The 4-Agent Pattern: Boss, Builder, Publisher, Researcher

The 4-agent pattern is an orchestration architecture where a central Boss agent receives tasks, decomposes them into subtasks, and delegates work to three specialized agents: a Researcher (gathers information), a Builder (enriches and transforms data), and a Publisher (delivers results). This pattern dominates production multi-agent systems in 2026 because it is simple to debug, scales horizontally, and isolates failures to a single agent instead of crashing the whole pipeline. We run this exact pattern daily across lead generation, content publishing, and data enrichment systems. If you are building autonomous AI for a real business, this is the architecture to start with.

Gartner reported a 1,445% surge in multi-agent system inquiries between Q1 2024 and Q2 2025. Two thirds of the agentic AI market now runs coordinated multi-agent systems instead of single-agent solutions. The orchestrator-worker model (which the 4-agent pattern implements) is the most commonly deployed pattern in production environments. If you want the full technical breakdown, read our Autonomous Agent Architecture guide first, then come back here for the practical implementation.

What Is the Orchestrator-Worker Pattern and Why Does It Matter?

The orchestrator-worker pattern uses a single coordinating agent (the orchestrator) to manage multiple specialized worker agents. Workers do not communicate with each other. All coordination flows through the orchestrator, creating a single traceable control flow that makes debugging straightforward. This is the hub-and-spoke model, and it outperforms swarm architectures and flat mesh patterns for business applications because it keeps complexity low while scaling well.

In academic papers, you will find references to swarm intelligence, hierarchical multi-agent frameworks, and consensus-based coordination. Those are fascinating for research. In production, they are fragile. The orchestrator-worker pattern wins because operators can see exactly what happened at every step, retry a single failed agent without restarting the whole run, and add more workers without changing the coordination logic.

How Does the Boss Agent Work?

The Boss agent receives the incoming task (a scheduled trigger, a webhook, a user request, or an email) and makes three decisions: what needs to happen, who should do it, and in what order. Its core job is decomposition and delegation. The Boss never does the actual work. It classifies, routes, and monitors.

What the Boss handles in production:

Classify the incoming request by type, priority, and vertical
Decompose the request into independent subtasks
Route each subtask to the correct specialized agent
Collect results from all agents
Synthesize the final output or escalate failures

Example from our lead generation system:

A client triggers "Generate roofing leads in the Dallas market." The Boss classifies this as a roofing lead generation task for Dallas. It decomposes into three subtasks: Researcher finds 50 roofing companies in the Dallas metro, Builder enriches each company with contact data and a lead score, Publisher exports the results to CSV and emails the client. The Boss routes all three in sequence (Researcher first, then Builder, then Publisher) and monitors completion.

The Boss runs on a lightweight model (Haiku or Sonnet depending on complexity). Decision latency is 5 to 15 seconds. If the Boss gets the decomposition wrong, everything downstream breaks. We validate Boss decisions by checking completeness (are any subtasks missing?), independence (can subtasks run in parallel?), and accuracy (did the right agent get the right instructions?).

What Does the Researcher Agent Do?

The Researcher gathers information from external sources: web searches, API calls, database queries, public records, and directory scraping. It finds raw data and returns it in a structured format. The Researcher does not enrich, score, or deliver. It finds. That single focus is what makes the pattern work.

Production example:

The Boss sends: "Find 50 roofing contractors in Dallas with 1 to 50 employees." The Researcher queries Google Maps, business directories, and public license databases. It returns a JSON array of 50 records, each containing company name, address, phone number, and website. Records with missing fields are flagged. Duplicates are removed. The Researcher logs what it found, what sources it used, and what it could not find.

The Researcher runs on Haiku (search tasks are straightforward and do not require heavy reasoning). Latency ranges from 30 to 120 seconds depending on search complexity. Data quality is the Researcher's most important metric. We validate by checking field completeness, recency, and duplicate rate.

What Does the Builder Agent Do?

The Builder takes raw data from the Researcher and transforms it into usable output. It enriches records with additional fields, scores leads, classifies data, generates personalized content, and validates the output schema. The Builder is the reasoning-heavy agent in the pattern.

Production example:

The Researcher delivers 50 roofing company records. The Builder enriches each one: looks up owner email addresses, estimates annual revenue from employee count and market data, calculates a lead score from 0 to 10 based on qualification criteria, and tags each record with attributes like "storm damage history" or "recently permitted." The Builder returns enriched JSON objects that are ready for delivery.

The Builder runs on Sonnet because enrichment requires reasoning over incomplete data, making judgment calls about scoring, and generating personalized content. Latency is 10 to 30 seconds per batch. We validate Builder output by checking that all required fields are present, numeric scores are within range, and error records are logged and skipped rather than silently dropped.

What Does the Publisher Agent Do?

The Publisher takes the final enriched dataset and delivers it to wherever it needs to go: email, CRM, CSV file, database, Slack channel, or dashboard. It handles formatting for each destination, tracks delivery success or failure, and reports back to the Boss. The Publisher is the execution layer.

Production example:

The Builder delivers 47 enriched leads (3 were dropped during enrichment for missing data). The Publisher formats the leads as a CSV attachment, sends an email to the client with a summary ("47 qualified roofing leads in Dallas, avg score 7.2/10"), writes each lead to the client's CRM via API, and logs delivery results ("CSV sent, CRM accepted 45, 2 API errors logged").

The Publisher runs on Haiku (minimal reasoning required, mostly formatting and routing). Latency is 5 to 15 seconds. We validate by confirming delivery succeeded, error logs are clear, and rate limits were respected.

Why Does This Pattern Beat Single-Agent Systems?

Single-agent systems fail in production for four predictable reasons. The 4-agent pattern solves all four.

Failure 1: Complexity overload. A single agent asked to "find leads, enrich them, score them, and send a report" tries to hold all four tasks in context simultaneously. By step three, it confuses earlier data with later instructions and hallucination rates climb. The 4-agent pattern gives each agent one job. Researcher finds. Builder enriches. Publisher delivers. Context stays clean.

Failure 2: Invisible breakpoints. When a single agent fails, you see "Error: task could not be completed." You have no idea which step broke. With the 4-agent pattern, each agent logs its own results. You see: "Researcher found 50 records. Builder enriched 47. Publisher failed on CRM write." The broken step is immediately visible.

Failure 3: Scaling bottlenecks. Scaling a single agent means running more copies of the entire pipeline. Each copy costs the same regardless of which step needs more capacity. The 4-agent pattern lets you scale individual agents. Need more search capacity? Add Researchers. Need faster delivery? Add Publishers. The Boss stays as a single lightweight instance.

Failure 4: Risky updates. Updating a single agent's instructions risks breaking all running instances. The 4-agent pattern isolates changes. Update the Publisher's email template without touching the Researcher, Builder, or Boss. Changes are contained.

How Much Does This Pattern Cost to Run?

API token costs in 2026 make this pattern extremely affordable. Claude Haiku runs at $1.00 per million input tokens and $5.00 per million output tokens. Claude Sonnet runs at $3.00 per million input tokens and $15.00 per million output tokens. A typical 4-agent run (Boss on Haiku, Researcher on Haiku, Builder on Sonnet, Publisher on Haiku) costs $0.02 to $0.05 per complete task execution.

At 8 tasks per day, the total API cost is roughly $0.16 to $0.40 per day, or $5 to $12 per month. Add hosting and infrastructure ($50 to $200/month for a VPS and database) and you are running a complete multi-agent system for under $250/month in operational costs.

This is why the AI agent pricing most agencies quote is dramatically inflated. The actual infrastructure cost is a fraction of what agencies charge. The value is in the architecture, the tuning, and the operational knowledge, not the raw compute.

For a detailed look at how these economics translate into lead generation pricing, see our AI Lead Gen Cost breakdown.

How Do You Transition From a Single Agent to the 4-Agent Pattern?

Start with what you have. If you already run a single agent that does research, enrichment, and delivery in one pass, split it incrementally:

Extract the information gathering step into a standalone Researcher agent
Extract the data transformation step into a standalone Builder agent
Extract the delivery step into a standalone Publisher agent
Create a Boss agent that orchestrates all three

Do not rebuild from scratch. Migrate one agent at a time. Run the old single-agent system in parallel until the new pattern is validated. This approach takes 2 to 4 weeks for a system handling 10 to 50 tasks per day.

What Frameworks Support Multi-Agent Orchestration in 2026?

The leading frameworks for multi-agent orchestration are LangGraph (from LangChain), CrewAI, AutoGen (Microsoft), and Semantic Kernel. All support defining agent roles, specifying tools, and coordinating work through a central orchestrator. For most business applications, LangGraph or CrewAI provide the right balance between simplicity and power.

The framework matters less than the pattern. A well-designed 4-agent system built with basic Python scripts and API calls will outperform a poorly designed system built on the fanciest framework. Start with the architecture. Choose the framework second.

Frequently Asked Questions

Can I use cheaper models like Haiku for all four agents?

For most business applications, yes. Researcher and Publisher rarely need heavy reasoning, so Haiku works well. Builder benefits from Sonnet when enrichment requires judgment calls (scoring, classification, personalization). Boss can run on Haiku for simple routing or Sonnet for complex decomposition.

What happens when one agent fails mid-run?

The Boss logs the failure, skips that agent's output, and continues with whatever results are available. If the Researcher fails, the run stops (no data to work with). If the Builder fails, the Boss can retry with the same input. If the Publisher fails, the enriched data is preserved and delivery can be retried. Failures never cascade.

Does the Boss agent become a bottleneck at scale?

No. The Boss makes routing decisions, which are lightweight API calls taking 5 to 15 seconds. A single Boss instance can orchestrate 100 parallel worker tasks. The workers (Researchers, Builders, Publishers) do the heavy lifting. You scale workers, not the Boss.

Can non-LLM agents participate in this pattern?

Absolutely. A Researcher can be a Python script calling an API. A Publisher can be a webhook to Zapier or a direct database write. LLMs are valuable for reasoning tasks (Boss decomposition, Builder enrichment). Execution tasks (search, delivery) often work better as deterministic scripts.

How is this different from microservices?

Microservices are deterministic: same input always produces the same output. Agents are flexible: they reason about what to do based on context. Use microservices when the task is always identical. Use agents when the task varies and requires judgment. Many production systems combine both, with agents handling variable tasks and microservices handling predictable ones.

What Should You Build Next?

The 4-agent pattern is the foundation. Once it is running reliably, the next improvements are:

Monitoring: Track latency, cost, and failure rate per agent. Build a dashboard.
Memory: Give the Researcher persistent memory so it skips previously found records.
Parallel execution: Run multiple Researchers simultaneously for different data sources.
Human in the loop: Let the Boss escalate uncertain decisions to a Slack channel before proceeding.

All of these additions are easier with the 4-agent pattern than with a single-agent system because each improvement targets a specific agent without affecting the others.

Related guides:

[How to Build an Autonomous AI Agent System (Architecture Guide)](/guides/autonomous-agent-architecture)
[How AI Lead Generation Actually Works](/guides/ai-lead-generation-how-it-works)

Want us to build a multi-agent system for your business? Get a free territory brief

---

Written by Aderson Rocha, founder of Sold By Agents. Aderson builds autonomous AI agent systems for service businesses.

Last updated: 2026-05-26

Ready to stop overpaying for leads?

Get a free territory brief for your market.

Get Your Free Territory Brief