Your Agents Are Multiplying. Who Is in Charge?

Enterprise Architecture · Agentforce · Summer '26

Your Agents Are Multiplying. Who Is in Charge?

Multi-Agent Orchestration goes GA on June 15 with the Summer '26 release. One agent answering questions was a demo. A team of agents that delegate, hand off, and collaborate is an architecture decision. Here is how the supervisor-specialist model actually works, where it breaks, and how to design for it.

Reading time: ~11 minutes | Published: June 2026 | Published By: Sandip Patel, Salesforce Architect
SUMMER '26 GA June 15 Multi-Agent Orchestration goes generally available
ATLAS ENGINE 50% Lower latency in the enhanced Atlas Reasoning Engine
GARTNER 2028 15% Of daily work decisions handled by multi-agent systems
A2A PROTOCOL Open Standard for agents to discover and delegate to each other
TL;DR

Multi-Agent Orchestration lets you build a team of specialist agents behind one primary agent that customers talk to. The Atlas Reasoning Engine reads each agent's description and routes work to the best fit. The hard part is not turning it on. It is deciding when to split a monolith into specialists, writing agent descriptions precise enough for Atlas to route correctly, and preventing context from leaking at the seam where one agent hands off to another.

1
When One Agent Stops Being Enough
A single agent works beautifully right up until the day it doesn't.

Here is the pattern I see over and over. A team ships their first Agentforce agent. It handles order status, answers a few product questions, maybe files a case. The demo lands, leadership is happy, and the obvious next move is to make it do more. So they add returns. Then billing. Then technical triage. Then a few HR questions because why not, it is already there.

Six months later that agent has forty actions, an instruction set the length of a short novel, and a habit of picking the wrong action at the worst possible moment. Nobody broke anything. They just kept adding, and the agent quietly crossed a line.

That line has a name. It is the point where capability sprawl outruns the reasoning engine's ability to choose well.

Why This Happens

The Atlas Reasoning Engine picks an action by reasoning over the descriptions of everything available to it. Give it ten focused actions and it routes cleanly. Give it fifty overlapping ones and you dilute its intent. The model is not getting dumber. You are handing it an impossible multiple-choice question and asking it to ace the test every time.

There is a second force at work, and it is organizational rather than technical. The Sales Ops team owns lead qualification. The Support team owns ticket resolution. Finance owns collections logic. When you cram all of that into one monolithic agent, you have also crammed three teams into one set of instructions they all have to edit carefully without stepping on each other. That is a governance problem disguised as a configuration file.

Multi-Agent Orchestration, going GA on June 15 with the Summer '26 release, is Salesforce's answer to both forces at once. Instead of one agent that does everything, you build a team of specialists, each with a tight scope, sitting behind a single primary agent that the customer actually talks to. The split solves the reasoning problem and the ownership problem in the same move.

SIGNAL 1
40+
Actions on one agent is usually past the point where routing accuracy starts to slip
SIGNAL 2
3+
Teams editing the same instruction set is a governance smell, not a feature
SIGNAL 3
2x
Distinct business domains in one agent (Sales and Service) signals time to split
THE FIX
1
Primary agent as the front door, specialists behind it doing focused work
2
How the Supervisor-Specialist Model Works
One front door, many specialists, and a reasoning engine doing the routing

The core idea is simple enough to explain over coffee. A customer talks to one agent, the primary agent. That agent does not try to answer everything itself. It figures out what the customer actually needs and hands the task to whichever specialist is built for it. The specialist does the work, passes the answer back, and the primary agent delivers it. The customer never knows a handoff happened.

Think of it like a good front-desk manager at a busy office. You walk in, explain your problem once, and they connect you to the right person. You do not get routed through a phone tree. You do not repeat yourself five times. That is the experience the primary agent is built to give.

The Three Roles in Play

🏠
The Primary Agent

The single point of contact. It holds the conversation, interprets intent, and routes work to specialists. The customer interacts only with this agent and keeps one continuous thread.

🤖
The Specialist Agents

Focused workers grounded in specific data and equipped with a tight library of actions. Each handles one domain well: returns, billing, scheduling. They report results back to the primary.

🧠
The Atlas Reasoning Engine

The router. It reviews each agent's description, instructions, and available actions, then decides which specialist is the best fit for the current task and routes intelligently.

The detail that matters most for architects is how Atlas decides. It does not follow a fixed decision tree or match keywords. It reasons from the descriptions every time, fresh, based on the task in front of it. The enhanced Atlas architecture that powers this routing delivers roughly 50% lower latency than earlier versions, which is what keeps the supervisor pattern from becoming a bottleneck at enterprise scale.

"Your agent descriptions are not documentation for humans. They are the inputs Atlas uses to route in real time. Write them like they are load-bearing, because they are."

This changes how you write agent descriptions. In a single-agent world, the description was mostly a label. In a multi-agent world, it is the thing that determines whether work lands on the right specialist or the wrong one. Be explicit about what each agent handles. Be just as explicit about what it does not handle. A description that says "handles customer inquiries" is useless to Atlas. A description that says "handles return authorizations, refund status, and exchange eligibility, but not billing disputes or shipping questions" is something the engine can actually route against.

Architect's Tip

Treat the boundary between two specialists as the riskiest part of the design. If two agents both plausibly handle a request, Atlas has to guess, and guessing is where routing errors live. Tune your descriptions so the boundaries are crisp and non-overlapping. The clearer the edges, the more reliable the routing.

3
The Four Orchestration Patterns
Same building blocks, four different ways to wire them together

Salesforce's own architecture guidance lays out a set of orchestration archetypes, and they are worth knowing by name because they map to genuinely different problems. You do not pick one and stick with it forever. You compose them as your needs grow.

1
START HERE
The Monolith (Single Agent)

One capable agent across a range of topics. For common, contained use cases this is the right starting point. Do not split prematurely. Most solutions begin here and earn their way into something larger.

2
SINGLE ORG
SOMA: Single-Org Multi-Agent

A supervisor agent acts as the front door, routing to specialist agents that all live in one org with shared governance and data. This is the workhorse pattern for most enterprises and the one Summer '26 makes click-to-configure.

3
CROSS ORG
MOMA: Multi-Org Multi-Agent

A supervisor in one org delegates to an agent in another org over the A2A protocol. The handshake carries org-level trust, identity flow, and shared conversation context. Useful when business units run separate orgs but need to collaborate.

4
MULTI VENDOR
Multi-Vendor A2A (MuleSoft-led)

Orchestration starts from a non-Salesforce entry point, so a neutral external orchestrator does the reasoning and routing. It keeps the experience in the domain system while centralizing policy and observability in MuleSoft.

Within SOMA, you also choose how specialists collaborate, and this is where the design gets interesting. Two collaboration shapes cover most real systems.

Sequential
One handoff at a time
  • The supervisor passes work to one specialist, waits for the result, then moves to the next
  • Best when steps depend on each other: qualify the lead, then enrich it, then schedule outreach
  • Easier to reason about and debug because the path is linear
  • Use when each step needs the output of the step before it
Swarm
Many specialists at once
  • The supervisor dispatches a task to several specialists in parallel and synthesizes their findings
  • Best when a decision needs independent inputs gathered at the same time
  • Example: a new contract triggers legal review, credit check, and inventory check simultaneously
  • Use when sub-tasks are independent and speed matters

A practical example brings it together. Imagine a high-value contract being drafted. The orchestrator swarms the task: a legal agent reviews the clauses, a credit agent checks the customer's recent payment history in Data 360, and a supply chain agent confirms inventory for the quoted items. All three run at once, report back, and the orchestrator assembles a single answer. No human stitched those three systems together. The architecture did.

Real Talk

The four patterns are building blocks, not a menu where you pick one. Real solutions compose them. You might run SOMA internally for your service org, reach into a separate finance org over MOMA for billing, and call a partner's logistics agent over multi-vendor A2A, all inside one customer conversation. The skill is knowing which block solves which coordination problem, not memorizing the acronyms.

4
A2A and the Multi-Vendor Question
What the Agent2Agent protocol actually buys you, and when

The Agent2Agent protocol, or A2A, is the piece that lets multi-agent systems extend past the edge of Salesforce. It is an open standard that defines how agents find each other, exchange structured information, and collaborate securely across platforms. Salesforce did not invent a proprietary handshake here. They adopted an open one, which is the right call for a feature whose whole point is crossing boundaries.

The mechanics are worth understanding at a high level. A2A uses a client-server model: a client agent orchestrates the work, and a remote agent processes the task through structured, stateful messaging. The protocol defines how agents communicate, not how they think. A specialist's internal reasoning stays private. That separation is what lets a Salesforce orchestrator delegate to a non-Salesforce agent without either side exposing its internals.

Discovery
Agents Find Each Other

Agents advertise what they can do in a structured way, so an orchestrator can locate a capable agent without a hardcoded integration pointing at a specific endpoint.

Secure Messaging
Structured Handoff

Built on established web standards, A2A carries secure transport, authentication, and structured communication. The handoff includes identity and shared conversation context, not just a payload.

Opaque Reasoning
Privacy by Design

The protocol standardizes communication while keeping each agent's internal logic private. Vendors collaborate without exposing proprietary reasoning models or internal architecture.

Here is the honest framing for most Salesforce teams in mid-2026: A2A is infrastructure to understand, not infrastructure to deploy on day one. The cross-platform scenario, where a procurement agent on one cloud talks to a logistics agent on another and a CRM agent in Agentforce, is real and it is coming. But it requires mature agents already running in multiple systems. That is not a first-week configuration. It is a target state you design toward.

Where to Start

If you are standing up your first multi-agent system, build it as SOMA inside one org. Get the supervisor-specialist split right, tune your descriptions, and learn how routing behaves with real traffic. Reach for A2A and cross-org patterns once you have agents worth connecting and a concrete reason to connect them. The protocol will still be there.

A2A pairs naturally with MCP, the Model Context Protocol, and the two solve different problems. MCP gives an agent reach into tools and data beyond Salesforce. A2A gives agents the ability to delegate to each other within a trusted orchestration layer. One extends what a single agent can touch. The other extends how many agents can work together. You will use both, and knowing which one a given problem calls for is part of the job now.

5
The Seam Problem and Other Realities
What to plan for before you put this in front of customers

Multi-agent systems introduce a failure mode that single agents simply do not have. It shows up at the seam, the moment work passes from one agent to another. This is where context can quietly drop, and when it drops, the downstream specialist makes a decision based on an incomplete picture.

Context Loss at the Seam
The signature multi-agent bug
  • When the primary hands off, the specialist needs the full picture, not just the last message
  • Lose the context and the specialist asks the customer to repeat themselves, killing the experience
  • Atlas 3.0 integrates tighter with memory and context preservation across handoffs to address this directly
  • Design: test every handoff path explicitly, not just the happy path
Routing Ambiguity
Two agents, one request
  • When two specialists both look like a fit, Atlas has to choose and can choose wrong
  • Overlapping agent descriptions are the usual root cause
  • Adding more specialists without sharpening boundaries makes this worse, not better
  • Design: keep descriptions crisp, non-overlapping, and explicit about exclusions

Governance Scales With Agent Count

Every specialist you add is another identity that can read and write data. In a single-agent world you governed one thing. In a multi-agent world you govern a fleet. Salesforce builds the controls in: orchestration runs within Einstein Trust Layer guardrails, agents operate against a shared data model, and Agentforce Observability lets you monitor performance, trace interactions, and enforce policy across the whole system. The platform gives you the levers. You still have to pull them.

Before You Go Live With Multiple Agents

!
Scope each specialist's permissions to exactly what it needs to read and write, nothing broader. Treat each agent as its own identity.
!
Write and review every agent description for crisp, non-overlapping boundaries. This is your routing accuracy, not paperwork.
i
Trace context across every handoff path in a sandbox. Confirm the receiving specialist gets what it needs to act correctly.
i
Turn on Observability and set up monitoring before launch, not after the first incident. You want the traces ready when you need them.
Keep a human-in-the-loop checkpoint on high-stakes actions until the system has earned your trust with real traffic.
The Upside of Doing This Right

None of these realities are reasons to wait. They are the design checklist. The teams that handle the seam, sharpen their descriptions, and govern their fleet from day one get something genuinely powerful: a coordinated set of agents that resolves complex, cross-domain work in a single conversation. That is a capability single agents could never reach, and it is available now.

6
A Design Approach That Holds Up
How to move from one agent to a team without painting yourself into a corner

The best multi-agent rollouts I have seen share a rhythm. They do not start by drawing an org chart of fifteen agents. They start with one agent, let it earn its complexity, and split only when the signals show up. Then they grow the team one specialist at a time, testing each seam as they go.

Step 1
Ship the Monolith
One agent, one clear use case, real traffic
Step 2
Watch for Signals
Action sprawl, routing misses, shared ownership
Step 3
Split Into SOMA
Primary plus focused specialists in one org
Step 4
Extend With A2A
Cross-org and multi-vendor when there is a reason

Salesforce's direction makes the bet clear. The architecture guidance says Agentforce was designed for multi-agent orchestration from the start, with that orchestration sitting underneath the Atlas Reasoning Engine itself. The Summer '26 release moves it from something you assembled with custom Apex into something you configure. Gartner expects multi-agent systems to handle 15% of day-to-day work decisions autonomously by 2028. The platform is being built for a world where teams of agents are the default, not the exception.

For architects, the question shifts. It is no longer "should we build an agent." It is "where do the boundaries between our agents belong, and who owns each one." That is an org-design question as much as a technical one, and it is the question worth spending real time on before anyone touches a config screen.

Start with one agent that does one thing well. Let the seams reveal themselves. Then build the team your business actually needs, one well-described specialist at a time.

7
Frequently Asked Questions
Common questions on routing, A2A, and getting started with multi-agent systems
When should I split a single agent into multiple agents?
Watch for three signals: a single agent accumulating dozens of actions until routing accuracy slips, multiple teams editing one shared instruction set, and two genuinely distinct business domains living in one agent. Any of these means it is time to consider the supervisor-specialist split. If none are present, stay with the monolith. Splitting too early adds coordination overhead you do not need yet.
How does the primary agent know which specialist to use?
The Atlas Reasoning Engine reads each agent's description, instructions, and available actions, then reasons about which specialist best fits the current task. It is not keyword matching or a fixed decision tree. Because of this, your agent descriptions directly determine routing accuracy. Write them to be precise about what each agent handles and explicit about what it does not.
What is the difference between A2A and MCP?
They solve different problems. MCP, the Model Context Protocol, gives an agent reach into tools and data sources beyond Salesforce. A2A, the Agent2Agent protocol, lets agents delegate work to each other across platforms within a trusted orchestration layer. MCP extends what one agent can touch. A2A extends how many agents can collaborate. Most mature systems use both.
Do I need A2A and cross-org patterns to get started?
No. Most teams should begin with SOMA, the single-org multi-agent pattern, where a primary agent routes to specialists all living in one org. A2A and multi-vendor orchestration matter once you have mature agents in multiple systems and a concrete reason to connect them. Treat cross-platform orchestration as a target state you design toward, not a first-week task.
What is the most common multi-agent failure, and how do I prevent it?
Context loss at the seam, the point where work hands off from one agent to another. If the receiving specialist does not get the full picture, it makes decisions on incomplete information or asks the customer to repeat themselves. Atlas 3.0 improves memory and context preservation across handoffs, but you still need to test every handoff path explicitly in a sandbox before going live.
How does governance change when I run many agents instead of one?
Every specialist is a separate identity that can read and write data, so you are governing a fleet rather than a single agent. Scope each agent's permissions to exactly what it needs, run orchestration within Einstein Trust Layer guardrails, and turn on Agentforce Observability to trace interactions and enforce policy across the whole system. Set monitoring up before launch, not after the first incident.

Leave a reply

Your email address will not be published. Required fields are marked *