Your Agents Are Multiplying. Who Is in Charge?
Multi-Agent Orchestration goes GA on June 15 with the Summer '26 release. One agent answering questions was a demo. A team of agents that delegate, hand off, and collaborate is an architecture decision. Here is how the supervisor-specialist model actually works, where it breaks, and how to design for it.
Multi-Agent Orchestration lets you build a team of specialist agents behind one primary agent that customers talk to. The Atlas Reasoning Engine reads each agent's description and routes work to the best fit. The hard part is not turning it on. It is deciding when to split a monolith into specialists, writing agent descriptions precise enough for Atlas to route correctly, and preventing context from leaking at the seam where one agent hands off to another.
In This Article
Here is the pattern I see over and over. A team ships their first Agentforce agent. It handles order status, answers a few product questions, maybe files a case. The demo lands, leadership is happy, and the obvious next move is to make it do more. So they add returns. Then billing. Then technical triage. Then a few HR questions because why not, it is already there.
Six months later that agent has forty actions, an instruction set the length of a short novel, and a habit of picking the wrong action at the worst possible moment. Nobody broke anything. They just kept adding, and the agent quietly crossed a line.
That line has a name. It is the point where capability sprawl outruns the reasoning engine's ability to choose well.
The Atlas Reasoning Engine picks an action by reasoning over the descriptions of everything available to it. Give it ten focused actions and it routes cleanly. Give it fifty overlapping ones and you dilute its intent. The model is not getting dumber. You are handing it an impossible multiple-choice question and asking it to ace the test every time.
There is a second force at work, and it is organizational rather than technical. The Sales Ops team owns lead qualification. The Support team owns ticket resolution. Finance owns collections logic. When you cram all of that into one monolithic agent, you have also crammed three teams into one set of instructions they all have to edit carefully without stepping on each other. That is a governance problem disguised as a configuration file.
Multi-Agent Orchestration, going GA on June 15 with the Summer '26 release, is Salesforce's answer to both forces at once. Instead of one agent that does everything, you build a team of specialists, each with a tight scope, sitting behind a single primary agent that the customer actually talks to. The split solves the reasoning problem and the ownership problem in the same move.
The core idea is simple enough to explain over coffee. A customer talks to one agent, the primary agent. That agent does not try to answer everything itself. It figures out what the customer actually needs and hands the task to whichever specialist is built for it. The specialist does the work, passes the answer back, and the primary agent delivers it. The customer never knows a handoff happened.
Think of it like a good front-desk manager at a busy office. You walk in, explain your problem once, and they connect you to the right person. You do not get routed through a phone tree. You do not repeat yourself five times. That is the experience the primary agent is built to give.
The Three Roles in Play
The single point of contact. It holds the conversation, interprets intent, and routes work to specialists. The customer interacts only with this agent and keeps one continuous thread.
Focused workers grounded in specific data and equipped with a tight library of actions. Each handles one domain well: returns, billing, scheduling. They report results back to the primary.
The router. It reviews each agent's description, instructions, and available actions, then decides which specialist is the best fit for the current task and routes intelligently.
The detail that matters most for architects is how Atlas decides. It does not follow a fixed decision tree or match keywords. It reasons from the descriptions every time, fresh, based on the task in front of it. The enhanced Atlas architecture that powers this routing delivers roughly 50% lower latency than earlier versions, which is what keeps the supervisor pattern from becoming a bottleneck at enterprise scale.
This changes how you write agent descriptions. In a single-agent world, the description was mostly a label. In a multi-agent world, it is the thing that determines whether work lands on the right specialist or the wrong one. Be explicit about what each agent handles. Be just as explicit about what it does not handle. A description that says "handles customer inquiries" is useless to Atlas. A description that says "handles return authorizations, refund status, and exchange eligibility, but not billing disputes or shipping questions" is something the engine can actually route against.
Treat the boundary between two specialists as the riskiest part of the design. If two agents both plausibly handle a request, Atlas has to guess, and guessing is where routing errors live. Tune your descriptions so the boundaries are crisp and non-overlapping. The clearer the edges, the more reliable the routing.
Salesforce's own architecture guidance lays out a set of orchestration archetypes, and they are worth knowing by name because they map to genuinely different problems. You do not pick one and stick with it forever. You compose them as your needs grow.
One capable agent across a range of topics. For common, contained use cases this is the right starting point. Do not split prematurely. Most solutions begin here and earn their way into something larger.
A supervisor agent acts as the front door, routing to specialist agents that all live in one org with shared governance and data. This is the workhorse pattern for most enterprises and the one Summer '26 makes click-to-configure.
A supervisor in one org delegates to an agent in another org over the A2A protocol. The handshake carries org-level trust, identity flow, and shared conversation context. Useful when business units run separate orgs but need to collaborate.
Orchestration starts from a non-Salesforce entry point, so a neutral external orchestrator does the reasoning and routing. It keeps the experience in the domain system while centralizing policy and observability in MuleSoft.
Within SOMA, you also choose how specialists collaborate, and this is where the design gets interesting. Two collaboration shapes cover most real systems.
- The supervisor passes work to one specialist, waits for the result, then moves to the next
- Best when steps depend on each other: qualify the lead, then enrich it, then schedule outreach
- Easier to reason about and debug because the path is linear
- Use when each step needs the output of the step before it
- The supervisor dispatches a task to several specialists in parallel and synthesizes their findings
- Best when a decision needs independent inputs gathered at the same time
- Example: a new contract triggers legal review, credit check, and inventory check simultaneously
- Use when sub-tasks are independent and speed matters
A practical example brings it together. Imagine a high-value contract being drafted. The orchestrator swarms the task: a legal agent reviews the clauses, a credit agent checks the customer's recent payment history in Data 360, and a supply chain agent confirms inventory for the quoted items. All three run at once, report back, and the orchestrator assembles a single answer. No human stitched those three systems together. The architecture did.
The four patterns are building blocks, not a menu where you pick one. Real solutions compose them. You might run SOMA internally for your service org, reach into a separate finance org over MOMA for billing, and call a partner's logistics agent over multi-vendor A2A, all inside one customer conversation. The skill is knowing which block solves which coordination problem, not memorizing the acronyms.
The Agent2Agent protocol, or A2A, is the piece that lets multi-agent systems extend past the edge of Salesforce. It is an open standard that defines how agents find each other, exchange structured information, and collaborate securely across platforms. Salesforce did not invent a proprietary handshake here. They adopted an open one, which is the right call for a feature whose whole point is crossing boundaries.
The mechanics are worth understanding at a high level. A2A uses a client-server model: a client agent orchestrates the work, and a remote agent processes the task through structured, stateful messaging. The protocol defines how agents communicate, not how they think. A specialist's internal reasoning stays private. That separation is what lets a Salesforce orchestrator delegate to a non-Salesforce agent without either side exposing its internals.
Agents advertise what they can do in a structured way, so an orchestrator can locate a capable agent without a hardcoded integration pointing at a specific endpoint.
Built on established web standards, A2A carries secure transport, authentication, and structured communication. The handoff includes identity and shared conversation context, not just a payload.
The protocol standardizes communication while keeping each agent's internal logic private. Vendors collaborate without exposing proprietary reasoning models or internal architecture.
Here is the honest framing for most Salesforce teams in mid-2026: A2A is infrastructure to understand, not infrastructure to deploy on day one. The cross-platform scenario, where a procurement agent on one cloud talks to a logistics agent on another and a CRM agent in Agentforce, is real and it is coming. But it requires mature agents already running in multiple systems. That is not a first-week configuration. It is a target state you design toward.
If you are standing up your first multi-agent system, build it as SOMA inside one org. Get the supervisor-specialist split right, tune your descriptions, and learn how routing behaves with real traffic. Reach for A2A and cross-org patterns once you have agents worth connecting and a concrete reason to connect them. The protocol will still be there.
A2A pairs naturally with MCP, the Model Context Protocol, and the two solve different problems. MCP gives an agent reach into tools and data beyond Salesforce. A2A gives agents the ability to delegate to each other within a trusted orchestration layer. One extends what a single agent can touch. The other extends how many agents can work together. You will use both, and knowing which one a given problem calls for is part of the job now.
Multi-agent systems introduce a failure mode that single agents simply do not have. It shows up at the seam, the moment work passes from one agent to another. This is where context can quietly drop, and when it drops, the downstream specialist makes a decision based on an incomplete picture.
- When the primary hands off, the specialist needs the full picture, not just the last message
- Lose the context and the specialist asks the customer to repeat themselves, killing the experience
- Atlas 3.0 integrates tighter with memory and context preservation across handoffs to address this directly
- Design: test every handoff path explicitly, not just the happy path
- When two specialists both look like a fit, Atlas has to choose and can choose wrong
- Overlapping agent descriptions are the usual root cause
- Adding more specialists without sharpening boundaries makes this worse, not better
- Design: keep descriptions crisp, non-overlapping, and explicit about exclusions
Governance Scales With Agent Count
Every specialist you add is another identity that can read and write data. In a single-agent world you governed one thing. In a multi-agent world you govern a fleet. Salesforce builds the controls in: orchestration runs within Einstein Trust Layer guardrails, agents operate against a shared data model, and Agentforce Observability lets you monitor performance, trace interactions, and enforce policy across the whole system. The platform gives you the levers. You still have to pull them.
Before You Go Live With Multiple Agents
None of these realities are reasons to wait. They are the design checklist. The teams that handle the seam, sharpen their descriptions, and govern their fleet from day one get something genuinely powerful: a coordinated set of agents that resolves complex, cross-domain work in a single conversation. That is a capability single agents could never reach, and it is available now.
The best multi-agent rollouts I have seen share a rhythm. They do not start by drawing an org chart of fifteen agents. They start with one agent, let it earn its complexity, and split only when the signals show up. Then they grow the team one specialist at a time, testing each seam as they go.
Salesforce's direction makes the bet clear. The architecture guidance says Agentforce was designed for multi-agent orchestration from the start, with that orchestration sitting underneath the Atlas Reasoning Engine itself. The Summer '26 release moves it from something you assembled with custom Apex into something you configure. Gartner expects multi-agent systems to handle 15% of day-to-day work decisions autonomously by 2028. The platform is being built for a world where teams of agents are the default, not the exception.
For architects, the question shifts. It is no longer "should we build an agent." It is "where do the boundaries between our agents belong, and who owns each one." That is an org-design question as much as a technical one, and it is the question worth spending real time on before anyone touches a config screen.
Start with one agent that does one thing well. Let the seams reveal themselves. Then build the team your business actually needs, one well-described specialist at a time.