Salesforce Agentforce review: where it works, where it stalls, and what to build instead

A practitioner review of Salesforce Agentforce in 2026. Real deployment failure modes, the per-conversation pricing trap, and when Claude Code subagents replace it.

Salesforce made Agentforce the centerpiece of their 2025 product roadmap and the centerpiece of every QBR through 2026. Every Salesforce AE is pitching it. Most deployments are stalling at configuration. The gap between what's promised in a Dreamforce keynote and what ships in production is wide.

AGENTFORCE FIT MAP · BASED ON CLIENT DEPLOYMENTS Bounded service workflows: fits. Sales conversations: doesn't.
WHERE IT WORKS
  • Service deflection Refunds, returns, password resets, billing. Resolution rate 70-85%.
  • Field service triage Technicians needing on-the-fly answers. Resolution 60-75%.
  • Bounded internal Q&A Standard CRM queries on data Salesforce already owns.
WHERE IT STALLS
  • Complex sales motions Open-ended buyer questions, competitive positioning, pricing flex.
  • Cross-system orchestration Workflows pulling from HubSpot, Pendo, billing, etc.
  • High-volume per-conversation pricing 100K+ monthly conversations make the per-conv model break.

Pattern from active deployments. The teams getting the most value run Agentforce for service workflows and route revenue work to Claude Code subagents via Salesforce API. Not either/or.

This is a practitioner review of where Agentforce earns its license, where it doesn't, and the cost math nobody puts in the AE deck. We deploy it for clients who want it deployed, and we tell clients to skip it when the use case doesn't fit. Both happen.

What Agentforce is, in plain English

Agentforce is a layer on top of Salesforce that lets you configure AI agents tied to specific business workflows. You define topics (the agent's domain), actions (what it can do), and knowledge articles (what it can read). The agent runs inside the Salesforce trust layer, which constrains what data it can touch and what actions it can take.

The pricing is per conversation. Not per seat. Each agent interaction (a customer service ticket, a sales question answered) draws against a conversation pool you license annually. List pricing is roughly $2 per conversation, with volume discounts that bring high-tier customers to $0.50-$1.00. The conversation cost is separate from the underlying Salesforce edition license, which itself has to be Einstein 1 or a specific add-on SKU.

For service organizations replacing offshore call centers, the math can pencil. For sales organizations replacing AI SDR contracts, the math rarely does, and the architecture rarely fits.

Where Agentforce works

Three categories of workflow. Each has shipped in production at customers we've worked with.

Service deflection

The strongest use case. Refund authorization, order status, returns, password resets, basic billing questions, account changes. The answer is bounded by clear policy. The data lives in Salesforce. The trust layer prevents the agent from wandering into territory it shouldn't. Resolution rates in this category land at 70-85%, which is competitive with offshore agent baselines.

The replacement economics are real. A $40K-$60K offshore agent handles ~3,000 tickets a month. Agentforce at $1 per conversation handles the same load at $36K annually. The license cost roughly halves the per-ticket spend, plus the latency drops from 15-minute response time to seconds.

Field service triage

Less common but high-value. Field service technicians needing on-the-fly answers to install or repair questions. Agentforce reads the knowledge base, technical documentation, and prior service history. The agent answers in the technician's mobile app. Resolution rates land at 60-75%, with the rest escalating to a senior technician. Where this works, it cuts truck rolls.

Internal CRM Q&A

Reps asking "what's the latest activity on Acme account" or "summarize the last 30 days of deal motion." Agentforce reads CRM data and answers in plain English. Adoption is mixed. Reps who already understand their pipeline don't need it. Reps who don't won't trust it. The use case has more demos than production traffic.

Where Agentforce stalls

Three failure modes show up across most deployments. They're not bugs in the product; they're mismatches between the architecture and the use case.

Complex sales motions

Sales conversations are open-ended. The buyer asks about competitive positioning, pricing flex, deployment specifics, customer references. The answers don't live in a knowledge article. The trust layer blocks the agent from reading external sources. The agent escalates, and the rep gets a notification instead of a resolution.

For sales work, custom Claude Code subagents connected to Salesforce via API consistently outperform Agentforce. The subagent reads CRM data, runs research across external sources, drafts the response, and posts it back to the Salesforce activity feed. The architecture is the same shape as the AI SDR replacement work we cover in the AI SDR failure analysis.

Cross-system orchestration

Anything requiring data from outside Salesforce. Marketing automation data in HubSpot or Marketo. Product usage data in Pendo or Mixpanel. Financial data in NetSuite or QuickBooks. Agentforce can integrate via APIs, but the configuration friction is high and the latency stacks. By the time the agent has fetched data from three external systems, the conversation budget for that turn has burned.

For cross-system work, a Claude Code subagent that reads from all relevant systems in parallel and posts a single answer back to Salesforce is faster, cheaper, and easier to debug.

Per-conversation pricing at scale

The cost model becomes hostile at high volume. A team running 100,000 sales conversations a month at $1 each is at $1.2M annually before any other cost. The same workload on a Claude Code subagent stack runs $80K-$150K in compute and engineering retainer, with no per-conversation tax.

The conversation pricing earns its margin in service workflows where the alternative is human labor at $15-$25 per ticket. It loses badly in revenue workflows where the alternative is fixed-cost compute.

The deployment sequence that works

For teams committed to deploying Agentforce, the sequence below produces the highest hit rate. Skipping any of these steps is the most common failure pattern.

Week one. Data model audit. Pull every object Agentforce will touch. Check field hygiene. Fix obvious gaps. Most Agentforce deployments fail because the data underneath is dirty, not because the configuration is wrong.

Week two. Topic configuration. Each agent gets one topic, scoped narrowly. "Refund processing" not "customer service." "Order status" not "order help." Narrow topics produce reliable agents. Broad topics produce escalation rates above 50%.

Week three. Knowledge article preparation. The articles the agent will read need to be current, structured, and granular. Most teams discover their knowledge base is six months out of date. The article rewrite is usually the highest-effort phase of the deployment.

Week four. Trust layer tuning. Action permissions, data masking rules, escalation triggers. This is where Agentforce blocks useful work in default config. Tune carefully against real cases.

Weeks five and six. Production testing. Real cases through the agent, results measured against a control. Resolution rate, escalation rate, customer satisfaction. Tune topics and articles based on what fails.

The integration pattern

The teams getting the most value from Agentforce in 2026 aren't running it standalone. They're running it for service workflows where it's strong and integrating Claude Code subagents for everything else. The integration looks like this.

Agentforce handles inbound service tickets. When the topic doesn't match Agentforce's scope, the case routes to a Claude Code subagent via Salesforce flow. The subagent reads CRM data plus external sources, drafts a response, and posts back to the Salesforce activity feed. From the customer's perspective, the experience is unified. From the GTM team's perspective, each agent runs the workflow it's best at.

We build this integration as a fixed-fee engagement. The Agentforce configuration is the first phase; the Claude Code subagent buildout is the second. Most engagements ship in 4-6 weeks.

The honest bottom line

Agentforce is real software solving real problems for service-side workflows. It's not the universal AI agent platform Salesforce is selling, and the per-conversation pricing breaks the unit economics for most sales applications. The deployments that work treat it as a service tool and route revenue work elsewhere.

Before signing the renewal, audit which workflows are actually running on Agentforce, what your real per-conversation cost has been, and whether the topics you've configured map to the use cases that pencil. If the answer is "we configured it for everything and use it for service tickets," you're paying for capabilities you're not using. The contract restructure is worth the conversation.

For sales and revenue work, the right call in 2026 is increasingly a custom Claude Code build connected to Salesforce via API. Lower cost, more flexible, owned by the buyer. We cover the architecture in the Claude Code for GTM guide.

Questions.

Do we need Salesforce Unlimited or Einstein 1 Edition for Agentforce?

Agentforce requires Einstein 1 Edition or specific add-on SKUs depending on the agent type. Service Cloud Agentforce works on Service Cloud Einstein editions; Sales Agentforce requires Sales Cloud Einstein. The per-conversation cost is separate from the license. Before deploying, audit whether your current contract includes the right SKU and whether the per-conversation pricing pencils for your use case.

What's the realistic deployment timeline?

Two to six weeks for a working deployment, though Salesforce will tell you it's faster. Week one is data model audit. Week two is topic and action configuration. Weeks three to four are knowledge article preparation and trust layer guardrail tuning. Weeks five to six are testing against real cases. Skipping the data audit is the single most common failure mode.

Where does Agentforce earn its license?

Structured service workflows. Refund authorization, order status, returns, password resets, basic billing questions. Anywhere the answer is bounded by clear policy, agent topics constrain the agent well and resolution rates land at 70-85%. The license earns its money in service deflection where the alternative is a $40K offshore agent salary.

Where does Agentforce stall?

Complex sales motions, multi-step research, anything requiring cross-system orchestration outside the Salesforce ecosystem. The trust layer blocks useful actions. Topics drift on edge cases. Customers ask about pricing or competitive positioning and the agent escalates because the knowledge base doesn't contain those answers. For sales-side work, custom Claude Code subagents connected to Salesforce via API consistently outperform Agentforce at lower cost.

How do you decide between Agentforce and a custom build?

Three questions. Is the workflow bounded by policy or open-ended? Bounded favors Agentforce. Does the answer live entirely inside Salesforce data or does it require external research? Internal favors Agentforce. Is volume high enough that per-conversation pricing pencils against fixed-cost compute? High volume favors custom. Most teams end up running both: Agentforce for service deflection, Claude Code subagents for sales and revenue work.

Want this built?

We deploy Claude Code subagents into your GTM stack. Fixed fee. You own everything.

→ Fix your GTM