The first half of 2026 turned the "AI agent" from a demo into a product category. In the span of four months we got Anthropic's Claude Cowork going GA on the desktop, Google announcing Gemini Spark at I/O, Nous Research shipping the open-source Hermes Agent, and an open-source project called OpenClaw becoming the most-starred repository in GitHub's history. Plus the tool that arguably started the current wave for developers β Claude Code β kept maturing in the terminal.
These five things all get called "agents," but they're not the same kind of product, and comparing them feature-by-feature without first understanding their shape leads to bad decisions. This article maps the landscape across two axes that actually matter β who runs the compute (a vendor's cloud vs. your own machine) and who the agent is for (developers vs. general knowledge workers) β then digs into the internals, features, use cases, and pricing of each.
Disclosure on sourcing: these are fast-moving products, several of which launched between February and May 2026. Everything here reflects publicly available information as of June 2026. Pricing and feature availability change constantly β treat the numbers as directional and verify against each vendor's current docs before you commit.
Two Axes That Organize the Whole Field
Almost every "which agent should I use" debate collapses once you place the contenders on two axes. The vertical axis is execution locus: does the agent run on infrastructure the vendor owns and bills you for, or on hardware you control? The horizontal axis is primary user: is it built for engineers living in a terminal and a codebase, or for everyone else doing knowledge work in documents, inboxes, and spreadsheets?
flowchart TB
subgraph Managed["βοΈ Vendor-hosted / managed compute"]
direction LR
CC["Claude Code\n(dev Β· local CLI, vendor models)"]
CW["Claude Cowork\n(knowledge work Β· desktop, vendor models)"]
GS["Gemini Spark\n(knowledge work Β· 24/7 cloud VM)"]
end
subgraph Self["π₯οΈ Self-hosted / bring-your-own"]
direction LR
HA["Hermes Agent\n(dev + personal Β· your VPS/GPU)"]
OC["OpenClaw\n(personal Β· your devices)"]
end
Dev["β¨οΈ Developer-centric"] -.-> CC
Dev -.-> HA
Know["π Knowledge-worker-centric"] -.-> CW
Know -.-> GS
Know -.-> OC
Managed --- Self
The 2026 agent field on two axes. Anthropic spans both user types with two products on vendor-hosted compute; Google bets on always-on cloud VMs; Hermes and OpenClaw are open-source and run on hardware you control.
Note where Anthropic sits: it's the only player here fielding two distinct products β Claude Code for engineers and Claude Cowork for everyone else β sharing the same model backend and subscription. Google's Spark is a single always-on cloud agent. The two open-source entrants, Hermes and OpenClaw, both let you bring any model and run on your own box, but they target different users: Hermes leans developer/power-user, OpenClaw leans personal-assistant-over-messaging.
The Contenders, One at a Time
Claude Code β the terminal-native coding agent
Claude Code is Anthropic's command-line coding agent. It lives in your terminal, reads and edits files in your repo, runs commands, and operates as an autonomous (but approval-gated) pair programmer. Architecturally it's a thin local client driving Anthropic's hosted Claude models (Opus, Sonnet, Haiku in the 4.x family), extended through the Model Context Protocol (MCP) for tools, plus a skills mechanism and subagents for parallel work. The compute that matters β inference β happens in Anthropic's cloud; the file and command execution happens locally on your machine.
Its sweet spot is software engineering: multi-file refactors, test-driven changes, codebase exploration, and CI/scripting glue. It's included in Anthropic's paid plans rather than sold separately.
Claude Cowork β the desktop knowledge-work agent
Cowork is Anthropic's answer to "what does Claude Code feel like for people who aren't developers." It went generally available on April 9, 2026 for paid subscribers on macOS and Windows, and it lives inside the Claude desktop app alongside Chat and Code. Rather than a sandbox in the cloud, Cowork operates on your actual machine: it opens applications, fills spreadsheets, navigates your browser, organizes files, and connects to services via MCP connectors (Slack, Chrome, Zoom). Crucially, it shows you an execution plan and waits for approval before significant actions β the UX is built around supervised autonomy.
The target user is explicitly not the developer: researchers, analysts, operations, legal, and finance teams who have time-consuming but not technically complex work. The GA release added enterprise controls β role-based access, group spend limits, usage analytics, per-connector controls, and private plugin marketplaces. (Anthropic temporarily doubled Cowork's 5-hour usage limit for the June 5 β July 5, 2026 window, a tell that usage caps are the binding constraint for heavy users.)
Gemini Spark β the always-on cloud agent
Announced at Google I/O on May 19, 2026, Gemini Spark is built from Gemini base models (Gemini 3.5) wrapped in an agentic harness from Google Antigravity, Google's agent platform. Its defining architectural choice is that it runs 24/7 on dedicated virtual machines in Google Cloud β you don't keep a laptop open; the agent works in the background and reports progress (including via "Android Halo" on mobile). You task it by emailing a dedicated Gmail address, and it browses the web through Chrome.
Spark ships with deep Workspace integration (Gmail, Docs, Sheets, Slides) out of the box and MCP connections to third-party apps β Canva, OpenTable, and Instacart at launch, with GitHub, Notion, Slack, Adobe, Samsung, Spotify, and CapCut slated for summer 2026. On the developer side, it leans on Antigravity to suggest code changes, file a Jira ticket for review, and update status across Sheets and Docs. It's available to Google AI Ultra subscribers (U.S. only at launch).
Hermes Agent β the open-source agent that grows with you
Released by Nous Research in February 2026 under an MIT license, Hermes Agent is built around a learning loop: it creates skills from experience, refines them during use, persists knowledge, searches its own past conversations, and builds a model of the user across sessions. That persistent memory is the headline feature β it's positioned as an agent that compounds in usefulness over time rather than resetting every session.
It ships with 40+ built-in tools (web search, browser automation, vision), scheduled automations, and subagents. Critically, it's model-agnostic: point it at Nous Portal, OpenRouter (200+ models), NovitaAI, NVIDIA NIM, or your own endpoint. It runs persistently on infrastructure you control β a $5 VPS, a GPU cluster, or near-free serverless when idle β and you can talk to it from Telegram while it works on a cloud VM. No telemetry, no cloud dependency, no vendor lock-in; runs on Linux, macOS, and WSL2.
OpenClaw β the viral personal assistant over messaging
OpenClaw has the most colorful origin story here. Created by Peter Steinberger, it first appeared as Warelay in November 2025, was renamed Moltbot on January 27, 2026 (after a trademark complaint from Anthropic), then settled on OpenClaw days later. It crossed 100,000 GitHub stars in February 2026 and hit ~347,000 by April β the most-starred repository in GitHub history. Steinberger has since joined OpenAI, with an OpenClaw Foundation set up for stewardship.
Architecturally, OpenClaw is a local-first Gateway (written in TypeScript and Swift) that acts as a control plane: it receives inbound messages across a huge range of channels (WhatsApp, Telegram, Signal, Discord, iMessage, Slack, and many more), routes them to isolated agent workspaces, manages sessions, and executes tools. Configuration lives in workspace files (SOUL.md, AGENTS.md) and skills are directories with a SKILL.md, distributed through a registry called ClawHub. It enforces sandbox isolation (Docker/SSH backends) for non-main sessions and DM-pairing approval for unknown senders. It's model-agnostic (Claude, GPT, DeepSeek), MIT-licensed, and installs as a global npm package running as a daemon. The interaction model is the differentiator: your agent answers you on the chat apps you already use.
Architecture, Side by Side
The clearest way to see the real differences is to line up the architectural decisions that determine cost, privacy, latency, and control.
| Dimension | Claude Code | Claude Cowork | Gemini Spark | Hermes Agent | OpenClaw |
|---|---|---|---|---|---|
| Vendor | Anthropic | Anthropic | Nous Research | OpenClaw Foundation | |
| Where it runs | Local CLI; inference in Anthropic cloud | Local desktop app; inference in Anthropic cloud | 24/7 dedicated VM in Google Cloud | Your VPS / GPU / serverless | Your devices (local-first gateway) |
| Model | Claude (locked to Anthropic) | Claude (locked to Anthropic) | Gemini 3.5 (locked to Google) | Any (OpenRouter, NIM, localβ¦) | Any (Claude, GPT, DeepSeekβ¦) |
| Agent harness | Anthropic agent loop + MCP | Anthropic agent loop + MCP | Google Antigravity | Hermes loop + 40+ tools | Gateway + skills |
| Primary interface | Terminal | Desktop app UI | Email (Gmail) + mobile | Telegram / CLI | Any messaging app |
| Memory / learning | Session + project context, MCP memory | Session + connectors | Long-horizon tasks on VM | Persistent, self-improving skills | Persistent per-workspace state |
| Extensibility | MCP, skills, subagents | MCP connectors, plugins | MCP connectors | Skills, subagents, any tool | SKILL.md skills, ClawHub |
| License | Proprietary | Proprietary | Proprietary | Open source (MIT) | Open source (MIT) |
| Data locality | Code local, prompts to vendor | Files local, prompts to vendor | Runs in Google Cloud | Fully self-hosted | Fully self-hosted |
A pattern jumps out: there are really three architectural archetypes here, not five. The managed ones (Claude Code, Cowork, Spark) trade control for polish and zero-ops β you accept model lock-in and sending data to the vendor in exchange for something that just works. The self-hosted ones (Hermes, OpenClaw) trade convenience for sovereignty β you bring your own model and keep your data, but you own the uptime, the security posture, and the bill.
flowchart LR
U["You / your task"]
subgraph A["Managed (Code Β· Cowork Β· Spark)"]
A1["Thin client / email / desktop"] --> A2["Vendor cloud:\nmodel + agent harness"]
A2 --> A3["Acts on local files\nor cloud apps"]
end
subgraph B["Self-hosted (Hermes Β· OpenClaw)"]
B1["Your gateway / runtime"] --> B2["Your chosen model\n(API or local GPU)"]
B2 --> B3["Sandboxed tools\non your infra"]
end
U --> A1
U --> B1
The two dominant patterns. Managed agents put the harness and model in the vendor's cloud; self-hosted agents put a gateway/runtime on your infrastructure and let you choose the model.
Features That Actually Differentiate
Marketing pages list dozens of features; only a few change the decision. Here are the ones that do.
| Capability | Claude Code | Claude Cowork | Gemini Spark | Hermes | OpenClaw |
|---|---|---|---|---|---|
| Writes & edits code | β Core strength | β οΈ Possible, not the focus | β Via Antigravity + Jira | β #1-ranked coding agent | β οΈ Possible, not the focus |
| Operates GUI apps | β | β Opens apps, fills sheets | β οΈ Web via Chrome | β Browser automation | β οΈ Via tools |
| Runs unattended 24/7 | β Interactive | β οΈ Scheduled tasks | β Always-on cloud VM | β Persistent daemon | β Persistent daemon |
| Persistent cross-session memory | β οΈ Project-scoped | β οΈ Connector-scoped | β οΈ Task-scoped | β Self-improving | β Per-workspace |
| Bring your own model | β | β | β | β | β |
| Messaging-app interface | β | β | β οΈ Email | β Telegram | β Many channels |
| Enterprise admin controls | β Team/Enterprise | β RBAC, spend, analytics | β οΈ Via Google Workspace | π§ DIY | π§ DIY |
| Data stays on your infra | β | β | β | β | β |
Pricing
Pricing splits cleanly along the managed/self-hosted line. Managed agents are bundled into a subscription; self-hosted agents are free as software but you pay for infrastructure and model inference yourself.
| Product | Pricing model | Indicative cost (June 2026) |
|---|---|---|
| Claude Code | Included in Anthropic paid plans | Pro $20/mo Β· Max 5Γ $100/mo Β· Max 20Γ $200/mo Β· Team Premium $125/seat |
| Claude Cowork | Included in Anthropic paid plans (GA Apr 9, 2026) | Same subscriptions as above; usage governed by rolling limits |
| Gemini Spark | Bundled with Google AI Ultra (U.S. only) | Ultra $99.99/mo (β5Γ limits) or $200/mo (β20Γ limits) |
| Hermes Agent | Free software (MIT) + your infra + your model API | From ~$5/mo VPS; inference billed by whichever provider you pick |
| OpenClaw | Free software (MIT) + your infra + your model API | Self-host cost + model subscription/API (e.g. your existing Claude/GPT plan) |
The hidden cost of "free": open-source agents have a $0 license but a real total cost of ownership β the VPS, the model API bill (which can dwarf a $20 subscription if the agent runs hot), and your own time on setup, security hardening, and upkeep. The managed agents fold all of that into one predictable monthly number. "Free" wins on sovereignty and flexibility, not always on dollars.
Which One Should You Actually Use?
There's no single winner β there's a best fit per situation. Mapped to the most common needs:
- You're a developer who lives in a codebase β Claude Code. Terminal-native, strongest at multi-file engineering work, and already in your paid plan. If you want open-source and bring-your-own-model for coding, Hermes is the credible alternative.
- You're a non-developer who wants an agent to do real work on your computer β Claude Cowork. It's purpose-built for analysts, ops, legal, and finance, with the approval-gated UX and enterprise controls that make it safe to roll out.
- You want work to happen while your laptop is closed β Gemini Spark. The always-on cloud VM and email interface are the whole point, and the Workspace integration is unmatched if you already live in Google.
- You care about data sovereignty, model choice, or cost control at scale β Hermes Agent or OpenClaw. Both keep data on your infrastructure and let you swap models freely. Pick Hermes for a developer/power-user agent with self-improving memory; pick OpenClaw if you want a personal assistant reachable from WhatsApp/Telegram/Signal.
- You want an agent reachable from the chat apps your family or team already use β OpenClaw. The multi-channel messaging gateway is its signature.
The decision heuristic that cuts through it: answer two questions first β "Am I willing to send my data and prompts to a vendor's cloud?" and "Is this for coding, for general desktop work, or for always-on background tasks?" Those two answers eliminate three of the five options immediately. Everything else β exact feature checklists, benchmark scores β is tuning at the margins.
The Bigger Pattern
Step back and the 2026 landscape tells a coherent story. The incumbents (Anthropic, Google) are productizing agents as managed, model-locked services β polished, governed, and billed per seat β and segmenting by user: Anthropic with two products, Google with one always-on cloud agent. Meanwhile the open-source world (Hermes, OpenClaw) is racing in the opposite direction: bring-any-model, self-hosted, data-sovereign agents that compound through persistent memory and skills, reachable from wherever you already chat.
That tension β convenience and governance vs. control and sovereignty β is the defining axis of the agent era, and it's the same tension we've watched play out in every prior platform shift. The pragmatic answer for most teams won't be one agent; it'll be a managed agent for governed, day-job work and a self-hosted one for the tasks where data locality or model choice is non-negotiable. The interesting part is that, for the first time in 2026, both halves of that answer are genuinely good.