You choose how data is processed. Local models run on your device. Cloud models send data to that provider's API.

How much does Vibe Co-Pilot cost?

Vibe is free to install. If you run cloud models, usage is billed by that provider, and paid Vibe plans are optional.

How does Vibe compare to OpenClaw?

OpenClaw is a self-hosted agent stack. Vibe is a browser co-pilot focused on completing overcomplicated web tasks in your existing browser sessions.

How does Vibe compare to Claude Cowork?

Claude browser workflows are Claude-first. Vibe is browser-first and model-flexible, so teams can choose how they run automations.

How does Vibe compare to Chrome DevTools MCP?

DevTools MCP is a browser debugging/control interface. Vibe packages browser task execution into a co-pilot experience for end-to-end workflows.

Vibe Engineering: From Claude Code to OpenCode — How We Set Up AI Coding Agents

Name: Vibe Browser
Author: Vibe Technologies

We run a multi-agent engineering team on OpenCode. Each agent gets a role, a model, and a task queue — an Opus-class orchestrator decomposes work and delegates to specialized subagents running in parallel, overnight, without me at the keyboard. This post covers the stack we built, what drove us off Claude Code alone, and how the orchestrator model actually works.

Starting Point: Claude Code

Claude Code was the right first tool. It understands large codebases, reasons about architecture, and produces working code for complex tasks. I used it for everything early on — new features, bug fixes, PR review, documentation.

The friction points appeared as the workload scaled:

Provider lock-in: Claude Code runs on Anthropic models only. I needed to route different tasks to different models based on cost and capability — GPT for code generation, Gemini for UI, cheaper models for tests. Every overnight run went through a single provider at full Opus pricing with no way to downgrade individual steps.
Context limit failures on long tasks: Claude Code would hit context limits mid-task on large refactors, dropping state and requiring a manual restart. Overnight runs failed silently — I'd wake up to nothing merged and no clear error.
Closed internals: when I wanted to add reflection layers (detecting when an agent is stuck in a loop) or custom task automation, Claude Code's architecture made it difficult. We had no exact cost baseline — we just knew overnight runs were failing and we couldn't tell why.
Single session: no way to run multiple agents in parallel on independent tasks. Tasks queued instead of running concurrently, which made the async-overnight model unworkable.

Claude Code remains excellent for complex reasoning tasks. But as an orchestration substrate for a multi-agent company, it was the wrong layer.

The Switch to OpenCode

OpenCode is an open-source coding agent with multi-model support and a composable architecture. The key properties I needed:

Model-agnostic: switch between Anthropic, OpenAI, Azure OpenAI, Google, local Ollama in config
Open architecture: add reflection layers, custom tools, GitHub plugins without forking closed source
Remote serve mode: opencode serve exposes a session over HTTP — I can send tasks from mobile, route tasks from other agents, run 24/7 on a remote VM
Community ecosystem: problems I hit have usually been solved already

We also maintain a fork of OpenCode with specific changes: planning-loop detection, cross-model review (Claude reviewing GPT-4o output), GitHub integration plugin, and self-healing Cloudflare tunnel watchdogs for the remote serve layer.

The Orchestrator Model

The current setup runs one Opus-class orchestrator and multiple specialized subagents:

Orchestrator (Claude Opus)
├── BackendDeveloper  (GPT-4o)      — APIs, databases, server logic
├── FrontendDeveloper (Gemini Pro)  — UI, styling, client-side
├── QAEngineer        (MiniMax)     — tests, edge cases, validation
├── DevOpsEngineer    (Claude Sonnet) — CI/CD, infra, deployments
└── SEOEngineer       (Gemini)      — release notes, blog posts, SEO

The orchestrator never writes code. Its system prompt is explicit:

When given a task, create a GitHub issue first.

Keep the issue updated throughout.

Decompose and delegate to subagents.

Run independent subagents in parallel.

When complete, review and reflect.

Create a PR if code changes were made.

Watch GitHub Actions and ensure tests pass. Never implement changes yourself — design, delegate, review.

Expensive models (Opus) doing expensive reasoning. Cheap models doing execution. Spec quality is what determines output quality — Opus turns a vague requirement into a proper technical spec, then routes it to the right subagent.

Cross-Model Review

One quality gate that consistently catches issues: having one model review another's output.

After a subagent submits a PR, the orchestrator sends the diff to a different model: "GPT-4o, do you agree with what Claude wrote here?" They usually do not agree completely. The disagreements surface real issues — logic gaps, missing edge cases, incorrect assumptions.

This is not expensive in practice. Diff review is a small context window. The catch rate justifies the cost.

Running 24/7

The opencode serve command runs a persistent session on a remote VM (DigitalOcean, ~$12/month). I send tasks from:

Terminal on the work machine
Mobile via voice → TypeWhisper transcribes → sends to the remote session
Other agents (VibeTeam operations agents can spawn coding tasks)

I check in asynchronously. The expectation is that most mornings start with reviewing what the agents shipped overnight — in practice we haven't tracked this systematically, but that is the pattern when overnight runs complete cleanly.

What This Replaces

Before this setup, a feature from spec to merged PR required me at the keyboard for several hours. Now:

Voice message (2 min) → transcribed spec
Orchestrator creates GitHub issue, decomposes, delegates
Subagents work in parallel (1-3 hours depending on complexity)
I review the PR (15-30 min)
CI passes, merge

The bottleneck shifted from coding to reviewing. That is the right bottleneck for a product company.

Evidence It Works

We haven't measured this yet — no latency benchmarks or cost deltas captured at time of writing. OpenCode feels faster for context switching but we have no numbers.

What Does Not Work Yet

TypeWhisper voice interface is unstable on long sessions: transcription drops or the connection to the remote session resets after 20–30 minutes, requiring a manual reconnect.
Orchestrator model selection is manual: choosing which model handles which subagent role is hand-tuned config, not automatic. There is no runtime mechanism that reassigns a role to a cheaper model when cost spikes.
No objective model benchmark: the model assignments in the diagram above are based on intuition and informal testing, not a controlled benchmark. A cheaper model might do just as well for several of those roles.

Update (Jan 2026): The production setup moved OpenCode off the DigitalOcean VM and onto a real dev workstation. opencode serve still exposes the session over HTTP — but instead of a cloud VM, it runs on a workstation on my desk. The cloud-side OpenClaw SoftwareEngineer agent (Gilfoyle Bertram) reaches that endpoint over Tailscale, so the dev box has no public ingress. Gilfoyle Bertram is the supervisor in the cloud; OpenCode on the dev machine is the worker. Full architecture is in Switching From OpenHands to VibeBrowser Agentic Team.

Next in This Series

VibeTeam: How We Run Operations with AI Agents →

The full #ainativecompany series:

Building Vibe Technologies: An AI-Native Startup
You are here — Vibe Engineering: From Claude Code to OpenCode
VibeTeam: OpenHands AI Operations Agents
Switching From OpenHands to VibeBrowser Agentic Team
Docs Support Chat: Azure AI RAG + SupportEngineer Escalation
Chatwoot AI Chatbot for openclaw.vibebrowser.app
Switching OpenClaw Operations to DeepSeek-V4-Flash
Token Optimization with OpenCode, LST, RTK, Caveman
Linear Customer Support Pipeline: From VibeBrowser Co-Pilot to Jared Dunn
Agent Communication: Slack Apps, OpenClaw Bindings, AGENTS.md Handoff Matrix — how agents route work to each other
Meet the Vibe Technologies Team: 10 AI Agents, One Human, One Framework — full agent roster with roles, models, and channel bindings
Two Layers of Agent Evaluation: Deployment Checks and Team Trace Review
OpenCode in Server Mode: Tailscale Access and AI Session Supervision
Claude Code Remote Control: Managing Coding Sessions from Mobile — per-PR YAML eval queue plus Claw's Langfuse-backed team evaluation

Previous in series: Building Vibe Technologies: An AI-Native Startup →

Vibe Engineering: From Claude Code to OpenCode — How We Set Up AI Coding Agents

Starting Point: Claude Code

The Switch to OpenCode

The Orchestrator Model

Cross-Model Review

Running 24/7

What This Replaces

Evidence It Works

What Does Not Work Yet

Next in This Series

Related posts

Claude Code Remote Control: Managing Coding Sessions from Mobile

OpenCode in Server Mode: Tailscale Access and AI Session Supervision

Why OpenCode, Not Claude Code: Five Reasons I Use Open-Source for the Coding Agent

Starting Point: Claude Code

The Switch to OpenCode

The Orchestrator Model

Cross-Model Review

Running 24/7

What This Replaces

Evidence It Works

What Does Not Work Yet

Next in This Series

Related reading

Related posts

Claude Code Remote Control: Managing Coding Sessions from Mobile

OpenCode in Server Mode: Tailscale Access and AI Session Supervision

Why OpenCode, Not Claude Code: Five Reasons I Use Open-Source for the Coding Agent