You choose how data is processed. Local models run on your device. Cloud models send data to that provider's API.

How much does Vibe Co-Pilot cost?

Vibe is free to install. If you run cloud models, usage is billed by that provider, and paid Vibe plans are optional.

How does Vibe compare to OpenClaw?

OpenClaw is a self-hosted agent stack. Vibe is a browser co-pilot focused on completing overcomplicated web tasks in your existing browser sessions.

How does Vibe compare to Claude Cowork?

Claude browser workflows are Claude-first. Vibe is browser-first and model-flexible, so teams can choose how they run automations.

How does Vibe compare to Chrome DevTools MCP?

DevTools MCP is a browser debugging/control interface. Vibe packages browser task execution into a co-pilot experience for end-to-end workflows.

March 3-9: Accessibility Tree Snapshots, Stable Refs, GPT-5.3 Codex, and a Fresh Chat Design

A big leap in how the agent sees web pages, a new model, and a chat interface that feels modern — all in one week.

Accessibility tree snapshots: the agent finally sees the page structure that matters

This is one of the most important technical changes we shipped. Vibe now extracts the full accessibility tree of every page and assigns stable refs to interactive elements.

Why that matters:

The accessibility tree keeps the parts of the page that matter for interaction — roles, labels, names, hierarchy, and actionable controls.
It strips away a lot of the layout and markup noise that makes raw HTML hard for a model to reason about.
Stable refs make it easier for the agent to talk about the same element across multiple steps instead of constantly re-guessing selectors.

In practice, this often gives the model a more useful representation than dumping a whole page of raw HTML. It is a more compact, interaction-focused view of the page.

`take_snapshot` shipped as a composite tool

We also introduced take_snapshot, a composite tool that lets the agent grab multiple views of the current page in one step.

That matters because page understanding is rarely one-dimensional:

sometimes the agent needs readable text
sometimes it needs the interactive tree
sometimes it needs a visual cross-check

take_snapshot made that possible without forcing agents to stitch together multiple ad hoc tools on their own.

Better page targeting with stable refs

This week also hardened the way Vibe tracks page elements.

With stable refs and better round-trips:

the agent can point to the right control more consistently
snapshot-derived refs stay aligned better across page analysis flows
multi-step interactions become less brittle

This is the kind of infrastructure work users do not always notice immediately, but it is what makes the browser operator feel less random and more dependable.

GPT-5.3 Codex and dynamic model lists

GPT-5.3 Codex is now available — OpenAI's code-optimized model, with stronger multi-step reasoning and better instruction following for complex browser tasks.

And you no longer need an extension update to see new models. Vibe now loads dynamic model lists from the backend, so when we deploy a new model, it appears in your dropdown immediately.

Redesigned chat interface

The chat page was restyled with a cleaner, more modern layout inspired by Gemini's UI conventions. Key improvements:

less visual clutter during long sessions
better tool-call visibility during execution
compact model selection by tier
processing status that collapses more gracefully during longer runs

Why this made Vibe more effective

This week improved more than observability. It improved effectiveness.

Once the agent has a cleaner representation of the page, it spends less effort guessing where the real control is and more effort actually completing the task.

That is a big deal on messy real-world apps where the visible UI and the raw HTML do not line up cleanly.

Under the hood

Added settings and settings_list tools so the agent can read and adjust its own configuration
Microphone permission is now requested before voice recognition starts
Fixed session replay to correctly show tool calls from previous sessions
Stabilized E2E tests across all subscription tiers
Improved LiteLLM retry logic for transient failures

This accessibility-tree work set up the next release: standalone snapshot tools like take_md_snapshot, take_a11y_snapshot, and take_html_snapshot, plus better handling for custom UI controls. Read the follow-on post: March 10-15 release notes.

March 3-9: Accessibility Tree Snapshots, Stable Refs, GPT-5.3 Codex, and a Fresh Chat Design

Accessibility tree snapshots: the agent finally sees the page structure that matters

`take_snapshot` shipped as a composite tool

Better page targeting with stable refs

GPT-5.3 Codex and dynamic model lists

Redesigned chat interface

Why this made Vibe more effective

Under the hood

Related posts

Week of February 3: GPT-5.2 Codex, TEE Attestation UI, and Subscription Reliability

March 10-15: Ollama, Standalone Snapshot Tools, Smarter Tab Control, and a Cleaner Agent

Weeks of February 24 – March 2: Reliability Sprint and Webhook Monitoring