May 21, 2026 •
Dzianis Vashchuk • 7 min read
How Vibe Technologies evaluates its ten AI agents at two levels: a YAML-based deployment verification suite that runs per-PR, and a Langfuse-backed team evaluation loop where Claw reviews cross-agent traces, checks for stuck sessions, and pushes agents to complete their tasks.
April 12, 2026 •
Dzianis Vashchuk • 10 min read
How we validate Chrome extension microphone initialization and listening-state transitions with manifest permission auditing, explicit FAIL vs SKIP semantics, and a virtual microphone in CI.
October 7, 2025 •
Dzianis Vashchuk • 9 min read
How Vibe tests browser agents in practice: OCR-backed extractor validation, deterministic mock LLM workflows, and real-model integration tests against live sites.