What are the main features of Vet?

The main features of Vet include: Agent conversation intent verification, Code quality and logic error checks, GitHub Action automation integration, Custom review guidelines (guides.toml).

How much does Vet cost?

Open-source and free; users provide their own LLM API Key (e.g., Claude API).

Developers using Claude Code, Codex, or other AI coding agents, and Tech Leads managing AI-heavy workflows.

What are the alternatives to Vet?

Alternatives to Vet include: CodeRabbit, Qodo/PR-Agent, Greptile.

Vet: An "Auditor" for Your AI Coding Agents

2026-03-06 | ProductHunt | GitHub | Official Site

30-Second Quick Judgment

What is it?: Vet (Verify Everything) is an open-source AI code review tool from Imbue. It’s specifically designed to "audit" the work of other coding agents—verifying that the code written by agents like Claude Code or Codex actually does what you asked for.

Is it worth your attention?: Yes. If you use Claude Code or other coding agents daily, Vet solves a real pain point: agents can sometimes "fake it." It’s open-source, free, installs in one line, and has zero telemetry, meaning almost no barrier to entry. However, it’s not a general-purpose code review tool; it’s better described as an "agent babysitter" than a "human code reviewer."

Three Questions That Matter

Is it for me?

Target Audience: Developers who use AI coding agents (Claude Code, Codex, OpenCode) in their daily workflow.
Am I the target?: If you have agents writing code and submitting PRs for you every day, you are the core user. If you still write everything by hand, you don't need this yet.
Use Cases:
- You had Claude Code write a large block of code but aren't sure if it actually ran the tests --> Use Vet to verify.
- You leave an agent running tasks overnight and want to confirm the quality the next morning --> Use Vet's agent skill for auto-review.
- Your team's PRs are full of AI-generated code --> Use Vet’s GitHub Action to audit PRs automatically.
- You don't use AI to write code --> You don't need Vet.

Is it useful?

Dimension	Benefit	Cost
Time	No need to line-check if the agent was lazy; saves massive review time.	5-minute installation, nearly zero config.
Money	The tool is free; prevents agent-generated bugs from hitting production.	You pay for your own LLM API usage (e.g., Claude API).
Effort	Peace of mind when letting agents code; can run tasks overnight.	Need to learn which reported issues are relevant to you.

ROI Judgment: If you use an agent to code for more than an hour a day, installing Vet is a no-brainer. The LLM API cost is far lower than the cost of your time spent manually reviewing every line.

What's the "Wow" Factor?

The Highlights:

Catching Agent Lies: When an agent says "tests passed" but didn't actually run them, Vet catches this "silent failure."
Understanding Intent: It can load agent conversation history to compare your original request with the agent's actual behavior, spotting deviations.
Evolutionary Prompts: Vet’s internal prompts are optimized using a "Darwinian Evolver" (evolutionary algorithm) rather than being manually tuned.

Real User Feedback:

"It's saved me so many times when Claude Code lied about tests passing, but didn't run them at all. Since it can run in the agent loop, I now run agents overnight, and ship knowing issues are fixed." — @kanjun (Imbue CEO)

"I like to envision Vet as a friendly vet wrangling rabid coding agents to keep them in check. It's already saved our team so much time + frustration on code review (and it's open source!)" — @ashleydzhang

For Independent Developers

Tech Stack

Core Logic: Snapshot repo + diff --> LLM checks --> filter/deduplicate --> output issue list.
LLM Backend: Defaults to Anthropic Claude (ANTHROPIC_API_KEY); supports OpenAI-compatible endpoints (OpenRouter, GPT-5.2, Kimi-K2, etc.).
Prompt Engineering: Uses the Darwinian Evolver algorithm to auto-optimize prompts, inspired by Sakana.ai.
Deployment: CLI / CI (GitHub Action) / Agent Skill.
Configuration: Profiles (named configs) + guides.toml (custom review rules).
Privacy: Local-first, zero telemetry, API requests go directly to the provider.

Core Implementation

Vet does something traditional code review tools don't: it reads the agent's conversation history. By using the --history-loader flag, Vet ingests the full log of what you asked and what the agent did. This isn't just checking for syntax; it's verifying "honesty."

Technically, the prompt optimization is the most interesting part. Imbue developed the Darwinian Evolver to auto-optimize Vet's prompts and decision logic. They moved away from traditional frameworks (like DSPy's MIPRO) due to context window limits and the constraints of single-prompt optimization. This evolutionary toolset even achieved a 95% SOTA on ARC-AGI-2 (Feb 2026).

Open Source Status

Fully Open Source: AGPL-3.0 license.
GitHub: 95 stars, 6 forks, actively maintained.
Darwinian Evolver is also open source: https://github.com/imbue-ai/darwinian_evolver
Build-it-yourself difficulty: Medium-High. The core logic is straightforward, but replicating the evolutionary prompt system would take an estimated 3-4 person-months.

Business Model

Monetization: Vet itself isn't monetized; it's a lead-gen tool for the Imbue ecosystem.
The Bigger Picture: Imbue’s paid product is Sculptor (an AI programming UI with a parallel agent sandbox). Vet builds trust in agents --> users use Sculptor to manage more agents --> ecosystem lock-in.
API Costs: Users cover their own LLM API fees.

Giant Risk

Medium. GitHub already has AI code review (Copilot Review), but currently, no one else focuses specifically on "verifying agent intent vs. implementation." Vet's differentiation lies in:

It's "Agent Auditing," not general code review.
It reads history to verify intent matching.
The AGPL-3.0 license ensures the community version survives even if giants build similar features.

However, if Claude Code or Cursor build in robust self-verification, Vet's standalone value will drop.

For Product Managers

Pain Point Analysis

The Problem: AI coding agent output is exploding (41% of commits involve AI in 2026), but quality verification can't keep up. A 40% gap in review capacity is expected by late 2026.
How painful is it?: High-frequency and critical. Anyone using Claude Code knows agents sometimes "silently fail"—claiming tests passed when they didn't, stopping halfway, or using fake data to bypass difficulties.
Key Insight: The issue isn't just "code quality," it's "agent integrity." While traditional linters check the code, Vet checks if the agent's behavior matches the user's intent.

User Personas

User Type	Scenario	Frequency
Independent Developer	Uses Claude Code daily; needs quality verification.	Daily
Team Tech Lead	Automatically audits agent-generated PRs in CI.	Every PR
Overnight Agent User	Runs agents at night; reviews results in the morning.	Every Night

Feature Breakdown

Feature	Type	Description
Agent Intent Verification	Core	Compares user requirements with agent implementation.
Code Quality Checks	Core	Logic errors, unhandled edges, missing tests.
GitHub Action PR Review	Core	CI automation.
Agent Skill Integration	Core	Auto-triggers within agent workflows.
Named Profiles	Nice-to-have	Standardized team configurations.
Custom Guides	Nice-to-have	Custom review rules via guides.toml.
Remote Model Registry	Nice-to-have	Community-contributed model definitions.

Competitive Landscape

Dimension	Vet	CodeRabbit	Qodo/PR-Agent	Greptile
Core Positioning	Agent Auditing	General PR Review	Enterprise Code Review	Codebase-aware Review
Unique Ability	Reads history, verifies intent	Line-level comments, PR summaries	Cross-repo analysis, test gen	Full codebase understanding
Open Source	AGPL-3.0	Partially	Open Source (PR-Agent)	No
Pricing	Free (BYOK)	$12-24/mo/user	Free (OSS) / Enterprise	Enterprise Pricing
Integration	CLI/CI/Agent Skill	GitHub/GitLab App	GitHub/GitLab App	GitHub App
Valuation/Scale	Imbue ($1B total)	-	-	$180M

Key Takeaways

The "Agent Auditing" Category: Don't just build a better code review tool; build a "watchdog" for agents. It's a precise and differentiated position.
Open Source + BYOK Model: Zero-cost entry. User stickiness comes from habit rather than vendor lock-in.
Evolutionary Prompt Optimization: Productizing AI research (Darwinian Evolver) creates a technical moat.
Agent Skill Distribution: A one-line curl command makes it easy to distribute across various agent platforms.

For Tech Bloggers

Founder Story

Kanjun Qiu (CEO): MIT CS background; paid her tuition by writing high-frequency trading algorithms. Former Dropbox Chief of Staff (scaled from 300 to 1500 people). Founded Sourceress (YC, $13M). Forbes 30 Under 30 (2020).
Josh Albrecht (CTO): Serial entrepreneur (BitBlinder, CloudFab, Outset Capital).
The Fun Detail: Kanjun co-founded "The Archive" coliving house; her roommates went on to found Anthropic (creators of Claude) and Bluesky. Essentially, the founders of Vet and Claude are former roommates.
Company History: Founded in 2020 as "Generally Intelligent," later rebranded to Imbue.

Discussion Angles

Is "Agent Auditing" a new sector or a temporary need? If agents become 100% reliable, does Vet lose its value?
How powerful is evolutionary prompt optimization? They hit 95% SOTA on ARC-AGI-2 with these tools—does this mean prompt engineering still has massive untapped potential?
The Strategy of a $1B Unicorn going Open Source: Is Vet the entry point while Sculptor is the endgame?
The AGPL-3.0 Choice: Why not MIT? What does this mean for commercialization?

Hype Metrics

PH: 90 votes, newly launched.
GitHub: 95 stars, 6 forks.
Twitter: Official announcement had 55 likes; Evolver open-source tweet had 942 likes and 125K views.
Recognition: Named in DEV Community's "Best AI Code Review Tools of 2026."

Content Suggestions

The Hook: "Your AI coding assistant might be lying to you"—this angle is guaranteed traffic.
Trend Jacking: Combine with the hype around Claude Code / Codex; "How to verify AI code" is a top developer concern.
Deep Tech: A dedicated piece on how the Darwinian Evolver uses evolutionary algorithms to optimize prompts.

For Early Adopters

Pricing Analysis

Tier	Price	Features	Is it enough?
Open Source	$0 + BYOK	All features	Completely sufficient

Hidden Costs: Every review consumes LLM API credits. Using Claude Sonnet costs a few cents per review, likely $5-20/month for regular use.

Getting Started

Setup Time: 5 minutes.
Learning Curve: Low.
Steps:
1. Run curl -fsSL https://raw.githubusercontent.com/imbue-ai/vet/main/install-skill.sh | bash to install.
2. Set your ANTHROPIC_API_KEY environment variable.
3. Run vet in your project directory or let your agent call it.
4. (Optional) Configure GitHub Action for PR reviews.
5. (Optional) Use guides.toml for custom rules.

Critiques & Pitfalls

Search Nightmare: Searching for "Vet" brings up animal hospitals; SEO is currently poor.
AI Noise: Like all AI review tools, false positives exist, and massive diffs can degrade performance.
AGPL-3.0: If you want to integrate Vet into a closed-source commercial product, the license is a hurdle.
Early Ecosystem: With 95 stars, the community is small; you might need to head to Discord for support.

Security & Privacy

Storage: Fully local, zero telemetry.
API Requests: Direct to the provider (e.g., Anthropic), never through Imbue’s servers.
Privacy: Your code is only sent to the LLM provider you choose.
Auditability: Open-source (AGPL-3.0).

Alternatives

Alternative	Advantage	Disadvantage
CodeRabbit	Mature ecosystem, 2M+ repos.	Paid SaaS, not agent-focused.
Qodo/PR-Agent	Enterprise-grade, cross-repo.	Heavier, complex config.
Manual Review	Most accurate.	Too slow for agent output volume.
Built-in Verification	No extra tools.	Conflict of interest (self-grading).

For Investors

Market Analysis

AI Code Review Sector: Expected to exceed $2B by 2026.
AI Code Generation: $4.91B (2024) --> $30.1B (2032), 27.1% CAGR.
AI Agent Market: $7.63B (2025) --> $182.97B (2033).
Drivers: By 2026, 41% of commits involve AI, creating a 40% gap in review capacity. This births the "Agent Verification" sub-sector.
Key Thesis: The more code AI writes, the greater the need for verification tools. This market scales directly with AI coding adoption.

Competitive Landscape

Tier	Player	Positioning
Leader	GitHub Copilot Review	Built-in platform feature.
Mid-Market	CodeRabbit ($200M+ user reach), Greptile ($180M valuation)	General AI code review.
New Entrant	Vet by Imbue	Agent verification, Open Source.
New Entrant	Qodo/PR-Agent	Open-source enterprise review.

Timing Analysis

Why now?: 2026 is the breakout year for AI coding agents (Claude Code, Cursor Agent). The question of "who reviews the agent's code" is currently unanswered.
Tech Maturity: LLMs are finally capable of meaningful review beyond simple linting.
Market Readiness: Developers have moved from "Can AI code?" to "How can I trust AI code?"
Analogy: Just as the rise of cars created the insurance industry, the rise of AI coding is creating the AI verification industry.

Team Background

Founders: Kanjun Qiu (CEO, MIT CS, ex-Dropbox CoS) + Josh Albrecht (CTO, serial entrepreneur).
Core Team: 11-50 people with deep AI research backgrounds.
Unique Resources: Capability to train 100B+ parameter models and access to a ~10,000 H100 cluster.
Network: Tom Brown (GPT-3 lead), Drew Houston (Dropbox), Anthropic founding team (former roommates).

Funding Status

Total Raised: $232M.
Series A: $20M (Oct 2022).
Series B: $200M (Sept 2023), led by Astera Institute, with Nvidia, Kyle Vogt (Cruise CEO), and Simon Last (Notion).
Series B Extension: $12M (Oct 2023), Alexa Fund + Eric Schmidt.
Valuation: $1B (Unicorn).
Note: Funding is for Imbue as a whole. Vet is the open-source funnel; Sculptor is the commercial engine.

Conclusion

Bottom Line: Vet accurately targets the new "Who reviews the AI?" sector. Its open-source + BYOK strategy is clever, but its long-term survival depends on whether agents become so reliable that external verification becomes redundant.

User Type	Recommendation
Developers	Install it. It takes 5 minutes and is immediately useful if you use agents daily.
Product Managers	Watch the "Agent Auditing" category; consider if your team needs standardized AI output verification.
Bloggers	The "Your AI is lying" angle has high viral potential; the Darwinian Evolver tech is also worth a deep dive.
Early Adopters	Highly recommended; zero cost to try, but remember to search for "imbue-ai/vet."
Investors	Imbue has a unicorn valuation and a stellar team, but Vet is a lead-gen tool, not the profit center.

Resource Links

Resource	Link
Official Site	https://imbue.com/product/vet/
GitHub	https://github.com/imbue-ai/vet
Darwinian Evolver	https://github.com/imbue-ai/darwinian_evolver
Sculptor	https://imbue.com/sculptor/
ProductHunt	https://www.producthunt.com/posts/vet-2
Twitter	https://x.com/imbue_ai
CEO Twitter	https://x.com/kanjun
Imbue Discord	See GitHub README

2026-03-06 | Trend-Tracker v7.3 | Data Sources: ProductHunt, GitHub, Twitter/X, Imbue Official, Crunchbase

Vet

Vet: An "Auditor" for Your AI Coding Agents

30-Second Quick Judgment

Three Questions That Matter

Is it for me?

Is it useful?

What's the "Wow" Factor?

For Independent Developers

Tech Stack

Core Implementation

Open Source Status

Business Model

Giant Risk

For Product Managers

Pain Point Analysis

User Personas

Feature Breakdown

Competitive Landscape

Key Takeaways

For Tech Bloggers

Founder Story

Discussion Angles

Hype Metrics

Content Suggestions

For Early Adopters

Pricing Analysis

Getting Started

Critiques & Pitfalls

Security & Privacy

Alternatives

For Investors

Market Analysis

Competitive Landscape

Timing Analysis

Team Background

Funding Status

Conclusion

Resource Links

Frequently Asked Questions about Vet