HelixDB: Two Dropouts Built a Rust DB to Unify Graph + Vector; YC Backed, but the Road is Long
2026-02-28 | Product Hunt | Official Site | GitHub

Screenshot Breakdown: This is the built-in HelixDB Dashboard, showcasing the core capabilities of a graph database—visualization of nodes (Doctor, Patient) and edges (relationships). Each node displays detailed attributes (ID, name, condition, cost, etc.), with toolbars for Queries, Schema, and Visualization on the left. This UI indicates that HelixDB is more than just a CLI tool; it has a visual management interface.
30-Second Quick Judgment
What is it?: An open-source "graph + vector" hybrid database built from scratch in Rust. Simply put: when building RAG or AI Agents, you no longer need to run Neo4j (for relationships) plus Pinecone (for semantic search) simultaneously. HelixDB handles both.
Is it worth watching?: Yes, but keep an eye on it for now. Backed by YC Spring 2025, with 3,840 GitHub Stars and impressive Rust-powered performance. However, the 5-vote PH launch, a 6-person team, the AGPL license, and the proprietary HelixQL language are all risks that need time to validate. If you're building a RAG/Agent project with clear pain points, it's worth a half-day trial; otherwise, just remember the name.
Three Questions for Me
Is this relevant to me?
Target Audience:
- Backend developers building RAG apps (needing both semantic search + relationship queries)
- Indie developers building AI Agent memory systems
- Engineers tired of shuffling data between vector and graph databases
Is that me?: You are the target user if you are --
- Using Pinecone for semantic search but realizing you need Neo4j for relationship traversal
- Building a memory system for AI Agents that needs to store both "who said what" (vector) and "who is related to whom" (graph)
- Building knowledge graph products that require semantic search overlays on relationship maps
Use Cases:
- Legal Document Retrieval: Use vectors to find relevant cases, then graph traversal to find citations between them
- Agent Long-term Memory: Semantic storage of dialogue history + graph structure to track entity relationships
- Medical Knowledge Base: Patient-doctor-drug relationship graphs + symptom semantic matching (as seen in the dashboard screenshot)
- Recommendation Systems: User preference vectors + social relationship graphs
Is it useful to me?
| Dimension | Benefit | Cost |
|---|---|---|
| Time | No need to maintain two DBs + glue code; saves ~30-50% of data layer dev time | Need to learn HelixQL, a new query language; takes ~1-3 days |
| Money | Open-source, free self-hosting, low hardware requirements (LMDB + Rust = resource efficient) | Managed service pricing is currently unlisted |
| Effort | One SDK for both graph + vector; no data sync headaches | Small community, evolving docs; high cost of troubleshooting |
ROI Judgment: If your project truly requires hybrid graph+vector queries and you're in the tech selection phase, trying HelixDB is worth it—the worst case is losing half a day. But if you're already running Neo4j + Pinecone smoothly, the migration risk currently outweighs the benefit.
Is it exciting?
The "Wow" Factors:
- One query for both: Perform graph traversal and vector search simultaneously in HelixQL without application-layer stitching
- Built-in Embedding: Use the
Embedfunction to vectorize text directly without pre-processing before storage - Native MCP Support: AI Agents can autonomously traverse the graph without needing to generate human-readable queries
- Compile-time Type Checking: Queries are validated for type safety before deployment, preventing production crashes
Real User Voices:
"I would love to try your db, but probably most LLMs won't be good with HQL" -- @mike_pavlukhin
This critique is spot on: HelixQL is proprietary, and LLMs don't know it yet. This means you can't have GPT/Claude generate HelixQL queries directly—a big hurdle in the AI era. However, the HelixDB team uses MCP to solve this, letting Agents traverse the graph directly rather than generating query language.
Someone asked "KuzuDB is dead, what's a good graph DB alternative?", and @JoshPurtell immediately recommended @helixdb
For Indie Developers
Tech Stack
- Language: Rust (built from scratch, not just bindings)
- Storage Engine: LMDB (via Heed3 wrapper) -- Lightning Memory-Mapped Database, battle-tested
- Data Model: Native Graph + Vector; also supports KV, Document, and Relational
- Query Language: HelixQL (Proprietary, 100% type-safe, compile-time checked)
- Vector Index: HNSW
- SDKs: TypeScript + Python
- License: AGPL
Core Implementation
HelixDB's core philosophy: A vector is essentially a node with coordinates, and edges represent neighbor links. Thus, vector search and graph traversal can be unified in a single data structure without needing two systems.
Architecturally, it uses compile-time optimization—HQL queries are compiled into optimized Rust code and deployed as API endpoints. This is much faster and safer than runtime parsing. The workspace consists of 5 Rust crates, with ACID transactions guaranteed by LMDB.
Performance Data (Official Claims):
- Vector Similarity Search: ~2ms
- Graph Traversal: <1ms
- 1000x faster than Neo4j, 100x faster than TigerGraph, vector ops on par with Qdrant
Open Source Status
- Fully Open Source: GitHub, 3,840 Stars
- License: AGPL -- Warning! If you use HelixDB for a SaaS product, your related code must also be open-sourced. Google bans AGPL software internally. Internal use is unaffected
- Build Difficulty: Extremely high, estimated 3-5 person-years. A graph DB + vector engine + query compiler is no small feat
Business Model
- Monetization: Open-core + Managed Cloud Service
- Pricing: Unlisted; managed service is currently open to select users (contact team)
- Revenue: Approx. $550K according to Getlatka (5-person team)
- Funding: $500K seed (YC + Pioneer Fund)
Giant Risk
Medium-High. Neo4j is adding vector capabilities, Weaviate has a graph-like schema, and ArangoDB is pivoting toward an AI platform. HelixDB's differentiator is "native fusion" rather than an "add-on extension." While giants are unlikely to rebuild a hybrid DB from scratch soon, if this category is validated, acquisition is more likely than being crushed.
For Product Managers
Pain Point Analysis
- Problem Solved: AI apps needing both semantic search and relationship queries currently require two DBs + glue code, leading to high maintenance, sync issues, and poor cross-DB performance
- Severity: High-frequency essential need. Every RAG and Agent app faces this; GraphRAG is now a mainstream architecture
User Persona
- Primary: Backend developers / Infrastructure engineers working on AI/RAG/Agent projects
- Secondary: Indie developers looking to build AI apps with minimal infrastructure
- Scenarios: Knowledge graphs + semantic search, Agent memory systems, recommendation engines, medical/legal retrieval
Feature Breakdown
| Feature | Type | Description |
|---|---|---|
| Graph Traversal | Core | Node, edge, and relationship queries |
| Vector Search | Core | HNSW index, ~2ms queries |
| HelixQL | Core | Proprietary type-safe query language |
| Built-in Embedding | Core | Automatic text vectorization |
| MCP Support | Core | Native integration for Agents |
| Dashboard | Extra | Web-based visual management interface |
| Keyword Search | Extra | BM25 full-text search |
Competitive Differentiation
| Dimension | HelixDB | Neo4j | Weaviate | Pinecone |
|---|---|---|---|---|
| Core Capability | Native Graph+Vector | Graph-first, Vector add-on | Vector-first, Graph schema | Pure Vector, No Graph |
| Language | Rust | Java | Go | Closed Source |
| Open Source | AGPL | GPL (Community) | BSD-3 | No |
| RAG Scenario | All-in-one | Needs Vector DB | Strong Hybrid Search | Needs Graph DB |
| Maturity | Very Early | Mature | Medium | Mature |
Key Takeaways
- The "Two-into-One" Strategy -- When a workflow requires two tools, building a unified one is a massive opportunity
- Compile-time Query Optimization -- Shifting safety checks to the compilation phase reduces runtime errors; a great idea for any query-based product
- Native MCP Support -- In the AI Agent era, databases aren't just for humans; they're for Agents. This direction is worth every DB product's attention
For Tech Bloggers
Founder Story
Two British students, Xavier Cochran and George Curtis, started a graph database side project at the University of Bristol. Xavier is a Rust enthusiast, while George previously worked at Twilight Zone Engineering and Mercury.
After reading RAG architecture papers, they realized: to run a RAG system, you need your own server, a graph DB, a vector DB, and middleware to connect them. They had an epiphany—a vector is just a node with coordinates, and an edge is a neighbor link. Why not just merge them?
They dropped out, moved to San Francisco, and joined YC Spring 2025. During YC, they attracted clients like United Healthcare, grew to 4k stars on GitHub, and claimed to have executed billions of queries. Now a 6-person team, they announced GA in early 2026.
Controversies / Discussion Points
- The AGPL Debate: AGPL is the most controversial license in the DB world. Google bans it; MongoDB eventually moved to the even more aggressive SSPL. HelixDB uses AGPL to prevent cloud providers from profiting for free, but it might scare off enterprise users
- The Proprietary Language Bet: HelixQL vs SQL/Cypher/Gremlin -- A custom language means high learning costs and no LLM support. The upside is 100% type safety and compile-time checks
- Performance Credibility: Claiming to be 1000x faster than Neo4j -- such a massive gap usually implies the test scenarios weren't a direct apples-to-apples comparison
Hype Data
- PH Votes: 5 (Very low, suggesting PH isn't their main stage)
- GitHub Stars: 3,840 (Quite good for a DB project founded in 2025)
- HN: 3 "Show HN" posts, peaking at 95 points / 48 comments
- Twitter: Mostly founders and bots; few real user voices yet
- YC Backing: Spring 2025 batch
Content Suggestions
- Narrative: "Two college dropouts building a database" + technical deep dive on "Why merge graph and vector?"
- Trend Jacking: GraphRAG is a hot topic; write about HelixDB as the "Infrastructure for the GraphRAG Era"
For Early Adopters
Pricing Analysis
| Tier | Price | Included Features | Is it enough? |
|---|---|---|---|
| Open Source | Free | All core features | Enough for techies |
| Managed Cloud | Unlisted | Hosting + Ops | Contact team |
| Enterprise | Unlisted | SLA + Support | Contact team |
Getting Started Guide
- Time to Hello World: 1-3 hours (Installation + first demo)
- Learning Curve: Medium-High (Must learn HelixQL, not SQL)
- Steps:
- Clone from GitHub or install via CLI
- Define Schema (Graph node and edge types)
- Write HelixQL queries
- Compile and deploy (Queries become API endpoints)
- Call via TypeScript/Python SDK
- Resources: Official Docs, Blog Tutorials (Docker, MCP, recommendation system examples)
- IDE Support: HelixQL syntax highlighting plugin available for Cursor
Pitfalls and Critiques
- LLMs don't know HelixQL: This is the biggest hurdle. You can't ask GPT/Claude to write queries for you. However, MCP integration can bypass this
- Tiny Community: GitHub Issues is the main channel; no mature Stack Overflow presence yet
- AGPL License: Be careful of the "viral" nature if you're building a SaaS product
- Incomplete Docs: Core docs exist, but edge cases and best practices are sparse
Security and Privacy
- Data Storage: Local LMDB; data stays on your machine when self-hosted
- Privacy by Design: HelixDB is private by default; data is only accessible via compiled HelixQL queries—no universal admin backdoor
- Security Audits: No public audit reports yet (project is too new)
Alternatives
| Alternative | Pros | Cons |
|---|---|---|
| Neo4j + Pinecone | Both mature, large communities, great docs | Must maintain two systems + glue code |
| Weaviate | Strong hybrid search, BSD-3 license | Not a true graph database |
| ArangoDB | Mature multi-model DB, enterprise-ready | Vector capabilities lag behind native ones |
| SurrealDB | Comprehensive multi-model, SQL-like syntax | Graph capabilities lag behind native ones |
| Qdrant + Custom Graph | Strong vector performance, Rust-based | Must handle graph structure yourself |
For Investors
Market Analysis
- Vector DB Market: $2.55B in 2025, projected $8.95B by 2030 (CAGR 27.5%)
- Graph DB Market: $2.85B in 2025, projected $20.29B by 2034 (CAGR 24.1%)
- The Intersection: HelixDB sits in the sweet spot; as GraphRAG becomes the standard, this niche could grow even faster
- Drivers: AI Agents need "Memory + Reasoning," and graph-vector fusion is the most natural data model for this
Competitive Landscape
| Tier | Players | Positioning |
|---|---|---|
| Leaders | Neo4j, AWS Neptune, Pinecone | Mature single-capability players |
| Challengers | Weaviate, Qdrant, ArangoDB, SurrealDB | Multi-model challengers |
| New Entrants | HelixDB | Native Graph+Vector fusion |
Timing Analysis
- Why Now: GraphRAG moved from academia to industry in 2025-2026; Microsoft released its GraphRAG framework; Agent memory is a hot topic. "Unified Graph+Vector" moved from academic concept to engineering requirement
- Tech Maturity: Rust ecosystem is mature, LMDB is battle-tested, HNSW is the vector standard. The components are ready; we just need an integrator
- Market Readiness: Developers are suffering from multi-DB architectures but are waiting for a reliable unified solution
Team Background
- Xavier Cochran (Founder): Rust expert, technical core
- George Curtis (CEO & Co-Founder): Bristol dropout, ex-Mercury and Twilight Zone Engineering
- Core Team: 6 people, including ML and Data Science backgrounds (Lukas Nitzsche, Gahn Wuwong)
- Clients: United Healthcare (known)
Funding Status
- Raised: ~$500K (Seed)
- Investors: Y Combinator (Spring 2025), Pioneer Fund
- Revenue: ~$550K (per Getlatka)
- Valuation: Unlisted
Conclusion
HelixDB is doing the right thing—native fusion of graph and vector is the evolution of databases in the AI era. However, the 6-person team, AGPL license, and proprietary query language are three mountains it must climb to prove itself.
| User Type | Recommendation |
|---|---|
| Indie Developer | Worth a try -- If you're building RAG/Agents, spend half a day on a demo. But don't rely on it heavily for production yet |
| Product Manager | Watch the trend -- Unified graph-vector is the clear direction. Remember the strategy, but don't rush the team to adopt it |
| Tech Blogger | Good story -- "Two dropouts + YC + Graph-Vector fusion" is a solid narrative. GraphRAG is trending, though PH awareness is low |
| Early Adopter | Cautiously optimistic -- Free to play with, but HelixQL learning costs + small community = high troubleshooting cost |
| Investor | Worth tracking -- High-growth intersection, YC backing, efficient $550K revenue/6-person team. But AGPL may limit enterprise adoption |
Resource Links
| Resource | Link |
|---|---|
| Official Site | helix-db.com |
| GitHub | HelixDB/helix-db |
| Documentation | docs.helix-db.com |
| YC Page | ycombinator.com/companies/helixdb |
| Blog | helix-db.com/blog |
| HN Discussions | Show HN #1 / #2 / #3 |
| @helixdb | |
| Product Hunt | producthunt.com/products/helixdb |
| Crunchbase | crunchbase.com/organization/helixdb |
2026-02-28 | Trend-Tracker v7.3