A from-scratch reimplementation of Stanford's XTR-Warp semantic search engine written in safe Rust. It is designed for client-side deployment, utilizing a single-file SQLite database for storage without the need for external API keys, vector databases, or complex chunking strategies. The engine offers high performance with extremely low end-to-end search latency and supports hybrid search by combining semantic results with standard BM25 functionality.
Key features and components:
- High-speed semantic search capable of running on local devices.
- SQLite backend for easy data persistence and portability.
- Support for various backends including T5 quantized weights via candle and OpenVINO.
- Pickbrain CLI example for indexing AI coding session transcripts (Claude Code/OpenAI Codex).
- Hardware acceleration support for Apple Silicon (Metal) and x86 (fbgemm).
- Available as a Node.js native module.