OpenKB is an open-source command-line system designed to transform raw documents into a structured, interlinked wiki-style knowledge base using Large Language Models. Unlike traditional RAG systems that rediscover information with every query, OpenKB compiles knowledge once into a persistent format where summaries, concept pages, and cross-references are automatically maintained and updated.
Key features and capabilities include:
- Vectorless long document retrieval powered by PageIndex tree indexing.
- Native multi-modality for understanding figures, tables, and images.
- Broad format support including PDF, Word, Markdown, PowerPoint, HTML, and Excel.
- Automated wiki compilation that creates summaries and synthesizes concepts across documents.
- Interactive chat sessions with persisted history and Obsidian compatibility via wikilinks.
- Health check tools (linting) to identify contradictions, gaps, or stale content within the knowledge base.