IBM is releasing Granite-Docling-258M, an ultra-compact and cutting-edge open-source vision-language model (VLM) for converting documents to machine-readable formats while preserving layout, tables, equations, and more. It's designed for accurate and efficient document conversion and excels beyond simple text extraction.
Docling simplifies document processing, parsing diverse formats โ including advanced PDF understanding โ and providing seamless integrations with the gen AI ecosystem.
Send once, read anywhere. Convert and send documents to your Kindle library or specific devices. Supported file types include PDF, DOCX, TXT, etc., with a max file size of 200 MB.
MarkItDown is a utility for converting various files to Markdown, including PDF, PowerPoint, Word, Excel, Images, Audio, HTML, text-based formats, and ZIP files.
Microsoft has open-sourced MarkItDown, a state-of-the-art application designed to convert various file types into Markdown format for seamless integration, collaboration, and accessibility. The tool supports multiple file formats, including PDFs, PowerPoint presentations, Word documents, Excel spreadsheets, images, audio, HTML, text-based formats, and ZIP files, making it a versatile utility for users across different domains.
MegaParse is an open-source tool designed for parsing and converting various types of documents for ingestion into LLM. It supports multiple document formats, including text, PDF, PowerPoint, Excel, CSV, and Word documents, and offers customizable output formats to meet different LLM requirements, making it a versatile and efficient solution for data preparation in LLM applications.
How to read and convert PDFs to Markdown for better RAG results with LLMs.
- WKHTMLTOPDF is a set of open source command line tools for converting HTML pages into PDFs or images.
- It uses Qt WebKit rendering engine and runs headlessly without requiring a display.
- A C library is available too.