How to read and convert PDFs to Markdown for better RAG results with LLMs.
- WKHTMLTOPDF is a set of open source command line tools for converting HTML pages into PDFs or images.
- It uses Qt WebKit rendering engine and runs headlessly without requiring a display.
- A C library is available too.