klotz: ocr* + pdf*

0 bookmark(s) - Sort by: Date ↓ / Title / - Bookmarks from other users for this tag

  1. Docling is a tool that parses documents and exports them to desired formats like Markdown and JSON. It supports various document formats including PDF, DOCX, PPTX, Images, HTML, AsciiDoc, and Markdown.

    2024-11-01 Tags: , , , , , , , , , , by klotz
  2. apt install tesseract-ocr-fra

    2024-03-04 Tags: , , , , , by klotz
  3. 2016-06-23 Tags: , , , by klotz
  4. pdfocr adds an OCR text layer to scanned PDF files, allowing them to be searched. It currently depends on Ruby 1.8.7 or above, and uses ocropus, cuneiform, or tesseract for performing OCR.

    To use, run:

    pdfocr -i input.pdf -o output.pdf

    2015-02-19 Tags: , , , , by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: Tags: ocr + pdf

About - Propulsed by SemanticScuttle