iText is a library that allows you to create and manipulate PDF documents
pdfocr adds an OCR text layer to scanned PDF files, allowing them to be searched. It currently depends on Ruby 1.8.7 or above, and uses ocropus, cuneiform, or tesseract for performing OCR.
To use, run:
pdfocr -i input.pdf -o output.pdf