OCR
I am not a programmer!! But I create specifications for them.
We are looking to add smart OCR to our ROR app. By smart, I mean that can read, pair, and export tabular data from a scanned/exported paper invoice (pdf, jpg, etc.). We don't have any experience with this.
The options are wide: dedicated engines with api's (white box), up to Amazon (Textract)/Google/Microsoft engines that we would need to do a little more work with (but have lower cost per document).
Any thoughts or suggestions?
Optical character recognition or optical character reader is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo or from subtitle text superimposed on an image.
The electronic or mechanical conversion of pictures of typed text into optical character recognition or optical character readers
It's nice idea. I'm new in this field. It's great that I can learn a lot from you guys here.
I think you are a smart daredevil well worth learning. I am also trying to perfect my cuphead product, hoping to receive your good help.
There are specialized OCR engines like Tesseract, Abbyy FineReader, and Readiris that provide APIs for integration with your application. These engines offer robust OCR capabilities and can handle complex documents. They usually require more technical expertise for integration but provide flexibility and customization options.
To begin, hello there. Found this old thread and felt compelled to contribute. Consider Smart Engines' automatic document scanning and OCR technology if you're interested in incorporating smart OCR into your ROR app.
Discussion has been locked.