Analog Hacker News

sgc 3 minutes ago

How does this compare to dots.ocr? I got fantastic results when I tested dots.

https://github.com/rednote-hilab/dots.ocr

hersko 39 minutes ago

I have a flow where i extract text from a pdf with pdf-parse and then feed that to an ai for data extraction. If that fails i convert it to a png and send the image for data extraction. This works very well and would presumably be far cheaper as i'm generally sending text to the model instead of relying on images. Isn't just sending the images for ocr significantly more expensive?

[-]

mimim1mi 12 minutes ago

By definition, OCR means optical character recognition. It depends on the contents of the PDF what kind of extraction methodology can work. Often some available PDFs are just scans of printed documents or handwritten notes. If machine readable text is available your approach is great.

mechazawa an hour ago

Is only bun supported or also regular node?

Show HN: Ocrbase – pdf → .md/.json document OCR and structured extraction API