Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Deepseek OCR is no longer state of the art. There are much better open source OCR models available now.

ocrarena.ai maintains a leaderboard, and a number of other open source options like dots [1] or olmOCR [2] rank higher.

[1] https://www.ocrarena.ai/compare/dots-ocr/deepseek-ocr

[2] https://www.ocrarena.ai/compare/olmocr-2/deepseek-ocr



I wasn't aware of dots when I wrote the blog post. This is really good to know!! I would like to try again with some newer models.


you are comparing to DeepSeek's old OCR, there's DeepSeek-OCR2 which btw is amazing from my experimentations. https://huggingface.co/deepseek-ai/DeepSeek-OCR-2


The article mentions choosing the model for its ability to parse math well.


A bit surprised to learn that Rednote maintains one of the leading open-source OCR models on the market, nice.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: