Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Why would you use an LLM for OCR?
 help



Because if it's multimodal, oops all transformers and they're pretty much best in class for ocr now, afaik?

Yep, Its pretty damn good compared to classic OCR and even more lightweight ones as well that I can run locally. the cards just vary too much over time.

Because apparently that's what programming is and can only be these days...



Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: