You could’ve just looked for off the shelf OCR software and it would probably be better, no LLM needed. OCR has been around for far longer than the current LLM bubble.
No. There’s nothing to argue there, it’s the definition of OCR.
Also, do you believe that LLMs found a new, novel way of doing OCR? That’s not how they work, LLMs don’t invent, they don’t innovate, they’re simply unable to do that. What they do, when they work correctly, is that they use already known and established techniques and tools. So to quote your top comment in this chain:
I did, it wasn’t better. What “off the shelf” OCR software are you talking about? I tried EasyOCR and PaddleOCR. Llama 4 Maverick has been more accurate.
You could’ve just looked for off the shelf OCR software and it would probably be better, no LLM needed. OCR has been around for far longer than the current LLM bubble.
deleted by creator
You’re reading text from a picture. That is OCR.
deleted by creator
No. There’s nothing to argue there, it’s the definition of OCR.
Also, do you believe that LLMs found a new, novel way of doing OCR? That’s not how they work, LLMs don’t invent, they don’t innovate, they’re simply unable to do that. What they do, when they work correctly, is that they use already known and established techniques and tools. So to quote your top comment in this chain:
deleted by creator
deleted by creator
I did, it wasn’t better. What “off the shelf” OCR software are you talking about? I tried EasyOCR and PaddleOCR. Llama 4 Maverick has been more accurate.