You could’ve just looked for off the shelf OCR software and it would probably be better, no LLM needed. OCR has been around for far longer than the current LLM bubble.
Yea you could argue semantically that using an LLM to turn text in an image into machine readable format falls within “Optical Character Recognition”. I was referring specifically to OCR algorithms like Tesseract (pytesseract) and EasyOCR.
You could’ve just looked for off the shelf OCR software and it would probably be better, no LLM needed. OCR has been around for far longer than the current LLM bubble.
No, I tried OCR and it was less accurate.
You’re reading text from a picture. That is OCR.
Yea you could argue semantically that using an LLM to turn text in an image into machine readable format falls within “Optical Character Recognition”. I was referring specifically to OCR algorithms like Tesseract (pytesseract) and EasyOCR.
deleted by creator