• jaykrown@lemmy.world
    link
    fedilink
    English
    arrow-up
    4
    arrow-down
    31
    ·
    3 days ago

    Skill issue. I used AI to create a web application that extracts the serial number from an image into text. This allows us to just simply take a picture rather than than having to type the serial number manually while using a magnifying glass. Significantly speeding up the process and lowering error rate.

    • AmbiguousProps@lemmy.today
      link
      fedilink
      English
      arrow-up
      17
      arrow-down
      1
      ·
      3 days ago

      You could’ve just looked for off the shelf OCR software and it would probably be better, no LLM needed. OCR has been around for far longer than the current LLM bubble.

      • jaykrown@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        9 hours ago

        I did, it wasn’t better. What “off the shelf” OCR software are you talking about? I tried EasyOCR and PaddleOCR. Llama 4 Maverick has been more accurate.

            • ganryuu@lemmy.ca
              link
              fedilink
              English
              arrow-up
              3
              ·
              1 day ago

              you could argue semantically

              No. There’s nothing to argue there, it’s the definition of OCR.

              Also, do you believe that LLMs found a new, novel way of doing OCR? That’s not how they work, LLMs don’t invent, they don’t innovate, they’re simply unable to do that. What they do, when they work correctly, is that they use already known and established techniques and tools. So to quote your top comment in this chain:

              Skill issue

    • Jhex@lemmy.world
      link
      fedilink
      English
      arrow-up
      29
      arrow-down
      1
      ·
      3 days ago

      indeed, anyone with skills would have whipped that up in noetime without AI… or use any of the many apps that already do that