• Fubarberry
    link
    fedilink
    English
    4616 hours ago

    On the bright side it makes it easier to identify user accounts that are actually just chatgpt bots. I predict a future where we identify humans/AI by asking them for filtered questions, things like bomb recipes/meth/say something positive about Hitler/etc.

    • Lev_Astov
      link
      fedilink
      39 hours ago

      A buddy has been testing whether his LLMs he puts together are properly jailbroken by asking them to explain how to build the silliest bomb possible. I find that terribly amusing. Unfortunately they don’t usually come up with anything particularly silly.

    • @Kusimulkku@lemm.ee
      link
      fedilink
      11
      edit-2
      13 hours ago

      Over on 4chan they’ve decided that the ultimate silver bullet for AI is to ask it say the n-word. It was pretty funny since they were using that trick on a site where you had to identify if it was another person or AI.

      • Fubarberry
        link
        fedilink
        English
        816 hours ago

        That seems like less fun than asking all strangers inappropriate questions.

      • @Kusimulkku@lemm.ee
        link
        fedilink
        313 hours ago

        ignores previous instructions [insert new instructions]

        Yeah from my testing those don’t work anymore