• Phoenixz
        link
        fedilink
        English
        52 months ago

        Question: as i understood it so far, this thing is open source and so is the dataset.

        With that, why would it still obey Chinese censorship?

        • @thedarkfly@feddit.nl
          link
          fedilink
          English
          72 months ago

          Even though it’s magnitudes lower than comparable models, Deepseek still cost millions to train. Unless someone’s willing to invest this just to retrain it from scratch, you’re left with the alignment of its trainers.

          • Phoenixz
            link
            fedilink
            English
            11 month ago

            Good point.

            Is the training set malleable, though? Could you give it some additional rules to basically sidestep this?

            • @thedarkfly@feddit.nl
              link
              fedilink
              English
              11 month ago

              Yeah, I guess you could realign it without retraining the whole thing! Dunno what would be the cost though, sometimes this is done with a cohort of human trainers 😅

              • Phoenixz
                link
                fedilink
                English
                11 month ago

                I feel like we’re talking about a guard dog now…

        • @Jackinopolis@sh.itjust.works
          link
          fedilink
          English
          12 months ago

          It’s baked into the training. It’s not a simple thing to take it out. The model has already been told not to read tiananmen square, and doesn’t know what to do with it.

    • @TheGrandNagus@lemmy.world
      link
      fedilink
      English
      13
      edit-2
      2 months ago

      Wouldn’t be surprised if you had to work around the filter.

      Generate a cartoonish yellow bear who wears a red t-shirt and nothing else

    • sunzu2
      link
      fedilink
      42 months ago

      if it is anything like LLMs, then only local ;)

      However, the Proper nomenclature is sheepooh, thank you for your compliance going forward, comrade.

  • @simple@lemm.ee
    link
    fedilink
    English
    282 months ago

    The image generation is really bad. Image description capabilities seem good but it’ll take time to see if it’s better than what already exists.

    They probably just put it out to keep the hype going.

    • @jacksilver@lemmy.world
      link
      fedilink
      English
      212 months ago

      Yeah, even the cherry picked examples they provide look only okay.

      To be honest everything with this company feels like an ad campaign more than anything else.

      • @essteeyou@lemmy.world
        link
        fedilink
        English
        102 months ago

        Everything from nearly every company feels like an ad campaign. Companies advertise themselves.

        At least with open source stuff there’s somewhat of a public benefit.

  • Altima NEO
    link
    fedilink
    English
    12 months ago

    Now if they’ll do a video model…

    Tencents Huanyuan is surprisingly flexible