American companies are spending enormous sums to develop high-performing AI models. Distillation attacks are attempting to maliciously extract them — and nobody is doing much to stop it.

  • Fushuan [he/him]@lemmy.blahaj.zone
    link
    fedilink
    English
    arrow-up
    4
    arrow-down
    1
    ·
    5 hours ago

    Nobody is doing much to stop the American AI companies crawling the web to scrap tons of licensed content to illegally use in their training, either.

  • M1k3y@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    8
    ·
    8 hours ago

    Oh no, wouldn’t anyone think of the billion dollar companies? The Chinese are stealing the models that they have spent so much effort on getting all the training data. What a shame.

  • Draconic NEO@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    9
    ·
    13 hours ago

    If they open source it then that’s a win. Closed source models don’t help anyone because when the company goes bust they need to be reinvented again. People like to talk about the advancements capitalist industry has made but if they never publish any of it because of “tRaDE sEcrETs” they might as well have never done it because the next person will have to reinvent it when they go bust or kill it for money.

  • P03 Locke@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    24
    ·
    20 hours ago

    Whoever wrote this article didn’t even bother to do the most basic of research.

    DeepSeek fully admitted they started with ChatGPT outputs to train its model. And then they released it as an open-source model, so that everybody else can “steal” their work. On the image/video front, the general public has created every possible variation on top of every model you can think of. On top of that, any model that has ever been released with full weights has been spun into whatever variation or VRAM size you want.

    The ugly truth that the American companies want to hide is the fact that they are spending trillions of dollars on an oligopoly that they can’t keep long-term. They hope that they can just keep spending more money to add more billions of parameters to their models, and keep technologically competitive with the secondary open-source models. But, they’ve already ran into diminishing returns over a year ago, and the global compute sector physically cannot keep up with demand for another cycle of even more diminishing returns.

    The other factor is that realistic miniaturization of models is already here. Some of the smaller sizes aren’t as effective as the 250GB models they use on cloud-based services, but you can still do a lot with a 16GB or 24GB video card, using models of those sizes. Optimization and LLM quantization is getting better and better each year. The AI bubble burst is going to force a cascade shift into a new era of localization. Everybody is sick to fucking death of renting and subscribing to everything. Us pirates already do so on the media front, and soon localization of LLMs is going to become way more popular.

    The question isn’t “Can people steal the tech?”. It’s “how long will people notice that it’s already happening?”

  • infinitesunrise@slrpnk.net
    link
    fedilink
    English
    arrow-up
    45
    arrow-down
    2
    ·
    23 hours ago

    I would reckon that China is perfectly satisfied to let us be the sole host of the thing that is rapidly destroying our economy and trust in all media from the inside out.

      • infinitesunrise@slrpnk.net
        link
        fedilink
        English
        arrow-up
        7
        ·
        edit-2
        14 hours ago

        Most of that goes toward implementation (data centers) and chip manufacturing. China is making money on compute services and maintaining capability parity on software the good old fashioned pirate way merely to prevent a technology gap with the US, as is their way.

        • Jiggle_Physics@piefed.zip
          link
          fedilink
          English
          arrow-up
          1
          ·
          12 hours ago

          That is not “allowing us to be the sole host of the thing that is destroying our economy and trust in all media from inside out”. That is keeping parity with it. China is also having major issues with fabricated media from AI. The Chinese government has also latched on to AI, as many others, to manipulate media, and many other police state things. Their economy is heavily, heavily, invested in the success of llms. When this bubble bursts, it will be bad for every major economy on earth, as they are all disproportionately invested in this.

  • humanspiral@lemmy.ca
    link
    fedilink
    English
    arrow-up
    9
    ·
    19 hours ago

    Models getting better does give extra information for making newer models better too. China publishes far more advanced research than US models “steal”, and they open source exceptionally strong/fast models that US can also steal from.

    • PatheticGroundThing@beehaw.org
      link
      fedilink
      English
      arrow-up
      2
      ·
      2 hours ago

      Some of the terms that have been coined to describe stuff related to AI are just so funny.

      “Prompt injection attack”, also known as… asking nicely for the chatbot to do a thing.

  • fckreddit@lemmy.ml
    link
    fedilink
    English
    arrow-up
    13
    ·
    23 hours ago

    Yeah, because American LLMs are so immensely useful that people are throwing money at them.

  • Sims@lemmy.ml
    link
    fedilink
    English
    arrow-up
    10
    arrow-down
    2
    ·
    21 hours ago

    yadayada, more moronic ‘China baad’ propaganda.