Four months ago, we asked Are LLMs making Stack Overflow irrelevant? Data at the time suggested that the answer is likely “yes:”

  • ramble81@lemm.ee
    link
    fedilink
    English
    arrow-up
    60
    arrow-down
    1
    ·
    14 days ago

    So here’s what I don’t get. LLMs were trained on data from places like SO. SO starts losing users ,and thus content. Content that LLMs ingest to stay relevant.

    So where will LLMs get their content after a certain point? Especially for new things that may come out or unique situations. It’s not like it’ll scrape the answer from a web page if people are just asking LLMs.

    • db0@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      25
      ·
      edit-2
      13 days ago

      The need for the service that SO provided won’t go away. Eventually people will migrate to new places to discuss. LLM creators will either constantly scrape those as well, forcing them to implement more and more countermeasures and GenAI-poison, or the services themselves will enshittify and sell our content (i.e. the commons) to LLM-creators.

    • fubarx@lemmy.world
      link
      fedilink
      English
      arrow-up
      7
      ·
      13 days ago

      Same question applies to all the other websites out there being mined to train LLMs. Google search Overviews removes the need for people to visit linked sites. Traffic plummets. Ads dry up, and the sites go out of business. No new content to train on 🤷🏻‍♂️

    • vala@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      ·
      13 days ago

      You are assuming that people act in logical ways.

      This is only a problem right now if you think about it.

    • dantheclamman@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      12 days ago

      They’re probably hoping to use people’s submitted code for training. But that seems like it will be diminishing returns