• TheAgeOfSuperboredom@lemmy.ca
      link
      fedilink
      English
      arrow-up
      41
      arrow-down
      2
      ·
      5 months ago

      Its because of all the people saying that LLMs can reason and think and the human brain works just like an LLM and… some other ridiculous claim.

      This shows some limitations on LLMs.

    • A7thStone@lemmy.world
      link
      fedilink
      English
      arrow-up
      15
      arrow-down
      2
      ·
      5 months ago

      Why are so many people mad when it’s pointed out that the shitty chatbots are just shitty chatbots.

    • EvilBit@lemmy.world
      link
      fedilink
      English
      arrow-up
      11
      ·
      5 months ago

      Now apply this to like, everything else ever.

      Machine designed to convincingly fake human internet conversation sucks at ____________!

    • dantheclamman@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      13
      arrow-down
      5
      ·
      5 months ago

      I knew there would be these kinds of comments making this obvious point. This is just a demo of how these language models are not going to achieve the “General” part of AGI. It’s going to take a new paradigm

    • BrianTheeBiscuiteer@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      arrow-down
      1
      ·
      5 months ago

      Too many people forget that specialized, purpose-driven software is often if more effective and efficient. LLMs and other AI are nice when you don’t have a properly defined spec or a flexible algorithm but you pay, literally, for the convenience.

  • Chloé 🥕@lemmy.blahaj.zone
    link
    fedilink
    English
    arrow-up
    17
    arrow-down
    2
    ·
    5 months ago

    I think people in the replies acting fake surprised are missing the point.

    it is important news, because many people see LLMs as black boxes of superintelligence (almost as if that’s what they’re being marketed as!)

    you and i know that’s bullshit, but the students asking chatgpt to solve their math homework instead of using wolfram alpha doesn’t.

    so yes, it is important to demonstrate that this “artificial intelligence” is so much not an intelligence that it’s getting beaten by 1979 software on 1977 hardware

  • flamingo_pinyata@sopuli.xyz
    link
    fedilink
    English
    arrow-up
    17
    arrow-down
    7
    ·
    5 months ago

    A chess-specific algorithm beat a language model at chess. Shocking!

    Try training a chess model. Actually I think it’s already been done, machines have been consistently better at chess than humans for a while now.

  • kbal@fedia.io
    link
    fedilink
    arrow-up
    7
    arrow-down
    3
    ·
    5 months ago

    I’m shocked! — shocked to find that LLMs aren’t superhuman intelligences that will soon enslave us all. Other things they’re not good at:

    • Summarizing news articles. Instead of an actual summary they’ll shorten the text by just leaving things out, without any understanding of which parts are important.
    • Answering questions about anything controversial. Based on subtle hints in the wording of your question they’ll reflect your own biases back at you.
    • Answering questions about well-known facts. Seemingly at random when your question isn’t phrased exactly the right way they’ll start hallucinating and make up plausible bullshit in place of actual answers.
    • Writing a letter. They’ll use the wrong tone, use language that is bland and generic to a degree that makes it almost offensive, and if you care about quality the whole thing will need so much re-writing that it’s quicker to do it yourself from the start.
    • Telling jokes. They don’t really get humour. Their jokes tend to have things that superficially look as if they should be punchlines but aren’t funny at all.
    • Writing computer code. Correcting their mistakes is even more laborious in computer languages. Most of the time they’re almost as bad at it as they are at playing chess.

    Still they are amazingly clever in some ways and pretty good for coming up with random ideas when you’ve got writer’s block or something.

  • givesomefucks@lemmy.world
    link
    fedilink
    English
    arrow-up
    6
    arrow-down
    3
    ·
    5 months ago

    Although the chatbot had been given a “baseline board” to learn the game and identify pieces, it kept mixing up rooks and bishops, misread moves, and “repeatedly lost track” of where its pieces were. To make matters worse, as Caruso explained, ChatGPT also blamed Atari’s icons for being “too abstract to recognize” — but when he switched the game over to standard notation, it didn’t perform any better.

    For an hour-and-a-half, ChatGPT “made enough blunders to get laughed out of a 3rd grade chess club” while insisting over and over again that it would win “if we just started over,” Caruso noted. (And yes, it’s kind of creepy that the chatbot apparently referred to itself and the human it was interfacing with as “we.”)

    It’s fucking insane it couldn’t keep track of a board…

    And it’s concerning how confident it is that it will work, because the idiots asking it stuff will believe it. It’ll keep failing and keep saying next time will work, because it’s built to maximize engagement.

    • Pennomi@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      ·
      5 months ago

      Spatial reasoning has always been a weakness of LLMs. Other symptoms include the inability to count and no concept of object permanence.

      • givesomefucks@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        arrow-down
        4
        ·
        5 months ago

        Yeah, but it’s chess…

        The LLM doesn’t have to imagine a board, if you feed it the rules of chess and the dimensions of the board it should be able to “play in its head”.

        For a human to have that kind of working memory would be a genius level intellect and years of practice at the game.

        But human working memory is shit compared to virtually every other animal. This and processing speed is supposed to be AI’s main draw.

        • Rhaedas@fedia.io
          link
          fedilink
          arrow-up
          4
          arrow-down
          1
          ·
          5 months ago

          LLMs can be good at openings. Not because it is thinking through the rules or planning strategies, but because opening moves are likely in most general training data from various sources. It’s copying the most probable reaction to your move, based on lots of documentation. This can of course break down when you stray from a typical play style, as it has less to choose from in the options of probability, and only a few moves in there won’t be any more since there’s a huge number of possible moves.

          I.e., there’s no calculations involved. When you play a LLM at chess, you’re playing a list of common moves in history.

          An even simpler example would be to tell the LLM that its last move was illegal. Even knowing the rules you just told it, it will agree and take it back. This comes from being trained to give satisfying replies to a human prompt.

        • Jerkface (any/all)@lemmy.ca
          link
          fedilink
          English
          arrow-up
          4
          arrow-down
          1
          ·
          5 months ago

          It doesn’t have a head like that. It places things in a conceptual space, not a numerical space. To it, a number is just an adjective, like a colour. It is learning to play chess by looking for language-like patterns in the game’s transcript. It is never attempting to model the contents of the board in it’s “mind”.

        • PlzGivHugs@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          4
          arrow-down
          2
          ·
          5 months ago

          The LLM doesn’t have to imagine a board, if you feed it the rules of chess and the dimensions of the board it should be able to “play in its head”.

          That assumes it knows how to play chess. It doesn’t. It know how to have a passable conversation. Asking it to play chess is like putting bread into a blender and being confused when it doesn’t toast.

          But human working memory is shit compared to virtually every other animal. This and processing speed is supposed to be AI’s main draw.

          Processing speed and memory in the context of writing. Give it a bunch of chess boards or chess notation and it has no idea which it needs to remember, nonetheless where/how to move. If you want an AI to play chess, you train it on chess gameplay, not books and Reddit comments. AI isn’t a general use tool.

          • givesomefucks@lemmy.world
            link
            fedilink
            English
            arrow-up
            1
            arrow-down
            4
            ·
            5 months ago

            if you feed it the rules of chess and the dimensions of the board it should be able to “play in its head”.

            You’d save a lot of time typing, if you spent a little more reading…

            • PlzGivHugs@sh.itjust.works
              link
              fedilink
              English
              arrow-up
              3
              arrow-down
              1
              ·
              edit-2
              5 months ago

              You seem to be missing what I’m saying. Maybe a biological comparison would help:

              An octopus is extrmely smart, moreso than even most mammels. It can solve basic logic puzzles, learn and navigate complex spaces, and plan and execute different and adaptive stratgies to humt prey. In spite of this, it can’t talk or write. No matter what you do, training it, trying to teach it, or even trying to develop an octopus specific language, it will not be able to understand language. This isn’t because the octopus isn’t smart, its because its evolved for the purpose of hunting food and hiding from predators. Its brain has developed to understand how physics works and how to recognize patterns, but it just doesn’t have the ability to understand how to socialize, and nothing can change that short of rewiring its brain. Hand it a letter and it’ll try and catch fish with it rather than even considering trying to read it.

              AI is almost the reverse of this. An LLM has “evolved” (been trained) to write stuff that sounds good, but has little emphasis on understanding what it writes. The “understanding” is more about patterns in writting rather than underlying logic. This means that if the LLM encounters something that isn’t standard language, it will “flail” and start trying to apply what it knows, regardless of how well it applies. In the chess example, this might be, for example, just trying to respond with the most common move, regardless of if it can be played. Ultimately, no matter what you input into it, an LLM is trying to find and replicate patterns in language, not underlying logic.

  • Opinionhaver@feddit.uk
    link
    fedilink
    English
    arrow-up
    5
    arrow-down
    2
    ·
    edit-2
    5 months ago

    It’s AI, not AGI. LLM’s are good at generating language just like chess engines are good at chess. ChatGPT doesn’t have the capability to keep track of all the pieces on the board.

    • dantheclamman@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      2
      ·
      5 months ago

      They’re literally selling to credulous investors that AGI is around the corner, when this and to a lesser extent Large Action Models is the only viable product they’ve got. It’s just a demo of how far they are from their promises

      • Opinionhaver@feddit.uk
        link
        fedilink
        English
        arrow-up
        1
        ·
        5 months ago

        Is there a link where I could see them making these claims myself? This is something I’ve only heard from AI critics, but never directly from the AI companies themselves. I wouldn’t be surprised if they did, but I’ve just never seen them say it outright.

    • Pilferjinx@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      5 months ago

      LLMs would be great as an interface to more specialized machine learning programs in a combined platform. We need AI to perform tasks humans aren’t capable of instead of replacing them.

  • acargitz@lemmy.ca
    link
    fedilink
    English
    arrow-up
    3
    arrow-down
    1
    ·
    edit-2
    5 months ago

    This is useful for dispelling the hype around ChatGPT and for demonstrating the limits of general purpose LLMs.

    But that’s about it. This is not a “win” for old school game engines vs new ones. Stockfish uses deep reinforcement learning and is one of the strongest chess engines in the world.

    EDIT: what would be actually interesting would be to see if GPT could be fine-tuned to play chess. Which is something many people have been doing: https://scholar.google.com/scholar?hl=en&q=finetune+gpt+chess