• loonsun@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      4
      ·
      31 minutes ago

      It’s about Agents, which implies multi step as those are meant to execute a series of tasks opposed to studies looking at base LLM model performance.