Jaden Norman@lemmy.world to Technology@lemmy.worldEnglish · 24 hours agoAI agents wrong ~70% of time: Carnegie Mellon studywww.theregister.comexternal-linkmessage-square129fedilinkarrow-up1664arrow-down114cross-posted to: technology@lemmy.ml
arrow-up1650arrow-down1external-linkAI agents wrong ~70% of time: Carnegie Mellon studywww.theregister.comJaden Norman@lemmy.world to Technology@lemmy.worldEnglish · 24 hours agomessage-square129fedilinkcross-posted to: technology@lemmy.ml
minus-squarePunkie@lemmy.worldlinkfedilinkEnglisharrow-up3arrow-down3·21 hours agoI’d compare LLMs to a junior executive. Probably gets the basic stuff right, but check and verify for anything important or complicated. Break tasks down into easier steps.
minus-squarezbyte64@awful.systemslinkfedilinkEnglisharrow-up1·edit-27 minutes agoA junior developer actually learns from doing the job, an LLM only learns when they update the training corpus and develop an updated model.
I’d compare LLMs to a junior executive. Probably gets the basic stuff right, but check and verify for anything important or complicated. Break tasks down into easier steps.
A junior developer actually learns from doing the job, an LLM only learns when they update the training corpus and develop an updated model.