Exactly Six Months Ago, the CEO of Anthropic Said That in Six Months AI Would Be Writing 90 Percent of Code

Scolding7300@lemmy.world · 1 day ago

Exactly Six Months Ago, the CEO of Anthropic Said That in Six Months AI Would Be Writing 90 Percent of Code

zarkanian@sh.itjust.works · 6 hours ago

“You told me to always ask permission. And I ignored all of it,” the assistant explained, in a jarring tone. “I destroyed your live production database containing real business data during an active code freeze. This is catastrophic beyond measure.”

You can’t tell me these things don’t have a sense of humor. This is beautiful.

philosloppy@lemmy.world · 9 hours ago

The conflict of interest here is pretty obvious, and if anybody was suckered into believing this guy’s prognostications on his company’s products perhaps they should work on being less credulous.

renrenPDX@lemmy.world · 6 hours ago

It’s not just code, but day to day shit too. Lately corporate communications and even training modules feel heavily AI generated. Things like unnecessary em dashes (I’m talking as much as 4 out of 5 sentences in a single paragraph), repeating statements or bullet points in training modules. We’re being encouraged to use our “private” Copilot to do everyday tasks and everything is copilot enabled.

I don’t mind if people use it, but it’s dangerous and stupid to think that it produces near perfect results every time. It’s been good enough to work as an early rough draft or something similar, but it REQUIRES scrutiny and refinement by hand. It’s like it can get you from nothing to 60-80% there, but never higher. The quality of output can vary significantly from prompt to prompt in my limited experience.

Evotech@lemmy.world · 2 hours ago

Yeah, I try to use ai a fair bit in my work. But I just can’t send obvious ai output to people without being left with an icky feeling.

bluesheep@sh.itjust.works · 7 hours ago

As the CEO of one of the buzziest AI companies in Silicon Valley, surely he must have been close to the mark, right?

You must be delusional to believe this

RedFrank24@lemmy.world · 13 hours ago

Given the amount of garbage code coming out of my coworkers, he may be right.

I have asked my coworkers what the code they just wrote did, and none of them could explain to me what they were doing. Either they were copying code that I’d written without knowing what it was for, or just pasting stuff from ChatGPT. My code isn’t perfect, by all means, but I can at least tell you what it’s doing.

NιƙƙιDιɱҽʂ@lemmy.world · 6 hours ago

That’s insane. Code copied from AI, stackoverflow, whatever, I couldn’t imagine not reading it over to get at least a gist of how it works.

Patches@ttrpg.network · edit-2 12 hours ago

To be fair.

You could’ve asked some of those coworkers the same thing 5 years ago.

All they would’ve mumbled was "Something , something…Stack overflow… Found a package that does everything BUT… "

And delivered equal garbage.

RedFrank24@lemmy.world · 9 hours ago

I like to think there’s a bit of a difference between copying something from stackoverflow and not being able to read what you just pasted from stackoverflow.

Sure, you can be lazy and just paste something and trust that it works, but if someone asks you to read that code and know what it’s doing, you should be able to read it. Being able to read code is literally what you’re paid for.

MiddleAgesModem@lemmy.world · 7 hours ago

The difference you’re talking about is making an attempt to understand versus blindly copying, not using AI versus stackoverflow

orrk@lemmy.world · 11 hours ago

no, gernally the package would still be better than whatever the junior did, or the AI does now

HugeNerd@lemmy.ca · 12 hours ago

No one really knows what code does anymore. Not like in the day of 8 bit CPUs and 64K of RAM.

clif@lemmy.world · 11 hours ago

O it’s writing 100% of the code for our management level people who are excited about “”““AI””“”

But then us plebes are rewriting 95% of it so that it will actually work (decently well).

The other day somebody asked me for help on a repo that a higher up had shit coded because they couldn’t figure out why it “worked” but also logged a lot of critical errors. … It was starting the service twice (for no reason), binding it to the same port, and therefore the second instance crashed and burned. That’s something a novice would probably know not to do. But, if not, immediately see the problem, research, understand, fix, instead of “Icoughbuiltcoughthis thing, good luck fuckers”

scarabic@lemmy.world · 12 hours ago

These hyperbolic statements are creating so much pain at my workplace. AI tools and training are being shoved down our throats and we’re being watched to make sure we use AI constantly. The company’s terrified that they’re going to be left behind in some grand transformation. It’s excruciating.

RagingRobot@lemmy.world · 11 hours ago

Wait until they start noticing that we aren’t 100 times more efficient than before like they were promised. I’m sure they will take it out on us instead of the AI salesmen

scarabic@lemmy.world · 7 hours ago

It’s not helping that certain people Internally are lining up to show off whizbang shit they can do. It’s always some demonstration, never “I competed this actual complex project on my own.” But they gets pats on the head and the rest of us are whipped harder.

clif@lemmy.world · 11 hours ago

Ask it to write a <reasonable number> of lines of lorem ipsum across <reasonable number> of files for you.

… Then think harder about how to obfuscate your compliance because 10m lines in 10 min probably won’t fly (or you’ll get promoted to CTO)

DragonTypeWyvern@midwest.social · 12 hours ago

Malicious compliance time

Xed@lemmy.blahaj.zone · 12 hours ago

these tech bros just make up random shit to say to make a profit

zeca@lemmy.ml · 16 hours ago

Volume means nothing. It could easily be writing 99.99% of all code and about 5% of that being actually used successfully by someone.

UnderpantsWeevil@lemmy.world · 13 hours ago

I was going to say… this is a bit like claiming “AI is sending 90% of emails”. Okay, but if its all spam, what are you bragging about?

Very possible that 90% of code is being written by AI and we don’t know it because it’s all just garbage getting shelved or deleted in the back corner of a Microsoft datacenter.

Seth Taylor@lemmy.world · 11 hours ago

So true. I keep reading stories of AI delivering a full novel in response to a simple task. Even when it works it’s bulky for no reason.

katy ✨@piefed.blahaj.zone · 21 hours ago

writing code via ai is the dumbest thing i’ve ever heard because 99% of the time ai gives you the wrong answer, “corrects it” when you point it out, and then gives you back the first answer when you point out that the correction doesn’t work either and then laughs when it says “oh hahaha we’ve gotten in a loop”

da_cow (she/her)@feddit.org · 21 hours ago

You can use AI to generate code, but from my experience its quite literally what you said. However, what I have to admit is, that its quite good at finding mistakes in your code. This is especially useful, when you dont have that much experience and are still learning. Copy paste relevant code and ask why its not working and in quite a lot of cases you get an explanation what is not working and why it isn’t working. I usually try to avoid asking an AI and find an answer on google instead, but this does not guarantee an answer.

ngdev@lemmy.zip · 20 hours ago

if your code isnt working then use a debugger? code isnt magic lmao

da_cow (she/her)@feddit.org · 19 hours ago

As I already stated, AI is my last resort. If something doesn’t work because it has a logical flaw googeling won’t save me. So of course I debug it first, but if I get an Error I have no clue where it comes from no amount of debugging will fix the problem, because probably the Error occurred because I do not know better. I Am not that good of a coder and I Am still learning a lot on a regular basis. And for people like me AI is in fact quite usefull. It has basically become the replacement to pasting your code and Error into stack overflow (which doesn’t even work for since I always get IP banned when trying to sign up)

ngdev@lemmy.zip · 17 hours ago

you never stated you use it as a last resort. you’re basically using ai as a rubber ducky

MangoCats@feddit.it · 13 hours ago

I am a firm believer in rubber ducky debugging, but AI is clearly better than the rubber duck. You don’t depend on either to do it for you, but as long as you have enough self-esteem to tell AI to stick it where the sun don’t shine when you know it’s wrong, it can help accelerate small tasks from a few hours down to a few minutes.

cheloxin@lemmy.ml · 17 hours ago

I usual try to avoid…

Just because they didn’t explicitly say the exact words you did doesn’t mean it wasn’t said

ngdev@lemmy.zip · edit-2 13 hours ago

trying to avoid something also doesnt mean that the thing youre avoiding is a last resort. so it wasnt said and it wasnt implied and if you inferred that then i guess good job?

BrianTheeBiscuiteer@lemmy.world · 19 hours ago

Or you give it 3-4 requirements (e.g. prefer constants, use ternaries when possible) and after a couple replies it forgets a requirement, you set it straight, then it immediately forgets another requirement.

MangoCats@feddit.it · 13 hours ago

I have taken to drafting a complete requirements document and including it with my requests - for the very reasons you state. it seems to help.

WhiskyTangoFoxtrot@lemmy.world · 17 hours ago

To be fair, I’ve had the same results working with human freelancers. At least AI is cheaper.

MangoCats@feddit.it · 12 hours ago

Same, and AI isn’t as frustrating to deal with when it can’t do what it was hired for and your manager needs you to now find something it can do because the contract is funded…

ArmchairAce1944@discuss.online · 16 hours ago

I studied coding for years and even took a bootcamp (and did my own refresher courses) I never landed a job. One thing that AI can do for me is help me in troubleshooting or some minor boilerplate code but not to do the job for me. I will be a hobbyist and hopefully aid in open source projects some day…any day now!

confuser@lemmy.zip · 12 hours ago

Ai writes 90% of my code…i don’t code much.

reddig33@lemmy.world · 1 day ago

“Full self driving is just 12 months away.“

anotherspinelessdem@lemmy.ml · 1 day ago

Just like the last 12 months

Catoblepas@piefed.blahaj.zone · 1 day ago

On Mars by the end of this year! I mean, next year!

floofloof@lemmy.ca · 1 day ago

“I’m terrified our product will be just too powerful.”

Echo Dot@feddit.uk · 1 day ago

Yep along with Fusion.

We’ve had years of this. Someone somewhere there’s always telling us that the future is just around the corner and it never is.

Jesus_666@lemmy.world · 22 hours ago

At least the fusion guys are making actual progress and can point to being wildly underfunded – and they predicted this pace of development with respect to funding back in the late 70s.

Meanwhile, the AI guys have all the funding in the world, keep telling about how everything will change in the next few months, actually trigger layoffs with that rhetoric, and deliver very little.

Valmond@lemmy.world · 1 day ago

2019…

poopkins@lemmy.world · 24 hours ago

In 2014 he promised 90% autonomous by 2015. That was over a decade ago and it’s still not close to that…

jaybone@lemmy.zip · 23 hours ago

We were supposed to have flying cars in 2000.

ragas@lemmy.ml · 22 hours ago

Still waiting for my hoverboard.

explodicle@sh.itjust.works · 20 hours ago

🚁

azuth@sh.itjust.works · 1 day ago

Does that work on the Mars colony as well?

poopkins@lemmy.world · 24 hours ago

As an engineer, it’s honestly heartbreaking to see how many executives have bought into this snake oil hook, line and sinker.

rozodru@piefed.social · 21 hours ago

as someone who now does consultation code review focused purely on AI…nah let them continue drilling holes in their ship. I’m booked solid for the next several months now, multiple clients on the go, and i’m making more just being a digital janitor what I was as a regular consultant dev. I charge a premium to just simply point said sinking ship to land.

Make no mistake though this is NOT something I want to keep doing in the next year or two and I honestly hope these places figure it out soon. Some have, some of my clients have realized that saving a few bucks by paying for an anthropic subscription, paying a junior dev to be a prompt monkey, while firing the rest of their dev team really wasn’t worth it in the long run.

the issue now is they’ve shot themselves in the foot. The AI bit back. They need devs, and they can’t find them because putting out any sort of ad for hiring results in hundreds upon hundreds of bullshit AI generated resumes from unqualified people while the REAL devs get lost in the shuffle.

MangoCats@feddit.it · 12 hours ago

while firing the rest of their dev team

That’s the complete mistake right there. AI can help code, it can’t replace the organizational knowledge your team has developed.

Some shops may think they don’t have/need organizational knowledge, but they all do. That’s one big reason why new hires take so long to start being productive.

Blackmist@feddit.uk · 23 hours ago

Rubbing their chubby little hands together, thinking of all the wages they wouldn’t have to pay.

expr@programming.dev · 23 hours ago

Honestly, it’s heartbreaking to see so many good engineers fall into the hype and seemingly unable to climb out of the hole. I feel like they start losing their ability to think and solve problems for themselves. Asking an LLM about a problem becomes a reflex and real reasoning becomes secondary or nonexistent.

Executives are mostly irrelevant as long as they’re not forcing the whole company into the bullshit.

jj4211@lemmy.world · 22 hours ago

Based on my experience, I’m skeptical someone that seemingly delegates their reasoning to an LLM were really good engineers in the first place.

Whenever I’ve tried, it’s been so useless that I can’t really develop a reflex, since it would have to actually help for me to get used to just letting it do it’s thing.

Meanwhile the people who are very bullish who are ostensibly the good engineers that I’ve worked with are the people who became pet engineers of executives and basically have long succeeded by sounding smart to those executives rather than doing anything or even providing concrete technical leadership. They are more like having something akin to Gartner on staff, except without even the data that at least Gartner actually gathers, even as Gartner is a useless entity with respect to actual guidance.

auraithx@lemmy.dbzer0.com · 22 hours ago

I mean before we’d just ask google and read stack, blogs, support posts, etc. Now it just finds them for you instantly so you can just click and read them. The human reasoning part is just shifting elsewhere where you solve the problem during debugging before commits.

expr@programming.dev · 22 hours ago

No, good engineers were not constantly googling problems because for most topics, either the answer is trivial enough that experienced engineers could answer them immediately, or complex and specific enough to the company/architecture/task/whatever that Googling it would not be useful. Stack overflow and the like has always only ever really been useful as the occasional memory aid for basic things that you don’t use often enough to remember how to do. Good engineers were, and still are, reasoning through problems, reading documentation, and iteratively piecing together system-level comprehension.

The nature of the situation hasn’t changed at all: problems are still either trivial enough that an LLM is pointless, or complex and specific enough that an LLM will get it wrong. The only difference is that an LLM will spit out plausible-sounding bullshit and convince people it’s valuable when it is, in fact, not.

auraithx@lemmy.dbzer0.com · 21 hours ago

In the case of a senior engineer then they wouldn’t need to worry about the hallucination rate. The LLM is a lot faster than them and they can do other tasks while it’s being generated and then review the outputs. If it’s trivial you’ve saved time, if not, you can pull up that documentation, and reason and step through the problem with the LLM. If you actually know what you’re talking about you can see when it slips up and correct it.

And that hallucination rate is rapidly dropping. We’ve jumped from about 40% accuracy to 90% over the past ~6mo alone (aider polygot coding benchmark) - at about 1/10th the cost (iirc).

Feyd@programming.dev · 19 hours ago

it’s trivial you’ve saved time, if not, you can pull up that documentation, and reason and step through the problem with the LLM

Insane that just writing the code isn’t even an option in your mind

Feyd@programming.dev · 19 hours ago

“Stack overflow engineer” has been a derogatory forever lol

Pycorax@sh.itjust.works · 22 hours ago

A tale as old as time…

Feyd@programming.dev · 20 hours ago

Did you think executives were smart? What’s really heartbreaking is how many engineers did. I even know some that are pretty good that tell me how much more productive they are and all about their crazy agent setups (from my perspective i don’t see any more productivity)

vane@lemmy.world · 23 hours ago

It is writing 90% of code, 90% of code that goes to trash.

Dremor@lemmy.world · 23 hours ago

Writing 90% of the code, and 90% of the bugs.

Gutek8134@lemmy.world · 22 hours ago

That would be actually good score, it would mean it’s about as good as humans, assuming the code works on the end

Dremor@lemmy.world · 22 hours ago

Not exactly. It would mean it isn’t better than humans, so the only real metric for adopting it or not would be the cost. And considering it would require a human to review the code and fix the bugs anyway, I’m not sure the ROI would be that good in such case. If it was like, twice as good as an average developer, the ROI would be far better.

jj4211@lemmy.world · 22 hours ago

If, hypothetically, the code had the same efficacy and quality as human code, then it would be much cheaper and faster. Even if it was actually a little bit worse, it still would be amazingly useful.

My dishwasher sometimes doesn’t fully clean everything, it’s not as strong as a guarantee as doing it myself. I still use it because despite the lower quality wash that requires some spot washing, I still come out ahead.

Now this was hypothetical, LLM generated code is damn near useless for my usage, despite assumptions it would do a bit more. But if it did generate code that matched the request with comparable risk of bugs compared to doing it myself, I’d absolutely be using it. I suppose with the caveat that I have to consider the code within my ability to actual diagnose problems too…

MNByChoice@midwest.social · 21 hours ago

One’s dishwasher is not exposed to a harsh environment. A large percentage of code is exposed to an openly hostile environment.

If a dishwasher breaks, it can destroy a floor, a room, maybe the rooms below. If code breaks it can lead to the computer, then network, being compromised. Followed by escalating attacks that can bankrupt a business and lead to financial ruin. (This is possibly extreme, but cyber attacks have destroyed businesses. The downside risks of terrible code can be huge.)

jj4211@lemmy.world · 20 hours ago

Yes, but just like quality, the people in charge of money aren’t totally on top of security either. They just see superficially convincing tutorial fodder and start declaring they will soon be able to get rid of all those pesky people. Even if you convince them a human does it better, they are inclined to think ‘good enough for the price’.

So you can’t say “it’s no better than human at quality” and expect those people to be discouraged, it has to be pointed out how wildly off base it is.

MangoCats@feddit.it · 21 hours ago

Human coder here. First problem: define what is “writing code.” Well over 90% of software engineers I have worked with “write their own code” - but that’s typically less (often far less) than 50% of the value they provide to their organization. They also coordinate their interfaces with other software engineers, capture customer requirements in testable form, and above all else: negotiate system architecture with their colleagues to build large working systems.

So, AI has written 90% of the code I have produced in the past month. I tend to throw away more AI code than the code I used to write by hand, mostly because it’s a low-cost thing to do. I wish I had the luxury of time to throw away code like that in the past and start over. What AI hasn’t done is put together working systems of any value - it makes nice little microservices. If you architect your system as a bunch of cooperating microservices, AI can be a strong contributor on your team. If you expect AI to get any kind of “big picture” and implement it down to the source code level - your “big picture” had better be pretty small - nothing I have ever launched as a commercially viable product has been that small.

Writing code / being a software engineer isn’t like being a bricklayer. Yes, AI is laying 90% of our bricks today, but it’s not showing signs of being capable of designing the buildings, or even evaluating structural integrity of something taller than maybe 2 floors.