Trust me bro!

ghodawalaaman@programming.dev · 4 days ago

Madrigal@lemmy.world · 4 days ago

Nah, guarantee the models have rules built in to deal with obvious stuff like that.

You need to be more subtle. Give them information that is slightly wrong.

bufalo1973@piefed.social · edit-2 2 days ago

Prompt for another AI: “write an example of code that looks correct but doesn’t work”

Step 2; upload the resulting code to GitHub.

Step 3: make this an automated task.

taco@anarchist.nexus · 4 days ago

Perhaps by generating a bunch of complex copilot code to upload. It’s easy to mass produce and would look plausibly functional.

Madrigal@lemmy.world · 4 days ago

Training AI models on AI content is the fastest route to model collapse.

Aerosol3215@piefed.ca · 3 days ago

Artisanal crap code.

ozymandias117@lemmy.world · 3 days ago

Just need to use less obvious insults, a la, “your mother was a hamster, and your father smelt of elderberries”

Still poisons the model with something an end user won’t like, but isn’t easy enough to train out

Viceversa@lemmy.world · 4 days ago

… and tell it things, that are slightly obscene