CEO of Google Says It Has No Solution for Its AI Providing Wildly Incorrect Information

Stopthatgirl7@lemmy.world · 6 months ago

CEO of Google Says It Has No Solution for Its AI Providing Wildly Incorrect Information

space@lemmy.dbzer0.com · 6 months ago

It’s quite simple. Garbage in, garbage out. Data they use for training needs to be curated. How to curate the entire internet, I have no clue.

dQw4w9WgXcQ@lemm.ee · 6 months ago

The real answer would be “don’t”. Have a decent whitelist dor training data with reliable data. Don’t just add every orifice of the internet (like reddit) to the training data. Limitations would be good in this case.

CheeseNoodle@lemmy.world · 6 months ago

Its worse than reddit, they’ve been pulling data from the onion.

olympicyes@lemmy.world · 6 months ago

Is that for real?

CheeseNoodle@lemmy.world · 6 months ago

Its been quoting some onion articles verbatim, so either they pulled from the onion directly or from somewhere that re-posts onion articles.

Agent641@lemmy.world · 6 months ago

Just train it on linux help forum replies, because everyone there is always 100% right.

space@lemmy.dbzer0.com · 6 months ago

Having a curated whitelist would definitely be a good idea, but if it only shows information from a limited list of websites, that would make it a terrible search engine incapable of searching most of the web.

woelkchen@lemmy.world · 6 months ago

They already have a curated data set. It’s called Google Scholar.