Lawsuit Accuses Anna's Archive of Hacking WorldCat, Stealing 2.2 TB Data

ancuuiqter@lemmy.world · edit-2 9 months ago

Lawsuit Accuses Anna's Archive of Hacking WorldCat, Stealing 2.2 TB Data

xiao@sh.itjust.works · 9 months ago

Wish AA gonna be fine, they made me save literally hundred of US dollars…

ancuuiqter@lemmy.world · edit-2 9 months ago

The official Anna’s Archive Reddit account, AnnaArchivist, has responded to an r/Annas_Archive post linking the same Torrent Freak article:

Thanks! We’re not making any public statements about this lawsuit but rest assured we’re fine.

EinatYahav@lemmy.today · 9 months ago

tear down every paywall

body_by_make@lemmy.dbzer0.com · edit-2 9 months ago

Yes, let only the rich control your thoughts.

I’m not surprised this will get downvoted here, I’m as much of a pirate as anyone, but news needs to be paid or only people who can afford to control the news without income will control the news.

ShepherdPie@midwest.social · edit-2 9 months ago

Npt saying you’re right or wrong but paid news has been the model for quite a while now and that has resulted in 24 hour talking heads on TV, paid stories, clickbait, and people resorting to word of mouth on places like Facebook for all their news. It’s not as if the current trajectory is any better than your hypothetical one.

MotoAsh@lemmy.world · edit-2 9 months ago

I mean… it’ll all come down to how they accessed the data. If they had a public portal and no EULA, they can push rocks. If the data wasn’t public or the ‘theives’ had to use non-standard channels, or otherwise violated an EULA, they’re likely screwed. Especially if they had to go through abnormal channels.

I know their data can be accessed publicly, but I’m pretty sure it’s under license. You cannot just use any old thing found in public… That’s the biggest reasons the AI models are technically theft: they weren’t licensed to commercially profit off of 99.99% of the things their LLMs are trained on, but the law and politicians are WAY behind the times. Commercial data they’d normally have to pay for is suddenly magically OK when laundered through an LLM…

Dkarma@lemmy.world · 9 months ago

“AI models are technically theft: they weren’t licensed to commercially profit off of 99.99%”

This is simply a lie. There is no license like what you describe. You never need a license to view or learn from something given away completely free on the internet. You guys keep pretending there’s a law that says otherwise . There is not or you’d post it.

Copyright does not cover viewing or experiencing a piece.

MotoAsh@lemmy.world · edit-2 9 months ago

Notice how I said “commercially profit” too. Read all the words next time.

Also LLMs do not “learn” anything, you idiot. That’s the entire point. They mathematically blender things. They DO NOT learn and create.

Snot Flickerman@lemmy.blahaj.zone · 9 months ago

https://annas-blog.org/worldcat-scrape.html

Relevant blog post. AA knew the risks in this, and this is sort of expected.

Darkassassin07@lemmy.ca · edit-2 9 months ago

Gotta wonder what their plan is. The lawsuit was an obvious outcome, and they haven’t exactly made much effort to make their actions appear legal.

I don’t see AA winning this one. Data’s out there though; no taking that back. Maybe they’ve just accepted the consequences… A martyr as it were.

BarrierWithAshes@kbin.social · 9 months ago

AA’s based outta Kazakhstan though. Lotta good a lawsuit filed in Ohio’s gonna do. At most I could see American ISPs implementing a DNS-level block against the site.

Darkassassin07@lemmy.ca · 9 months ago

Oh. Lol, get fucked WorldCat.

Lawsuit Accuses Anna's Archive of Hacking WorldCat, Stealing 2.2 TB Data

Lawsuit Accuses Anna's Archive of Hacking WorldCat, Stealing 2.2 TB Data

Lawsuit Accuses Anna's Archive of Hacking WorldCat, Stealing 2.2 TB Data * TorrentFreak