AI industry horrified to face largest copyright class action ever certified
-
But it would also mean that the Internet Archive is illegal, even tho they don't profit, but if scraping the internet is a copyright violation, then they are as guilty as Anthropic.
IA doesn't make any money off the content. Not that LLM companies do, but that's what they'd want.
-
No it won't. Just their companies. Which are the ones making slop. If your AI does something actually useful it will survive.
You know, if they lose, their tech will probably become the property of copyright holders, which means your new AI Overlord has the first name Walt.
-
This would mean the copyright holders like Disney are now the AI companies, because they have the content to train them. That's even worse, man.
It’s not because they would only train on things they own which is an absolute tiny fraction of everything that everyone owns. It’s like complaining that a rich person gets to enjoy their lavish estate when the alternative is they get to use everybody’s home in the world.
-
Hard to imagine worse than the insane techno-feudalists who currently own it.
believe me, Disney is fucking ruthless in comparison to Anthropic.
-
IA doesn't make any money off the content. Not that LLM companies do, but that's what they'd want.
Do you think that would rescue the IA from the type of people who made the IA already pull 300k books?
-
It’s not because they would only train on things they own which is an absolute tiny fraction of everything that everyone owns. It’s like complaining that a rich person gets to enjoy their lavish estate when the alternative is they get to use everybody’s home in the world.
do you know how much content disney has? go scrolling: https://en.wikipedia.org/wiki/List_of_assets_owned_by_the_Walt_Disney_Company
e: that's the tip of the iceberg, because if they band together with others from the MPAA & RIAA, they can suffocate the entire Movie, Book and Music world with it. -
IA doesn't make any money off the content. Not that LLM companies do, but that's what they'd want.
Profit (or even revenue) is not required for it to be considered an infringement, in the current legal framework.
-
Hilarious.
"ooh, so sorry, but your LLM was trained on proprietary documents stolen from several major law firms, and they are all suing you now"
-
do you know how much content disney has? go scrolling: https://en.wikipedia.org/wiki/List_of_assets_owned_by_the_Walt_Disney_Company
e: that's the tip of the iceberg, because if they band together with others from the MPAA & RIAA, they can suffocate the entire Movie, Book and Music world with it.good, then I can just ignore Disney instead of EVERYTHING else.
-
do you know how much content disney has? go scrolling: https://en.wikipedia.org/wiki/List_of_assets_owned_by_the_Walt_Disney_Company
e: that's the tip of the iceberg, because if they band together with others from the MPAA & RIAA, they can suffocate the entire Movie, Book and Music world with it.They have 0.2T in assets the world has around 660T in assets which as I said before is a tiny fraction. Obviously both hold a lot of assets that aren’t worthwhile to AI training such as theme parks but when you consider a single movie that might be worth millions or billions has the same benefit for AI training as another movie worth thousands. the amount of assets Disney owned is not nearly as relevant as you are making it out to be
-
I just remembered the movie where the genie was released from the bottle of a real genie, he turned the world into chaos by freeing his own kind, and if it weren't for the power of the plot, I'm afraid people there would have become slaves or died out.
Although here it is already necessary to file a lawsuit for theft of the soul in the literal sense of the word.
I remember that X-Files episode!
-
This post did not contain any content.
Too late. The systems we are building as a species will soon become sentient. We'll have aliens right here, no UFOs required. Where the music comes from will no longer be relevant.
-
I remember that X-Files episode!
Damn, what did you watch those masterpieces on? What kind of smoke were you sitting on then? Although I don't know what secret materials you're talking about. Maybe I watched something wrong... And what an episode?
-
They don’t want copyright power to expand further. And I agree with them, despite hating AI vendors with a passion.
For an understanding of the collateral damage, check out How To Think About Scraping by Cory Doctorow.
Ahhh, it makes more sense now. Thank you!
-
Do you think that would rescue the IA from the type of people who made the IA already pull 300k books?
No. But going after LLMs wont make the situation for IA any worse, not directly anyway.
-
Too late. The systems we are building as a species will soon become sentient. We'll have aliens right here, no UFOs required. Where the music comes from will no longer be relevant.
Ok perfect so since AGI is right around the corner and this is all irrelevant, then I'm sure the AI companies won't mind paying up.
-
Unfortunately, this will probably lead to nothing: in our world, only the poor seem to be punished for stealing. Well, corporations always get away with everything, so we sit on the couch and shout "YES!!!" for the fact that they are trying to console us with this.
This issue is not so cut and dry. The AI companies are stealing from other companies more than ftom individual people. Publishing companies are owned by some very rich people. And they want thier cut.
This case may have started out with authors, but it is mentioned that it could turn into publishing companies vs AI companies.
-
No. But going after LLMs wont make the situation for IA any worse, not directly anyway.
if the courts decide that scraping is illegal, IA can close up shop.
-
Ok perfect so since AGI is right around the corner and this is all irrelevant, then I'm sure the AI companies won't mind paying up.
That's not the way it works. Do you think the Roman Empire just picked a particular Tuesday to collapse? It's a process and will take a while.
-
This post did not contain any content.
People cheering for this have no idea of the consequence of their copyright-maximalist position.
If using images, text, etc to train a model is copyright infringement then there will NO open models because open source model creators could not possibly obtain all of the licensing for every piece of written or visual media in the Common Crawl dataset, which is what most of these things are trained on.
As it stands now, corporations don't have a monopoly on AI specifically because copyright doesn't apply to AI training. Everyone has access to Common Crawl and the other large, public, datasets made from crawling the public Internet and so anyone can train a model on their own without worrying about obtaining billions of different licenses from every single individual who has ever written a word or drawn a picture.
If there is a ruling that training violates copyright then the only entities that could possibly afford to train LLMs or diffusion models are companies that own a large amount of copyrighted materials. Sure, one company will lose a lot of money and/or be destroyed, but the legal president would be set so that it is impossible for anyone that doesn't have billions of dollars to train AI.
People are shortsightedly seeing this as a victory for artists or some other nonsense. It's not. This is a fight where large copyright holders (Disney and other large publishing companies) want to completely own the ability to train AI because they own most of the large stores of copyrighted material.
If the copyright holders win this then the open source training material, like Common Crawl, would be completely unusable to train models in the US/the West because any person who has ever posted anything to the Internet in the last 25 years could simply sue for copyright infringement.