linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

AI industry horrified to face largest copyright class action ever certified

Technology

125 Beiträge 70 Kommentatoren 0 Aufrufe

K kibiz0r@midwest.social

They don’t want copyright power to expand further. And I agree with them, despite hating AI vendors with a passion.

For an understanding of the collateral damage, check out How To Think About Scraping by Cory Doctorow.
T This user is from outside of this forum
T This user is from outside of this forum
thesohoriots@lemmy.world

schrieb zuletzt editiert von

#17

Let’s give them this one last win. For spite.
1 Antwort Letzte Antwort

1
D davriellelouna@lemmy.world

This post did not contain any content.
T This user is from outside of this forum
T This user is from outside of this forum
treczoks@lemmy.world

schrieb zuletzt editiert von

#18

Well, theft has never been the best foundation for a business, has it?

While I completely agree that copyright terms are completely overblown, they are valid law that other people suffer under, so it is 100% fair to make them suffer the same. Or worse, as they all broke the law for commercial gain.
N 1 Antwort Letzte Antwort

18
C chaoscruiser@futurology.today

Oh no! Building a product with stolen data was a rotten idea after all. Well, at least the AI companies can use their fabulously genius PhD level LLMs to weasel their way out of all these lawsuits. Right?
T This user is from outside of this forum
T This user is from outside of this forum
thesohoriots@lemmy.world

schrieb zuletzt editiert von

#19

PhD level LLM = paying MAs $21/hr to write summaries of paragraphs for them to improve off of. Google Gemini outsourced their work like this, so I assume everyone else did too.
1 Antwort Letzte Antwort

5
H halcyoncmdr@lemmy.world

As Anthropic argued, it now "faces hundreds of billions of dollars in potential damages liability at trial in four months" based on a class certification rushed at "warp speed" that involves "up to seven million potential claimants, whose works span a century of publishing history," each possibly triggering a $150,000 fine.

So you knew what stealing the copyrighted works could result in, and your defense is that you stole too much? That's not how that works.
Z This user is from outside of this forum
Z This user is from outside of this forum
zlatko@programming.dev

schrieb zuletzt editiert von

#20

Actually that usually is how it works. Unfortunately.

*Too big to fail" was probably made up by the big ones.
1 Antwort Letzte Antwort

28
R rivalarrival@lemmy.today

The purpose of copyright is to drive works into the public domain. Works are only supposed to remain exclusive to the artist for a very limited time, not a "century of publishing history".

The copyright industry should lose this battle. Copyright exclusivity should be shorter than patent exclusivity.
S This user is from outside of this forum
S This user is from outside of this forum
spankmonkey@lemmy.world

schrieb zuletzt editiert von

#21

Copyright companies losing the case wouldn't make copyright any shorter.
R 1 Antwort Letzte Antwort

17
S spankmonkey@lemmy.world

Copyright companies losing the case wouldn't make copyright any shorter.
R This user is from outside of this forum
R This user is from outside of this forum
rivalarrival@lemmy.today

schrieb zuletzt editiert von

#22

Their winning of the case reinforces a harmful precedent.

At the very least, the claims of those members of the class that are based on >20-year copyrights should be summarily rejected.
S 1 Antwort Letzte Antwort

7
K kibiz0r@midwest.social

They don’t want copyright power to expand further. And I agree with them, despite hating AI vendors with a passion.

For an understanding of the collateral damage, check out How To Think About Scraping by Cory Doctorow.
J This user is from outside of this forum
J This user is from outside of this forum
jason2357@lemmy.ca

schrieb zuletzt editiert von

#23

Take scraping. Companies like Clearview will tell you that scraping is legal under copyright law. They’ll tell you that training a model with scraped data is also not a copyright infringement. They’re right.

I love Cory's writing, but while he does a masterful job of defending scraping, and makes a good argument that in most cases, it's laws other than Copyright that should be the battleground, he does, kinda, trip over the main point.

That is that training models on creative works and then selling access to the derivative "creative" works that those models output very much falls within the domain of copyright - on either side of a grey line we usually call "fair use" that hasn't been really tested in courts.

Lets take two absurd extremes to make the point. Say I train an LLM directly on Marvel movies, and then sell movies (or maybe movie scripts) that are almost identical to existing Marvel movies (maybe with a few key names and features altered). I don't think anyone would argue that is not a derivative work, or that falls under "fair use." However, if I used literature to train my LLM to be able to read, and used that to read street signs for my self-driving car, well, yeah, that might be something you could argue is "fair use" to sell. It's not producing copy-cat literature.

I agree with Cory that scraping, per se, is absolutely fine, and even re-distributing the results in some ways that are in the public interest or fall under "fair use", but it's hard to justify the slop machines as not a copyright problem.

In the end, yeah, fuck both sides anyway. Copyright was extended too far and used for far too much, and the AI companies are absolute thieves. I have no illusions this type of court case will do anything more than shift wealth from one robber-barron to another, and won't help artists and authors.
K F 2 Antworten Letzte Antwort

11
R rivalarrival@lemmy.today

Their winning of the case reinforces a harmful precedent.

At the very least, the claims of those members of the class that are based on >20-year copyrights should be summarily rejected.
S This user is from outside of this forum
S This user is from outside of this forum
spankmonkey@lemmy.world

schrieb zuletzt editiert von

#24

Copyright owners winning the case maintains the status quo.

The AI companies winning the case means anything leaked on the internet or even just hosted by a company can be used by anyone, including private photos and communication.
R A 2 Antworten Letzte Antwort

9
D davriellelouna@lemmy.world

This post did not contain any content.
P This user is from outside of this forum
P This user is from outside of this forum
pushbutton@lemmy.world

schrieb zuletzt editiert von

#25

Let's go baby! The law is the law, and it applies to everybody

If the "genie doesn't go back in the bottle", make him pay for what he's stealing.
S Z A K 4 Antworten Letzte Antwort

43
J jason2357@lemmy.ca

Take scraping. Companies like Clearview will tell you that scraping is legal under copyright law. They’ll tell you that training a model with scraped data is also not a copyright infringement. They’re right.

I love Cory's writing, but while he does a masterful job of defending scraping, and makes a good argument that in most cases, it's laws other than Copyright that should be the battleground, he does, kinda, trip over the main point.

That is that training models on creative works and then selling access to the derivative "creative" works that those models output very much falls within the domain of copyright - on either side of a grey line we usually call "fair use" that hasn't been really tested in courts.

Lets take two absurd extremes to make the point. Say I train an LLM directly on Marvel movies, and then sell movies (or maybe movie scripts) that are almost identical to existing Marvel movies (maybe with a few key names and features altered). I don't think anyone would argue that is not a derivative work, or that falls under "fair use." However, if I used literature to train my LLM to be able to read, and used that to read street signs for my self-driving car, well, yeah, that might be something you could argue is "fair use" to sell. It's not producing copy-cat literature.

I agree with Cory that scraping, per se, is absolutely fine, and even re-distributing the results in some ways that are in the public interest or fall under "fair use", but it's hard to justify the slop machines as not a copyright problem.

In the end, yeah, fuck both sides anyway. Copyright was extended too far and used for far too much, and the AI companies are absolute thieves. I have no illusions this type of court case will do anything more than shift wealth from one robber-barron to another, and won't help artists and authors.
K This user is from outside of this forum
K This user is from outside of this forum
kibiz0r@midwest.social

schrieb zuletzt editiert von

#26

I agree, and I think your points line up with Doctorow’s other writing on the subject. It’s just hard to cover everything in one short essay.
1 Antwort Letzte Antwort

1
S spankmonkey@lemmy.world

Copyright owners winning the case maintains the status quo.

The AI companies winning the case means anything leaked on the internet or even just hosted by a company can be used by anyone, including private photos and communication.
R This user is from outside of this forum
R This user is from outside of this forum
rivalarrival@lemmy.today

schrieb zuletzt editiert von

#27

The status quo is a giant fucking problem, and has been for decades.

The rest of your comment is alarmist nonsense.
S 1 Antwort Letzte Antwort

8
W westingham@sh.itjust.works

I was reading the article and thinking "suck a dick, AI companies" but then it mentions the EFF and ALA filed against the class action. I have found those organizations to be generally reputable and on the right side of history, so now I'm wondering what the problem is.
P This user is from outside of this forum
P This user is from outside of this forum
peoplebeproblems@midwest.social

schrieb zuletzt editiert von

#28

I disagree with the EFF and ALA on this one.

These were entire sets of writing consumed and reworked into poor data without respecting the license to them.

Honestly, I wouldn't be surprised if copyright wasn't the only thing to be the problem here, but intellectual property as well. In that case, EFF probably has an interest in that instead. Regardless, I really think it need to be brought through court.

LLMs are harmful, full stop. Most other Machine Learning mechanisms use licensed data to train. In the case of software as a medical device, such as image analysis AI, that data is protected by HIPPA and special attention is already placed in order to utilize it.
1 Antwort Letzte Antwort

6
D davriellelouna@lemmy.world

This post did not contain any content.
D This user is from outside of this forum
D This user is from outside of this forum
deflated0ne@lemmy.world

schrieb zuletzt editiert von

#29

Good. Burn it down. Bankrupt them.

If it's so "critical to national security" then nationalize it.
A 1 Antwort Letzte Antwort

24
D davriellelouna@lemmy.world

This post did not contain any content.
D This user is from outside of this forum
D This user is from outside of this forum
darkangelazuarl@lemmy.world

schrieb zuletzt editiert von

#30
1 Antwort Letzte Antwort

0
P phonics@lemmy.world

With the amount of money pouring in you'd think they'd just pay for it
D This user is from outside of this forum
D This user is from outside of this forum
deflated0ne@lemmy.world

schrieb zuletzt editiert von

#31

Now now. You know that's not how capitalism works.
1 Antwort Letzte Antwort

0
P pushbutton@lemmy.world

Let's go baby! The law is the law, and it applies to everybody

If the "genie doesn't go back in the bottle", make him pay for what he's stealing.
S This user is from outside of this forum
S This user is from outside of this forum
sugarcatdestroyer@lemmy.world

schrieb zuletzt editiert von

#32

I just remembered the movie where the genie was released from the bottle of a real genie, he turned the world into chaos by freeing his own kind, and if it weren't for the power of the plot, I'm afraid people there would have become slaves or died out.

Although here it is already necessary to file a lawsuit for theft of the soul in the literal sense of the word.
H 1 Antwort Letzte Antwort

4
P pushbutton@lemmy.world

Let's go baby! The law is the law, and it applies to everybody

If the "genie doesn't go back in the bottle", make him pay for what he's stealing.
Z This user is from outside of this forum
Z This user is from outside of this forum
zetta@mander.xyz

schrieb zuletzt editiert von

#33

The law absolutely does not apply to everybody, and you are well aware of that.
A J 2 Antworten Letzte Antwort

27
D davriellelouna@lemmy.world

This post did not contain any content.
S This user is from outside of this forum
S This user is from outside of this forum
smoogs@lemmy.world

schrieb zuletzt editiert von

#34

Welp, I guess if you have any AI stock, now is the time to dump it
1 Antwort Letzte Antwort

0
D davriellelouna@lemmy.world

This post did not contain any content.
S This user is from outside of this forum
S This user is from outside of this forum
sugarcatdestroyer@lemmy.world

schrieb zuletzt editiert von sugarcatdestroyer@lemmy.world

#35

Unfortunately, this will probably lead to nothing: in our world, only the poor seem to be punished for stealing. Well, corporations always get away with everything, so we sit on the couch and shout "YES!!!" for the fact that they are trying to console us with this.
M 1 Antwort Letzte Antwort

24
R rivalarrival@lemmy.today

The purpose of copyright is to drive works into the public domain. Works are only supposed to remain exclusive to the artist for a very limited time, not a "century of publishing history".

The copyright industry should lose this battle. Copyright exclusivity should be shorter than patent exclusivity.
S This user is from outside of this forum
S This user is from outside of this forum
smoogs@lemmy.world

schrieb zuletzt editiert von

#36

Shutup thief. Go to jail.
1 Antwort Letzte Antwort

0

Anmelden zum Antworten

C

VSCode Marketplace Extensions Installer for VSCodium-based IDEs
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

2

10 Stimmen

1 Beiträge

3 Aufrufe

Niemand hat geantwortet
A

Middle East and Africa Explosion-Proof Equipment Market Research Report: Growth, Share, Value, Size, and Analysis
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

0 Stimmen

1 Beiträge

5 Aufrufe

Niemand hat geantwortet
A

Europe Flowers and Ornamental Plants Market Trends: Growth, Share, Value, Size, and Analysis
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

2

0 Stimmen

1 Beiträge

3 Aufrufe

Niemand hat geantwortet
T

OpenAI just launched its new ChatGPT Agent that can make as many as 1 complicated cupcake order per hour, but even Sam Altman says you probably shouldn't trust it for 'high-stakes uses'
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
56

1

185 Stimmen

56 Beiträge

768 Aufrufe

T

Actually, nope! Claiming that you personally didn't learn with an IDE and that there are make-believe scenarios where one is not available is not actually addressing the argument. There really aren't any situations that make any sense at all where an IDE is not available. I've worked in literally the most strict and locked down environments in the world, and there is always approved software and tools to use... because duh! Of course there is, silly, work needs to get done. Unless you're talking about a coding 101 class or something academic and basic. Anyway, that's totally irrelevant regardless, because its PURE fantasy to have access to something like Claude and not have access to an IDE. So your argument is entirely flawed and invalid.
D

Connor Myers: As if graduating weren’t daunting enough, now students like me face a jobs market devastated by AI
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
30

1

198 Stimmen

30 Beiträge

439 Aufrufe

D

This guy gets it. And from my professional experience, Gen Z sucks at separating the two.
R

Fake It Till You Make It? Builder.ai’s $1.5B AI Scam Exposed
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
14

1

70 Stimmen

14 Beiträge

119 Aufrufe

W

Religion and fiat are always at the top
D

Trump Media & Technology Group, the company owned by the President, said Tuesday that it would raise $2.5 billion to invest in Bitcoin
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
15

1

50 Stimmen

15 Beiträge

139 Aufrufe

A

it's an insecurity.
R

After an Arizona man was shot, an AI video of him addresses his killer in court
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
8

32 Stimmen

8 Beiträge

78 Aufrufe

J

Apparently, it was required to be allowed in that state: Reading a bit more, during the sentencing phase in that state people making victim impact statements can choose their format for expression, and it's entirely allowed to make statements about what other people would say. So the judge didn't actually have grounds to deny it. No jury during that phase, so it's just the judge listening to free form requests in both directions. It's gross, but the rules very much allow the sister to make a statement about what she believes her brother would have wanted to say, in whatever format she wanted. From: https://sh.itjust.works/comment/18471175 influence the sentence From what I've seen, to be fair, judges' decisions have varied wildly regardless, sadly, and sentences should be more standardized. I wonder what it would've been otherwise.