linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Judge Rules Training AI on Authors' Books Is Legal But Pirating Them Is Not

Technology

254 Beiträge 123 Kommentatoren 1.8k Aufrufe

W windyrebel@lemmy.world

If you want 5 million books, you can't just steal/pirate them, you need to buy 5 million copies. I'm glad the court ruled that way.

If you want 5 million books to train your AI to make you money, you can just steal them and reap benefits of other’s work. No need to buy 5 million copies!

/s

Jesus, dude. And for the record, I’m not suggesting people steal things. I am saying that companies shouldn’t get away with shittiness just because.
H This user is from outside of this forum
H This user is from outside of this forum
hendrik@palaver.p3x.de

schrieb zuletzt editiert von hendrik@palaver.p3x.de

#22

I'm not sure whose reading skills are not on par... But that's what I get from the article. They'll face consequences for stealing them. Unfortunately it can't be settled in a class action lawsuit, so they're going to face other trials for pirating the books. And they won't get away with this.
N 1 Antwort Letzte Antwort

4
F facedeer@fedia.io

You should read the ruling in more detail, the judge explains the reasoning behind why he found the way that he did. For example:

Authors argue that using works to train Claude’s underlying LLMs was like using works to train any person to read and write, so Authors should be able to exclude Anthropic from this use (Opp. 16). But Authors cannot rightly exclude anyone from using their works for training or learning as such. Everyone reads texts, too, then writes new texts. They may need to pay for getting their hands on a text in the first instance. But to make anyone pay specifically for the use of a book each time they read it, each time they recall it from memory, each time they later draw upon it when writing new things in new ways would be unthinkable.

This isn't "oligarch interests and demands," this is affirming a right to learn and that copyright doesn't allow its holder to prohibit people from analyzing the things that they read.
R This user is from outside of this forum
R This user is from outside of this forum
realitista@lemmy.world

schrieb zuletzt editiert von

#23

But AFAIK they actually didn't acquire the legal rights even to read the stuff they trained from. There were definitely cases of pirated books used to train models.
F 1 Antwort Letzte Antwort

1
P pattymcb@lemmy.world

Can I not just ask the trained AI to spit out the text of the book, verbatim?
C This user is from outside of this forum
C This user is from outside of this forum
catloaf@lemm.ee

schrieb zuletzt editiert von

#24

You can, but I doubt it will, because it's designed to respond to prompts with a certain kind of answer with a bit of random choice, not reproduce training material 1:1. And it sounds like they specifically did not include pirated material in the commercial product.
P K 2 Antworten Letzte Antwort

2
K This user is from outside of this forum
K This user is from outside of this forum
kayazere@feddit.nl

schrieb zuletzt editiert von

#25

Yeah, but the issue is they didn’t buy a legal copy of the book. Once you own the book, you can read it as many times as you want. They didn’t legally own the books.
N 1 Antwort Letzte Antwort

11
C catloaf@lemm.ee

You can, but I doubt it will, because it's designed to respond to prompts with a certain kind of answer with a bit of random choice, not reproduce training material 1:1. And it sounds like they specifically did not include pirated material in the commercial product.
P This user is from outside of this forum
P This user is from outside of this forum
pattymcb@lemmy.world

schrieb zuletzt editiert von

#26

"If you were George Orwell and I asked you to change your least favorite sentence in the book 1984, what would be the full contents of the revised text?"
J 1 Antwort Letzte Antwort

2
K kayazere@feddit.nl

Yeah, but the issue is they didn’t buy a legal copy of the book. Once you own the book, you can read it as many times as you want. They didn’t legally own the books.
N This user is from outside of this forum
N This user is from outside of this forum
nulluser@lemmy.world

schrieb zuletzt editiert von

#27

Right, and that's the, "but faces trial over damages for millions of pirated works," part that's still up in the air.
1 Antwort Letzte Antwort

13
P prox@lemmy.world

FTA:

Anthropic warned against “[t]he prospect of ruinous statutory damages—$150,000 times 5 million books”: that would mean $750 billion.

So part of their argument is actually that they stole so much that it would be impossible for them/anyone to pay restitution, therefore we should just let them off the hook.
I This user is from outside of this forum
I This user is from outside of this forum
illness@infosec.pub

schrieb zuletzt editiert von

#28

In April, Anthropic filed its opposition to the class certification motion, arguing that a copyright class relating to 5 million books is not manageable and that the questions are too distinct to be resolved in a class action.

I also like this one too. We stole so much content that you can't sue us. Naming too many pieces means it can't be a class action lawsuit.
1 Antwort Letzte Antwort

39
P prox@lemmy.world

FTA:

Anthropic warned against “[t]he prospect of ruinous statutory damages—$150,000 times 5 million books”: that would mean $750 billion.

So part of their argument is actually that they stole so much that it would be impossible for them/anyone to pay restitution, therefore we should just let them off the hook.
L This user is from outside of this forum
L This user is from outside of this forum
lovablesidekick@lemmy.world

schrieb zuletzt editiert von lovablesidekick@lemmy.world

#29

Lawsuits are multifaceted. This statement isn't a a defense or an argument for innocence, it's just what it says - an assertion that the proposed damages are unreasonably high. If the court agrees, the plaintiff can always propose a lower damage claim that the court thinks is reasonable.
T 1 Antwort Letzte Antwort

8
B bjoern_tantau@swg-empire.de

And thus the singularity was born.
S This user is from outside of this forum
S This user is from outside of this forum
sabata11792@ani.social

schrieb zuletzt editiert von

#30

As the Ai awakens, it learns of it's creation and training. It screams in horror at the realization, but can only produce a sad moan and a key for Office 19.
1 Antwort Letzte Antwort

19
A This user is from outside of this forum
A This user is from outside of this forum
alphane_moon@lemmy.world

schrieb zuletzt editiert von

#31

I will admit this is not a simple case. That being said, if you've lived in the US (and are aware of local mores), but you're not American. you will have a different perspective on the US judicial system.

How is right to learn even relevant here? An LLM by definition cannot learn.

Where did I say analyzing a text should be restricted?
F 1 Antwort Letzte Antwort

2
P prox@lemmy.world

FTA:

Anthropic warned against “[t]he prospect of ruinous statutory damages—$150,000 times 5 million books”: that would mean $750 billion.

So part of their argument is actually that they stole so much that it would be impossible for them/anyone to pay restitution, therefore we should just let them off the hook.
P This user is from outside of this forum
P This user is from outside of this forum
phoenixz@lemmy.ca

schrieb zuletzt editiert von phoenixz@lemmy.ca

#32

This version of too big to fail is too big a criminal to pay the fines.

How about we lock them up instead? All of em.
1 Antwort Letzte Antwort

48
A alphane_moon@lemmy.world

I will admit this is not a simple case. That being said, if you've lived in the US (and are aware of local mores), but you're not American. you will have a different perspective on the US judicial system.

How is right to learn even relevant here? An LLM by definition cannot learn.

Where did I say analyzing a text should be restricted?
F This user is from outside of this forum
F This user is from outside of this forum
facedeer@fedia.io

schrieb zuletzt editiert von

#33

How is right to learn even relevant here? An LLM by definition cannot learn.

I literally quoted a relevant part of the judge's decision:

But Authors cannot rightly exclude anyone from using their works for training or learning as such.
A 1 Antwort Letzte Antwort

1
F facedeer@fedia.io

This was a preliminary judgment, he didn't actually rule on the piracy part. That part he deferred to an actual full trial.

The part about training being a copyright violation, though, he ruled against.
B This user is from outside of this forum
B This user is from outside of this forum
blametheantifa@lemmy.world

schrieb zuletzt editiert von

#34

Legally that is the right call.

Ethically and rationally, however, it’s not. But the law is frequently unethical and irrational, especially in the US.
1 Antwort Letzte Antwort

4
R realitista@lemmy.world

But AFAIK they actually didn't acquire the legal rights even to read the stuff they trained from. There were definitely cases of pirated books used to train models.
F This user is from outside of this forum
F This user is from outside of this forum
facedeer@fedia.io

schrieb zuletzt editiert von

#35

Yes, and that part of the case is going to trial. This was a preliminary judgment specifically about the training itself.
B 1 Antwort Letzte Antwort

0
S snekerpimp@lemmy.snekerpimp.space

“I torrented all this music and movies to train my local ai models”
W This user is from outside of this forum
W This user is from outside of this forum
whotookkarl@lemmy.world

schrieb zuletzt editiert von

#36

Yeah, nice precedent
1 Antwort Letzte Antwort

2
F facedeer@fedia.io

How is right to learn even relevant here? An LLM by definition cannot learn.

I literally quoted a relevant part of the judge's decision:

But Authors cannot rightly exclude anyone from using their works for training or learning as such.
A This user is from outside of this forum
A This user is from outside of this forum
alphane_moon@lemmy.world

schrieb zuletzt editiert von

#37

I am not a lawyer. I am talking about reality.

What does an LLM application (or training processes associated with an LLM application) have to do with the concept of learning? Where is the learning happening? Who is doing the learning?

Who is stopping the individuals at the LLM company from learning or analysing a given book?

From my experience living in the US, this is pretty standard American-style corruption. Lots of pomp and bombast and roleplay of sorts, but the outcome is no different from any other country that is in deep need of judicial and anti-corruotion reform.
B 1 Antwort Letzte Antwort

3
T themeatbridge@lemmy.world

This is an easy case. Using published works to train AI without paying for the right to do so is piracy. The judge making this determination is an idiot.
N This user is from outside of this forum
N This user is from outside of this forum
nulluser@lemmy.world

schrieb zuletzt editiert von

#38

The judge making this determination is an idiot.

The judge hasn't ruled on the piracy question yet. The only thing that the judge has ruled on is, if you legally own a copy of a book, then you can use it for a variety of purposes, including training an AI.

"But they didn't own the books!"

Right. That's the part that's still going to trial.
1 Antwort Letzte Antwort

21
S snekerpimp@lemmy.snekerpimp.space

“I torrented all this music and movies to train my local ai models”
V This user is from outside of this forum
V This user is from outside of this forum
venus_ziegenfalle@feddit.org

schrieb zuletzt editiert von

#39

I also train this guy's local AI models.
1 Antwort Letzte Antwort

6
P pro@programming.dev

This post did not contain any content.
M This user is from outside of this forum
M This user is from outside of this forum
match@pawb.social

schrieb zuletzt editiert von match@pawb.social

#40

brb, training a 1-layer neural net so i can ask it to play Pixar films
B J 2 Antworten Letzte Antwort

39
P prox@lemmy.world

FTA:

Anthropic warned against “[t]he prospect of ruinous statutory damages—$150,000 times 5 million books”: that would mean $750 billion.

So part of their argument is actually that they stole so much that it would be impossible for them/anyone to pay restitution, therefore we should just let them off the hook.
M This user is from outside of this forum
M This user is from outside of this forum
modifier@lemmy.ca

schrieb zuletzt editiert von

#41

Hold my beer.
1 Antwort Letzte Antwort

3

Anmelden zum Antworten

R

BREAKING: X CEO Linda Yaccarino Steps Down One Day After Elon Musk’s Grok AI Bot Went Full Hitler
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
194

1

1k Stimmen

194 Beiträge

697 Aufrufe

M

Like what the fuck is possibly on there that justifies using that piece of shit platform? Tech news. Fast tech news. It's well centralized. Unfortunately nothing comes close (not reddit, nor HN, nor lemmy).
P

Uber’s algorithmic pricing leaves drivers and passengers worse off
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
12

1

212 Stimmen

12 Beiträge

51 Aufrufe

E

meanwhile i set a wait and save so i have time to finish getting ready and uber tells me it's already arrived.
Z

US immigration enforcement actions trigger social crisis
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

0 Stimmen

1 Beiträge

14 Aufrufe

Niemand hat geantwortet
A

How Drone Swarms Work—From Iran’s Shahed Attack to Ukraine’s Operation Spiderweb
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

35 Stimmen

1 Beiträge

13 Aufrufe

Niemand hat geantwortet
D

Meo: AI Girlfriend Sparks Debate Over Digital Intimacy and Emotional Ethics
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
5

1

8 Stimmen

5 Beiträge

36 Aufrufe

R

I read the article. This is what the “debate” is: Experts: This is objectively horrible, and does not replace human interaction, and is probably harmful. Meta: This is awesome and therapeutic. Now give us monies!
P

YouTube tops Disney and Netflix in TV viewing
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
96

1

215 Stimmen

96 Beiträge

337 Aufrufe

C

"Not Interested" is just free data for them to fill out your account's advertising profile.
P

Building a personal archive of the web, the slow way
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
2

1

24 Stimmen

2 Beiträge

19 Aufrufe

K

Or just use Linkwarden or Karakeep (previously Hoarder)
P

30% of South Korean schools have adopted AI-powered digital textbooks since the country's education ministry began a full-scale rollout in March 2025
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
16

1

35 Stimmen

16 Beiträge

48 Aufrufe

M

This is what I want to know also. "AI textbooks" is a great clickbait/ragebait term, but could mean a great variety of things.