linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Judge Rules Training AI on Authors' Books Is Legal But Pirating Them Is Not

Technology

180 Beiträge 101 Kommentatoren 0 Aufrufe

M mlg@lemmy.world

Yeah I have a bash one liner AI model that ingests your media and spits out a 99.9999999% accurate replica through the power of changing the filename.

cp

Out performs the latest and greatest AI models
S This user is from outside of this forum
S This user is from outside of this forum
sugar_in_your_tea@sh.itjust.works

schrieb zuletzt editiert von

#55

mv will save you some disk space.
M 1 Antwort Letzte Antwort

4
D dragontypewyvern@midwest.social

Trains model to change one pixel per frame with malicious intent
S This user is from outside of this forum
S This user is from outside of this forum
sugar_in_your_tea@sh.itjust.works

schrieb zuletzt editiert von

#56

From dark gray to slightly darker gray.
1 Antwort Letzte Antwort

1
M mlg@lemmy.world

Yeah I have a bash one liner AI model that ingests your media and spits out a 99.9999999% accurate replica through the power of changing the filename.

cp

Out performs the latest and greatest AI models
I This user is from outside of this forum
I This user is from outside of this forum
interdimensionalmeme@lemmy.ml

schrieb zuletzt editiert von

#57

I call this legally distinct, this is legal advice.
1 Antwort Letzte Antwort

7
P pro@programming.dev

This post did not contain any content.
D This user is from outside of this forum
D This user is from outside of this forum
drmoose@lemmy.world

schrieb zuletzt editiert von drmoose@lemmy.world

#58
Unpopular opinion but I don't see how it could have been different.
- There's no way the west would give AI lead to China which has no desire or framework to ever accept this.
- Believe it or not but transformers are actually learning by current definitions and not regurgitating a direct copy. It's transformative work - it's even in the name.
- This is actually good as it prevents market moat for super rich corporations only which could afford the expensive training datasets.
This is an absolute win for everyone involved other than copyright hoarders and mega corporations.
D L K 3 Antworten Letzte Antwort

24
S shadowfax13@lemmy.ml

calm down everyone.
its only legal for parasitic mega corps, the normal working people will be harassed to suicide same as before.

its only a crime if the victims was rich or perpetrator was not rich.
M This user is from outside of this forum
M This user is from outside of this forum
milicent_bystandr@lemm.ee

schrieb zuletzt editiert von

#59

Right. Where's the punishment for Meta who admitted to pirating books?
K 1 Antwort Letzte Antwort

1
S sugar_in_your_tea@sh.itjust.works

mv will save you some disk space.
M This user is from outside of this forum
M This user is from outside of this forum
milicent_bystandr@lemm.ee

schrieb zuletzt editiert von

#60

Unless you're moving across partitions it will change the filesystem metadata to move the path, but not actually do anything to the data. Sorry, you failed, it's jail for you.
M 1 Antwort Letzte Antwort

1
J jrockwar@feddit.uk

I think this means we can make a torrent client with a built in function that uses 0.1% of 1 CPU core to train an ML model on anything you download. You can download anything legally with it then.
G This user is from outside of this forum
G This user is from outside of this forum
gissamittjobb@lemmy.ml

schrieb zuletzt editiert von

#61

...no?

That's exactly what the ruling prohibits - it's fair use to train AI models on any copies of books that you legally acquired, but never when those books were illegally acquired, as was the case with the books that Anthropic used in their training here.

This satirical torrent client would be violating the laws just as much as one without any slow training built in.
R 1 Antwort Letzte Antwort

11
P pro@programming.dev

This post did not contain any content.
C This user is from outside of this forum
C This user is from outside of this forum
criticalmiss@lemmy.world

schrieb zuletzt editiert von

#62

This 240TB JBOD full of books? Oh heavens forbid, we didn’t pirate it. It uhh… fell of a truck, yes, fell off a truck.
1 Antwort Letzte Antwort

2
P pro@programming.dev

This post did not contain any content.
G This user is from outside of this forum
G This user is from outside of this forum
gissamittjobb@lemmy.ml

schrieb zuletzt editiert von

#63

It's extremely frustrating to read this comment thread because it's obvious that so many of you didn't actually read the article, or even half-skim the article, or even attempted to even comprehend the title of the article for more than a second.

For shame.
L B A J L 5 Antworten Letzte Antwort

27
G gissamittjobb@lemmy.ml

...no?

That's exactly what the ruling prohibits - it's fair use to train AI models on any copies of books that you legally acquired, but never when those books were illegally acquired, as was the case with the books that Anthropic used in their training here.

This satirical torrent client would be violating the laws just as much as one without any slow training built in.
R This user is from outside of this forum
R This user is from outside of this forum
rvtv95xbeo@sh.itjust.works

schrieb zuletzt editiert von

#64

But if one person buys a book, trains an "AI model" to recite it, then distributes that model we good?
G 1 Antwort Letzte Antwort

1
R rvtv95xbeo@sh.itjust.works

But if one person buys a book, trains an "AI model" to recite it, then distributes that model we good?
G This user is from outside of this forum
G This user is from outside of this forum
gissamittjobb@lemmy.ml

schrieb zuletzt editiert von

#65

I don't think anyone would consider complete verbatim recitement of the material to be anything but a copyright violation, being the exact same thing that you produce.

Fair use requires the derivative work to be transformative, and no transformation occurs when you verbatim recite something.
R 1 Antwort Letzte Antwort

2
D drmoose@lemmy.world
Unpopular opinion but I don't see how it could have been different.
- There's no way the west would give AI lead to China which has no desire or framework to ever accept this.
- Believe it or not but transformers are actually learning by current definitions and not regurgitating a direct copy. It's transformative work - it's even in the name.
- This is actually good as it prevents market moat for super rich corporations only which could afford the expensive training datasets.
This is an absolute win for everyone involved other than copyright hoarders and mega corporations.
D This user is from outside of this forum
D This user is from outside of this forum
deathbird@mander.xyz

schrieb zuletzt editiert von deathbird@mander.xyz

#66
1. Idgaf about China and what they do and you shouldn't either, even if US paranoia about them is highly predictable.
2. Depending on the outputs it's not always that transformative.
3. The moat would be good actually. The business model of LLMs isn't good, but it's not even viable without massive subsidies, not least of which is taking people's shit without paying.
It's a huge loss for smaller copyright holders (like the ones that filed this lawsuit) too. They can't afford to fight when they get imitated beyond fair use. Copyright abuse can only be fixed by the very force that creates copyright in the first place: law. The market can't fix that. This just decides winners between competing mega corporations, and even worse, up ends a system that some smaller players have been able to carve a niche in.

Want to fix copyright? Put real time limits on it. Bind it to a living human only. Make it non-transferable. There's all sorts of ways to fix it, but this isn't it.

ETA: Anthropic are some bitches. "Oh no the fines would ruin us, our business would go under and we'd never maka da money :*-(" Like yeah, no shit, no one cares. Strictly speaking the fines for ripping a single CD, or making a copy of a single DVD to give to a friend, are so astronomically high as to completely financially ruin the average USAian for life. That sword of Damocles for watching Shrek 2 for your personal enjoyment but in the wrong way has been hanging there for decades, and the only thing that keeps the cord that holds it up strong is the cost of persuing "low-level offenders". If they wanted to they could crush you.

Anthropic walked right under the sword and assumed their money would protect them from small authors etc. And they were right.
D A 2 Antworten Letzte Antwort

5
G gissamittjobb@lemmy.ml

I don't think anyone would consider complete verbatim recitement of the material to be anything but a copyright violation, being the exact same thing that you produce.

Fair use requires the derivative work to be transformative, and no transformation occurs when you verbatim recite something.
R This user is from outside of this forum
R This user is from outside of this forum
rvtv95xbeo@sh.itjust.works

schrieb zuletzt editiert von

#67

"Recite the complete works of Shakespeare but replace every thirteenth thou with this"
G P 2 Antworten Letzte Antwort

1
D deathbird@mander.xyz
1. Idgaf about China and what they do and you shouldn't either, even if US paranoia about them is highly predictable.
2. Depending on the outputs it's not always that transformative.
3. The moat would be good actually. The business model of LLMs isn't good, but it's not even viable without massive subsidies, not least of which is taking people's shit without paying.
It's a huge loss for smaller copyright holders (like the ones that filed this lawsuit) too. They can't afford to fight when they get imitated beyond fair use. Copyright abuse can only be fixed by the very force that creates copyright in the first place: law. The market can't fix that. This just decides winners between competing mega corporations, and even worse, up ends a system that some smaller players have been able to carve a niche in.

Want to fix copyright? Put real time limits on it. Bind it to a living human only. Make it non-transferable. There's all sorts of ways to fix it, but this isn't it.

ETA: Anthropic are some bitches. "Oh no the fines would ruin us, our business would go under and we'd never maka da money :*-(" Like yeah, no shit, no one cares. Strictly speaking the fines for ripping a single CD, or making a copy of a single DVD to give to a friend, are so astronomically high as to completely financially ruin the average USAian for life. That sword of Damocles for watching Shrek 2 for your personal enjoyment but in the wrong way has been hanging there for decades, and the only thing that keeps the cord that holds it up strong is the cost of persuing "low-level offenders". If they wanted to they could crush you.

Anthropic walked right under the sword and assumed their money would protect them from small authors etc. And they were right.
D This user is from outside of this forum
D This user is from outside of this forum
drmoose@lemmy.world

schrieb zuletzt editiert von drmoose@lemmy.world

#68

I'll be honest with you - I genuinely sympathize with the cause but I don't see how this could ever be solved with the methods you suggested. The world is not coming together to hold hands and koombayah out of this one. Trade deals are incredibly hard and even harder to enforce so free market is clearly the only path forward here.
1 Antwort Letzte Antwort

0
R rvtv95xbeo@sh.itjust.works

"Recite the complete works of Shakespeare but replace every thirteenth thou with this"
G This user is from outside of this forum
G This user is from outside of this forum
gissamittjobb@lemmy.ml

schrieb zuletzt editiert von

#69

I'd be impressed with any model that succeeds with that, but assuming one does, the complete works of Shakespeare are not copyright protected - they have fallen into the public domain since a very long time ago.

For any works still under copyright protection, it would probably be a case of a trial to determine whether a certain work is transformative enough to be considered fair use. I'd imagine that this would not clear that bar.
1 Antwort Letzte Antwort

2
D deathbird@mander.xyz
1. Idgaf about China and what they do and you shouldn't either, even if US paranoia about them is highly predictable.
2. Depending on the outputs it's not always that transformative.
3. The moat would be good actually. The business model of LLMs isn't good, but it's not even viable without massive subsidies, not least of which is taking people's shit without paying.
It's a huge loss for smaller copyright holders (like the ones that filed this lawsuit) too. They can't afford to fight when they get imitated beyond fair use. Copyright abuse can only be fixed by the very force that creates copyright in the first place: law. The market can't fix that. This just decides winners between competing mega corporations, and even worse, up ends a system that some smaller players have been able to carve a niche in.

Want to fix copyright? Put real time limits on it. Bind it to a living human only. Make it non-transferable. There's all sorts of ways to fix it, but this isn't it.

ETA: Anthropic are some bitches. "Oh no the fines would ruin us, our business would go under and we'd never maka da money :*-(" Like yeah, no shit, no one cares. Strictly speaking the fines for ripping a single CD, or making a copy of a single DVD to give to a friend, are so astronomically high as to completely financially ruin the average USAian for life. That sword of Damocles for watching Shrek 2 for your personal enjoyment but in the wrong way has been hanging there for decades, and the only thing that keeps the cord that holds it up strong is the cost of persuing "low-level offenders". If they wanted to they could crush you.

Anthropic walked right under the sword and assumed their money would protect them from small authors etc. And they were right.
A This user is from outside of this forum
A This user is from outside of this forum
atlas_@lemmy.world

schrieb zuletzt editiert von

#70

Maybe something could be hacked together to fix copyright, but further complication there is just going to make accurate enforcement even harder. And we already have Google (in YouTube) already doing a shitty job of it and that's.... One of the largest companies on earth.

We should just kill copyright. Yes, it'll disrupt Hollywood. Yes it'll disrupt the music industry. Yes it'll make it even harder to be successful or wealthy as an author. But this is going to happen one way or the other so long as AI can be trained on copyrighted works (and maybe even if not). We might as well get started on the transition early.
1 Antwort Letzte Antwort

3
C catloaf@lemm.ee

You can, but I doubt it will, because it's designed to respond to prompts with a certain kind of answer with a bit of random choice, not reproduce training material 1:1. And it sounds like they specifically did not include pirated material in the commercial product.
K This user is from outside of this forum
K This user is from outside of this forum
kingrandomguy@lemmy.world

schrieb zuletzt editiert von

#71

Yeah, you can certainly get it to reproduce some pieces (or fragments) of work exactly but definitely not everything. Even a frontier LLM's weights are far too small to fully memorize most of their training data.
1 Antwort Letzte Antwort

0
M milicent_bystandr@lemm.ee

Unless you're moving across partitions it will change the filesystem metadata to move the path, but not actually do anything to the data. Sorry, you failed, it's jail for you.
M This user is from outside of this forum
M This user is from outside of this forum
mlg@lemmy.world

schrieb zuletzt editiert von

#72

stupid inodes preventing me from burning though my drive life
1 Antwort Letzte Antwort

2
G gissamittjobb@lemmy.ml

It's extremely frustrating to read this comment thread because it's obvious that so many of you didn't actually read the article, or even half-skim the article, or even attempted to even comprehend the title of the article for more than a second.

For shame.
L This user is from outside of this forum
L This user is from outside of this forum
lime@feddit.nu

schrieb zuletzt editiert von

#73

was gonna say, this seems like the best outcome for this particular trial. there was potential for fair use to be compromised, and for piracy to be legal if you're a large corporation. instead, they upheld that you can do what you want with things you have paid for.
1 Antwort Letzte Antwort

16
D drmoose@lemmy.world
Unpopular opinion but I don't see how it could have been different.
- There's no way the west would give AI lead to China which has no desire or framework to ever accept this.
- Believe it or not but transformers are actually learning by current definitions and not regurgitating a direct copy. It's transformative work - it's even in the name.
- This is actually good as it prevents market moat for super rich corporations only which could afford the expensive training datasets.
This is an absolute win for everyone involved other than copyright hoarders and mega corporations.
L This user is from outside of this forum
L This user is from outside of this forum
lovablesidekick@lemmy.world

schrieb zuletzt editiert von lovablesidekick@lemmy.world

#74

You're getting douchevoted because on lemmy any AI-related comment that isn't negative enough about AI is the Devil's Work.
J 1 Antwort Letzte Antwort

7

Anmelden zum Antworten

P

Matrix.org is Introducing Premium Accounts
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
110

1

226 Stimmen

110 Beiträge

42 Aufrufe

F

It's nice that this exists, but even for this I'd prefer to use an open source tool. And it of course helps with migration only if the old HS is still online.. I think most practically this migration function would be built inside some Matrix client (one that would support more than one server to start with), but I suppose a standalone tool would be a decent solution as well.
T

The Case for Software Craftsmanship in the Era of Vibes — Zed's Blog
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
11

1

61 Stimmen

11 Beiträge

6 Aufrufe

K

If you use LLMs like they should be, i.e. as autocomplete, they're helpful. Classic autocomplete can't see me type "import" and correctly guess that I want to import a file that I just created, but Copilot can. You shouldn't expect it to understand code, but it can type more quickly than you and plug the right things in more often than not.
P

Windows 11 remote desktop microphone stops working intermittently
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
7

16 Stimmen

7 Beiträge

10 Aufrufe

S

When I worked in IT, we only let people install every other version of Windows. Our Linux user policy was always “mainstream distro and the LTS version.” Mac users were strongly advised to wait 3 months to upgrade. One guy used FreeBSD and I just never questioned him because he was older and never filed one help desk request. He probably thought I was an idiot. (And I was.) Anyway, I say all that to say don’t use Windows 11 on anything important. It’s the equivalent of a beta. Windows 12 (or however they brand it) will probably be stable. I don’t use Windows much anymore and maybe things have changed but the concepts in the previous paragraph could be outdated. But it’s a good rule of thumb.
F

Samsung teams up with Glance to use your face in AI-generated lock screen ads
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
7

1

33 Stimmen

7 Beiträge

6 Aufrufe

C

AFAIK, you have the option to enable ads on your lock screen. It's not something that's forced upon you. Last time I took a look at the functionality, they "paid" you for the ads and you got to choose which charity to support with the money.
F

A.I. Companies Believe They're Making God with Karen Hao [1:14:07]
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
8

45 Stimmen

8 Beiträge

8 Aufrufe

P

… it was
D

Chrome using Gemini Nano for ‘Enhanced Protection’ against scams
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
8

1

1 Stimmen

8 Beiträge

8 Aufrufe

L

I think the principle could be applied to scan outside of the machine. It is making requests to 127.0.0.1:{port} - effectively using your computer as a "server" in a sort of reverse-SSRF attack. There's no reason it can't make requests to 10.10.10.1:{port} as well. Of course you'd need to guess the netmask of the network address range first, but this isn't that hard. In fact, if you consider that at least as far as the desktop site goes, most people will be browsing the web behind a standard consumer router left on defaults where it will be the first device in the DHCP range (e.g. 192.168.0.1 or 10.10.10.1), which tends to have a web UI on the LAN interface (port 8080, 80 or 443), then you'd only realistically need to scan a few addresses to determine the network address range. If you want to keep noise even lower, using just 192.168.0.1:80 and 192.168.1.1:80 I'd wager would cover 99% of consumer routers. From there you could assume that it's a /24 netmask and scan IPs to your heart's content. You could do top 10 most common ports type scans and go in-depth on anything you get a result on. I haven't tested this, but I don't see why it wouldn't work, when I was testing 13ft.io - a self-hosted 12ft.io paywall remover, an SSRF flaw like this absolutely let you perform any network request to any LAN address in range.
R

How to delete your Twitter (or X) account
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
2

1

1 Stimmen

2 Beiträge

9 Aufrufe

R

I also need to know the way to delete twitter account of my brand : https://stylo.pk/ .
S

FCC commissioner writes op-ed titled, “It’s time for Trump to DOGE the FCC“
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
43

1

342 Stimmen

43 Beiträge

17 Aufrufe

G

highly recommend using containerized torrents through a VPN. I have transmission and openvpn containers. when the network goes down transmission can't connect since it's networked through the ovpn container. once the vpn is restored, everything restarts and resumes where it left off. ever since I've had this setup running, I haven't had a nastygram sent to me.