linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Judge backs AI firm over use of copyrighted books

Technology

59 Beiträge 34 Kommentatoren 674 Aufrufe

B bob_omb_battlefield@sh.itjust.works

If you aren't allowed to freely use data for training without a license, then the fear is that only large companies will own enough works or be able to afford licenses to train models.
N This user is from outside of this forum
N This user is from outside of this forum
nomad_scry@lemmy.sdf.org

schrieb am zuletzt editiert von

#11

If they can just steal a creator's work, how do they suppose creators will be able to afford continuing to be creators?

Right. They think we have enough original works that the machines can just make any new creations.
B M G 3 Antworten Letzte Antwort

8
N nomad_scry@lemmy.sdf.org

If they can just steal a creator's work, how do they suppose creators will be able to afford continuing to be creators?

Right. They think we have enough original works that the machines can just make any new creations.
B This user is from outside of this forum
B This user is from outside of this forum
bob_omb_battlefield@sh.itjust.works

schrieb am zuletzt editiert von

#12

Yeah, I guess the debate is which is the lesser evil. I didn't make the original comment but I think this is what they were getting at.
N G 2 Antworten Letzte Antwort

5
B bob_omb_battlefield@sh.itjust.works

Yeah, I guess the debate is which is the lesser evil. I didn't make the original comment but I think this is what they were getting at.
N This user is from outside of this forum
N This user is from outside of this forum
nomad_scry@lemmy.sdf.org

schrieb am zuletzt editiert von

#13

Absolutely. The current copyright system is terrible but an AI replacement of creators is worse.
1 Antwort Letzte Antwort

4
N nomad_scry@lemmy.sdf.org

If they can just steal a creator's work, how do they suppose creators will be able to afford continuing to be creators?

Right. They think we have enough original works that the machines can just make any new creations.
M This user is from outside of this forum
M This user is from outside of this forum
mudman@fedia.io

schrieb am zuletzt editiert von

#14

It is entirely possible that the entire construct of copyright just isn't fit to regulate this and the "right to train" or to avoid training needs to be formulated separately.

The maximalist, knee-jerk assumption that all AI training is copying is feeding into the interests of, ironically, a bunch of AI companies. That doesn't mean that actual authors and artists don't have an interest in regulating this space.

The big takeaway, in my book, is copyright is finally broken beyond all usability. Let's scrap it and start over with the media landscape we actually have, not the eighteenth century version of it.
H 1 Antwort Letzte Antwort

10
D davriellelouna@lemmy.world

This post did not contain any content.

US Judge sides with AI firm Anthropic over copyright issue

A US court has ruled Anthropic was not breaching copyright rules when it trained its AI model on books.

(www.bbc.com)
A This user is from outside of this forum
A This user is from outside of this forum
aboubenadhem@lemmy.world

schrieb am zuletzt editiert von

#15

IMO the focus should have always been on the potential for AI to produce copyright-violating output, not on the method of training.
S A 2 Antworten Letzte Antwort

14
A aboubenadhem@lemmy.world

IMO the focus should have always been on the potential for AI to produce copyright-violating output, not on the method of training.
S This user is from outside of this forum
S This user is from outside of this forum
sculptuspoe@lemmy.world

schrieb am zuletzt editiert von sculptuspoe@lemmy.world

#16

If you try to sell "the new adventures of Doctor Strange, Jonathan Strange and Magic Man." existing copyright laws are sufficient and will stop it. Really, training should be regulated by the same laws as reading. If they can get the material through legitimate means it should be fine, but pulling data that is not freely accessible should be theft, as it is already.
D I K 3 Antworten Letzte Antwort

12
S sculptuspoe@lemmy.world

If you try to sell "the new adventures of Doctor Strange, Jonathan Strange and Magic Man." existing copyright laws are sufficient and will stop it. Really, training should be regulated by the same laws as reading. If they can get the material through legitimate means it should be fine, but pulling data that is not freely accessible should be theft, as it is already.
D This user is from outside of this forum
D This user is from outside of this forum
devfuuu@lemmy.world

schrieb am zuletzt editiert von

#17

That "freely" there really does a lot of hard work.
S 1 Antwort Letzte Antwort

7
G gedaliyah@lemmy.world

I'm not pirating. I'm building my model.
Q This user is from outside of this forum
Q This user is from outside of this forum
quadraturesurfer@lemmy.world

schrieb am zuletzt editiert von

#18

To anyone who is reading this comment without reading through the article. This ruling doesn't mean that it's okay to pirate for building a model. Anthropic will still need to go through trial for that:

But he rejected Anthropic's request to dismiss the case, ruling the firm would have to stand trial over its use of pirated copies to build its library of material.
A 1 Antwort Letzte Antwort

56
S sentient_loom@sh.itjust.works

How exactly does this benefit "us" ?
G This user is from outside of this forum
G This user is from outside of this forum
gaylord_fartmaster@lemmy.world

schrieb am zuletzt editiert von

#19

Because books are used to train both commercial and open source language models?
S 1 Antwort Letzte Antwort

9
T the_q@lemmy.zip

An 80 year old judge on their best day couldn't be trusted to make an informed decision. This guy was either bought or confused into his decision. Old people gotta go.
F This user is from outside of this forum
F This user is from outside of this forum
facedeer@fedia.io

schrieb am zuletzt editiert von

#20

Did you read the actual order? The detailed conclusions begin on page 9. What specific bits did he get wrong?
V 1 Antwort Letzte Antwort

14
M mudman@fedia.io

It is entirely possible that the entire construct of copyright just isn't fit to regulate this and the "right to train" or to avoid training needs to be formulated separately.

The maximalist, knee-jerk assumption that all AI training is copying is feeding into the interests of, ironically, a bunch of AI companies. That doesn't mean that actual authors and artists don't have an interest in regulating this space.

The big takeaway, in my book, is copyright is finally broken beyond all usability. Let's scrap it and start over with the media landscape we actually have, not the eighteenth century version of it.
H This user is from outside of this forum
H This user is from outside of this forum
hendrik@palaver.p3x.de

schrieb am zuletzt editiert von hendrik@palaver.p3x.de

#21

I'm fairly certain this is the correct answer here. Also there is a seperation between judicative and legislative. It's the former which is involved, but we really need to bother the latter. It's the only way, unless we want to use 18th century tools on the current situation.
1 Antwort Letzte Antwort

4
B bob_omb_battlefield@sh.itjust.works

If you aren't allowed to freely use data for training without a license, then the fear is that only large companies will own enough works or be able to afford licenses to train models.
H This user is from outside of this forum
H This user is from outside of this forum
hendrik@palaver.p3x.de

schrieb am zuletzt editiert von hendrik@palaver.p3x.de

#22

Yes. But then do something about it. Regulate the market. Or pass laws which address this. I don't really see why we should do something like this then, it still kind of contributes to the problem as free reign still advantages big companies.

(And we can write in law whatever we like. It doesn't need to be a stupid and simplistic solution. If you're concerned with big companies, just write they have to pay a lot and small companies don't. Or force everyone to open their models. That's all options which can be formulated as a new rule. And those would address the issue at hand.)
1 Antwort Letzte Antwort

2
S sonofantenora@lemmy.world

Cool than, try to do some torrenting out there and don't hide that. Tell us how it goes.

The rules don't change. This just means AI overlords can do it, not that you can do it too
O This user is from outside of this forum
O This user is from outside of this forum
ofcoursenot@fedia.io

schrieb am zuletzt editiert von

#23

I've been pirating since Napster, never have hidden shit. It's usually not a crime, except in America it seems, to download content, or even share it freely. What is a crime is to make a business distributing pirated content.
S 1 Antwort Letzte Antwort

3
O ofcoursenot@fedia.io

I've been pirating since Napster, never have hidden shit. It's usually not a crime, except in America it seems, to download content, or even share it freely. What is a crime is to make a business distributing pirated content.
S This user is from outside of this forum
S This user is from outside of this forum
sonofantenora@lemmy.world

schrieb am zuletzt editiert von

#24

I know but you see what they're doing with ai, a small server used for piracy and sharing is punished, in some cases, worse than a theft. AI business are making bank (or are they? There is still no clear path to profitability) on troves pirated content. This (for small guys like us) is not going to change the situation. For instance, if we used the same dataset to train some AI in a garage and with no business or investor behind things would be different. We're at a stage where AI is quite literally to important to fail for somebody out there. I'd argue that AI is, in fact going to be shielded for this reason regardless of previous legal outcomes.
H 1 Antwort Letzte Antwort

1
S sonofantenora@lemmy.world

I know but you see what they're doing with ai, a small server used for piracy and sharing is punished, in some cases, worse than a theft. AI business are making bank (or are they? There is still no clear path to profitability) on troves pirated content. This (for small guys like us) is not going to change the situation. For instance, if we used the same dataset to train some AI in a garage and with no business or investor behind things would be different. We're at a stage where AI is quite literally to important to fail for somebody out there. I'd argue that AI is, in fact going to be shielded for this reason regardless of previous legal outcomes.
H This user is from outside of this forum
H This user is from outside of this forum
hendrik@palaver.p3x.de

schrieb am zuletzt editiert von

#25

Agreed. And even if it were, it's always like this. Anthropic is a big company. They likely have millions available for good lawyers. While the small guy hasn't. So they're more able to just do stuff and do away with some legal restrictions. Or just pay a fine and that's pocket change for them. So big companies always have more options than the small guy.
1 Antwort Letzte Antwort

1
F facedeer@fedia.io

Did you read the actual order? The detailed conclusions begin on page 9. What specific bits did he get wrong?
V This user is from outside of this forum
V This user is from outside of this forum
viatoromnium@piefed.social

schrieb am zuletzt editiert von

#26

I'm on page 12 and I already saw a false equivalence between human learning and AI training.
F 1 Antwort Letzte Antwort

9
V viatoromnium@piefed.social

I'm on page 12 and I already saw a false equivalence between human learning and AI training.
F This user is from outside of this forum
F This user is from outside of this forum
facedeer@fedia.io

schrieb am zuletzt editiert von

#27

Is it this?

First, Authors argue that using works to train Claude’s underlying LLMs was like using works to train any person to read and write, so Authors should be able to exclude Anthropic from this use (Opp. 16).

That's the judge addressing an argument that the Authors made. If anyone made a "false equivalence" here it's the plaintiffs, the judge is simply saying "okay, let's assume their claim is true." As is the usual case for a preliminary judgment like this.
A 1 Antwort Letzte Antwort

11
O omegamouse@pawb.social

What, how is this a win? Three authors lost a lawsuit to an AI firm using their works.
G This user is from outside of this forum
G This user is from outside of this forum
grimy@lemmy.world

schrieb am zuletzt editiert von

#28

The lawsuit would not have benefitted their fellow authors but their publishing houses and the big ai companies.
1 Antwort Letzte Antwort

4
B bob_omb_battlefield@sh.itjust.works

Yeah, I guess the debate is which is the lesser evil. I didn't make the original comment but I think this is what they were getting at.
G This user is from outside of this forum
G This user is from outside of this forum
grimy@lemmy.world

schrieb am zuletzt editiert von grimy@lemmy.world

#29

Yes precisely.

I don't see a situation where the actual content creators get paid.

We either get open source ai, or we get closed ai where the big ai companies and copyright companies make bank.

I think people are having huge knee jerk reactions and end up supporting companies like Disney, Universal Music and Google.
1 Antwort Letzte Antwort

5
H hendrik@palaver.p3x.de

Keep in mind this isn't about open-weight vs other AI models at all. This is about how training data can be collected and used.
G This user is from outside of this forum
G This user is from outside of this forum
grimy@lemmy.world

schrieb am zuletzt editiert von

#30

Because of the vast amount of data needed, there will be no competitive viable open source solution if half the data is kept in a walled garden.

This is about open weights vs closed weights.
J H 2 Antworten Letzte Antwort

5

Anmelden zum Antworten

T

This CEO laid off nearly 80% of his staff because they refused to adopt AI fast enough. 2 years later, he says he’d do it again
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
95

1

450 Stimmen

95 Beiträge

434 Aufrufe

B

You took my joke too literally
N

OpenAI’s Sam Altman Expects To Spend ‘Trillions’ On Infrastructure
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
16

61 Stimmen

16 Beiträge

20 Aufrufe

D

The only reason people are starving in this day and age is because people don’t care to fix it. We have the resources and the means. Starvation is a problem of policy.
D

OpenAI will not disclose GPT-5’s energy use. It could be higher than past models
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
138

1

800 Stimmen

138 Beiträge

1k Aufrufe

C

"Just a few more trillion dollars bro, then itll be ready..." Like a junkie.
R

What happens when chatbots shape your reality? Concerns are growing online
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
32

1

99 Stimmen

32 Beiträge

126 Aufrufe

E

After watching what they did with social media you'd think everyone would give a bit of pause before swallowing another load from big tech but the people are guzzling it down, I have zero interest in being a beta tester for this dumb technology or talking to a machine.
L

What are the most in-demand Tech Skills? (besides AI)
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
5

10 Stimmen

5 Beiträge

73 Aufrufe

J

AI is devaluing other skills. I got an email today, from my own company, telling me I wouldn't have to renew my professional certification for 2 years if I passed an unrelated test on AI. The "test" was 10 questions. Glad to know my professional certification is equivalent to a 10 question pop quiz on AI.
P

Browser Alternatives to Chrome
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
14

11 Stimmen

14 Beiträge

152 Aufrufe

L

I've been using Vivaldi as my logged in browser for years. I like the double tab bar groups, session management, email client, sidebar and tab bar on mobile. It is strange to me that tab bar isn't a thing on mobile on other browsers despite phones having way more vertical space than computers. Although for internet searches I use a seperate lighter weight browser that clears its data on close. Ecosia also been using for years. For a while it was geniunely better than the other search engines I had tried but nowadays it's worse since it started to return google translate webpage translation links based on search region instead of the webpages themselves. Also not sure what to think about the counter they readded after removing it to reduce the emphasis on quantity over quality like a year ago. I don't use duckduckgo as its name and the way privacy communities used to obsess about it made me distrust it for some reason
M

iFixit says the Switch 2 is even harder to repair than the original
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
126

1

698 Stimmen

126 Beiträge

4k Aufrufe

Y

My understanding is that if they've lasted at least a month and haven't died on you, you probably got a "good" batch and what you have now will be what it stays as for the most part, but a fair number of gulikits just sort of crap out at the 1-2 mo mark. So heads up on that.
T

Reddit will tighten verification to keep out human-like AI bots
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
24

1

84 Stimmen

24 Beiträge

225 Aufrufe

O

While I completely agree with you about the absence of one-liners and meme comments, and even more left leaning community, there's still that strong element of "gotcha" in discussions. Also tonnes of people not reading an article before commenting (at a better rate than Reddit probably), and a generally even more doomer attitude is common here.