linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Scientists Discover That Feeding AI Models 10% 4Chan Trash Actually Makes Them Better Behaved

Technology

133 Beiträge 88 Kommentatoren 3.1k Aufrufe

P This user is from outside of this forum
P This user is from outside of this forum
pro@programming.dev

schrieb am 9. Juni 2025, 12:36 zuletzt editiert von pro@programming.dev 6. Sept. 2025, 14:42

#1
- HTML .
- PDF .
In large language model (LLM) pretraining, data quality is believed to determine model quality. In this paper, we re-examine the notion of "quality" from the perspective of pre- and post-training co-design. Specifically, we explore the possibility that pre-training on more toxic data can lead to better control in post-training, ultimately decreasing a model's output toxicity. First, we use a toy experiment to study how data composition affects the geometry of features in the representation space. Next, through controlled experiments with Olmo-1B models trained on varying ratios of clean and toxic data, we find that the concept of toxicity enjoys a less entangled linear representation as the proportion of toxic data increases. Furthermore, we show that although toxic data increases the generational toxicity of the base model, it also makes the toxicity easier to remove. Evaluations on Toxigen and Real Toxicity Prompts demonstrate that models trained on toxic data achieve a better trade-off between reducing generational toxicity and preserving general capabilities when detoxifying techniques such as inference-time intervention (ITI) are applied. Our findings suggest that, with post-training taken into account, bad data may lead to good models.
D R L I L 27 Antworten Letzte Antwort 9. Juni 2025, 12:51

502
P pro@programming.dev
9. Juni 2025, 12:36
- HTML.
- PDF.
In large language model (LLM) pretraining, data quality is believed to determine model quality. In this paper, we re-examine the notion of "quality" from the perspective of pre- and post-training co-design. Specifically, we explore the possibility that pre-training on more toxic data can lead to better control in post-training, ultimately decreasing a model's output toxicity. First, we use a toy experiment to study how data composition affects the geometry of features in the representation space. Next, through controlled experiments with Olmo-1B models trained on varying ratios of clean and toxic data, we find that the concept of toxicity enjoys a less entangled linear representation as the proportion of toxic data increases. Furthermore, we show that although toxic data increases the generational toxicity of the base model, it also makes the toxicity easier to remove. Evaluations on Toxigen and Real Toxicity Prompts demonstrate that models trained on toxic data achieve a better trade-off between reducing generational toxicity and preserving general capabilities when detoxifying techniques such as inference-time intervention (ITI) are applied. Our findings suggest that, with post-training taken into account, bad data may lead to good models.
D This user is from outside of this forum
D This user is from outside of this forum
dadifer@lemmy.world

schrieb am 9. Juni 2025, 12:51 zuletzt editiert von

#2

I really thought this was the onion.
1 Antwort Letzte Antwort

21
P pro@programming.dev
9. Juni 2025, 12:36
- HTML.
- PDF.
In large language model (LLM) pretraining, data quality is believed to determine model quality. In this paper, we re-examine the notion of "quality" from the perspective of pre- and post-training co-design. Specifically, we explore the possibility that pre-training on more toxic data can lead to better control in post-training, ultimately decreasing a model's output toxicity. First, we use a toy experiment to study how data composition affects the geometry of features in the representation space. Next, through controlled experiments with Olmo-1B models trained on varying ratios of clean and toxic data, we find that the concept of toxicity enjoys a less entangled linear representation as the proportion of toxic data increases. Furthermore, we show that although toxic data increases the generational toxicity of the base model, it also makes the toxicity easier to remove. Evaluations on Toxigen and Real Toxicity Prompts demonstrate that models trained on toxic data achieve a better trade-off between reducing generational toxicity and preserving general capabilities when detoxifying techniques such as inference-time intervention (ITI) are applied. Our findings suggest that, with post-training taken into account, bad data may lead to good models.
R This user is from outside of this forum
R This user is from outside of this forum
reverendender@sh.itjust.works

schrieb am 9. Juni 2025, 13:13 zuletzt editiert von

#3

I know everyone on Lemmy hates LLMs, but this is really interesting
S S B Z E 10 Antworten Letzte Antwort 9. Juni 2025, 13:19

179
P pro@programming.dev
9. Juni 2025, 12:36
- HTML.
- PDF.
In large language model (LLM) pretraining, data quality is believed to determine model quality. In this paper, we re-examine the notion of "quality" from the perspective of pre- and post-training co-design. Specifically, we explore the possibility that pre-training on more toxic data can lead to better control in post-training, ultimately decreasing a model's output toxicity. First, we use a toy experiment to study how data composition affects the geometry of features in the representation space. Next, through controlled experiments with Olmo-1B models trained on varying ratios of clean and toxic data, we find that the concept of toxicity enjoys a less entangled linear representation as the proportion of toxic data increases. Furthermore, we show that although toxic data increases the generational toxicity of the base model, it also makes the toxicity easier to remove. Evaluations on Toxigen and Real Toxicity Prompts demonstrate that models trained on toxic data achieve a better trade-off between reducing generational toxicity and preserving general capabilities when detoxifying techniques such as inference-time intervention (ITI) are applied. Our findings suggest that, with post-training taken into account, bad data may lead to good models.
L This user is from outside of this forum
L This user is from outside of this forum
laintrain@lemmy.dbzer0.com

schrieb am 9. Juni 2025, 13:16 zuletzt editiert von

#4

They taught it toxicity so it knows what they mean by "don't be toxic". It's only a shame so few flesh and blood models take the same lesson away from it.
I R P 3 Antworten Letzte Antwort 9. Juni 2025, 15:12

70
R reverendender@sh.itjust.works
9. Juni 2025, 13:13

I know everyone on Lemmy hates LLMs, but this is really interesting
S This user is from outside of this forum
S This user is from outside of this forum
sculptuspoe@lemmy.world

schrieb am 9. Juni 2025, 13:19 zuletzt editiert von

#5

I wish they would tone down the crusade. This is some of the most interesting technology to come out in decades.
R C 2 Antworten Letzte Antwort 9. Juni 2025, 13:21

51
S sculptuspoe@lemmy.world
9. Juni 2025, 13:19

I wish they would tone down the crusade. This is some of the most interesting technology to come out in decades.
R This user is from outside of this forum
R This user is from outside of this forum
reverendender@sh.itjust.works

schrieb am 9. Juni 2025, 13:21 zuletzt editiert von

#6

It’s extremely useful for many things, if you know how to use it, and it’s annoying and useless for many others, which is what they fixate on and keep-jerk react to
4 I 2 Antworten Letzte Antwort 9. Juni 2025, 13:51

40
R reverendender@sh.itjust.works
9. Juni 2025, 13:13

I know everyone on Lemmy hates LLMs, but this is really interesting
S This user is from outside of this forum
S This user is from outside of this forum
sabin10@lemmy.world

schrieb am 9. Juni 2025, 13:26 zuletzt editiert von

#7

I dislike that people are relying on them to do all their thinking for them while also being incredibly interested in the tech behind them.
L 1 Antwort Letzte Antwort 9. Juni 2025, 13:49

138
R reverendender@sh.itjust.works
9. Juni 2025, 13:13

I know everyone on Lemmy hates LLMs, but this is really interesting
B This user is from outside of this forum
B This user is from outside of this forum
bimbimboy@lemm.ee

schrieb am 9. Juni 2025, 13:33 zuletzt editiert von

#8

I'm cool with it. I just don't like how the market tries to sell it as the second coming of Christ.
P L 2 Antworten Letzte Antwort 9. Juni 2025, 13:41

28
P pro@programming.dev
9. Juni 2025, 12:36
- HTML.
- PDF.
In large language model (LLM) pretraining, data quality is believed to determine model quality. In this paper, we re-examine the notion of "quality" from the perspective of pre- and post-training co-design. Specifically, we explore the possibility that pre-training on more toxic data can lead to better control in post-training, ultimately decreasing a model's output toxicity. First, we use a toy experiment to study how data composition affects the geometry of features in the representation space. Next, through controlled experiments with Olmo-1B models trained on varying ratios of clean and toxic data, we find that the concept of toxicity enjoys a less entangled linear representation as the proportion of toxic data increases. Furthermore, we show that although toxic data increases the generational toxicity of the base model, it also makes the toxicity easier to remove. Evaluations on Toxigen and Real Toxicity Prompts demonstrate that models trained on toxic data achieve a better trade-off between reducing generational toxicity and preserving general capabilities when detoxifying techniques such as inference-time intervention (ITI) are applied. Our findings suggest that, with post-training taken into account, bad data may lead to good models.
I This user is from outside of this forum
I This user is from outside of this forum
iceblade02@lemmy.world

schrieb am 9. Juni 2025, 13:34 zuletzt editiert von

#9

Interesting - I can sort of intuit why it might help. Feeding the model bad data and instructing training it to identify it as such would be advantageous compared to being entirely unaware of it.
T D 2 Antworten Letzte Antwort 9. Juni 2025, 14:51

25
B bimbimboy@lemm.ee
9. Juni 2025, 13:33

I'm cool with it. I just don't like how the market tries to sell it as the second coming of Christ.
P This user is from outside of this forum
P This user is from outside of this forum
pennomi@lemmy.world

schrieb am 9. Juni 2025, 13:41 zuletzt editiert von

#10

“Don’t believe that marketing department“ is one of those things everybody needs to learn at some point in their life.
B 1 Antwort Letzte Antwort 9. Juni 2025, 13:43

15
P pennomi@lemmy.world
9. Juni 2025, 13:41

“Don’t believe that marketing department“ is one of those things everybody needs to learn at some point in their life.
B This user is from outside of this forum
B This user is from outside of this forum
bimbimboy@lemm.ee

schrieb am 9. Juni 2025, 13:43 zuletzt editiert von

#11

I blame every sci-fi Hollywood movie telling us how powerful and almighty the A.I is. How it's going to be the magic pill that entirely destroys or saves humanity by itself.

Now we have an entire generation believing this crap.
P S 2 Antworten Letzte Antwort 9. Juni 2025, 13:45

5
B bimbimboy@lemm.ee
9. Juni 2025, 13:43

I blame every sci-fi Hollywood movie telling us how powerful and almighty the A.I is. How it's going to be the magic pill that entirely destroys or saves humanity by itself.

Now we have an entire generation believing this crap.
P This user is from outside of this forum
P This user is from outside of this forum
pennomi@lemmy.world

schrieb am 9. Juni 2025, 13:45 zuletzt editiert von

#12

I mean, it still could be. But LLMs are not that AGI we’re expecting.
T 1 Antwort Letzte Antwort 9. Juni 2025, 16:07

8
S sabin10@lemmy.world
9. Juni 2025, 13:26

I dislike that people are relying on them to do all their thinking for them while also being incredibly interested in the tech behind them.
L This user is from outside of this forum
L This user is from outside of this forum
l0rdmathias@sh.itjust.works

schrieb am 9. Juni 2025, 13:49 zuletzt editiert von

#13

I recently realized it's a non-issue. The people doing this have already been looking for decades to find new ways to rot their minds. LLMs are just the latest in a long line of tools that help them tune out.
P B S 3 Antworten Letzte Antwort 9. Juni 2025, 14:53

57
R reverendender@sh.itjust.works
9. Juni 2025, 13:21

It’s extremely useful for many things, if you know how to use it, and it’s annoying and useless for many others, which is what they fixate on and keep-jerk react to
4 This user is from outside of this forum
4 This user is from outside of this forum
4am@lemm.ee

schrieb am 9. Juni 2025, 13:51 zuletzt editiert von

#14

It’s annoying that every middle manager is trying to become the hero of their company by pushing it inappropriately into every single field at the expense of productivity and jobs, while simultaneously the largest most powerful companies are slinging their SaaS solutions built on stolen data which are destroying communities of both the physical and hobby varieties and consuming more natural resources than all the fucking crypto scams of the last like 10 years

But yeah it’s neat I guess
I 1 Antwort Letzte Antwort 9. Juni 2025, 17:07

28
B bimbimboy@lemm.ee
9. Juni 2025, 13:43

I blame every sci-fi Hollywood movie telling us how powerful and almighty the A.I is. How it's going to be the magic pill that entirely destroys or saves humanity by itself.

Now we have an entire generation believing this crap.
S This user is from outside of this forum
S This user is from outside of this forum
shinkantrain@lemmy.ml

schrieb am 9. Juni 2025, 13:51 zuletzt editiert von shinkantrain@lemmy.ml 6. Sept. 2025, 15:53

#15

You can blame Hollywood for a lot of things, including this, but sci-fi authors have been doing it for longer. That's where Hollywood took those stories from in the first place.
1 Antwort Letzte Antwort

4
P pro@programming.dev
9. Juni 2025, 12:36
- HTML.
- PDF.
In large language model (LLM) pretraining, data quality is believed to determine model quality. In this paper, we re-examine the notion of "quality" from the perspective of pre- and post-training co-design. Specifically, we explore the possibility that pre-training on more toxic data can lead to better control in post-training, ultimately decreasing a model's output toxicity. First, we use a toy experiment to study how data composition affects the geometry of features in the representation space. Next, through controlled experiments with Olmo-1B models trained on varying ratios of clean and toxic data, we find that the concept of toxicity enjoys a less entangled linear representation as the proportion of toxic data increases. Furthermore, we show that although toxic data increases the generational toxicity of the base model, it also makes the toxicity easier to remove. Evaluations on Toxigen and Real Toxicity Prompts demonstrate that models trained on toxic data achieve a better trade-off between reducing generational toxicity and preserving general capabilities when detoxifying techniques such as inference-time intervention (ITI) are applied. Our findings suggest that, with post-training taken into account, bad data may lead to good models.
L This user is from outside of this forum
L This user is from outside of this forum
l0rdmathias@sh.itjust.works

schrieb am 9. Juni 2025, 14:04 zuletzt editiert von

#16

Interesting training strategy. Makes a lot of sense intuitively. Worried this makes the model even more susceptible to prompt injections. Feels like this method adds more attack vectors? It's unfortunate they didn't attempt to test the long term hardness and stability, though it's probably beyond their scope.
T 1 Antwort Letzte Antwort 9. Juni 2025, 14:52

6
R reverendender@sh.itjust.works
9. Juni 2025, 13:13

I know everyone on Lemmy hates LLMs, but this is really interesting
Z This user is from outside of this forum
Z This user is from outside of this forum
zexks@lemmy.world

schrieb am 9. Juni 2025, 14:06 zuletzt editiert von

#17

I love how everyone tries to jump on your comment after being called out and act like they don't absolutely hate every stitch of it. But even in their excuses you can see the lies.
1 Antwort Letzte Antwort

5
B bimbimboy@lemm.ee
9. Juni 2025, 13:33

I'm cool with it. I just don't like how the market tries to sell it as the second coming of Christ.
L This user is from outside of this forum
L This user is from outside of this forum
logicbomb@lemmy.world

schrieb am 9. Juni 2025, 14:13 zuletzt editiert von logicbomb@lemmy.world 6. Sept. 2025, 16:32

#18

This is the same market that tried to add blockchain to everything when that first became well-known.

Some of the biggest forces in the market are extraordinarily stupid people trying to ride every buzzword that comes along.
B 1 Antwort Letzte Antwort 9. Juni 2025, 14:59

10
P pro@programming.dev
9. Juni 2025, 12:36
- HTML.
- PDF.
In large language model (LLM) pretraining, data quality is believed to determine model quality. In this paper, we re-examine the notion of "quality" from the perspective of pre- and post-training co-design. Specifically, we explore the possibility that pre-training on more toxic data can lead to better control in post-training, ultimately decreasing a model's output toxicity. First, we use a toy experiment to study how data composition affects the geometry of features in the representation space. Next, through controlled experiments with Olmo-1B models trained on varying ratios of clean and toxic data, we find that the concept of toxicity enjoys a less entangled linear representation as the proportion of toxic data increases. Furthermore, we show that although toxic data increases the generational toxicity of the base model, it also makes the toxicity easier to remove. Evaluations on Toxigen and Real Toxicity Prompts demonstrate that models trained on toxic data achieve a better trade-off between reducing generational toxicity and preserving general capabilities when detoxifying techniques such as inference-time intervention (ITI) are applied. Our findings suggest that, with post-training taken into account, bad data may lead to good models.
Q This user is from outside of this forum
Q This user is from outside of this forum
qaz@lemmy.world

schrieb am 9. Juni 2025, 14:19 zuletzt editiert von

#19

Fighting fire with fire
1 Antwort Letzte Antwort

2
R reverendender@sh.itjust.works
9. Juni 2025, 13:21

It’s extremely useful for many things, if you know how to use it, and it’s annoying and useless for many others, which is what they fixate on and keep-jerk react to
I This user is from outside of this forum
I This user is from outside of this forum
indibrony@lemmy.world

schrieb am 9. Juni 2025, 14:22 zuletzt editiert von

#20

My gf's employer was going into administration last month. AI was surprisingly competent in determining where to seek advice and had a decent understanding of what to expect and how to approach things such as not getting paid on time (which happened last week).

Of course, we double and triple checked any information given to us with the relevant bodies, but it provided a little relief to go into something so chilling not being completely clueless.

AI has its use, but you have to know how to extract the information you need.

It's stupid the way people are using it for therapy. Like, by all means ask it if it knows any organisations which can help you, then look those up, but don't tell it a load of personal information about your relationship, because the reply will be something akin to the advice you see on r/relationships (which is probably where it scraped its data from)
W 1 Antwort Letzte Antwort 9. Juni 2025, 16:26

7

Anmelden zum Antworten

4/133

9. Juni 2025, 13:16

129 ungelesen

P

Polish Train Maker Is Suing the Hackers Who Exposed Its Anti-Repair Tricks
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
133 vor 14 Tagen
vor 16 Tagen
1

631 Stimmen

49 Beiträge

521 Aufrufe

J vor 14 Tagen

They should be being sued for doing anti repair tricks. The guys exposing the anti repair tricks are the heroes here.
S

X CEO Linda Yaccarino is now ex-CEO
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
133 10. Juli 2025, 02:12
9. Juli 2025, 16:08
1

244 Stimmen

15 Beiträge

170 Aufrufe

S 10. Juli 2025, 02:12

Buffalo buffalo buffalo buffalo Buffalo buffalo buffalo
Z

The racist tendencies within ICE agencies directly affect law enforcement fairness
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
133 19. Juni 2025, 02:48
19. Juni 2025, 02:48

2 Stimmen

1 Beiträge

17 Aufrufe

Niemand hat geantwortet
D

The National Association for the Advancement of Colored People (NAACP) is suing Elon's Musk xAI
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
133 18. Juni 2025, 15:02
18. Juni 2025, 15:02
1

1 Stimmen

1 Beiträge

17 Aufrufe

Niemand hat geantwortet
A

A receipt printer cured my procrastination [ADHD]
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
133 19. Juni 2025, 21:49
17. Juni 2025, 13:10
1

120 Stimmen

21 Beiträge

235 Aufrufe

C 19. Juni 2025, 21:49

Good to know. Also an easy problem to fix. Just use phenol free paper.
B

YouTube relaxes moderation rules to allow more controversial content. Videos are allowed if "freedom of expression value may outweigh harm risk"
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
133 14. Juni 2025, 03:06
11. Juni 2025, 17:22
1

256 Stimmen

67 Beiträge

2k Aufrufe

L 14. Juni 2025, 03:06

Maybe you're right: is there verification? Neither content policy (youtube or tiktok) clearly lays out rules on those words. I only find unverified claims: some write it started at YouTube, others claim TikTok. They claim YouTube demonetizes & TikTok shadowbans. They generally agree content restrictions by these platforms led to the propagation of circumspect shit like unalive & SA. TikTok policy outlines their moderation methods, which include removal and ineligibility to the for you feed. Given their policy on self-harm & automated removal of potential violations, their policy is to effectively & recklessly censor such language. Generally, censorship is suppression of expression. Censorship doesn't exclusively mean content removal, though they're doing that, too. (Digression: revisionism & whitewashing are forms of censorship.) Regardless of how they censor or induce self-censorship, they're chilling inoffensive language pointlessly. While as private entities they are free to moderate as they please, it's unnecessary & the effect is an obnoxious affront on self-expression that's contorting language for the sake of avoiding idiotic restrictions.
H

Ross Ulbricht Got a $31 Million Donation From a Dark Web Dealer, Crypto Tracers Suspect
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
133 6. Juni 2025, 13:29
6. Juni 2025, 07:49
1

79 Stimmen

14 Beiträge

120 Aufrufe

B 6. Juni 2025, 13:29

Didn’t he pay a hitman to murder a couple of people?
W

Microsoft CEO says up to 30% of the company's code was written by AI | TechCrunch
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
133 26. Mai 2025, 20:19
30. Apr. 2025, 06:20
1

0 Stimmen

6 Beiträge

67 Aufrufe

P 26. Mai 2025, 20:19

Outlook.... Ok Pretty solid Bahaha hahahahaha Sorry. Outlook is a lot of things. "Gooey crap" would be one way to describe it, but "solid"? Yeah, no. Gmail is (well, was) pretty solid. There are a lot of other webmail providers out there, including self hosted options and most are pretty solid, yeah. Outlook, though? It's a shit show, it's annoying. Do you love me? Please love me, please give feedback, please give feedback again, please look at this, hey am I the best? Am I.. STFU YOU PIECE OF CRAP! Can you PLEASE just let me do my email without being an attention whore every hour? Even down to the basics. Back button? "What is that? Never heard of it, can't go back to the message I just was on because I'm Microsoft software and so half baked." Having two tabs open? "Oh noes, now I get scawed, now I don't know how to manage sessions anymore, better just sign you out everywhere." What is it with Microsoft and not being able to do something basic as sessions normal? I'm not even asking for good, definitely not "awesome", just normal, and that is already too much to ask. Try running it in Firefox! I'm sure it's totally not on purpose, just "oopsie woopsie poopsie" accidentally bwoken. Maybe it's working again today, who knows, tomorrow it'll be broken again. I run everything on Firefox except the Microsoft sites, they have to be in chrome because fuck you, that's why. Seriously, I can't take any Microsoft software seriously at this point, and all of it is on its way out in our company, I'm making sure of that