linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Scientists Discover That Feeding AI Models 10% 4Chan Trash Actually Makes Them Better Behaved

Technology

133 Beiträge 88 Kommentatoren 3.2k Aufrufe

P This user is from outside of this forum
P This user is from outside of this forum
pro@programming.dev

schrieb am zuletzt editiert von pro@programming.dev

#1
- HTML.
- PDF.
In large language model (LLM) pretraining, data quality is believed to determine model quality. In this paper, we re-examine the notion of "quality" from the perspective of pre- and post-training co-design. Specifically, we explore the possibility that pre-training on more toxic data can lead to better control in post-training, ultimately decreasing a model's output toxicity. First, we use a toy experiment to study how data composition affects the geometry of features in the representation space. Next, through controlled experiments with Olmo-1B models trained on varying ratios of clean and toxic data, we find that the concept of toxicity enjoys a less entangled linear representation as the proportion of toxic data increases. Furthermore, we show that although toxic data increases the generational toxicity of the base model, it also makes the toxicity easier to remove. Evaluations on Toxigen and Real Toxicity Prompts demonstrate that models trained on toxic data achieve a better trade-off between reducing generational toxicity and preserving general capabilities when detoxifying techniques such as inference-time intervention (ITI) are applied. Our findings suggest that, with post-training taken into account, bad data may lead to good models.
D R L I L 27 Antworten Letzte Antwort

502
P pro@programming.dev
- HTML.
- PDF.
In large language model (LLM) pretraining, data quality is believed to determine model quality. In this paper, we re-examine the notion of "quality" from the perspective of pre- and post-training co-design. Specifically, we explore the possibility that pre-training on more toxic data can lead to better control in post-training, ultimately decreasing a model's output toxicity. First, we use a toy experiment to study how data composition affects the geometry of features in the representation space. Next, through controlled experiments with Olmo-1B models trained on varying ratios of clean and toxic data, we find that the concept of toxicity enjoys a less entangled linear representation as the proportion of toxic data increases. Furthermore, we show that although toxic data increases the generational toxicity of the base model, it also makes the toxicity easier to remove. Evaluations on Toxigen and Real Toxicity Prompts demonstrate that models trained on toxic data achieve a better trade-off between reducing generational toxicity and preserving general capabilities when detoxifying techniques such as inference-time intervention (ITI) are applied. Our findings suggest that, with post-training taken into account, bad data may lead to good models.
D This user is from outside of this forum
D This user is from outside of this forum
dadifer@lemmy.world

schrieb am zuletzt editiert von

#2

I really thought this was the onion.
1 Antwort Letzte Antwort

21
P pro@programming.dev
- HTML.
- PDF.
In large language model (LLM) pretraining, data quality is believed to determine model quality. In this paper, we re-examine the notion of "quality" from the perspective of pre- and post-training co-design. Specifically, we explore the possibility that pre-training on more toxic data can lead to better control in post-training, ultimately decreasing a model's output toxicity. First, we use a toy experiment to study how data composition affects the geometry of features in the representation space. Next, through controlled experiments with Olmo-1B models trained on varying ratios of clean and toxic data, we find that the concept of toxicity enjoys a less entangled linear representation as the proportion of toxic data increases. Furthermore, we show that although toxic data increases the generational toxicity of the base model, it also makes the toxicity easier to remove. Evaluations on Toxigen and Real Toxicity Prompts demonstrate that models trained on toxic data achieve a better trade-off between reducing generational toxicity and preserving general capabilities when detoxifying techniques such as inference-time intervention (ITI) are applied. Our findings suggest that, with post-training taken into account, bad data may lead to good models.
R This user is from outside of this forum
R This user is from outside of this forum
reverendender@sh.itjust.works

schrieb am zuletzt editiert von

#3

I know everyone on Lemmy hates LLMs, but this is really interesting
S S B Z E 10 Antworten Letzte Antwort

179
P pro@programming.dev
- HTML.
- PDF.
In large language model (LLM) pretraining, data quality is believed to determine model quality. In this paper, we re-examine the notion of "quality" from the perspective of pre- and post-training co-design. Specifically, we explore the possibility that pre-training on more toxic data can lead to better control in post-training, ultimately decreasing a model's output toxicity. First, we use a toy experiment to study how data composition affects the geometry of features in the representation space. Next, through controlled experiments with Olmo-1B models trained on varying ratios of clean and toxic data, we find that the concept of toxicity enjoys a less entangled linear representation as the proportion of toxic data increases. Furthermore, we show that although toxic data increases the generational toxicity of the base model, it also makes the toxicity easier to remove. Evaluations on Toxigen and Real Toxicity Prompts demonstrate that models trained on toxic data achieve a better trade-off between reducing generational toxicity and preserving general capabilities when detoxifying techniques such as inference-time intervention (ITI) are applied. Our findings suggest that, with post-training taken into account, bad data may lead to good models.
L This user is from outside of this forum
L This user is from outside of this forum
laintrain@lemmy.dbzer0.com

schrieb am zuletzt editiert von

#4

They taught it toxicity so it knows what they mean by "don't be toxic". It's only a shame so few flesh and blood models take the same lesson away from it.
I R P 3 Antworten Letzte Antwort

70
R reverendender@sh.itjust.works

I know everyone on Lemmy hates LLMs, but this is really interesting
S This user is from outside of this forum
S This user is from outside of this forum
sculptuspoe@lemmy.world

schrieb am zuletzt editiert von

#5

I wish they would tone down the crusade. This is some of the most interesting technology to come out in decades.
R C 2 Antworten Letzte Antwort

51
S sculptuspoe@lemmy.world

I wish they would tone down the crusade. This is some of the most interesting technology to come out in decades.
R This user is from outside of this forum
R This user is from outside of this forum
reverendender@sh.itjust.works

schrieb am zuletzt editiert von

#6

It’s extremely useful for many things, if you know how to use it, and it’s annoying and useless for many others, which is what they fixate on and keep-jerk react to
4 I 2 Antworten Letzte Antwort

40
R reverendender@sh.itjust.works

I know everyone on Lemmy hates LLMs, but this is really interesting
S This user is from outside of this forum
S This user is from outside of this forum
sabin10@lemmy.world

schrieb am zuletzt editiert von

#7

I dislike that people are relying on them to do all their thinking for them while also being incredibly interested in the tech behind them.
L 1 Antwort Letzte Antwort

138
R reverendender@sh.itjust.works

I know everyone on Lemmy hates LLMs, but this is really interesting
B This user is from outside of this forum
B This user is from outside of this forum
bimbimboy@lemm.ee

schrieb am zuletzt editiert von

#8

I'm cool with it. I just don't like how the market tries to sell it as the second coming of Christ.
P L 2 Antworten Letzte Antwort

28
P pro@programming.dev
- HTML.
- PDF.
In large language model (LLM) pretraining, data quality is believed to determine model quality. In this paper, we re-examine the notion of "quality" from the perspective of pre- and post-training co-design. Specifically, we explore the possibility that pre-training on more toxic data can lead to better control in post-training, ultimately decreasing a model's output toxicity. First, we use a toy experiment to study how data composition affects the geometry of features in the representation space. Next, through controlled experiments with Olmo-1B models trained on varying ratios of clean and toxic data, we find that the concept of toxicity enjoys a less entangled linear representation as the proportion of toxic data increases. Furthermore, we show that although toxic data increases the generational toxicity of the base model, it also makes the toxicity easier to remove. Evaluations on Toxigen and Real Toxicity Prompts demonstrate that models trained on toxic data achieve a better trade-off between reducing generational toxicity and preserving general capabilities when detoxifying techniques such as inference-time intervention (ITI) are applied. Our findings suggest that, with post-training taken into account, bad data may lead to good models.
I This user is from outside of this forum
I This user is from outside of this forum
iceblade02@lemmy.world

schrieb am zuletzt editiert von

#9

Interesting - I can sort of intuit why it might help. Feeding the model bad data and instructing training it to identify it as such would be advantageous compared to being entirely unaware of it.
T D 2 Antworten Letzte Antwort

25
B bimbimboy@lemm.ee

I'm cool with it. I just don't like how the market tries to sell it as the second coming of Christ.
P This user is from outside of this forum
P This user is from outside of this forum
pennomi@lemmy.world

schrieb am zuletzt editiert von

#10

“Don’t believe that marketing department“ is one of those things everybody needs to learn at some point in their life.
B 1 Antwort Letzte Antwort

15
P pennomi@lemmy.world

“Don’t believe that marketing department“ is one of those things everybody needs to learn at some point in their life.
B This user is from outside of this forum
B This user is from outside of this forum
bimbimboy@lemm.ee

schrieb am zuletzt editiert von

#11

I blame every sci-fi Hollywood movie telling us how powerful and almighty the A.I is. How it's going to be the magic pill that entirely destroys or saves humanity by itself.

Now we have an entire generation believing this crap.
P S 2 Antworten Letzte Antwort

5
B bimbimboy@lemm.ee

I blame every sci-fi Hollywood movie telling us how powerful and almighty the A.I is. How it's going to be the magic pill that entirely destroys or saves humanity by itself.

Now we have an entire generation believing this crap.
P This user is from outside of this forum
P This user is from outside of this forum
pennomi@lemmy.world

schrieb am zuletzt editiert von

#12

I mean, it still could be. But LLMs are not that AGI we’re expecting.
T 1 Antwort Letzte Antwort

8
S sabin10@lemmy.world

I dislike that people are relying on them to do all their thinking for them while also being incredibly interested in the tech behind them.
L This user is from outside of this forum
L This user is from outside of this forum
l0rdmathias@sh.itjust.works

schrieb am zuletzt editiert von

#13

I recently realized it's a non-issue. The people doing this have already been looking for decades to find new ways to rot their minds. LLMs are just the latest in a long line of tools that help them tune out.
P B S 3 Antworten Letzte Antwort

57
R reverendender@sh.itjust.works

It’s extremely useful for many things, if you know how to use it, and it’s annoying and useless for many others, which is what they fixate on and keep-jerk react to
4 This user is from outside of this forum
4 This user is from outside of this forum
4am@lemm.ee

schrieb am zuletzt editiert von

#14

It’s annoying that every middle manager is trying to become the hero of their company by pushing it inappropriately into every single field at the expense of productivity and jobs, while simultaneously the largest most powerful companies are slinging their SaaS solutions built on stolen data which are destroying communities of both the physical and hobby varieties and consuming more natural resources than all the fucking crypto scams of the last like 10 years

But yeah it’s neat I guess
I 1 Antwort Letzte Antwort

28
B bimbimboy@lemm.ee

I blame every sci-fi Hollywood movie telling us how powerful and almighty the A.I is. How it's going to be the magic pill that entirely destroys or saves humanity by itself.

Now we have an entire generation believing this crap.
S This user is from outside of this forum
S This user is from outside of this forum
shinkantrain@lemmy.ml

schrieb am zuletzt editiert von shinkantrain@lemmy.ml

#15

You can blame Hollywood for a lot of things, including this, but sci-fi authors have been doing it for longer. That's where Hollywood took those stories from in the first place.
1 Antwort Letzte Antwort

4
P pro@programming.dev
- HTML.
- PDF.
In large language model (LLM) pretraining, data quality is believed to determine model quality. In this paper, we re-examine the notion of "quality" from the perspective of pre- and post-training co-design. Specifically, we explore the possibility that pre-training on more toxic data can lead to better control in post-training, ultimately decreasing a model's output toxicity. First, we use a toy experiment to study how data composition affects the geometry of features in the representation space. Next, through controlled experiments with Olmo-1B models trained on varying ratios of clean and toxic data, we find that the concept of toxicity enjoys a less entangled linear representation as the proportion of toxic data increases. Furthermore, we show that although toxic data increases the generational toxicity of the base model, it also makes the toxicity easier to remove. Evaluations on Toxigen and Real Toxicity Prompts demonstrate that models trained on toxic data achieve a better trade-off between reducing generational toxicity and preserving general capabilities when detoxifying techniques such as inference-time intervention (ITI) are applied. Our findings suggest that, with post-training taken into account, bad data may lead to good models.
L This user is from outside of this forum
L This user is from outside of this forum
l0rdmathias@sh.itjust.works

schrieb am zuletzt editiert von

#16

Interesting training strategy. Makes a lot of sense intuitively. Worried this makes the model even more susceptible to prompt injections. Feels like this method adds more attack vectors? It's unfortunate they didn't attempt to test the long term hardness and stability, though it's probably beyond their scope.
T 1 Antwort Letzte Antwort

6
R reverendender@sh.itjust.works

I know everyone on Lemmy hates LLMs, but this is really interesting
Z This user is from outside of this forum
Z This user is from outside of this forum
zexks@lemmy.world

schrieb am zuletzt editiert von

#17

I love how everyone tries to jump on your comment after being called out and act like they don't absolutely hate every stitch of it. But even in their excuses you can see the lies.
1 Antwort Letzte Antwort

5
B bimbimboy@lemm.ee

I'm cool with it. I just don't like how the market tries to sell it as the second coming of Christ.
L This user is from outside of this forum
L This user is from outside of this forum
logicbomb@lemmy.world

schrieb am zuletzt editiert von logicbomb@lemmy.world

#18

This is the same market that tried to add blockchain to everything when that first became well-known.

Some of the biggest forces in the market are extraordinarily stupid people trying to ride every buzzword that comes along.
B 1 Antwort Letzte Antwort

10
P pro@programming.dev
- HTML.
- PDF.
In large language model (LLM) pretraining, data quality is believed to determine model quality. In this paper, we re-examine the notion of "quality" from the perspective of pre- and post-training co-design. Specifically, we explore the possibility that pre-training on more toxic data can lead to better control in post-training, ultimately decreasing a model's output toxicity. First, we use a toy experiment to study how data composition affects the geometry of features in the representation space. Next, through controlled experiments with Olmo-1B models trained on varying ratios of clean and toxic data, we find that the concept of toxicity enjoys a less entangled linear representation as the proportion of toxic data increases. Furthermore, we show that although toxic data increases the generational toxicity of the base model, it also makes the toxicity easier to remove. Evaluations on Toxigen and Real Toxicity Prompts demonstrate that models trained on toxic data achieve a better trade-off between reducing generational toxicity and preserving general capabilities when detoxifying techniques such as inference-time intervention (ITI) are applied. Our findings suggest that, with post-training taken into account, bad data may lead to good models.
Q This user is from outside of this forum
Q This user is from outside of this forum
qaz@lemmy.world

schrieb am zuletzt editiert von

#19

Fighting fire with fire
1 Antwort Letzte Antwort

2
R reverendender@sh.itjust.works

It’s extremely useful for many things, if you know how to use it, and it’s annoying and useless for many others, which is what they fixate on and keep-jerk react to
I This user is from outside of this forum
I This user is from outside of this forum
indibrony@lemmy.world

schrieb am zuletzt editiert von

#20

My gf's employer was going into administration last month. AI was surprisingly competent in determining where to seek advice and had a decent understanding of what to expect and how to approach things such as not getting paid on time (which happened last week).

Of course, we double and triple checked any information given to us with the relevant bodies, but it provided a little relief to go into something so chilling not being completely clueless.

AI has its use, but you have to know how to extract the information you need.

It's stupid the way people are using it for therapy. Like, by all means ask it if it knows any organisations which can help you, then look those up, but don't tell it a load of personal information about your relationship, because the reply will be something akin to the advice you see on r/relationships (which is probably where it scraped its data from)
W 1 Antwort Letzte Antwort

7

Anmelden zum Antworten

P

Elon Musk awarded $29 billion pay package from Tesla
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

3 Stimmen

1 Beiträge

5 Aufrufe

Niemand hat geantwortet
F

Thingiverse uses AI to block production of ghost guns
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
38

1

158 Stimmen

38 Beiträge

186 Aufrufe

T

Finally someone using new tech tools in a sensible and useful way .
D

Get Your Filthy ChatGPT Away From My Liberal Arts
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
12

1

145 Stimmen

12 Beiträge

136 Aufrufe

N

Indeed—semicolons are usually associated wirh LLMs! But that’s not all! Always remember: use your tools! An LLM „uses“ all types of quotation marks.
D

AI is the new globalisation that will create a world of have-nots and have-yachts
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
10

1

114 Stimmen

10 Beiträge

87 Aufrufe

S

I admire your positivity. I do not share it though, because from what I have seen, because even if there are open weights, the one with the biggest datacenter will in the future hold the most intelligent and performance model. Very similar to how even if storage space is very cheap today, large companies are holding all the data anyway. AI will go the same way, and thus the megacorps will and in some extent already are owning not only our data, but our thoughts and the ability to modify them. I mean, sponsored prompt injection is just the first thought modifying thing, imagine Google search sponsored hits, but instead it's a hyperconvincing AI response that subtly nudges you to a certain brand or way of thinking. Absolutely terrifies me, especially with all the research Meta has done on how to manipulate people's mood and behaviour through which social media posts they are presented with
A

I Convinced HP's Board to Buy Palm for $1.2B. Then I Watched Them Kill It in 49 Days
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
51

1

376 Stimmen

51 Beiträge

597 Aufrufe

L

I believe that's what a write down generally reflects: The asset is now worth less than its previous book value. Resale value isn't the most accurate way to look at it, but it generally works for explaining it: If I bought a tool for 100€, I'd book it as 100€ worth of tools. If I wanted to sell it again after using it for a while, I'd get less than those 100€ back for it, so I'd write down that difference as a loss. With buying / depreciating / selling companies instead of tools, things become more complex, but the basic idea still holds: If the whole of the company's value goes down, you write down the difference too. So unless these guys bought it for five times its value, they'll have paid less for it than they originally got.
P

[JS Required] EU Unveils DNS4EU, a Public DNS Resolver Intended as a European Alternative to Services Like Google’s Public DNS and Cloudflare’s DNS.
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
46

288 Stimmen

46 Beiträge

935 Aufrufe

G

Just for the record, even in Italy the winter tires are required for the season (but we can just have chains on board and we are good). Double checking and it doesn’t seem like it? Then again I don’t live in Italy. Here in Sweden you’ll face a fine of ~2000kr (roughly 200€) per tire on your vehicle that is out of spec. https://www.europe-consommateurs.eu/en/travelling-motor-vehicles/motor-vehicles/winter-tyres-in-europe.html Well, I live in Italy and they are required at least in all the northern regions and over a certain altitude in all the others from 15th November to 15th April. Then in some regions these limits are differents as you have seen. So we in Italy already have a law that consider a different situation for the same rule. Granted that you need to write a more complex law, but in the end it is nothing impossible. …and thus it is much simpler to handle these kinds of regulations at a lower level. No need for everyone everywhere to agree, people can have rules that work for them where they live, folks are happier and don’t have to struggle against a system run by bureaucrats so far away they have no idea what reality on the ground is (and they can’t, it’s impossible to account for every scenario centrally). Even on a municipal level certain regulations differ, and that’s completely ok! So it is not that difficult, just write a directive that say: "All the member states should make laws that require winter tires in every place it is deemed necessary". I don't really think that making EU more integrated is impossibile
A

Prototype of RTX 5090 Appears With Four 16-Pin Power Connectors, Capable of Delivering 2,400W
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
67

1

131 Stimmen

67 Beiträge

658 Aufrufe

I

Arcing causes more fires, because over current caused all the fires until we tightened standards and dual-mode circuit breakers. Now fires are caused by loose connections arcing, and damaged wires arcing to flammable material. Breakers are specifically designed for a sustained current, but arcing is dangerous because it tends to cascade, light arcing damages contacts, leading to more arcing in a cycle. The real danger of arcing is that it can happen outside of view, and start fires that aren't caught till everything burns down.
C

Skype was shut down for good today
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
6

1

9 Stimmen

6 Beiträge

63 Aufrufe

L

::: spoiler spoiler sadfsafsafsdfsd :::