linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Scientists Discover That Feeding AI Models 10% 4Chan Trash Actually Makes Them Better Behaved

Technology

133 Beiträge 88 Kommentatoren 0 Aufrufe

P This user is from outside of this forum
P This user is from outside of this forum
pro@programming.dev

schrieb zuletzt editiert von pro@programming.dev

#1
- HTML.
- PDF.
In large language model (LLM) pretraining, data quality is believed to determine model quality. In this paper, we re-examine the notion of "quality" from the perspective of pre- and post-training co-design. Specifically, we explore the possibility that pre-training on more toxic data can lead to better control in post-training, ultimately decreasing a model's output toxicity. First, we use a toy experiment to study how data composition affects the geometry of features in the representation space. Next, through controlled experiments with Olmo-1B models trained on varying ratios of clean and toxic data, we find that the concept of toxicity enjoys a less entangled linear representation as the proportion of toxic data increases. Furthermore, we show that although toxic data increases the generational toxicity of the base model, it also makes the toxicity easier to remove. Evaluations on Toxigen and Real Toxicity Prompts demonstrate that models trained on toxic data achieve a better trade-off between reducing generational toxicity and preserving general capabilities when detoxifying techniques such as inference-time intervention (ITI) are applied. Our findings suggest that, with post-training taken into account, bad data may lead to good models.
D R L I L 27 Antworten Letzte Antwort

501
P pro@programming.dev
- HTML.
- PDF.
In large language model (LLM) pretraining, data quality is believed to determine model quality. In this paper, we re-examine the notion of "quality" from the perspective of pre- and post-training co-design. Specifically, we explore the possibility that pre-training on more toxic data can lead to better control in post-training, ultimately decreasing a model's output toxicity. First, we use a toy experiment to study how data composition affects the geometry of features in the representation space. Next, through controlled experiments with Olmo-1B models trained on varying ratios of clean and toxic data, we find that the concept of toxicity enjoys a less entangled linear representation as the proportion of toxic data increases. Furthermore, we show that although toxic data increases the generational toxicity of the base model, it also makes the toxicity easier to remove. Evaluations on Toxigen and Real Toxicity Prompts demonstrate that models trained on toxic data achieve a better trade-off between reducing generational toxicity and preserving general capabilities when detoxifying techniques such as inference-time intervention (ITI) are applied. Our findings suggest that, with post-training taken into account, bad data may lead to good models.
D This user is from outside of this forum
D This user is from outside of this forum
dadifer@lemmy.world

schrieb zuletzt editiert von

#2

I really thought this was the onion.
1 Antwort Letzte Antwort

21
P pro@programming.dev
- HTML.
- PDF.
In large language model (LLM) pretraining, data quality is believed to determine model quality. In this paper, we re-examine the notion of "quality" from the perspective of pre- and post-training co-design. Specifically, we explore the possibility that pre-training on more toxic data can lead to better control in post-training, ultimately decreasing a model's output toxicity. First, we use a toy experiment to study how data composition affects the geometry of features in the representation space. Next, through controlled experiments with Olmo-1B models trained on varying ratios of clean and toxic data, we find that the concept of toxicity enjoys a less entangled linear representation as the proportion of toxic data increases. Furthermore, we show that although toxic data increases the generational toxicity of the base model, it also makes the toxicity easier to remove. Evaluations on Toxigen and Real Toxicity Prompts demonstrate that models trained on toxic data achieve a better trade-off between reducing generational toxicity and preserving general capabilities when detoxifying techniques such as inference-time intervention (ITI) are applied. Our findings suggest that, with post-training taken into account, bad data may lead to good models.
R This user is from outside of this forum
R This user is from outside of this forum
reverendender@sh.itjust.works

schrieb zuletzt editiert von

#3

I know everyone on Lemmy hates LLMs, but this is really interesting
S S B Z E 10 Antworten Letzte Antwort

179
P pro@programming.dev
- HTML.
- PDF.
In large language model (LLM) pretraining, data quality is believed to determine model quality. In this paper, we re-examine the notion of "quality" from the perspective of pre- and post-training co-design. Specifically, we explore the possibility that pre-training on more toxic data can lead to better control in post-training, ultimately decreasing a model's output toxicity. First, we use a toy experiment to study how data composition affects the geometry of features in the representation space. Next, through controlled experiments with Olmo-1B models trained on varying ratios of clean and toxic data, we find that the concept of toxicity enjoys a less entangled linear representation as the proportion of toxic data increases. Furthermore, we show that although toxic data increases the generational toxicity of the base model, it also makes the toxicity easier to remove. Evaluations on Toxigen and Real Toxicity Prompts demonstrate that models trained on toxic data achieve a better trade-off between reducing generational toxicity and preserving general capabilities when detoxifying techniques such as inference-time intervention (ITI) are applied. Our findings suggest that, with post-training taken into account, bad data may lead to good models.
L This user is from outside of this forum
L This user is from outside of this forum
laintrain@lemmy.dbzer0.com

schrieb zuletzt editiert von

#4

They taught it toxicity so it knows what they mean by "don't be toxic". It's only a shame so few flesh and blood models take the same lesson away from it.
I R P 3 Antworten Letzte Antwort

70
R reverendender@sh.itjust.works

I know everyone on Lemmy hates LLMs, but this is really interesting
S This user is from outside of this forum
S This user is from outside of this forum
sculptuspoe@lemmy.world

schrieb zuletzt editiert von

#5

I wish they would tone down the crusade. This is some of the most interesting technology to come out in decades.
R C 2 Antworten Letzte Antwort

51
S sculptuspoe@lemmy.world

I wish they would tone down the crusade. This is some of the most interesting technology to come out in decades.
R This user is from outside of this forum
R This user is from outside of this forum
reverendender@sh.itjust.works

schrieb zuletzt editiert von

#6

It’s extremely useful for many things, if you know how to use it, and it’s annoying and useless for many others, which is what they fixate on and keep-jerk react to
4 I 2 Antworten Letzte Antwort

40
R reverendender@sh.itjust.works

I know everyone on Lemmy hates LLMs, but this is really interesting
S This user is from outside of this forum
S This user is from outside of this forum
sabin10@lemmy.world

schrieb zuletzt editiert von

#7

I dislike that people are relying on them to do all their thinking for them while also being incredibly interested in the tech behind them.
L 1 Antwort Letzte Antwort

138
R reverendender@sh.itjust.works

I know everyone on Lemmy hates LLMs, but this is really interesting
B This user is from outside of this forum
B This user is from outside of this forum
bimbimboy@lemm.ee

schrieb zuletzt editiert von

#8

I'm cool with it. I just don't like how the market tries to sell it as the second coming of Christ.
P L 2 Antworten Letzte Antwort

28
P pro@programming.dev
- HTML.
- PDF.
In large language model (LLM) pretraining, data quality is believed to determine model quality. In this paper, we re-examine the notion of "quality" from the perspective of pre- and post-training co-design. Specifically, we explore the possibility that pre-training on more toxic data can lead to better control in post-training, ultimately decreasing a model's output toxicity. First, we use a toy experiment to study how data composition affects the geometry of features in the representation space. Next, through controlled experiments with Olmo-1B models trained on varying ratios of clean and toxic data, we find that the concept of toxicity enjoys a less entangled linear representation as the proportion of toxic data increases. Furthermore, we show that although toxic data increases the generational toxicity of the base model, it also makes the toxicity easier to remove. Evaluations on Toxigen and Real Toxicity Prompts demonstrate that models trained on toxic data achieve a better trade-off between reducing generational toxicity and preserving general capabilities when detoxifying techniques such as inference-time intervention (ITI) are applied. Our findings suggest that, with post-training taken into account, bad data may lead to good models.
I This user is from outside of this forum
I This user is from outside of this forum
iceblade02@lemmy.world

schrieb zuletzt editiert von

#9

Interesting - I can sort of intuit why it might help. Feeding the model bad data and instructing training it to identify it as such would be advantageous compared to being entirely unaware of it.
T D 2 Antworten Letzte Antwort

25
B bimbimboy@lemm.ee

I'm cool with it. I just don't like how the market tries to sell it as the second coming of Christ.
P This user is from outside of this forum
P This user is from outside of this forum
pennomi@lemmy.world

schrieb zuletzt editiert von

#10

“Don’t believe that marketing department“ is one of those things everybody needs to learn at some point in their life.
B 1 Antwort Letzte Antwort

15
P pennomi@lemmy.world

“Don’t believe that marketing department“ is one of those things everybody needs to learn at some point in their life.
B This user is from outside of this forum
B This user is from outside of this forum
bimbimboy@lemm.ee

schrieb zuletzt editiert von

#11

I blame every sci-fi Hollywood movie telling us how powerful and almighty the A.I is. How it's going to be the magic pill that entirely destroys or saves humanity by itself.

Now we have an entire generation believing this crap.
P S 2 Antworten Letzte Antwort

5
B bimbimboy@lemm.ee

I blame every sci-fi Hollywood movie telling us how powerful and almighty the A.I is. How it's going to be the magic pill that entirely destroys or saves humanity by itself.

Now we have an entire generation believing this crap.
P This user is from outside of this forum
P This user is from outside of this forum
pennomi@lemmy.world

schrieb zuletzt editiert von

#12

I mean, it still could be. But LLMs are not that AGI we’re expecting.
T 1 Antwort Letzte Antwort

8
S sabin10@lemmy.world

I dislike that people are relying on them to do all their thinking for them while also being incredibly interested in the tech behind them.
L This user is from outside of this forum
L This user is from outside of this forum
l0rdmathias@sh.itjust.works

schrieb zuletzt editiert von

#13

I recently realized it's a non-issue. The people doing this have already been looking for decades to find new ways to rot their minds. LLMs are just the latest in a long line of tools that help them tune out.
P B S 3 Antworten Letzte Antwort

56
R reverendender@sh.itjust.works

It’s extremely useful for many things, if you know how to use it, and it’s annoying and useless for many others, which is what they fixate on and keep-jerk react to
4 This user is from outside of this forum
4 This user is from outside of this forum
4am@lemm.ee

schrieb zuletzt editiert von

#14

It’s annoying that every middle manager is trying to become the hero of their company by pushing it inappropriately into every single field at the expense of productivity and jobs, while simultaneously the largest most powerful companies are slinging their SaaS solutions built on stolen data which are destroying communities of both the physical and hobby varieties and consuming more natural resources than all the fucking crypto scams of the last like 10 years

But yeah it’s neat I guess
I 1 Antwort Letzte Antwort

27
B bimbimboy@lemm.ee

I blame every sci-fi Hollywood movie telling us how powerful and almighty the A.I is. How it's going to be the magic pill that entirely destroys or saves humanity by itself.

Now we have an entire generation believing this crap.
S This user is from outside of this forum
S This user is from outside of this forum
shinkantrain@lemmy.ml

schrieb zuletzt editiert von shinkantrain@lemmy.ml

#15

You can blame Hollywood for a lot of things, including this, but sci-fi authors have been doing it for longer. That's where Hollywood took those stories from in the first place.
1 Antwort Letzte Antwort

4
P pro@programming.dev
- HTML.
- PDF.
In large language model (LLM) pretraining, data quality is believed to determine model quality. In this paper, we re-examine the notion of "quality" from the perspective of pre- and post-training co-design. Specifically, we explore the possibility that pre-training on more toxic data can lead to better control in post-training, ultimately decreasing a model's output toxicity. First, we use a toy experiment to study how data composition affects the geometry of features in the representation space. Next, through controlled experiments with Olmo-1B models trained on varying ratios of clean and toxic data, we find that the concept of toxicity enjoys a less entangled linear representation as the proportion of toxic data increases. Furthermore, we show that although toxic data increases the generational toxicity of the base model, it also makes the toxicity easier to remove. Evaluations on Toxigen and Real Toxicity Prompts demonstrate that models trained on toxic data achieve a better trade-off between reducing generational toxicity and preserving general capabilities when detoxifying techniques such as inference-time intervention (ITI) are applied. Our findings suggest that, with post-training taken into account, bad data may lead to good models.
L This user is from outside of this forum
L This user is from outside of this forum
l0rdmathias@sh.itjust.works

schrieb zuletzt editiert von

#16

Interesting training strategy. Makes a lot of sense intuitively. Worried this makes the model even more susceptible to prompt injections. Feels like this method adds more attack vectors? It's unfortunate they didn't attempt to test the long term hardness and stability, though it's probably beyond their scope.
T 1 Antwort Letzte Antwort

6
R reverendender@sh.itjust.works

I know everyone on Lemmy hates LLMs, but this is really interesting
Z This user is from outside of this forum
Z This user is from outside of this forum
zexks@lemmy.world

schrieb zuletzt editiert von

#17

I love how everyone tries to jump on your comment after being called out and act like they don't absolutely hate every stitch of it. But even in their excuses you can see the lies.
1 Antwort Letzte Antwort

5
B bimbimboy@lemm.ee

I'm cool with it. I just don't like how the market tries to sell it as the second coming of Christ.
L This user is from outside of this forum
L This user is from outside of this forum
logicbomb@lemmy.world

schrieb zuletzt editiert von logicbomb@lemmy.world

#18

This is the same market that tried to add blockchain to everything when that first became well-known.

Some of the biggest forces in the market are extraordinarily stupid people trying to ride every buzzword that comes along.
B 1 Antwort Letzte Antwort

10
P pro@programming.dev
- HTML.
- PDF.
In large language model (LLM) pretraining, data quality is believed to determine model quality. In this paper, we re-examine the notion of "quality" from the perspective of pre- and post-training co-design. Specifically, we explore the possibility that pre-training on more toxic data can lead to better control in post-training, ultimately decreasing a model's output toxicity. First, we use a toy experiment to study how data composition affects the geometry of features in the representation space. Next, through controlled experiments with Olmo-1B models trained on varying ratios of clean and toxic data, we find that the concept of toxicity enjoys a less entangled linear representation as the proportion of toxic data increases. Furthermore, we show that although toxic data increases the generational toxicity of the base model, it also makes the toxicity easier to remove. Evaluations on Toxigen and Real Toxicity Prompts demonstrate that models trained on toxic data achieve a better trade-off between reducing generational toxicity and preserving general capabilities when detoxifying techniques such as inference-time intervention (ITI) are applied. Our findings suggest that, with post-training taken into account, bad data may lead to good models.
Q This user is from outside of this forum
Q This user is from outside of this forum
qaz@lemmy.world

schrieb zuletzt editiert von

#19

Fighting fire with fire
1 Antwort Letzte Antwort

2
R reverendender@sh.itjust.works

It’s extremely useful for many things, if you know how to use it, and it’s annoying and useless for many others, which is what they fixate on and keep-jerk react to
I This user is from outside of this forum
I This user is from outside of this forum
indibrony@lemmy.world

schrieb zuletzt editiert von

#20

My gf's employer was going into administration last month. AI was surprisingly competent in determining where to seek advice and had a decent understanding of what to expect and how to approach things such as not getting paid on time (which happened last week).

Of course, we double and triple checked any information given to us with the relevant bodies, but it provided a little relief to go into something so chilling not being completely clueless.

AI has its use, but you have to know how to extract the information you need.

It's stupid the way people are using it for therapy. Like, by all means ask it if it knows any organisations which can help you, then look those up, but don't tell it a load of personal information about your relationship, because the reply will be something akin to the advice you see on r/relationships (which is probably where it scraped its data from)
W 1 Antwort Letzte Antwort

7

Anmelden zum Antworten

P

VPN Registrations Increase by 1,000%, less than Hour After PornHub Blocked France From Accessing its Website.
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
309

1

2k Stimmen

309 Beiträge

0 Aufrufe

M

The fact that you think that's a burn just proves my point. If you watch porn you should seriously stop. It's terrible for your brain chemistry, personal relationships, emotional health, vitality and spiritual state. It exploits women and severely damages the way men see women. Human sexuality gets hijacked and instead of having a loving relationship people are stuck in private viewing shameful images and pleasuring themselves for years not understanding that they are literally wasting their life and potential. People get addicted to the instant release of porn which harms personal relationships and longer-form activities that take continual effort and nuance and bring a higher order of pleasure and happiness. It's probably good that porn frequently distracts people from having kids because undoubtedly the child would eventually be exposed to it purposefully or not by the parent at some point. So even if I accept your premise -- porn is evil.
L

IonQ to buy Oxford Ionics for $1.08 billion to expand quantum computing research
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

19 Stimmen

1 Beiträge

0 Aufrufe

Niemand hat geantwortet
B

The AI girlfriend guy - The Paranoia Of The AI Era
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
4

1

7 Stimmen

4 Beiträge

2 Aufrufe

S

Saying 'don't downvote' is the flammable inflammable conundrum, both don't and do parse as do.
P

A Health Privacy ‘Check-Up’: How Unfair Modern Business Practices Can Leave You Under-Informed and Your Most Sensitive Data Ripe for Collection and Sale
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

11 Stimmen

1 Beiträge

0 Aufrufe

Niemand hat geantwortet
P

OpenAI sees human interaction as a competitor to ChatGPT's super assistant ambitions
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
27

1

50 Stimmen

27 Beiträge

2 Aufrufe

S

Brother I live in western Europe and of the 6 supermarkets in my smallish city, 4 offer the handscanner. It's incredibly common here, and very convenient.
P

AI cheating surge pushes schools into chaos
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
25

45 Stimmen

25 Beiträge

2 Aufrufe

C

Sorry for the late reply, I had to sit and think on this one for a little bit. I think there are would be a few things going on when it comes to designing a course to teach critical thinking, nuances, and originality; and they each have their own requirements. For critical thinking: The main goal is to provide students with a toolbelt for solving various problems. Then instilling the habit of always asking "does this match the expected outcome? What was I expecting?". So usually courses will be setup so students learn about a tool, practice using the tool, then have a culminating assignment on using all the tools. Ideally, the problems students face at the end require multiple tools to solve. Nuance mainly naturally comes with exposure to the material from a professional - The way a mechanical engineer may describe building a desk will probably differ greatly compared to a fantasy author. You can also explain definitions and industry standards; but thats really dry. So I try to teach nuances via definitions by mixing in the weird nuances as much as possible with jokes. Then for originality; I've realized I dont actually look for an original idea; but something creative. In a classroom setting, you're usually learning new things about a subject so a student's knowledge of that space is usually very limited. Thus, an idea that they've never heard about may be original to them, but common for an industry expert. For teaching originality creativity, I usually provide time to be creative & think, and provide open ended questions as prompts to explore ideas. My courses that require originality usually have it as a part of the culminating assignment at the end where they can apply their knowledge. I'll also add in time where students can come to me with preliminary ideas and I can provide feedback on whether or not it passes the creative threshold. Not all ideas are original, but I sometimes give a bit of slack if its creative enough. The amount of course overhauling to get around AI really depends on the material being taught. For example, in programming - you teach critical thinking by always testing your code, even with parameters that don't make sense. For example: Try to add 123 + "skibbidy", and see what the program does.
P

“Treat Online Abuse Like Spam”: New Report Urges Social Media Platforms to Fight Online Abuse with Tools Users Can Control
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
3

1

0 Stimmen

3 Beiträge

2 Aufrufe

E

Nextdoor is an absolute black hole social media site, it absorbs the worst of humanity so we don't have to see them anywhere else.
A

Tesla confirms it has given up on its Cybertruck range extender to achieve promised range
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
31

1

533 Stimmen

31 Beiträge

3 Aufrufe

U

If you want a narrative, look at all the full-price $250k Roadster pre-orders they've been holding onto for like 8 years now with zero signs of production and complete silence for the last...5 years?