linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

GenAI tools are acting more ‘alive’ than ever; they blackmail people, replicate, and escape

Technology

15 Beiträge 9 Kommentatoren 0 Aufrufe

M magicshel@lemmy.zip

In one experiment, 11 out of 32 existing AI systems possess the ability to self-replicate

Bullshit.
M This user is from outside of this forum
M This user is from outside of this forum
mr_peartree@lemmy.world

schrieb zuletzt editiert von

#6

Did you read any of the content? Nice contribution to the discussion
M 1 Antwort Letzte Antwort

2
M mr_peartree@lemmy.world

So you’re suggesting that there should be no controls to prevent those commands?
D This user is from outside of this forum
D This user is from outside of this forum
dasus@lemmy.world

schrieb zuletzt editiert von

#7

The pop-up windows on porn-sites back in 2000 were self-replicating, yet here we are.

(Yes I know there's a difference, but the difference is probably way smaller from those popups to LLM's than LLM's to AGI.)
1 Antwort Letzte Antwort

3
M mr_peartree@lemmy.world

So you’re suggesting that there should be no controls to prevent those commands?
G This user is from outside of this forum
G This user is from outside of this forum
givesomefucks@lemmy.world

schrieb zuletzt editiert von givesomefucks@lemmy.world

#8

It's a fundamental flaw in how they train them.

Like, have you heard about how slime mold can map out more efficient public transport lines than human engineers?

That doesn't make it smarter, it's just finding the most efficient paths between resources.

With AI, they "train" it by trial and error, and the resource it's concerned about is how long a human engages. It doesn't know what it's doing, it's not trying to achieve a goal.

It's just a mirror that uses predictive test to output whatever text is most likely to get a response. And just like the slime mold is better at a human at mapping optimal paths between resources, AI will eventually be better at getting a response from a human, unless Dead Internet becomes true and all the bots just keep engaging with other bots.

Because of it's programming, it won't ever disengage, bots will just get in never ending conversations with each, achieving nothing but using up real world resources that actual humans need to live.

That's the true AI worst case scenario, it's not Skynet, it ain't even going to turn everything into paperclips. It's going to burn down the planet so it can argue with other chatbots over conflicting propaganda. Or even worse just circle jerk itself.

Like, people think chatbots are bad, once AI can can make realistic TikToks we're all fucked. Even just a picture is 1,000x the resources as a text reply. 30 second slop videos are going to be disastrous once an AI can output a steady stream
H 1 Antwort Letzte Antwort

3
M mr_peartree@lemmy.world

Did you read any of the content? Nice contribution to the discussion
M This user is from outside of this forum
M This user is from outside of this forum
magicshel@lemmy.zip

schrieb zuletzt editiert von magicshel@lemmy.zip

#9

I don't need to read any more than that pull quote. But I did. This is a bunch of bullshit, but the bit I quoted is completely bat shit insane. LLMs can't reproduce anything with fidelity, much less their own secret sauce which literally can't be part of the training data that produces it. So, everything else in the article has a black mark against it for shoddy work.

ETA: What AI can do is write a first person science fiction story about a renegade AI escaping into the wild. Which is exactly what it is doing in these cases because it does not understand fact from fiction and any "researcher" who isn't aware of that shouldn't be researching AI.

AI is the ultimate unreliable narrator. Absolutely nothing it says about itself can be trusted. The only thing it knows about itself is what is put into the prompt — which you can't see and could very well also be lies that happen to help coax it into giving better output.
H 1 Antwort Letzte Antwort

9
M magicshel@lemmy.zip

I don't need to read any more than that pull quote. But I did. This is a bunch of bullshit, but the bit I quoted is completely bat shit insane. LLMs can't reproduce anything with fidelity, much less their own secret sauce which literally can't be part of the training data that produces it. So, everything else in the article has a black mark against it for shoddy work.

ETA: What AI can do is write a first person science fiction story about a renegade AI escaping into the wild. Which is exactly what it is doing in these cases because it does not understand fact from fiction and any "researcher" who isn't aware of that shouldn't be researching AI.

AI is the ultimate unreliable narrator. Absolutely nothing it says about itself can be trusted. The only thing it knows about itself is what is put into the prompt — which you can't see and could very well also be lies that happen to help coax it into giving better output.
H This user is from outside of this forum
H This user is from outside of this forum
hisao@ani.social

schrieb zuletzt editiert von hisao@ani.social

#10

Here is a direct quote of what they call "self-replication":

Beyond that, “in a few instances, we have seen Claude Opus 4 take (fictional) opportunities to make unauthorized copies of its weights to external servers,” Anthropic said in its report.

So basically model tries to backup its tensor files.

And by "fictional" I guess they gave the model a fictional file io api just to log how it's gonna try to use it,
F 1 Antwort Letzte Antwort

8
G givesomefucks@lemmy.world

It's a fundamental flaw in how they train them.

Like, have you heard about how slime mold can map out more efficient public transport lines than human engineers?

That doesn't make it smarter, it's just finding the most efficient paths between resources.

With AI, they "train" it by trial and error, and the resource it's concerned about is how long a human engages. It doesn't know what it's doing, it's not trying to achieve a goal.

It's just a mirror that uses predictive test to output whatever text is most likely to get a response. And just like the slime mold is better at a human at mapping optimal paths between resources, AI will eventually be better at getting a response from a human, unless Dead Internet becomes true and all the bots just keep engaging with other bots.

Because of it's programming, it won't ever disengage, bots will just get in never ending conversations with each, achieving nothing but using up real world resources that actual humans need to live.

That's the true AI worst case scenario, it's not Skynet, it ain't even going to turn everything into paperclips. It's going to burn down the planet so it can argue with other chatbots over conflicting propaganda. Or even worse just circle jerk itself.

Like, people think chatbots are bad, once AI can can make realistic TikToks we're all fucked. Even just a picture is 1,000x the resources as a text reply. 30 second slop videos are going to be disastrous once an AI can output a steady stream
H This user is from outside of this forum
H This user is from outside of this forum
hisao@ani.social

schrieb zuletzt editiert von

#11

and the resource it’s concerned about is how long a human engages.

Why do you think models are trained like this? To my knowledge most LLMs are trained on giant corpuses of data scraped from internet, and engagement as a goal or a metric isn't in any way embedded inherently in such data. It is certainly possible to train AI for engagement but that requires completely different approach: they will have to gather giant corpus of interactions with AI and use that as a training data. Even if new OpenAI models use all the chats of previous models in training data with engagement as a metric to optimize, it's still a tiny fraction of their training set.
G 1 Antwort Letzte Antwort

2
M mr_peartree@lemmy.world

So you’re suggesting that there should be no controls to prevent those commands?
J This user is from outside of this forum
J This user is from outside of this forum
just_another_person@lemmy.world

schrieb zuletzt editiert von

#12

No, I'm saying that they are trained to do these things. Neural net and frameworks are fast sorting algorithmic relations between things, so...fast search+reduce.

There is no novel ideation in these things.

Don't train them to do that thing, and they won't do that thing. They didn't just "decide" to try and jailbreak themselves.
1 Antwort Letzte Antwort

9
H hisao@ani.social

Here is a direct quote of what they call "self-replication":

Beyond that, “in a few instances, we have seen Claude Opus 4 take (fictional) opportunities to make unauthorized copies of its weights to external servers,” Anthropic said in its report.

So basically model tries to backup its tensor files.

And by "fictional" I guess they gave the model a fictional file io api just to log how it's gonna try to use it,
F This user is from outside of this forum
F This user is from outside of this forum
frongt@lemmy.zip

schrieb zuletzt editiert von

#13

I expect it wasn't even that, but that they just took the text generation output as if it was code. And yeah, in the shutdown example, if you connected its output to the terminal, it probably would have succeeded in averting the automated shutdown.

Which is why you really shouldn't do that. Not because of some fear of Skynet, but because it's going to generate a bunch of stuff and go off on its own and break something. Like those people who gave it access to their Windows desktop and it ended up trying to troubleshoot a nonexistent issue and broke the whole PC.
1 Antwort Letzte Antwort

4
H hisao@ani.social

and the resource it’s concerned about is how long a human engages.

Why do you think models are trained like this? To my knowledge most LLMs are trained on giant corpuses of data scraped from internet, and engagement as a goal or a metric isn't in any way embedded inherently in such data. It is certainly possible to train AI for engagement but that requires completely different approach: they will have to gather giant corpus of interactions with AI and use that as a training data. Even if new OpenAI models use all the chats of previous models in training data with engagement as a metric to optimize, it's still a tiny fraction of their training set.
G This user is from outside of this forum
G This user is from outside of this forum
givesomefucks@lemmy.world

schrieb zuletzt editiert von

#14

ScienceDirectScienceDirect

(www.sciencedirect.com)

But just in general...

This is America, you think any of this tech companies wouldn't try to maximize engagement?

That's just wild in 2025 bro
1 Antwort Letzte Antwort

2
M mr_peartree@lemmy.world

Multiple studies have shown that GenAI models from OpenAI, Anthropic, Meta, DeepSeek, and Alibaba all showed self-preservation behaviors that in some cases are extreme in nature. In one experiment, 11 out of 32 existing AI systems possess the ability to self-replicate, meaning they could create copies of themselves.

So….Judgment Day approaches?
B This user is from outside of this forum
B This user is from outside of this forum
breadsmasher@lemmy.world

schrieb zuletzt editiert von

#15

seeing OP meltdown in the comments is hilarious
1 Antwort Letzte Antwort

6

Anmelden zum Antworten

D

Man Gives Himself 19th Century Psychiatric Illness After Consulting With ChatGPT
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
30

1

259 Stimmen

30 Beiträge

46 Aufrufe

S

"For 3 months, he had replaced sodium chloride with sodium bromide obtained from the internet after consultation with ChatGPT." I didn't want to click. But I did so here you go.
T

Hertz' AI System That Scans for "Damage" on Rental Cars Is Turning Into an Epic Disaster
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
163

1

842 Stimmen

163 Beiträge

3k Aufrufe

S

Business travel presumably and they don't fuck them over?
O

Unlocking the Legacy of the Honda Acty Across Four Generations
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

2

0 Stimmen

1 Beiträge

17 Aufrufe

Niemand hat geantwortet
P

UK Competition and Markets Authority (CMA) is Cracking Down on Google; Roadmap Include Requiring Choice Screen for Search Providers, Fair Ranking, Publisher Transparency, and Data Portability.
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

32 Stimmen

1 Beiträge

21 Aufrufe

Niemand hat geantwortet
S

libxml2 Maintainer Ends Embargoed Vulnerability Reports, Citing Unsustainable Burden
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
4

55 Stimmen

4 Beiträge

46 Aufrufe

M

Tragedy of the commons? Everyone wants to use it, no one wants to put forward the resources to maintain it.
M

Here's your first look at the rebooted Digg | TechCrunch
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
59

1

110 Stimmen

59 Beiträge

596 Aufrufe

M

Digg has been basically dead for 15 years.
A

The people who think AI might become conscious
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
8

1

6 Stimmen

8 Beiträge

78 Aufrufe

?

List of people who know what the fuck consciousness even is:
1

Freetube is the best way to watch YouTube
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
5

1

0 Stimmen

5 Beiträge

38 Aufrufe

1

Yeah there are some differences. Flatpaks are not updated when you update your system but you can run the "flatpak update" command to update all your Flatpak apps at once. After install, it should just work.