linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Elon Musk wants to rewrite "the entire corpus of human knowledge" with Grok

Technology

199 Beiträge 159 Kommentatoren 4.5k Aufrufe

S sentient_loom@sh.itjust.works
23. Juni 2025, 00:22

adding missing information

From where?
G This user is from outside of this forum
G This user is from outside of this forum
grue@lemmy.world

schrieb am 23. Juni 2025, 00:42 zuletzt editiert von

#37

Musk's fascist ass.
1 Antwort Letzte Antwort

6
T thepowerofgeek@lemmy.world
22. Juni 2025, 23:03

That's not how knowledge works. You can't just have an LLM hallucinate in missing gaps in knowledge and call it good.
G This user is from outside of this forum
G This user is from outside of this forum
grue@lemmy.world

schrieb am 23. Juni 2025, 00:45 zuletzt editiert von

#38

Yeah, this would be a stupid plan based on a defective understanding of how LLMs work even before taking the blatant ulterior motives into account.
1 Antwort Letzte Antwort

1
P pro@programming.dev
22. Juni 2025, 21:57

We will use Grok 3.5 (maybe we should call it 4), which has advanced reasoning, to rewrite the entire corpus of human knowledge, adding missing information and deleting errors.

Then retrain on that.

Far too much garbage in any foundation model trained on uncorrected data.

Source.

::: spoiler More Context

Source.

Source.
:::
E This user is from outside of this forum
E This user is from outside of this forum
elgenzay@lemmy.ml

schrieb am 23. Juni 2025, 00:59 zuletzt editiert von

#39

Aren't you not supposed to train LLMs on LLM-generated content?

Also he should call it Grok 5; so powerful that it skips over 4. That would be very characteristic of him
T V B H 4 Antworten Letzte Antwort 23. Juni 2025, 02:14

18
B brokenglepnir@lemmy.world
23. Juni 2025, 00:34

It's not the LLM doing that though. It's the people feeding it information
M This user is from outside of this forum
M This user is from outside of this forum
majinblayze@lemmy.world

schrieb am 23. Juni 2025, 01:12 zuletzt editiert von

#40

Try rereading the whole tweet, it's not very long. It's specifically saying that they plan to "correct" the dataset using Grok, then retrain with that dataset.

It would be way too expensive to go through it by hand
1 Antwort Letzte Antwort

20
M maxfield@pf.z.org
22. Juni 2025, 22:28

The plan to "rewrite the entire corpus of human knowledge" with AI sounds impressive until you realize LLMs are just pattern-matching systems that remix existing text. They can't create genuinely new knowledge or identify "missing information" that wasn't already in their training data.
Z This user is from outside of this forum
Z This user is from outside of this forum
zildjiandrummer1@lemmy.world

schrieb am 23. Juni 2025, 01:41 zuletzt editiert von

#41

Generally, yes. However, there have been some incredible (borderline "magic") emergent generalization capabilities that I don't think anyone was expecting.

Modern AI is more than just "pattern matching" at this point. Yes at the lowest levels, sure that's what it's doing, but then you could also say human brains are just pattern matching at that same low level.
Q 1 Antwort Letzte Antwort 23. Juni 2025, 01:49

8
Z zildjiandrummer1@lemmy.world
23. Juni 2025, 01:41

Generally, yes. However, there have been some incredible (borderline "magic") emergent generalization capabilities that I don't think anyone was expecting.

Modern AI is more than just "pattern matching" at this point. Yes at the lowest levels, sure that's what it's doing, but then you could also say human brains are just pattern matching at that same low level.
Q This user is from outside of this forum
Q This user is from outside of this forum
queermunist@lemmy.ml

schrieb am 23. Juni 2025, 01:49 zuletzt editiert von

#42

Nothing that has been demonstrated makes me think these chatbots should be allowed to rewrite human history what the fuck?!
Z Z 2 Antworten Letzte Antwort 23. Juni 2025, 04:49

13
P pro@programming.dev
22. Juni 2025, 21:57

We will use Grok 3.5 (maybe we should call it 4), which has advanced reasoning, to rewrite the entire corpus of human knowledge, adding missing information and deleting errors.

Then retrain on that.

Far too much garbage in any foundation model trained on uncorrected data.

Source.

::: spoiler More Context

Source.

Source.
:::
P This user is from outside of this forum
P This user is from outside of this forum
pattymcb@lemmy.world

schrieb am 23. Juni 2025, 02:00 zuletzt editiert von

#43

Faek news!

What a dickbag. I'll never forgive him for bastardizing one of my favorite works of fiction (Stranger in a Strange Land)
1 Antwort Letzte Antwort

1
S sentient_loom@sh.itjust.works
23. Juni 2025, 00:22

adding missing information

From where?
W This user is from outside of this forum
W This user is from outside of this forum
whirling_cloudburst@lemmy.world

schrieb am 23. Juni 2025, 02:01 zuletzt editiert von

#44

He wants to give Grok some digital ketamine and/or other psychoactive LLM mind expansives.
1 Antwort Letzte Antwort

2
L linktank@lemmy.today
22. Juni 2025, 23:16

"Adding missing information" Like... From where?
L This user is from outside of this forum
L This user is from outside of this forum
literallylmao@lemmy.world

schrieb am 23. Juni 2025, 02:07 zuletzt editiert von

#45

Computer... enhance!
1 Antwort Letzte Antwort

2
E elgenzay@lemmy.ml
23. Juni 2025, 00:59

Aren't you not supposed to train LLMs on LLM-generated content?

Also he should call it Grok 5; so powerful that it skips over 4. That would be very characteristic of him
T This user is from outside of this forum
T This user is from outside of this forum
the_q@lemmy.zip

schrieb am 23. Juni 2025, 02:14 zuletzt editiert von

#46

Watch the documentary "Multiplicity".
D 1 Antwort Letzte Antwort 23. Juni 2025, 02:45

3
P pro@programming.dev
22. Juni 2025, 21:57

We will use Grok 3.5 (maybe we should call it 4), which has advanced reasoning, to rewrite the entire corpus of human knowledge, adding missing information and deleting errors.

Then retrain on that.

Far too much garbage in any foundation model trained on uncorrected data.

Source.

::: spoiler More Context

Source.

Source.
:::
H This user is from outside of this forum
H This user is from outside of this forum
hector@sh.itjust.works

schrieb am 23. Juni 2025, 02:16 zuletzt editiert von

#47

This is it I'm adding 'Musk' to my block list I'm so tired of the pseudo intellectual bullshit with bad interpretation science fiction work
D 1 Antwort Letzte Antwort 23. Juni 2025, 03:58

10
E elgenzay@lemmy.ml
23. Juni 2025, 00:59

Aren't you not supposed to train LLMs on LLM-generated content?

Also he should call it Grok 5; so powerful that it skips over 4. That would be very characteristic of him
V This user is from outside of this forum
V This user is from outside of this forum
voroxpete@sh.itjust.works

schrieb am 23. Juni 2025, 02:38 zuletzt editiert von voroxpete@sh.itjust.works

#48

There are, as I understand it, ways that you can train on AI generated material without inviting model collapse, but that's more to do with distilling the output of a model. What Musk is describing is absolutely wholesale confabulation being fed back into the next generation of their model, which would be very bad. It's also a total pipe dream. Getting an AI to rewrite something like the total training data set to your exact requirements, and verifying that it had done so satisfactorily would be an absolutely monumental undertaking. The compute time alone would be staggering and the human labour (to check the output) many times higher than that.

But the whiny little piss baby is mad that his own AI keeps fact checking him, and his engineers have already explained that coding it to lie doesn't really work because the training data tends to outweigh the initial prompt, so this is the best theory he can come up with for how he can "fix" his AI expressing reality's well known liberal bias.
D 1 Antwort Letzte Antwort 23. Juni 2025, 02:46

18
S someguy3@lemmy.world
22. Juni 2025, 23:53

This is the Ministry of Truth.

This is the Ministry of Truth on AI.
V This user is from outside of this forum
V This user is from outside of this forum
voroxpete@sh.itjust.works

schrieb am 23. Juni 2025, 02:42 zuletzt editiert von

#49

Actually one of the characters in 1984 works in the department that produces computer generated romance novels. Orwell pretty accurately predicted the idea of AI slop as a propaganda tool.
1 Antwort Letzte Antwort

2
P pro@programming.dev
22. Juni 2025, 21:57

We will use Grok 3.5 (maybe we should call it 4), which has advanced reasoning, to rewrite the entire corpus of human knowledge, adding missing information and deleting errors.

Then retrain on that.

Far too much garbage in any foundation model trained on uncorrected data.

Source.

::: spoiler More Context

Source.

Source.
:::
D This user is from outside of this forum
D This user is from outside of this forum
deflated0ne@lemmy.world

schrieb am 23. Juni 2025, 02:45 zuletzt editiert von

#50

Dude is gonna spend Manhattan Project level money making another stupid fucking shitbot. Trained on regurgitated AI Slop.

Glorious.
1 Antwort Letzte Antwort

47
T the_q@lemmy.zip
23. Juni 2025, 02:14

Watch the documentary "Multiplicity".
D This user is from outside of this forum
D This user is from outside of this forum
dzsimbo@lemm.ee

schrieb am 23. Juni 2025, 02:45 zuletzt editiert von

#51

I rented that multiple times when it came out!
1 Antwort Letzte Antwort

2
V voroxpete@sh.itjust.works
23. Juni 2025, 02:38

There are, as I understand it, ways that you can train on AI generated material without inviting model collapse, but that's more to do with distilling the output of a model. What Musk is describing is absolutely wholesale confabulation being fed back into the next generation of their model, which would be very bad. It's also a total pipe dream. Getting an AI to rewrite something like the total training data set to your exact requirements, and verifying that it had done so satisfactorily would be an absolutely monumental undertaking. The compute time alone would be staggering and the human labour (to check the output) many times higher than that.

But the whiny little piss baby is mad that his own AI keeps fact checking him, and his engineers have already explained that coding it to lie doesn't really work because the training data tends to outweigh the initial prompt, so this is the best theory he can come up with for how he can "fix" his AI expressing reality's well known liberal bias.
D This user is from outside of this forum
D This user is from outside of this forum
deflated0ne@lemmy.world

schrieb am 23. Juni 2025, 02:46 zuletzt editiert von

#52

Model collapse is the ideal.
1 Antwort Letzte Antwort

2
P pro@programming.dev
22. Juni 2025, 21:57

We will use Grok 3.5 (maybe we should call it 4), which has advanced reasoning, to rewrite the entire corpus of human knowledge, adding missing information and deleting errors.

Then retrain on that.

Far too much garbage in any foundation model trained on uncorrected data.

Source.

::: spoiler More Context

Source.

Source.
:::
K This user is from outside of this forum
K This user is from outside of this forum
kurious84@eviltoast.org

schrieb am 23. Juni 2025, 03:14 zuletzt editiert von

#53

Leme guess. The holocaust was a myth is first on his list.

He should just goto hell early.
D Z U 3 Antworten Letzte Antwort 23. Juni 2025, 03:37

7
E elgenzay@lemmy.ml
23. Juni 2025, 00:59

Aren't you not supposed to train LLMs on LLM-generated content?

Also he should call it Grok 5; so powerful that it skips over 4. That would be very characteristic of him
B This user is from outside of this forum
B This user is from outside of this forum
brucethemoose@lemmy.world

schrieb am 23. Juni 2025, 03:25 zuletzt editiert von brucethemoose@lemmy.world

#54
There’s some nuance.

Using LLMs to augment data, especially for fine tuning (not training the base model), is a sound method. The Deepseek paper using, for instance, generated reasoning traces is famous for it.

Another is using LLMs to generate logprobs of text, and train not just on the text itself but on the *probability a frontier LLM sees in every ‘word.’ This is called distillation, though there’s some variation and complication. This is also great because it’s more power/time efficient. Look up Arcee models and their distillation training kit for more on this, and code to see how it works.

There are some papers on “self play” that can indeed help LLMs.

But yes, the “dumb” way, aka putting data into a text box and asking an LLM to correct it, is dumb and dumber, because:
- You introduce some combination of sampling errors and repetition/overused word issues, depending on the sampling settings. There’s no way around this with old autoregressive LLMs.
- You possibly pollute your dataset with “filler”
- In Musk's specific proposition, it doesn’t even fill knowledge gaps the old Grok has.
In other words, Musk has no idea WTF he’s talking about. It’s the most boomer, AI Bro, not techy ChatGPT user thing he could propose.
1 Antwort Letzte Antwort

4
P pro@programming.dev
22. Juni 2025, 21:57

We will use Grok 3.5 (maybe we should call it 4), which has advanced reasoning, to rewrite the entire corpus of human knowledge, adding missing information and deleting errors.

Then retrain on that.

Far too much garbage in any foundation model trained on uncorrected data.

Source.

::: spoiler More Context

Source.

Source.
:::
B This user is from outside of this forum
B This user is from outside of this forum
brucethemoose@lemmy.world

schrieb am 23. Juni 2025, 03:30 zuletzt editiert von brucethemoose@lemmy.world

#55

I elaborated below, but basically Musk has no idea WTF he’s talking about.

If I had his “f you” money, I’d at least try a diffusion or bitnet model (and open the weights for others to improve on), and probably 100 other papers I consider low hanging fruit, before this absolutely dumb boomer take.

He’s such an idiot know it all. It’s so painful whenever he ventures into a field you sorta know.

But he might just be shouting nonsense on Twitter while X employees actually do something different. Because if they take his orders verbatim they’re going to get crap models, even with all the stupid brute force they have.
1 Antwort Letzte Antwort

58
P pro@programming.dev
22. Juni 2025, 21:57

We will use Grok 3.5 (maybe we should call it 4), which has advanced reasoning, to rewrite the entire corpus of human knowledge, adding missing information and deleting errors.

Then retrain on that.

Far too much garbage in any foundation model trained on uncorrected data.

Source.

::: spoiler More Context

Source.

Source.
:::
N This user is from outside of this forum
N This user is from outside of this forum
nigelfrobisher@aussie.zone

schrieb am 23. Juni 2025, 03:32 zuletzt editiert von

#56

I figure the whole point of this stuff is to trick people into replacing their own thoughts with these models, and effectively replace consensus reality with nonsense. Meanwhile, the oligarchy will utilise mass data collection via Palantir and ML to power the police state.
L D 2 Antworten Letzte Antwort 23. Juni 2025, 03:53

11

Anmelden zum Antworten

46/199

23. Juni 2025, 02:14

153 ungelesen

I

I made a WordPress Devcontainer Template
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
199 vor 10 Tagen
vor 10 Tagen
1

20 Stimmen

1 Beiträge

5 Aufrufe

Niemand hat geantwortet
T

Steam Users Rally Behind Anti-Censorship Petition
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
199 vor 8 Tagen
vor 16 Tagen

1k Stimmen

244 Beiträge

5k Aufrufe

J vor 8 Tagen

It's also the US legal standard for obscenity laws, unfortunately.
E

How Apple’s iOS 26 and Google’s Android 16 Will Change Our Phones
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
199 vor 23 Tagen
vor 24 Tagen

8 Stimmen

17 Beiträge

229 Aufrufe

A vor 23 Tagen

The one thing I’m continually annoyed about though is battery management. Why, in this day and age, do we not have a smartphone that can last on a single charge for a week? Instead, after a year or two of use, the devices with a glued in battery can barely last 8 hours on a charge. Doesn’t seem all that smart.
R

Like clockwork, Peacock is raising subscription prices again
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
199 vor 21 Tagen
vor 25 Tagen
1

327 Stimmen

55 Beiträge

912 Aufrufe

M vor 21 Tagen

And a 24TB drive goes for 350, I don't see prices going up either
S

The Future of Content Control: Cloud-Based Digital Signage Software and CMS Solutions
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
199 23. Juni 2025, 07:59
23. Juni 2025, 07:59
1

0 Stimmen

1 Beiträge

16 Aufrufe

Niemand hat geantwortet
D

The Guardian, in collaboration with the University of Cambridge, launches new open-source Secure Messaging technology
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
199 15. Juni 2025, 22:24
15. Juni 2025, 22:24
1

1 Stimmen

1 Beiträge

19 Aufrufe

Niemand hat geantwortet
T

Why 3D-Printing an Untraceable Ghost Gun Is Easier Than Ever (Podcast 18mins)
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
199 26. Mai 2025, 09:28
24. Mai 2025, 04:06
1

11 Stimmen

19 Beiträge

149 Aufrufe

E 26. Mai 2025, 09:28

No, just laminated ones. Closed at one end. Easy enough to make or buy. You can even improvise the propellant.
R

Audible unveils plans to use AI voices to narrate audiobooks
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
199 19. Mai 2025, 11:55
13. Mai 2025, 23:38
1

0 Stimmen

6 Beiträge

67 Aufrufe

F 19. Mai 2025, 11:55

Ah, I see what you’re saying, I misunderstood and thought you were taking about picking a different book. Indeed, for the worst case scenario a mediocre AI voice could be an improvement!