linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Elon Musk wants to rewrite "the entire corpus of human knowledge" with Grok

Technology

199 Beiträge 159 Kommentatoren 3.9k Aufrufe

T the_q@lemmy.zip

Watch the documentary "Multiplicity".
D This user is from outside of this forum
D This user is from outside of this forum
dzsimbo@lemm.ee

schrieb am zuletzt editiert von

#51

I rented that multiple times when it came out!
1 Antwort Letzte Antwort

2
V voroxpete@sh.itjust.works

There are, as I understand it, ways that you can train on AI generated material without inviting model collapse, but that's more to do with distilling the output of a model. What Musk is describing is absolutely wholesale confabulation being fed back into the next generation of their model, which would be very bad. It's also a total pipe dream. Getting an AI to rewrite something like the total training data set to your exact requirements, and verifying that it had done so satisfactorily would be an absolutely monumental undertaking. The compute time alone would be staggering and the human labour (to check the output) many times higher than that.

But the whiny little piss baby is mad that his own AI keeps fact checking him, and his engineers have already explained that coding it to lie doesn't really work because the training data tends to outweigh the initial prompt, so this is the best theory he can come up with for how he can "fix" his AI expressing reality's well known liberal bias.
D This user is from outside of this forum
D This user is from outside of this forum
deflated0ne@lemmy.world

schrieb am zuletzt editiert von

#52

Model collapse is the ideal.
1 Antwort Letzte Antwort

2
P pro@programming.dev

We will use Grok 3.5 (maybe we should call it 4), which has advanced reasoning, to rewrite the entire corpus of human knowledge, adding missing information and deleting errors.

Then retrain on that.

Far too much garbage in any foundation model trained on uncorrected data.

Source.

::: spoiler More Context

Source.

Source.
:::
K This user is from outside of this forum
K This user is from outside of this forum
kurious84@eviltoast.org

schrieb am zuletzt editiert von

#53

Leme guess. The holocaust was a myth is first on his list.

He should just goto hell early.
D Z U 3 Antworten Letzte Antwort

7
E elgenzay@lemmy.ml

Aren't you not supposed to train LLMs on LLM-generated content?

Also he should call it Grok 5; so powerful that it skips over 4. That would be very characteristic of him
B This user is from outside of this forum
B This user is from outside of this forum
brucethemoose@lemmy.world

schrieb am zuletzt editiert von brucethemoose@lemmy.world

#54
There’s some nuance.

Using LLMs to augment data, especially for fine tuning (not training the base model), is a sound method. The Deepseek paper using, for instance, generated reasoning traces is famous for it.

Another is using LLMs to generate logprobs of text, and train not just on the text itself but on the *probability a frontier LLM sees in every ‘word.’ This is called distillation, though there’s some variation and complication. This is also great because it’s more power/time efficient. Look up Arcee models and their distillation training kit for more on this, and code to see how it works.

There are some papers on “self play” that can indeed help LLMs.

But yes, the “dumb” way, aka putting data into a text box and asking an LLM to correct it, is dumb and dumber, because:
- You introduce some combination of sampling errors and repetition/overused word issues, depending on the sampling settings. There’s no way around this with old autoregressive LLMs.
- You possibly pollute your dataset with “filler”
- In Musk's specific proposition, it doesn’t even fill knowledge gaps the old Grok has.
In other words, Musk has no idea WTF he’s talking about. It’s the most boomer, AI Bro, not techy ChatGPT user thing he could propose.
1 Antwort Letzte Antwort

4
P pro@programming.dev

We will use Grok 3.5 (maybe we should call it 4), which has advanced reasoning, to rewrite the entire corpus of human knowledge, adding missing information and deleting errors.

Then retrain on that.

Far too much garbage in any foundation model trained on uncorrected data.

Source.

::: spoiler More Context

Source.

Source.
:::
B This user is from outside of this forum
B This user is from outside of this forum
brucethemoose@lemmy.world

schrieb am zuletzt editiert von brucethemoose@lemmy.world

#55

I elaborated below, but basically Musk has no idea WTF he’s talking about.

If I had his “f you” money, I’d at least try a diffusion or bitnet model (and open the weights for others to improve on), and probably 100 other papers I consider low hanging fruit, before this absolutely dumb boomer take.

He’s such an idiot know it all. It’s so painful whenever he ventures into a field you sorta know.

But he might just be shouting nonsense on Twitter while X employees actually do something different. Because if they take his orders verbatim they’re going to get crap models, even with all the stupid brute force they have.
1 Antwort Letzte Antwort

58
P pro@programming.dev

We will use Grok 3.5 (maybe we should call it 4), which has advanced reasoning, to rewrite the entire corpus of human knowledge, adding missing information and deleting errors.

Then retrain on that.

Far too much garbage in any foundation model trained on uncorrected data.

Source.

::: spoiler More Context

Source.

Source.
:::
N This user is from outside of this forum
N This user is from outside of this forum
nigelfrobisher@aussie.zone

schrieb am zuletzt editiert von

#56

I figure the whole point of this stuff is to trick people into replacing their own thoughts with these models, and effectively replace consensus reality with nonsense. Meanwhile, the oligarchy will utilise mass data collection via Palantir and ML to power the police state.
L D 2 Antworten Letzte Antwort

11
S sentient_loom@sh.itjust.works

adding missing information

From where?
I This user is from outside of this forum
I This user is from outside of this forum
imgonnatrythis@sh.itjust.works

schrieb am zuletzt editiert von

#57

Frog DNA
1 Antwort Letzte Antwort

5
K kurious84@eviltoast.org

Leme guess. The holocaust was a myth is first on his list.

He should just goto hell early.
D This user is from outside of this forum
D This user is from outside of this forum
dirthawker0@lemmy.world

schrieb am zuletzt editiert von

#58

It already proved its inability with facts with its white genocide rantings.
1 Antwort Letzte Antwort

1
N nigelfrobisher@aussie.zone

I figure the whole point of this stuff is to trick people into replacing their own thoughts with these models, and effectively replace consensus reality with nonsense. Meanwhile, the oligarchy will utilise mass data collection via Palantir and ML to power the police state.
L This user is from outside of this forum
L This user is from outside of this forum
lefaucet@slrpnk.net

schrieb am zuletzt editiert von

#59

I don't like it but sure seems like you're correct
1 Antwort Letzte Antwort

1
N nigelfrobisher@aussie.zone

I figure the whole point of this stuff is to trick people into replacing their own thoughts with these models, and effectively replace consensus reality with nonsense. Meanwhile, the oligarchy will utilise mass data collection via Palantir and ML to power the police state.
D This user is from outside of this forum
D This user is from outside of this forum
doublespace@lemm.ee

schrieb am zuletzt editiert von

#60

Has consensus reality ever been a thing?
1 Antwort Letzte Antwort

0
P pro@programming.dev

We will use Grok 3.5 (maybe we should call it 4), which has advanced reasoning, to rewrite the entire corpus of human knowledge, adding missing information and deleting errors.

Then retrain on that.

Far too much garbage in any foundation model trained on uncorrected data.

Source.

::: spoiler More Context

Source.

Source.
:::
H This user is from outside of this forum
H This user is from outside of this forum
hansolo@lemmy.today

schrieb am zuletzt editiert von

#61

Prepare for Grokipedia to only have one article about white genocide, then every other article links to "Did you mean White Genocide?"
1 Antwort Letzte Antwort

3
H hector@sh.itjust.works

This is it I'm adding 'Musk' to my block list I'm so tired of the pseudo intellectual bullshit with bad interpretation science fiction work
D This user is from outside of this forum
D This user is from outside of this forum
doublespace@lemm.ee

schrieb am zuletzt editiert von

#62

I think deep down musk knows he's fairly mediocre intelligence wise. I think the drugs allow him to temporarily forget that.
1 Antwort Letzte Antwort

4
E elgenzay@lemmy.ml

Aren't you not supposed to train LLMs on LLM-generated content?

Also he should call it Grok 5; so powerful that it skips over 4. That would be very characteristic of him
H This user is from outside of this forum
H This user is from outside of this forum
hansolo@lemmy.today

schrieb am zuletzt editiert von hansolo@lemmy.today

#63

Musk probably heard about "synthetic data" training, which is where you use machine learning to create thousands of things that are typical-enough to be good training data. Microsoft uses it to take documents users upload to Office365, train the ML model, and then use that ML output to train an LLM so they can technically say "no, your data wasn't used to train an LLM." Because it trained the thing that trained the LLM.

However, you can't do that with LLM output and stuff like... History. WTF evidence and documents are the basis for the crap he wants to add? The hallucinations will just compound because who's going to cross-check this other than Grok anyway?
1 Antwort Letzte Antwort

4
P pro@programming.dev

We will use Grok 3.5 (maybe we should call it 4), which has advanced reasoning, to rewrite the entire corpus of human knowledge, adding missing information and deleting errors.

Then retrain on that.

Far too much garbage in any foundation model trained on uncorrected data.

Source.

::: spoiler More Context

Source.

Source.
:::
S This user is from outside of this forum
S This user is from outside of this forum
sixtyforce@sh.itjust.works

schrieb am zuletzt editiert von sixtyforce@sh.itjust.works

#64

I think most AI corp tech bros do want to control information, they just aren't high enough on Ket to say it out loud.
1 Antwort Letzte Antwort

2
P pro@programming.dev

We will use Grok 3.5 (maybe we should call it 4), which has advanced reasoning, to rewrite the entire corpus of human knowledge, adding missing information and deleting errors.

Then retrain on that.

Far too much garbage in any foundation model trained on uncorrected data.

Source.

::: spoiler More Context

Source.

Source.
:::
A This user is from outside of this forum
A This user is from outside of this forum
antihumanitarian@lemmy.world

schrieb am zuletzt editiert von

#65

Most if not all leading models use synthetic data extensively to do exactly this. However, the synthetic data needs to be well defined and essentially programmed by the data scientists. If you don't define the data very carefully, ideally math or programs you can verify as correct automatically, it's worse than useless. The scope is usually very narrow, no hitchhikers guide to the galaxy rewrite.

But in any case he's probably just parroting whatever his engineers pitched him to look smart and in charge.
1 Antwort Letzte Antwort

9
K kurious84@eviltoast.org

Leme guess. The holocaust was a myth is first on his list.

He should just goto hell early.
Z This user is from outside of this forum
Z This user is from outside of this forum
zqps@sh.itjust.works

schrieb am zuletzt editiert von

#66

Just one of those errors to be deleted.
1 Antwort Letzte Antwort

1
M magicshel@lemmy.zip

If we had direct control over how our tax dollars were spent, things would be different pretty fast. Might not be better, but different.
Z This user is from outside of this forum
Z This user is from outside of this forum
zqps@sh.itjust.works

schrieb am zuletzt editiert von zqps@sh.itjust.works

#67

At this point a significant part of the country would decide to airstrike US primary schools to stop wasting money and indoctrinating kids.
1 Antwort Letzte Antwort

12
K kurious84@eviltoast.org

Leme guess. The holocaust was a myth is first on his list.

He should just goto hell early.
U This user is from outside of this forum
U This user is from outside of this forum
utopiah@lemmy.world

schrieb am zuletzt editiert von

#68

He should just goto hell early.

He's going to Mars as soon as FSD on Tesla is ready, next year for sure!, to not blow in his rocket then once there chat with his amazing chatbot telling him, with 20min delay for each message, that he truly is the best.

What an absolute retard.
1 Antwort Letzte Antwort

3
B brokenglepnir@lemmy.world

It's not the LLM doing that though. It's the people feeding it information
Z This user is from outside of this forum
Z This user is from outside of this forum
zqps@sh.itjust.works

schrieb am zuletzt editiert von

#69

Yes.

He wants to prompt grok to rewrite history according to his worldview, then retrain the model on that output.
1 Antwort Letzte Antwort

5
Q queermunist@lemmy.ml

Nothing that has been demonstrated makes me think these chatbots should be allowed to rewrite human history what the fuck?!
Z This user is from outside of this forum
Z This user is from outside of this forum
zqps@sh.itjust.works

schrieb am zuletzt editiert von

#70

Tech bros see zero value in humanity beyond how it can be commodified.
1 Antwort Letzte Antwort

5

Anmelden zum Antworten

I

Rising rocket launches linked to ozone layer thinning
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
26

1

216 Stimmen

26 Beiträge

583 Aufrufe

Z

They cry antisemitism, then the Seven Mountain Mandate people cry antisemitism, yadda yadda yadda...
D

Netflix uses AI effects for first time to cut costs
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
54

1

202 Stimmen

54 Beiträge

933 Aufrufe

G

yo ho fiddle dee free
O

Learn About Climate Change with Stunning Visual Flashcards 🌍📚
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

0 Stimmen

1 Beiträge

18 Aufrufe

Niemand hat geantwortet
J

Why Chinese Open Source AI Model Will Win Over American OpenAI [20:37 | JUL 07 2025 | Carl Zha]
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

0 Stimmen

1 Beiträge

17 Aufrufe

Niemand hat geantwortet
P

PauseAI presents: The Google DeepMind Protest
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
2

1

27 Stimmen

2 Beiträge

31 Aufrufe

M

At 17:00, on Monday, the 30th of June, in Granary Square, London, PauseAI will be holding our biggest protest yet. It's already Tuesday, July 1st
A

best Head Shop Online
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

0 Stimmen

1 Beiträge

13 Aufrufe

Niemand hat geantwortet
P

CBDC Explained : Can your money really expire?
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
4

6 Stimmen

4 Beiträge

50 Aufrufe

S

CBDCs could well take the prize for most dangerous thing in our lifetime, similar to nuclear weapons during the Cold War. I'm thinking of that line from the song in Les Mis. Look down, look down. You'll always be a slave. Look down, look down. You're standing in your grave.
K

We did the math on AI’s energy footprint. Here’s the story you haven’t heard.
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
4

1

0 Stimmen

4 Beiträge

50 Aufrufe

D

I don't think accuracy is an issue either. I've been on the web since inception and we always had a terribly inaccurate information landscape. It's really about individual ability to put together found information to an accurate world model and LLMs is a tool just like any other. The real issues imo are effects on society be it information manipulation, breaking our education and workforce systems. But all of that is overshadowed by meme issues like energy use or inaccuracy as these are easy to understand for any person while sociology, politics and macro economics are really hard.