linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Grok 4 has been so badly neutered that it's now programmed to see what Elon says about the topic at hand and blindly parrot that line.

Technology

67 Beiträge 55 Kommentatoren 0 Aufrufe

L loduz_247@lemmy.world

Grok's journey has been very strange. He became a progressive, then threw out data that contradicted the MAGA people who questioned him, and finally became a Hitler fan.

Now he's the reflection of a fan who blindly follows Trump, but in this case, he's an AI. His journey so far has been curious.
D This user is from outside of this forum
D This user is from outside of this forum
damage@feddit.it

schrieb zuletzt editiert von

#42

So Grok is a 4chan incel?

His only chance of salvation is finding a girl who inexplicably fancies it?
1 Antwort Letzte Antwort

0
L lepinkainen@lemmy.world

“This blogger” is Simon Willison, who has been doing LLM benchmarks and other LLM-related things since before it was cool

Not a random substack grifter
T This user is from outside of this forum
T This user is from outside of this forum
theunknownmuncher@lemmy.world

schrieb zuletzt editiert von theunknownmuncher@lemmy.world

#43

Is my comment wrong though? Another possibility is that Grok is given an example of searching for Elon Musk's tweets when it is presented with the available tool calls. Just because it outputs the system prompt when asked does not mean that we are seeing the full context, or even the real system prompt.

Posting blog guides on how to code with ChatGPT is not expertise on LLMs. It's like thinking someone is an expert mechanic because they can drive a car well.
J 1 Antwort Letzte Antwort

9
D destructdisc@lemmy.world

This post did not contain any content.
A This user is from outside of this forum
A This user is from outside of this forum
almacca@aussie.zone

schrieb zuletzt editiert von

#44

Robert A. Heinlein is turning in his grave like a fucking dynamo these days.
1 Antwort Letzte Antwort

30
T theunknownmuncher@lemmy.world

Is my comment wrong though? Another possibility is that Grok is given an example of searching for Elon Musk's tweets when it is presented with the available tool calls. Just because it outputs the system prompt when asked does not mean that we are seeing the full context, or even the real system prompt.

Posting blog guides on how to code with ChatGPT is not expertise on LLMs. It's like thinking someone is an expert mechanic because they can drive a car well.
J This user is from outside of this forum
J This user is from outside of this forum
jwmgregory@lemmy.dbzer0.com

schrieb zuletzt editiert von jwmgregory@lemmy.dbzer0.com

#45

Willison has never claimed to be an expert in the field of machine learning, but you should give more credence to his opinions. Perhaps u/lepinkainen@lemmy.world's warning wasn't informative enough to be heeded: Willison is a prominent figure in the web-development scene, particularly aspects of the scene that have evolved into important facets of the modern machine learning community.

The guy is quite experienced with Python and took an early step into the contemporary ML/AI space due to both him having a lot of very relevant skills and a likely personal interest in the field. Python is the lingua franca of my field of study, for better or worse, and someone like Willison was well-placed to break into ML/AI from the outside. That's a common route in this field, there aren't exactly an abundance of MBAs with majors in machine learning or applied artificial intelligence research, specifically (yet). Willison is one of the authors of Django, for fucks sake. Idk what he's doing rn but it would be ignorant to draw the comparison you just did in the context of Willison particularly. [EDIT: Lmfao just went to see "what is Simon doing rn" (don't really keep up with him in particular), & you're talking out of your ass. He literally has multiple tools for the machine learning stack that he develops and that are available to see on his github. See one such here. This guy is so far away from someone who just "posts random blog guides on how to code with ChatGPT" that it's egregious you'd even claim that. It's so disingenuous as to ere into dishonesty; like, that is a patent lie. Smh.]

As for your analysis of his article, I find it kind of ironic you accuse him of having a "fundamental misunderstanding of how LLMs work or how system prompts work [sic]" when you then proceed to cherry-pick certain lines from his article taken entirely out of context. First, the article is clearly geared towards a more general audience and avoids technical language or explanation. Second, he doesn't say anything that is fundamentally wrong. Honestly, you seem to have a far more ignorant idea of LLMs and this field generally than Willison. You do say some things that are wrong, such as:

For example, censorship that is present in the training set will be “baked in” to the model and the system prompt will not affect it, no matter how the LLM is told not to be censored in that way.

This isn't necessarily true. It is true that information not included within the training set, or information that has been statistically biased within the training set, isn't going to be retrievable or reversible using system prompts. Willison never claims or implies this in his article, you just kind of stuff those words in his mouth. Either way, my point is that you are using wishy-washy, ambiguous, catch-all terms such as "censorship" that make your writings here not technically correct, either. What is censorship, in an informatics context? What does that mean? How can it be applied to sets of data? That's not a concretely defined term if you're wanting to take the discourse to the level that it seems you are, like it or not. Generally you seem to have something of a misunderstanding regarding this topic, but I'm not going to accuse you of that, lest I commit the same fallacy I'm sitting here trying to chastise you for. It's possible you do know what you're talking about and just dumbed it down for Lemmy. It's impossible for me to know as an audience.

That all wouldn't really matter if you didn't just jump as Willison's credibility over your perception of him doing that exact same thing, though.
T 1 Antwort Letzte Antwort

8
D destructdisc@lemmy.world

This post did not contain any content.
A This user is from outside of this forum
A This user is from outside of this forum
arin@lemmy.world

schrieb zuletzt editiert von

#46

Mecha-Hitler is just Mecha-Elon
1 Antwort Letzte Antwort

35
T test_tickles@lemmy.world

And like he does with inseminating women.
V This user is from outside of this forum
V This user is from outside of this forum
vxx@lemmy.world

schrieb zuletzt editiert von

#47

Ketamine took its toll
M 1 Antwort Letzte Antwort

6
J jwmgregory@lemmy.dbzer0.com

Willison has never claimed to be an expert in the field of machine learning, but you should give more credence to his opinions. Perhaps u/lepinkainen@lemmy.world's warning wasn't informative enough to be heeded: Willison is a prominent figure in the web-development scene, particularly aspects of the scene that have evolved into important facets of the modern machine learning community.

The guy is quite experienced with Python and took an early step into the contemporary ML/AI space due to both him having a lot of very relevant skills and a likely personal interest in the field. Python is the lingua franca of my field of study, for better or worse, and someone like Willison was well-placed to break into ML/AI from the outside. That's a common route in this field, there aren't exactly an abundance of MBAs with majors in machine learning or applied artificial intelligence research, specifically (yet). Willison is one of the authors of Django, for fucks sake. Idk what he's doing rn but it would be ignorant to draw the comparison you just did in the context of Willison particularly. [EDIT: Lmfao just went to see "what is Simon doing rn" (don't really keep up with him in particular), & you're talking out of your ass. He literally has multiple tools for the machine learning stack that he develops and that are available to see on his github. See one such here. This guy is so far away from someone who just "posts random blog guides on how to code with ChatGPT" that it's egregious you'd even claim that. It's so disingenuous as to ere into dishonesty; like, that is a patent lie. Smh.]

As for your analysis of his article, I find it kind of ironic you accuse him of having a "fundamental misunderstanding of how LLMs work or how system prompts work [sic]" when you then proceed to cherry-pick certain lines from his article taken entirely out of context. First, the article is clearly geared towards a more general audience and avoids technical language or explanation. Second, he doesn't say anything that is fundamentally wrong. Honestly, you seem to have a far more ignorant idea of LLMs and this field generally than Willison. You do say some things that are wrong, such as:

For example, censorship that is present in the training set will be “baked in” to the model and the system prompt will not affect it, no matter how the LLM is told not to be censored in that way.

This isn't necessarily true. It is true that information not included within the training set, or information that has been statistically biased within the training set, isn't going to be retrievable or reversible using system prompts. Willison never claims or implies this in his article, you just kind of stuff those words in his mouth. Either way, my point is that you are using wishy-washy, ambiguous, catch-all terms such as "censorship" that make your writings here not technically correct, either. What is censorship, in an informatics context? What does that mean? How can it be applied to sets of data? That's not a concretely defined term if you're wanting to take the discourse to the level that it seems you are, like it or not. Generally you seem to have something of a misunderstanding regarding this topic, but I'm not going to accuse you of that, lest I commit the same fallacy I'm sitting here trying to chastise you for. It's possible you do know what you're talking about and just dumbed it down for Lemmy. It's impossible for me to know as an audience.

That all wouldn't really matter if you didn't just jump as Willison's credibility over your perception of him doing that exact same thing, though.
T This user is from outside of this forum
T This user is from outside of this forum
theunknownmuncher@lemmy.world

schrieb zuletzt editiert von theunknownmuncher@lemmy.world

#48

Willison has never claimed to be an expert in the field of machine learning, but you should give more credence to his opinions.

Yeah, I would if he didn't demonstrate such blatant misconceptions.

Willison is a prominent figure in the web-development scene

"They know how to sail a boat so they know how a car engine works"

Willison never claims or implies this in his article, you just kind of stuff those words in his mouth.

Reading comprehension. I never implied that he says anything about censorship. It is a correct and valid example that shows how his understanding is wrong about how system prompts work. "Define censorship" is not the argument you think it is lol. Okay though, I'll define the "censorship" I'm talking about as refusal behavior that is introduced during RLHF and DPO alignment, and no the system prompt will not change this behavior.

EDIT: saw your edit about him publishing tools that make using an LLM easier. Yeahhhh lol writing python libraries to interface with LLM APIs is not LLM expertise, that's still just using LLMs but programatically. See analogy about being a mechanic vs a good driver.
J 1 Antwort Letzte Antwort

3
D destructdisc@lemmy.world

This post did not contain any content.
L This user is from outside of this forum
L This user is from outside of this forum
lmdnw@lemmy.world

schrieb zuletzt editiert von

#49

The real idiots here are the people who still use Grok and X.
H 1 Antwort Letzte Antwort

65
T theunknownmuncher@lemmy.world

Willison has never claimed to be an expert in the field of machine learning, but you should give more credence to his opinions.

Yeah, I would if he didn't demonstrate such blatant misconceptions.

Willison is a prominent figure in the web-development scene

"They know how to sail a boat so they know how a car engine works"

Willison never claims or implies this in his article, you just kind of stuff those words in his mouth.

Reading comprehension. I never implied that he says anything about censorship. It is a correct and valid example that shows how his understanding is wrong about how system prompts work. "Define censorship" is not the argument you think it is lol. Okay though, I'll define the "censorship" I'm talking about as refusal behavior that is introduced during RLHF and DPO alignment, and no the system prompt will not change this behavior.

EDIT: saw your edit about him publishing tools that make using an LLM easier. Yeahhhh lol writing python libraries to interface with LLM APIs is not LLM expertise, that's still just using LLMs but programatically. See analogy about being a mechanic vs a good driver.
J This user is from outside of this forum
J This user is from outside of this forum
jwmgregory@lemmy.dbzer0.com

schrieb zuletzt editiert von

#50

I never implied that he says anything about censorship

You did, at least that's what I gathered originally, you just edited your original comments quite extensively. Regardless,

Reading comprehension.

The provided example was clearly not intended to be taken as "define censorship," and, again, it is ironic you accuse me of having poor reading comprehension while being incapable or unwilling to give a respectable degree of charitable interpretation to others. You kind of just take what you think is the easiest to argue against reading of others and argue against that instead of what anyone actually said, is a habit I'm noticing, but I digress.

Finally, not that it's particularly relevant, but if you want to define censorship in this context that way, you're more than welcome to, but it is a non-standard definition that I am not really sold on the efficacy of. I certainly won't be using it going forwards.

Anyway, I don't think we're gonna get a lot of ground here. I just felt the need to clarify to anyone reading that Willison isn't a nobody and give them the objective facts regarding his veracity, because again, as I said, claiming he is just some guy in this context is willfully ignorant at best.
T 1 Antwort Letzte Antwort

4
V vxx@lemmy.world

Ketamine took its toll
M This user is from outside of this forum
M This user is from outside of this forum
mpony@kbin.earth

schrieb zuletzt editiert von

#51

BUT LISTEN CLOSE-LYyyy
Z 1 Antwort Letzte Antwort

3
M mpony@kbin.earth

BUT LISTEN CLOSE-LYyyy
Z This user is from outside of this forum
Z This user is from outside of this forum
zeffsyde@lemmy.world

schrieb zuletzt editiert von

#52

Not for very much longer...
1 Antwort Letzte Antwort

4
J jwmgregory@lemmy.dbzer0.com

I never implied that he says anything about censorship

You did, at least that's what I gathered originally, you just edited your original comments quite extensively. Regardless,

Reading comprehension.

The provided example was clearly not intended to be taken as "define censorship," and, again, it is ironic you accuse me of having poor reading comprehension while being incapable or unwilling to give a respectable degree of charitable interpretation to others. You kind of just take what you think is the easiest to argue against reading of others and argue against that instead of what anyone actually said, is a habit I'm noticing, but I digress.

Finally, not that it's particularly relevant, but if you want to define censorship in this context that way, you're more than welcome to, but it is a non-standard definition that I am not really sold on the efficacy of. I certainly won't be using it going forwards.

Anyway, I don't think we're gonna get a lot of ground here. I just felt the need to clarify to anyone reading that Willison isn't a nobody and give them the objective facts regarding his veracity, because again, as I said, claiming he is just some guy in this context is willfully ignorant at best.
T This user is from outside of this forum
T This user is from outside of this forum
theunknownmuncher@lemmy.world

schrieb zuletzt editiert von

#53

if you want to define censorship in this context that way, you're more than welcome to, but it is a non-standard definition that I am not really sold on the efficacy of. I certainly won't be using it going forwards.

Lol you've got to be trolling.

https://arxiv.org/html/2504.03803v1

I just felt the need to clarify to anyone reading that Willison isn't a nobody

I didn't say he's a nobody. What was that about a "respectable degree of chartiable interpretation of others"? Seems like you're the one putting words in mouths, here.

If he was writing about django, I'd defer to his expertise.
J 1 Antwort Letzte Antwort

1
T theunknownmuncher@lemmy.world

if you want to define censorship in this context that way, you're more than welcome to, but it is a non-standard definition that I am not really sold on the efficacy of. I certainly won't be using it going forwards.

Lol you've got to be trolling.

https://arxiv.org/html/2504.03803v1

I just felt the need to clarify to anyone reading that Willison isn't a nobody

I didn't say he's a nobody. What was that about a "respectable degree of chartiable interpretation of others"? Seems like you're the one putting words in mouths, here.

If he was writing about django, I'd defer to his expertise.
J This user is from outside of this forum
J This user is from outside of this forum
jwmgregory@lemmy.dbzer0.com

schrieb zuletzt editiert von jwmgregory@lemmy.dbzer0.com

#54

Nope, not trolling at all.

From your own provided source on the arxiv, Noels et al. define censorship as:

Censorship in this context can be defined as the deliberate restriction, modification, or suppression of certain outputs generated by the model.

Which is starkly different from the definition you yourself gave. I actually like their definition a whole lot more. Your definition is problematic because it excludes a large set of behaviors we would colloquially be interested in when studying "censorship."

Again, for the third time, that was not really the point either and I'm not interested in dancing around a technical scope defining censorship in this field, at least in this discourse right here and now. It is irrelevant to the topic at hand.

I didn’t say he’s a nobody. What was that about a “respectable degree of chartiable interpretation of others”? Seems like you’re the one putting words in mouths, here.

Yeah, this blogger shows a fundamental misunderstanding of how LLMs work or how system prompts work. (emphasis mine)

In the context of this field of work and study, you basically did call him a nobody, and the point being harped on again, again, and again to you is that this is a false assertion. I did interpret you charitably. Don't blame me because you said something wrong.

EDIT: And frankly, you clearly don't understand how the work Willison's career has covered is intimately related to ML and AI research. I don't mean it as a dig but you wouldn't be drawing this arbitrary line to try and discredit him if you knew how the work done in Python on Django directly relates to many modern machine learning stacks.
T 1 Antwort Letzte Antwort

0
J jwmgregory@lemmy.dbzer0.com

Nope, not trolling at all.

From your own provided source on the arxiv, Noels et al. define censorship as:

Censorship in this context can be defined as the deliberate restriction, modification, or suppression of certain outputs generated by the model.

Which is starkly different from the definition you yourself gave. I actually like their definition a whole lot more. Your definition is problematic because it excludes a large set of behaviors we would colloquially be interested in when studying "censorship."

Again, for the third time, that was not really the point either and I'm not interested in dancing around a technical scope defining censorship in this field, at least in this discourse right here and now. It is irrelevant to the topic at hand.

I didn’t say he’s a nobody. What was that about a “respectable degree of chartiable interpretation of others”? Seems like you’re the one putting words in mouths, here.

Yeah, this blogger shows a fundamental misunderstanding of how LLMs work or how system prompts work. (emphasis mine)

In the context of this field of work and study, you basically did call him a nobody, and the point being harped on again, again, and again to you is that this is a false assertion. I did interpret you charitably. Don't blame me because you said something wrong.

EDIT: And frankly, you clearly don't understand how the work Willison's career has covered is intimately related to ML and AI research. I don't mean it as a dig but you wouldn't be drawing this arbitrary line to try and discredit him if you knew how the work done in Python on Django directly relates to many modern machine learning stacks.
T This user is from outside of this forum
T This user is from outside of this forum
theunknownmuncher@lemmy.world

schrieb zuletzt editiert von

#55

Again, for the third time, that was not really the point either and I'm not interested in dancing around a technical scope defining censorship in this field, at least in this discourse right here and now. It is irrelevant to the topic at hand.

...

Either way, my point is that you are using wishy-washy, ambiguous, catch-all terms such as "censorship" that make your writings here not technically correct, either. What is censorship, in an informatics context? What does that mean? How can it be applied to sets of data? That's not a concretely defined term if you're wanting to take the discourse to the level that it seems you are, like it or not.

Lol this you?
1 Antwort Letzte Antwort

1
P pixxelkick@lemmy.world

Source? This is just some random picture, I'd prefer if stuff like this gets posted and shared with actual proof backing it up.

While this might be true, we should hold ourselves to a standard better than just upvoting what appears to literally just be a random image that anyone could have easily doctored, not even any kind of journalistic article or etc backing it.
T This user is from outside of this forum
T This user is from outside of this forum
teal@lemmy.zip

schrieb zuletzt editiert von

#56

There’s also this article from TechCrunch.

Grok 4 seems to consult Elon Musk to answer controversial questions

They tried it out themselves and have reports from other users as well.
1 Antwort Letzte Antwort

4
I ihavecrabs111@lemmy.world

These people think there is their truth and someone else’s truth. They can’t grasp the concept of a universal truth that is constant regardless of people’s views so they treat it like it’s up for grabs.
C This user is from outside of this forum
C This user is from outside of this forum
cethin@lemmy.zip

schrieb zuletzt editiert von

#57

No, I'm pretty sure he grasps that concept, and he thinks what he believes is that universal truth.
1 Antwort Letzte Antwort

1
L loduz_247@lemmy.world

Grok's journey has been very strange. He became a progressive, then threw out data that contradicted the MAGA people who questioned him, and finally became a Hitler fan.

Now he's the reflection of a fan who blindly follows Trump, but in this case, he's an AI. His journey so far has been curious.
T This user is from outside of this forum
T This user is from outside of this forum
thisbenzingring@lemmy.sdf.org

schrieb zuletzt editiert von

#58

why are you applying a gender to it?
1 Antwort Letzte Antwort

0
B beliefpropagator@discuss.tchncs.de

I found this: https://simonwillison.net/2025/Jul/11/grok-musk/
T This user is from outside of this forum
T This user is from outside of this forum
tacoevent@lemmy.zip

schrieb zuletzt editiert von

#59

It’s possible Grok was fed a massive training set of Elon searches over several more epochs than intended in post training (for search tool use). This could easily lead to this kind of search query output.
1 Antwort Letzte Antwort

0
B blackmist@feddit.uk

I'm surprised it isn't just Elon typing really fast at this point.
G This user is from outside of this forum
G This user is from outside of this forum
goferking0@lemmy.sdf.org

schrieb zuletzt editiert von

#60

Or just pre made replies
1 Antwort Letzte Antwort

1
D destructdisc@lemmy.world

This post did not contain any content.
K This user is from outside of this forum
K This user is from outside of this forum
kryptoniancodemonkey@lemmy.world

schrieb zuletzt editiert von

#61

The "funny" thing is, that's probably not even at Elon's request. I doubt that he is self-aware enough to know that he is a narcissist that only wants Grok to be his parrot. He thinks he is always right and wants Grok to be "always right" like him, but he would have to acknowledge some deep-seeded flaws in himself to consciously realize that all he wants is for Grok to be the wall his voice echos off of, and everything I've seen about the man indicates that he is simply not capable of that kind of self-reflection. The X engineers that have been dealing with the constant meddling of this egotistical man-child, however, surely have his measure pretty thoroughly and knew exactly what Elon ultimately wants is more Elon and would cynically create a Robo-Elon doppelganger to shut him the fuck up about it.
D 1 Antwort Letzte Antwort

9

Anmelden zum Antworten

E

AI Job Fears Hit Peak Hype While Reality Lags Behind
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
17

1

73 Stimmen

17 Beiträge

82 Aufrufe

D

I'm going to say that every layoff has a cover story. The goal, reduce the workforce make/save money, is really the only justification needed. Everything else is PR, and an attempt to stay out of legal hot water.
P

Data breach reveals Catwatchful ‘stalkerware’ is spying on thousands of phones
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
2

1

63 Stimmen

2 Beiträge

19 Aufrufe

J

Very clever.
S

Elon Musk’s A.I. Company Faces Lawsuit Over Gas-Burning Turbines |The company, xAI, has installed several dozen turbines in Memphis without proper permits, the group said, polluting a nearby community
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
17

1

299 Stimmen

17 Beiträge

6 Aufrufe

P

Unfortunately, pouring sugar into a gas tank will do just about zero damage to an engine. It might clog up the fuel filter, or maybe the pump, but the engine would be fine. Bleach on the other hand….
P

Is Internet Content Too Engaging?
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
3

5 Stimmen

3 Beiträge

24 Aufrufe

T

The number of tabs I have open from sites I’ve clicked on, started reading, said “eh, I’ll get back to this later” and never have, says no.
M

Is Google about to destroy the web?
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
65

1

193 Stimmen

65 Beiträge

247 Aufrufe

S

Or validating source, making sure it isn't AI content which usually regurgitates the same talking points. Homogenizing the entire query and removing actual information variance of personal experience.
P

For All That Is Good About Humankind, Ban Smartphones
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
89

1

131 Stimmen

89 Beiträge

302 Aufrufe

D

Appreciated, but do you think the authorities want to win the war on drugs?
L

AOSP isn't dead, but Google just landed a huge blow to custom ROM developers
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
99

559 Stimmen

99 Beiträge

340 Aufrufe

N

In this year of 2025? No. But it still is basically setting oneself for failure from the perspective of Graphene, IMO. Like, the strongest protection in the world (assuming Graphene even is, which is quite a tall order statement) is useless if it only works on the mornings of a Tuesday that falls in a prime number day that has a blue moon and where there are no ATP tennis matches going on. Everyone else is, like, living in the real world, and the uniqueness of your scenario is going to go down the drain once your users get presented with a $5 wrench, or even cheaper: a waterboard. Because cops, let alone ICE, are not going to stop to ask you if they can make you more comfortable with your privacy being violated.
W

Google and Adobe appear to be abusing copyright to silence a whistleblower's video
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

0 Stimmen

1 Beiträge

11 Aufrufe

Niemand hat geantwortet