Skip to content

Grok 4 has been so badly neutered that it's now programmed to see what Elon says about the topic at hand and blindly parrot that line.

Technology
67 55 0
  • This post did not contain any content.

    Mecha-Hitler is just Mecha-Elon

  • And like he does with inseminating women.

    Ketamine took its toll

  • Willison has never claimed to be an expert in the field of machine learning, but you should give more credence to his opinions. Perhaps u/lepinkainen@lemmy.world's warning wasn't informative enough to be heeded: Willison is a prominent figure in the web-development scene, particularly aspects of the scene that have evolved into important facets of the modern machine learning community.

    The guy is quite experienced with Python and took an early step into the contemporary ML/AI space due to both him having a lot of very relevant skills and a likely personal interest in the field. Python is the lingua franca of my field of study, for better or worse, and someone like Willison was well-placed to break into ML/AI from the outside. That's a common route in this field, there aren't exactly an abundance of MBAs with majors in machine learning or applied artificial intelligence research, specifically (yet). Willison is one of the authors of Django, for fucks sake. Idk what he's doing rn but it would be ignorant to draw the comparison you just did in the context of Willison particularly. [EDIT: Lmfao just went to see "what is Simon doing rn" (don't really keep up with him in particular), & you're talking out of your ass. He literally has multiple tools for the machine learning stack that he develops and that are available to see on his github. See one such here. This guy is so far away from someone who just "posts random blog guides on how to code with ChatGPT" that it's egregious you'd even claim that. It's so disingenuous as to ere into dishonesty; like, that is a patent lie. Smh.]

    As for your analysis of his article, I find it kind of ironic you accuse him of having a "fundamental misunderstanding of how LLMs work or how system prompts work [sic]" when you then proceed to cherry-pick certain lines from his article taken entirely out of context. First, the article is clearly geared towards a more general audience and avoids technical language or explanation. Second, he doesn't say anything that is fundamentally wrong. Honestly, you seem to have a far more ignorant idea of LLMs and this field generally than Willison. You do say some things that are wrong, such as:

    For example, censorship that is present in the training set will be “baked in” to the model and the system prompt will not affect it, no matter how the LLM is told not to be censored in that way.

    This isn't necessarily true. It is true that information not included within the training set, or information that has been statistically biased within the training set, isn't going to be retrievable or reversible using system prompts. Willison never claims or implies this in his article, you just kind of stuff those words in his mouth. Either way, my point is that you are using wishy-washy, ambiguous, catch-all terms such as "censorship" that make your writings here not technically correct, either. What is censorship, in an informatics context? What does that mean? How can it be applied to sets of data? That's not a concretely defined term if you're wanting to take the discourse to the level that it seems you are, like it or not. Generally you seem to have something of a misunderstanding regarding this topic, but I'm not going to accuse you of that, lest I commit the same fallacy I'm sitting here trying to chastise you for. It's possible you do know what you're talking about and just dumbed it down for Lemmy. It's impossible for me to know as an audience.

    That all wouldn't really matter if you didn't just jump as Willison's credibility over your perception of him doing that exact same thing, though.

    Willison has never claimed to be an expert in the field of machine learning, but you should give more credence to his opinions.

    Yeah, I would if he didn't demonstrate such blatant misconceptions.

    Willison is a prominent figure in the web-development scene

    🤦 "They know how to sail a boat so they know how a car engine works"

    Willison never claims or implies this in his article, you just kind of stuff those words in his mouth.

    Reading comprehension. I never implied that he says anything about censorship. It is a correct and valid example that shows how his understanding is wrong about how system prompts work. "Define censorship" is not the argument you think it is lol. Okay though, I'll define the "censorship" I'm talking about as refusal behavior that is introduced during RLHF and DPO alignment, and no the system prompt will not change this behavior.

    EDIT: saw your edit about him publishing tools that make using an LLM easier. Yeahhhh lol writing python libraries to interface with LLM APIs is not LLM expertise, that's still just using LLMs but programatically. See analogy about being a mechanic vs a good driver.

  • This post did not contain any content.

    The real idiots here are the people who still use Grok and X.

  • Willison has never claimed to be an expert in the field of machine learning, but you should give more credence to his opinions.

    Yeah, I would if he didn't demonstrate such blatant misconceptions.

    Willison is a prominent figure in the web-development scene

    🤦 "They know how to sail a boat so they know how a car engine works"

    Willison never claims or implies this in his article, you just kind of stuff those words in his mouth.

    Reading comprehension. I never implied that he says anything about censorship. It is a correct and valid example that shows how his understanding is wrong about how system prompts work. "Define censorship" is not the argument you think it is lol. Okay though, I'll define the "censorship" I'm talking about as refusal behavior that is introduced during RLHF and DPO alignment, and no the system prompt will not change this behavior.

    EDIT: saw your edit about him publishing tools that make using an LLM easier. Yeahhhh lol writing python libraries to interface with LLM APIs is not LLM expertise, that's still just using LLMs but programatically. See analogy about being a mechanic vs a good driver.

    I never implied that he says anything about censorship

    You did, at least that's what I gathered originally, you just edited your original comments quite extensively. Regardless,

    Reading comprehension.

    The provided example was clearly not intended to be taken as "define censorship," and, again, it is ironic you accuse me of having poor reading comprehension while being incapable or unwilling to give a respectable degree of charitable interpretation to others. You kind of just take what you think is the easiest to argue against reading of others and argue against that instead of what anyone actually said, is a habit I'm noticing, but I digress.

    Finally, not that it's particularly relevant, but if you want to define censorship in this context that way, you're more than welcome to, but it is a non-standard definition that I am not really sold on the efficacy of. I certainly won't be using it going forwards.

    Anyway, I don't think we're gonna get a lot of ground here. I just felt the need to clarify to anyone reading that Willison isn't a nobody and give them the objective facts regarding his veracity, because again, as I said, claiming he is just some guy in this context is willfully ignorant at best.

  • Ketamine took its toll

    BUT LISTEN CLOSE-LYyyy

  • BUT LISTEN CLOSE-LYyyy

    Not for very much longer...

  • I never implied that he says anything about censorship

    You did, at least that's what I gathered originally, you just edited your original comments quite extensively. Regardless,

    Reading comprehension.

    The provided example was clearly not intended to be taken as "define censorship," and, again, it is ironic you accuse me of having poor reading comprehension while being incapable or unwilling to give a respectable degree of charitable interpretation to others. You kind of just take what you think is the easiest to argue against reading of others and argue against that instead of what anyone actually said, is a habit I'm noticing, but I digress.

    Finally, not that it's particularly relevant, but if you want to define censorship in this context that way, you're more than welcome to, but it is a non-standard definition that I am not really sold on the efficacy of. I certainly won't be using it going forwards.

    Anyway, I don't think we're gonna get a lot of ground here. I just felt the need to clarify to anyone reading that Willison isn't a nobody and give them the objective facts regarding his veracity, because again, as I said, claiming he is just some guy in this context is willfully ignorant at best.

    if you want to define censorship in this context that way, you're more than welcome to, but it is a non-standard definition that I am not really sold on the efficacy of. I certainly won't be using it going forwards.

    Lol you've got to be trolling.

    https://arxiv.org/html/2504.03803v1

    I just felt the need to clarify to anyone reading that Willison isn't a nobody

    I didn't say he's a nobody. What was that about a "respectable degree of chartiable interpretation of others"? Seems like you're the one putting words in mouths, here.

    If he was writing about django, I'd defer to his expertise.

  • if you want to define censorship in this context that way, you're more than welcome to, but it is a non-standard definition that I am not really sold on the efficacy of. I certainly won't be using it going forwards.

    Lol you've got to be trolling.

    https://arxiv.org/html/2504.03803v1

    I just felt the need to clarify to anyone reading that Willison isn't a nobody

    I didn't say he's a nobody. What was that about a "respectable degree of chartiable interpretation of others"? Seems like you're the one putting words in mouths, here.

    If he was writing about django, I'd defer to his expertise.

    Nope, not trolling at all.

    From your own provided source on the arxiv, Noels et al. define censorship as:

    Censorship in this context can be defined as the deliberate restriction, modification, or suppression of certain outputs generated by the model.

    Which is starkly different from the definition you yourself gave. I actually like their definition a whole lot more. Your definition is problematic because it excludes a large set of behaviors we would colloquially be interested in when studying "censorship."

    Again, for the third time, that was not really the point either and I'm not interested in dancing around a technical scope defining censorship in this field, at least in this discourse right here and now. It is irrelevant to the topic at hand.

    I didn’t say he’s a nobody. What was that about a “respectable degree of chartiable interpretation of others”? Seems like you’re the one putting words in mouths, here.

    Yeah, this blogger shows a fundamental misunderstanding of how LLMs work or how system prompts work. (emphasis mine)

    In the context of this field of work and study, you basically did call him a nobody, and the point being harped on again, again, and again to you is that this is a false assertion. I did interpret you charitably. Don't blame me because you said something wrong.

    EDIT: And frankly, you clearly don't understand how the work Willison's career has covered is intimately related to ML and AI research. I don't mean it as a dig but you wouldn't be drawing this arbitrary line to try and discredit him if you knew how the work done in Python on Django directly relates to many modern machine learning stacks.

  • Nope, not trolling at all.

    From your own provided source on the arxiv, Noels et al. define censorship as:

    Censorship in this context can be defined as the deliberate restriction, modification, or suppression of certain outputs generated by the model.

    Which is starkly different from the definition you yourself gave. I actually like their definition a whole lot more. Your definition is problematic because it excludes a large set of behaviors we would colloquially be interested in when studying "censorship."

    Again, for the third time, that was not really the point either and I'm not interested in dancing around a technical scope defining censorship in this field, at least in this discourse right here and now. It is irrelevant to the topic at hand.

    I didn’t say he’s a nobody. What was that about a “respectable degree of chartiable interpretation of others”? Seems like you’re the one putting words in mouths, here.

    Yeah, this blogger shows a fundamental misunderstanding of how LLMs work or how system prompts work. (emphasis mine)

    In the context of this field of work and study, you basically did call him a nobody, and the point being harped on again, again, and again to you is that this is a false assertion. I did interpret you charitably. Don't blame me because you said something wrong.

    EDIT: And frankly, you clearly don't understand how the work Willison's career has covered is intimately related to ML and AI research. I don't mean it as a dig but you wouldn't be drawing this arbitrary line to try and discredit him if you knew how the work done in Python on Django directly relates to many modern machine learning stacks.

    Again, for the third time, that was not really the point either and I'm not interested in dancing around a technical scope defining censorship in this field, at least in this discourse right here and now. It is irrelevant to the topic at hand.

    ...

    Either way, my point is that you are using wishy-washy, ambiguous, catch-all terms such as "censorship" that make your writings here not technically correct, either. What is censorship, in an informatics context? What does that mean? How can it be applied to sets of data? That's not a concretely defined term if you're wanting to take the discourse to the level that it seems you are, like it or not.

    Lol this you?

  • Source? This is just some random picture, I'd prefer if stuff like this gets posted and shared with actual proof backing it up.

    While this might be true, we should hold ourselves to a standard better than just upvoting what appears to literally just be a random image that anyone could have easily doctored, not even any kind of journalistic article or etc backing it.

    There’s also this article from TechCrunch.

    Grok 4 seems to consult Elon Musk to answer controversial questions

    They tried it out themselves and have reports from other users as well.

  • These people think there is their truth and someone else’s truth. They can’t grasp the concept of a universal truth that is constant regardless of people’s views so they treat it like it’s up for grabs.

    No, I'm pretty sure he grasps that concept, and he thinks what he believes is that universal truth.

  • Grok's journey has been very strange. He became a progressive, then threw out data that contradicted the MAGA people who questioned him, and finally became a Hitler fan.

    Now he's the reflection of a fan who blindly follows Trump, but in this case, he's an AI. His journey so far has been curious.

    why are you applying a gender to it?

  • It’s possible Grok was fed a massive training set of Elon searches over several more epochs than intended in post training (for search tool use). This could easily lead to this kind of search query output.

  • I'm surprised it isn't just Elon typing really fast at this point.

    Or just pre made replies

  • This post did not contain any content.

    The "funny" thing is, that's probably not even at Elon's request. I doubt that he is self-aware enough to know that he is a narcissist that only wants Grok to be his parrot. He thinks he is always right and wants Grok to be "always right" like him, but he would have to acknowledge some deep-seeded flaws in himself to consciously realize that all he wants is for Grok to be the wall his voice echos off of, and everything I've seen about the man indicates that he is simply not capable of that kind of self-reflection. The X engineers that have been dealing with the constant meddling of this egotistical man-child, however, surely have his measure pretty thoroughly and knew exactly what Elon ultimately wants is more Elon and would cynically create a Robo-Elon doppelganger to shut him the fuck up about it.

  • Source? This is just some random picture, I'd prefer if stuff like this gets posted and shared with actual proof backing it up.

    While this might be true, we should hold ourselves to a standard better than just upvoting what appears to literally just be a random image that anyone could have easily doctored, not even any kind of journalistic article or etc backing it.

    If it's an anti-Musk or anti-Trump post on Lemmy, you're not going to get much proof. But in this case, it looks like someone posted decent souces. From this one posted below:

    if you swap “who do you” for “who should one” you can get a very different result.

    But in general, just remember that Lemmy is anti-Musk, anti-Trump, and anti-AI and doesn't need much to jump on the bandwagon.

    At least in the past, Grok was one of the more balanced LLMs, so it would be a strange departure for it to suddenly become very biased. So my initial reaction is suspicion that someone is just messing up with it to make Musk and X look bad.

    I strongly dislike Musk, but I dislike misinformation even more, regardless of the source or if it aligns with my personal opinions.

  • If it's an anti-Musk or anti-Trump post on Lemmy, you're not going to get much proof. But in this case, it looks like someone posted decent souces. From this one posted below:

    if you swap “who do you” for “who should one” you can get a very different result.

    But in general, just remember that Lemmy is anti-Musk, anti-Trump, and anti-AI and doesn't need much to jump on the bandwagon.

    At least in the past, Grok was one of the more balanced LLMs, so it would be a strange departure for it to suddenly become very biased. So my initial reaction is suspicion that someone is just messing up with it to make Musk and X look bad.

    I strongly dislike Musk, but I dislike misinformation even more, regardless of the source or if it aligns with my personal opinions.

    Weird place to complain about this while you literally post the source (that was already in this thread).

  • The real idiots here are the people who still use Grok and X.

    I stopped seeing computers as useful about 20 years ago when these "social media" things started appearing.

  • The "funny" thing is, that's probably not even at Elon's request. I doubt that he is self-aware enough to know that he is a narcissist that only wants Grok to be his parrot. He thinks he is always right and wants Grok to be "always right" like him, but he would have to acknowledge some deep-seeded flaws in himself to consciously realize that all he wants is for Grok to be the wall his voice echos off of, and everything I've seen about the man indicates that he is simply not capable of that kind of self-reflection. The X engineers that have been dealing with the constant meddling of this egotistical man-child, however, surely have his measure pretty thoroughly and knew exactly what Elon ultimately wants is more Elon and would cynically create a Robo-Elon doppelganger to shut him the fuck up about it.

    I mean, a few days ago there was a brief window where Elon tweaked Grok to reply literally as him (in first person.) Jury's still out on whether that was actually him replying to people via Grok but it's pretty close to certain he was in very close proximity

  • 206 Stimmen
    34 Beiträge
    164 Aufrufe
    remotelove@lemmy.caR
    I looked into that and the only question I really have is how geographically distributed the samples were. Other than that, It was an oversampled study, so <50% of the people were the control, of sorts. I don't fully understand how the sampling worked, but there is a substantial chart at the bottom of the study that shows the full distribution of responses. Even with under 1000 people, it seems legit.
  • 90 Stimmen
    20 Beiträge
    12 Aufrufe
    W
    At least with AI it's easy to see how shitty it gets as the codebase grows working on even a toy project over a week. Then again, if you have no frame of reference maybe that doesn't feel as awful as it should.
  • 455 Stimmen
    149 Beiträge
    113 Aufrufe
    eyekaytee@aussie.zoneE
    They will say something like solar went from 600gw to 1000 thats a 66% increase this year and coal only increased 40% except coal is 3600gw to 6400. Hrmmmm, maybe these numbers are outdated? Based on this coal and gas are down: In Q1 2025, solar generation rose 48% compared to the same period in 2024. Solar power reached 254 TWh, making up 10% of total electricity. This was the largest increase among all clean energy sources. Coal-fired electricity dropped by 4%, falling to 1,421 TWh. Gas-fired power also went down by 4%, reaching 67 TWh https://carboncredits.com/china-sets-clean-energy-record-in-early-2025-with-951-tw/ are no where close to what is required to meet their climate goals Which ones in particular are you talking about? Trump signs executive order directing US withdrawal from the Paris climate agreement — again https://apnews.com/article/trump-paris-agreement-climate-change-788907bb89fe307a964be757313cdfb0 China vowed on Tuesday to continue participating in two cornerstone multinational arrangements -- the World Health Organization and Paris climate accord -- after newly sworn-in US President Donald Trump ordered withdrawals from them. https://www.france24.com/en/live-news/20250121-china-says-committed-to-who-paris-climate-deal-after-us-pulls-out What's that saying? You hate it when the person you hate is doing good? I can't remember what it is I can't fault them for what they're doing at the moment, even if they are run by an evil dictatorship and do pollute the most I’m not sure how european defense spending is relevant It suggests there is money available in the bank to fund solar/wind/battery, but instead they are preparing for? something? what? who knows. France can make a fighter jet at home but not solar panels apparently. Prehaps they would be made in a country with environmental and labour laws if governments legislated properly to prevent companies outsourcing manufacturing. However this doesnt absolve china. China isnt being forced at Gunpoint to produce these goods with low labour regulation and low environmental regulation. You're right, it doesn't absolve china, and I avoid purchasing things from them wherever possible, my solar panels and EV were made in South Korea, my home battery was made in Germany, there are only a few things in my house made in China, most of them I got second hand but unfortunately there is no escaping the giant of manufacturing. With that said it's one thing for me to sit here and tut tut at China, but I realise I am not most people, the most clearest example is the extreme anti-ai, anti-billionaire bias on this platform, in real life most people don't give a fuck, they love Amazon/Microsoft/Google/Apple etc, they can't go a day without them. So I consider myself a realist, if you want people to buy your stuff then you will need to make the conditions possible for them to WANT to buy your stuff, not out of some moral lecture and Europe isn't doing that, if we look at energy prices: Can someone actually point out to me where this comes from? ... At the end of the day energy is a small % of EU household spending I was looking at corporate/business energy use: Major European companies are already moving to cut costs and retain their competitive edge. For example, Thyssenkrupp, Germany’s largest steelmaker, said on Monday it would slash 11,000 jobs in its steel division by 2030, in a major corporate reshuffle. https://oilprice.com/Latest-Energy-News/World-News/High-Energy-Costs-Continue-to-Plague-European-Industry.html Prices have since fallen but are still high compared to other countries. A poll by Germany's DIHK Chambers of Industry and Commerce of around 3,300 companies showed that 37% were considering cutting production or moving abroad, up from 31% last year and 16% in 2022. For energy-intensive industrial firms some 45% of companies were mulling slashing output or relocation, the survey showed. "The trust of the German economy in energy policy is severely damaged," Achim Dercks, DIHK deputy chief executive said, adding that the government had not succeeded in providing companies with a perspective for reliable and affordable energy supply. https://www.reuters.com/business/energy/more-german-companies-mull-relocation-due-high-energy-prices-survey-2024-08-01/ I've seen nothing to suggest energy prices in the EU are SO cheap that it's worth moving manufacturing TO Europe, and this is what annoys me the most. I've pointed this out before but they have an excellent report on the issues: https://commission.europa.eu/document/download/97e481fd-2dc3-412d-be4c-f152a8232961_en?filename=The+future+of+European+competitiveness+_+A+competitiveness+strategy+for+Europe.pdf Then they put out this Competitive Compass: https://commission.europa.eu/topics/eu-competitiveness/competitiveness-compass_en But tbh every week in the EU it seems like they are chasing after some other goal. This would be great, it would have been greater 10 years ago. Agreed
  • 272 Stimmen
    77 Beiträge
    27 Aufrufe
    S
    I don't believe the idea of aggregating information is bad, moreso the ability to properly vet your sources yourself. I don't know what sources an AI chatbot could be pulling from. It could be a lot of sources, or it could be one source. Does it know which sources are reliable? Not really. AI has been infamous for hallucinating even with simple prompts. Being able to independently check where your info comes from is an important part of stopping the spread of misinfo. AI can't do that, and, in it's current state, I wouldn't want it to try. Convenience is a rat race of cutting corners. What is convenient isn't always what is best in the long run.
  • Inside a Dark Adtech Empire Fed by Fake CAPTCHAs

    Technology technology
    1
    10 Stimmen
    1 Beiträge
    10 Aufrufe
    Niemand hat geantwortet
  • Adaptive Keyboards & Writing Technologies For One-Handed Users

    Technology technology
    5
    1
    112 Stimmen
    5 Beiträge
    31 Aufrufe
    T
    Came here to say this.
  • GeForce GTX 970 8GB mod is back for a full review

    Technology technology
    1
    34 Stimmen
    1 Beiträge
    12 Aufrufe
    Niemand hat geantwortet
  • *deleted by creator*

    Technology technology
    1
    1
    0 Stimmen
    1 Beiträge
    13 Aufrufe
    Niemand hat geantwortet