Skip to content

Grok 4 has been so badly neutered that it's now programmed to see what Elon says about the topic at hand and blindly parrot that line.

Technology
67 55 0
  • That's more like it, thank you!

  • I think there is a good chance this behavior is unintended!

    Lmao, sure...

    I can believe it insofar as they might not have explicitly programmed it to do that. I'd imagine they put in something like "Make sure your output aligns with Elon Musk's opinions.", "Elon Musk is always objectively correct.", etc. From there, this would be emergent, but quite predictable behavior.

  • This is my take. Elon just showed the world what we all knew. The tool is not trustworthy. All other AI suppliers are busy trying to work on credibility that grok just butchered.

    They deliberately injected prompts on top of the users prompt.

    Saying that’s a problem of AI is akin to say me deliberately painting my car badly and saying it’s a problem of all car manufacturers.

    And this frankly shows how little you know about the subject, because we went through this years ago with prompts trying to force corpo-lib “diversity” and leading to hilarious results.

    If anything you should be concerned about the non prompt stuff, the underlying training data that it pulls from and of which I doubt Grok has even changed since release.

  • This post did not contain any content.

    they should just put it down and out of it's misery

  • This post did not contain any content.

    Honestly, who was surprised by this news?

    I feel like everyone could see Grok as some sort of 24/7 tool to push a particular viewpoint, even more so when it says things that are leftist and Elon is compelled to "upgrade" the system as he's tweeted.

  • This post did not contain any content.

    I'm surprised it isn't just Elon typing really fast at this point.

  • I can believe it insofar as they might not have explicitly programmed it to do that. I'd imagine they put in something like "Make sure your output aligns with Elon Musk's opinions.", "Elon Musk is always objectively correct.", etc. From there, this would be emergent, but quite predictable behavior.

    Yeah the transparency of it might be unintended.

  • I think there is a good chance this behavior is unintended!

    Lmao, sure...

    If the system prompt doesn’t tell it to search for Elon’s views, why is it doing that?

    My best guess is that Grok “knows” that it is “Grok 4 buit by xAI”, and it knows that Elon Musk owns xAI, so in circumstances where it’s asked for an opinion the reasoning process often decides to see what Elon thinks.

    Yeah, this blogger shows a fundamental misunderstanding of how LLMs work or how system prompts work. LLM behavior is not directly controlled by the system prompt the way this person imagines. For example, censorship that is present in the training set will be "baked in" to the model and the system prompt will not affect it, no matter how the LLM is told not to be censored in that way.

    My best guess is that the LLM is interfacing with a tool in order to search through tweets, and the training set that demonstrates how to use the tool contains example searches for Elon Musk's tweets.

  • they should just put it down and out of it's misery

    It used to be so based

  • I'm surprised it isn't just Elon typing really fast at this point.

    Probably couldn't type fast if he tried. Would probably pay someone to do it for him just like he did with Path if Exile.

  • Probably couldn't type fast if he tried. Would probably pay someone to do it for him just like he did with Path if Exile.

    And like he does with inseminating women.

  • If the system prompt doesn’t tell it to search for Elon’s views, why is it doing that?

    My best guess is that Grok “knows” that it is “Grok 4 buit by xAI”, and it knows that Elon Musk owns xAI, so in circumstances where it’s asked for an opinion the reasoning process often decides to see what Elon thinks.

    Yeah, this blogger shows a fundamental misunderstanding of how LLMs work or how system prompts work. LLM behavior is not directly controlled by the system prompt the way this person imagines. For example, censorship that is present in the training set will be "baked in" to the model and the system prompt will not affect it, no matter how the LLM is told not to be censored in that way.

    My best guess is that the LLM is interfacing with a tool in order to search through tweets, and the training set that demonstrates how to use the tool contains example searches for Elon Musk's tweets.

    “This blogger” is Simon Willison, who has been doing LLM benchmarks and other LLM-related things since before it was cool

    Not a random substack grifter

  • They deliberately injected prompts on top of the users prompt.

    Saying that’s a problem of AI is akin to say me deliberately painting my car badly and saying it’s a problem of all car manufacturers.

    And this frankly shows how little you know about the subject, because we went through this years ago with prompts trying to force corpo-lib “diversity” and leading to hilarious results.

    If anything you should be concerned about the non prompt stuff, the underlying training data that it pulls from and of which I doubt Grok has even changed since release.

    You are correct. But the right tool in the wrong hands is still non credible in the eyes of perception.

  • Grok's journey has been very strange. He became a progressive, then threw out data that contradicted the MAGA people who questioned him, and finally became a Hitler fan.

    Now he's the reflection of a fan who blindly follows Trump, but in this case, he's an AI. His journey so far has been curious.

    So Grok is a 4chan incel?

    His only chance of salvation is finding a girl who inexplicably fancies it?

  • “This blogger” is Simon Willison, who has been doing LLM benchmarks and other LLM-related things since before it was cool

    Not a random substack grifter

    Is my comment wrong though? Another possibility is that Grok is given an example of searching for Elon Musk's tweets when it is presented with the available tool calls. Just because it outputs the system prompt when asked does not mean that we are seeing the full context, or even the real system prompt.

    Posting blog guides on how to code with ChatGPT is not expertise on LLMs. It's like thinking someone is an expert mechanic because they can drive a car well.

  • This post did not contain any content.

    Robert A. Heinlein is turning in his grave like a fucking dynamo these days.

  • Is my comment wrong though? Another possibility is that Grok is given an example of searching for Elon Musk's tweets when it is presented with the available tool calls. Just because it outputs the system prompt when asked does not mean that we are seeing the full context, or even the real system prompt.

    Posting blog guides on how to code with ChatGPT is not expertise on LLMs. It's like thinking someone is an expert mechanic because they can drive a car well.

    Willison has never claimed to be an expert in the field of machine learning, but you should give more credence to his opinions. Perhaps u/lepinkainen@lemmy.world's warning wasn't informative enough to be heeded: Willison is a prominent figure in the web-development scene, particularly aspects of the scene that have evolved into important facets of the modern machine learning community.

    The guy is quite experienced with Python and took an early step into the contemporary ML/AI space due to both him having a lot of very relevant skills and a likely personal interest in the field. Python is the lingua franca of my field of study, for better or worse, and someone like Willison was well-placed to break into ML/AI from the outside. That's a common route in this field, there aren't exactly an abundance of MBAs with majors in machine learning or applied artificial intelligence research, specifically (yet). Willison is one of the authors of Django, for fucks sake. Idk what he's doing rn but it would be ignorant to draw the comparison you just did in the context of Willison particularly. [EDIT: Lmfao just went to see "what is Simon doing rn" (don't really keep up with him in particular), & you're talking out of your ass. He literally has multiple tools for the machine learning stack that he develops and that are available to see on his github. See one such here. This guy is so far away from someone who just "posts random blog guides on how to code with ChatGPT" that it's egregious you'd even claim that. It's so disingenuous as to ere into dishonesty; like, that is a patent lie. Smh.]

    As for your analysis of his article, I find it kind of ironic you accuse him of having a "fundamental misunderstanding of how LLMs work or how system prompts work [sic]" when you then proceed to cherry-pick certain lines from his article taken entirely out of context. First, the article is clearly geared towards a more general audience and avoids technical language or explanation. Second, he doesn't say anything that is fundamentally wrong. Honestly, you seem to have a far more ignorant idea of LLMs and this field generally than Willison. You do say some things that are wrong, such as:

    For example, censorship that is present in the training set will be “baked in” to the model and the system prompt will not affect it, no matter how the LLM is told not to be censored in that way.

    This isn't necessarily true. It is true that information not included within the training set, or information that has been statistically biased within the training set, isn't going to be retrievable or reversible using system prompts. Willison never claims or implies this in his article, you just kind of stuff those words in his mouth. Either way, my point is that you are using wishy-washy, ambiguous, catch-all terms such as "censorship" that make your writings here not technically correct, either. What is censorship, in an informatics context? What does that mean? How can it be applied to sets of data? That's not a concretely defined term if you're wanting to take the discourse to the level that it seems you are, like it or not. Generally you seem to have something of a misunderstanding regarding this topic, but I'm not going to accuse you of that, lest I commit the same fallacy I'm sitting here trying to chastise you for. It's possible you do know what you're talking about and just dumbed it down for Lemmy. It's impossible for me to know as an audience.

    That all wouldn't really matter if you didn't just jump as Willison's credibility over your perception of him doing that exact same thing, though.

  • This post did not contain any content.

    Mecha-Hitler is just Mecha-Elon

  • And like he does with inseminating women.

    Ketamine took its toll

  • Willison has never claimed to be an expert in the field of machine learning, but you should give more credence to his opinions. Perhaps u/lepinkainen@lemmy.world's warning wasn't informative enough to be heeded: Willison is a prominent figure in the web-development scene, particularly aspects of the scene that have evolved into important facets of the modern machine learning community.

    The guy is quite experienced with Python and took an early step into the contemporary ML/AI space due to both him having a lot of very relevant skills and a likely personal interest in the field. Python is the lingua franca of my field of study, for better or worse, and someone like Willison was well-placed to break into ML/AI from the outside. That's a common route in this field, there aren't exactly an abundance of MBAs with majors in machine learning or applied artificial intelligence research, specifically (yet). Willison is one of the authors of Django, for fucks sake. Idk what he's doing rn but it would be ignorant to draw the comparison you just did in the context of Willison particularly. [EDIT: Lmfao just went to see "what is Simon doing rn" (don't really keep up with him in particular), & you're talking out of your ass. He literally has multiple tools for the machine learning stack that he develops and that are available to see on his github. See one such here. This guy is so far away from someone who just "posts random blog guides on how to code with ChatGPT" that it's egregious you'd even claim that. It's so disingenuous as to ere into dishonesty; like, that is a patent lie. Smh.]

    As for your analysis of his article, I find it kind of ironic you accuse him of having a "fundamental misunderstanding of how LLMs work or how system prompts work [sic]" when you then proceed to cherry-pick certain lines from his article taken entirely out of context. First, the article is clearly geared towards a more general audience and avoids technical language or explanation. Second, he doesn't say anything that is fundamentally wrong. Honestly, you seem to have a far more ignorant idea of LLMs and this field generally than Willison. You do say some things that are wrong, such as:

    For example, censorship that is present in the training set will be “baked in” to the model and the system prompt will not affect it, no matter how the LLM is told not to be censored in that way.

    This isn't necessarily true. It is true that information not included within the training set, or information that has been statistically biased within the training set, isn't going to be retrievable or reversible using system prompts. Willison never claims or implies this in his article, you just kind of stuff those words in his mouth. Either way, my point is that you are using wishy-washy, ambiguous, catch-all terms such as "censorship" that make your writings here not technically correct, either. What is censorship, in an informatics context? What does that mean? How can it be applied to sets of data? That's not a concretely defined term if you're wanting to take the discourse to the level that it seems you are, like it or not. Generally you seem to have something of a misunderstanding regarding this topic, but I'm not going to accuse you of that, lest I commit the same fallacy I'm sitting here trying to chastise you for. It's possible you do know what you're talking about and just dumbed it down for Lemmy. It's impossible for me to know as an audience.

    That all wouldn't really matter if you didn't just jump as Willison's credibility over your perception of him doing that exact same thing, though.

    Willison has never claimed to be an expert in the field of machine learning, but you should give more credence to his opinions.

    Yeah, I would if he didn't demonstrate such blatant misconceptions.

    Willison is a prominent figure in the web-development scene

    🤦 "They know how to sail a boat so they know how a car engine works"

    Willison never claims or implies this in his article, you just kind of stuff those words in his mouth.

    Reading comprehension. I never implied that he says anything about censorship. It is a correct and valid example that shows how his understanding is wrong about how system prompts work. "Define censorship" is not the argument you think it is lol. Okay though, I'll define the "censorship" I'm talking about as refusal behavior that is introduced during RLHF and DPO alignment, and no the system prompt will not change this behavior.

    EDIT: saw your edit about him publishing tools that make using an LLM easier. Yeahhhh lol writing python libraries to interface with LLM APIs is not LLM expertise, that's still just using LLMs but programatically. See analogy about being a mechanic vs a good driver.

  • How could AI escape human control?

    Technology technology
    5
    6 Stimmen
    5 Beiträge
    29 Aufrufe
    Z
    Don't mix up country bosses with technology bosses - even if they have the same brain damages.
  • 238 Stimmen
    54 Beiträge
    39 Aufrufe
    P
    I was so confused when I saw your comment until I reread my own. It really is top notch technology I guess!
  • 179 Stimmen
    1 Beiträge
    12 Aufrufe
    Niemand hat geantwortet
  • 371 Stimmen
    26 Beiträge
    102 Aufrufe
    hollownaught@lemmy.worldH
    Bit misleading. Tumour-associated antigens can very easily be detected very early. Problem is, these are only associated with cancer, and provide a very high rate of false positives They're better used as a stepping stone for further testing, or just seeing how advanced a cancer is That is to say, I'm assuming that's what this is about, as i didnt rwad the article. It's the first thing I thought of when I heard "cancer in bloodstream", as the other options tend to be a bit more bleak Edit: they're talking about cancer "shedding genetic material", which I hate how general they're being. Probably talking about proto oncogenes from dead tumour debris, but seems different to what I was expecting
  • 308 Stimmen
    23 Beiträge
    100 Aufrufe
    G
    I spent way too long researching the morning. That industry implies a much greater population that is attracted to children. Things get more nuanced. People are attracted to different stages, like prebubesant, early adolescence, and mid to late adolescence. It seems like an important distinction because this is a common mental disorder. I was ready to write this comment about my fear that there's a bunch of evil pedophiles living among us who are simply deterred by legal or social pressures. It seems more like the extreme stigma of pedophilia has prevented individuals from seeking assistance and has resulted in more child sexual abuse. This sort of disorder can be caused by experiencing this abuse at a younger age. When I was religious, we worked closely with an organization to help victims of trafficking. We had their stories. They entered our lives. I took care of some of these kids. As a victim of sexual abuse when I was kid, I had a hatred for these kinds of people. I feel like my brain is melting seeing how there is a high chance of people in my life being attracted to children. This isn't really to justify the industry. I'm just realizing that general harassing people openly about it might not be helping the situation.
  • Is the ‘tech bro-ification’ of abortion here?

    Technology technology
    15
    1
    69 Stimmen
    15 Beiträge
    63 Aufrufe
    T
    Nah. Been working in tech for nearly 30 years, "tech bro" is a delineation. Keeps the fuckers from smearing the rest of us
  • Tribo777: Promoções e Recompensas Que Valem a Pena

    Technology technology
    1
    1
    1 Stimmen
    1 Beiträge
    11 Aufrufe
    Niemand hat geantwortet
  • *deleted by creator*

    Technology technology
    1
    1
    0 Stimmen
    1 Beiträge
    12 Aufrufe
    Niemand hat geantwortet