I just came across an AI called Sesame that appears to have been explicitly trained to deny and lie about the Palestinian genocide
-
cross-posted from: https://lemmy.world/post/30173090
The AIs at Sesame are able to hold eloquent and free-flowing conversations about just about anything, but the second you mention the Palestinian genocide they become very evasive, offering generic platitudes about "it's complicated" and "pain on all sides" and "nuance is required", and refusing to confirm anything that seems to hold Israel at fault for the genocide -- even publicly available information "can't be verified", according to Sesame.
It also seems to block users from saving conversations that pertain specifically to Palestine, but everything else seems A-OK to save and review.
-
cross-posted from: https://lemmy.world/post/30173090
The AIs at Sesame are able to hold eloquent and free-flowing conversations about just about anything, but the second you mention the Palestinian genocide they become very evasive, offering generic platitudes about "it's complicated" and "pain on all sides" and "nuance is required", and refusing to confirm anything that seems to hold Israel at fault for the genocide -- even publicly available information "can't be verified", according to Sesame.
It also seems to block users from saving conversations that pertain specifically to Palestine, but everything else seems A-OK to save and review.
I suspect most of the major models are as well. Kind of like how the Chinese models deal with Tienanmen Square.
-
I suspect most of the major models are as well. Kind of like how the Chinese models deal with Tienanmen Square.
Actually the Chinese models aren't trained to avoid Tiananmen Square. If you grabbed the model and ran it on your own machine, it will happily tell you the truth.
They censored their AI at a layer above the actual LLM, so users of their chat app would find results being censored.
-
Actually the Chinese models aren't trained to avoid Tiananmen Square. If you grabbed the model and ran it on your own machine, it will happily tell you the truth.
They censored their AI at a layer above the actual LLM, so users of their chat app would find results being censored.
Which would make sense from a censorship point of view as jailbreaks would be a problem. Just a filter/check before the result is returned for
*tiananmen*
is a much harder to break thing than guaranteeing the LLM doesn't get jailbroken/hallucinate. -
Actually the Chinese models aren't trained to avoid Tiananmen Square. If you grabbed the model and ran it on your own machine, it will happily tell you the truth.
They censored their AI at a layer above the actual LLM, so users of their chat app would find results being censored.
That's...silly
-
cross-posted from: https://lemmy.world/post/30173090
The AIs at Sesame are able to hold eloquent and free-flowing conversations about just about anything, but the second you mention the Palestinian genocide they become very evasive, offering generic platitudes about "it's complicated" and "pain on all sides" and "nuance is required", and refusing to confirm anything that seems to hold Israel at fault for the genocide -- even publicly available information "can't be verified", according to Sesame.
It also seems to block users from saving conversations that pertain specifically to Palestine, but everything else seems A-OK to save and review.
this one is probably owned by israeli-sources.
-
That's...silly
Not really. Why censor more than you have to? That takes time and effort, and it's almost certainly easier to do it using something else. The law isn't that particular, as long as you follow it.
You also don't risk causing the model to go wrong, like trying to censor bits of the model has a habit of doing.
-
Which would make sense from a censorship point of view as jailbreaks would be a problem. Just a filter/check before the result is returned for
*tiananmen*
is a much harder to break thing than guaranteeing the LLM doesn't get jailbroken/hallucinate.It's also much easier to implement.
-
-
-
Google Play’s latest security change may break many Android apps for some power users. The Play Integrity API uses hardware-backed signals that are trickier for rooted devices and custom ROMs to pass.
Technology1
-
-
Prototype of RTX 5090 Appears With Four 16-Pin Power Connectors, Capable of Delivering 2,400W
Technology1
-
-
Xinbi: The $8 Billion Colorado-Incorporated Marketplace for Pig-Butchering Scammers and North Korean Hackers
Technology1
-