ChatGPT 'got absolutely wrecked' by Atari 2600 in beginner's chess match — OpenAI's newest model bamboozled by 1970s logic
-
This post did not contain any content.schrieb am 9. Juni 2025, 23:03 zuletzt editiert von krigo666@lemmy.world 6. Okt. 2025, 01:04
Next, pit ChatGPT against 1K ZX Chess in a ZX81.
-
Those are some funky looking knights lol
schrieb am 9. Juni 2025, 23:04 zuletzt editiert von -
A strange game. How about a nice game of Global Thermonuclear War?
schrieb am 9. Juni 2025, 23:06 zuletzt editiert vonNo thank you. The only winning move is not to play
-
There was a chess game for the Atari 2600?
I wanna see them W I D E pieces.
schrieb am 9. Juni 2025, 23:09 zuletzt editiert vonHere you go (online emulator): https://www.retrogames.cz/play_716-Atari2600.php
-
Did the author thinks ChatGPT is in fact an AGI? It's a chatbot. Why would it be good at chess? It's like saying an Atari 2600 running a dedicated chess program can beat Google Maps at chess.
schrieb am 9. Juni 2025, 23:10 zuletzt editiert vonI think that’s generally the point is most people thing chat GPT is this sentient thing that knows everything and… no.
-
This post did not contain any content.schrieb am 9. Juni 2025, 23:17 zuletzt editiert von muntedcrocodile@lemm.ee 6. Okt. 2025, 02:58
This isn't the strength of gpt-o4 the model has been optimised for tool use as an agent. That's why its so good at image gen relative to other models it uses tools to construct an image piece by piece similar to a human. Also probably poor system prompting. A LLM is not a universal thinking machine its a a universal process machine. An LLM understands the process and uses tools to accomplish the process hence its strengths in writing code (especially as an agent).
Its similar to how a monkey is infinitely better at remembering a sequence of numbers than a human ever could but is totally incapable of even comprehending writing down numbers.
-
The Atari chess program can play chess better than the Boeing 747 too. And better than the North Pole. Amazing!
schrieb am 9. Juni 2025, 23:19 zuletzt editiert vonAre either of those marketed as powerful AI?
-
The Atari chess program can play chess better than the Boeing 747 too. And better than the North Pole. Amazing!
schrieb am 9. Juni 2025, 23:20 zuletzt editiert vonNeither of those things are marketed as being artificially intelligent.
-
AI including ChatGPT is being marketed as super awesome at everything, which is why that and similar AI is being forced into absolutely everything and being sold as a replacement for people.
Something marketed as AGI should be treated as AGI when proving it isn't AGI.
schrieb am 9. Juni 2025, 23:21 zuletzt editiert vonNot to help the AI companies, but why don't they program them to look up math programs and outsource chess to other programs when they're asked for that stuff? It's obvious they're shit at it, why do they answer anyway? It's because they're programmed by know-it-all programmers, isn't it.
-
Not to help the AI companies, but why don't they program them to look up math programs and outsource chess to other programs when they're asked for that stuff? It's obvious they're shit at it, why do they answer anyway? It's because they're programmed by know-it-all programmers, isn't it.
schrieb am 9. Juni 2025, 23:30 zuletzt editiert vonI think they're trying to do that. But AI can still fail at that lol
-
Not to help the AI companies, but why don't they program them to look up math programs and outsource chess to other programs when they're asked for that stuff? It's obvious they're shit at it, why do they answer anyway? It's because they're programmed by know-it-all programmers, isn't it.
schrieb am 9. Juni 2025, 23:30 zuletzt editiert von...or a simple counter to count the r in strawberry.
Because that's more difficult than one might think and they are starting to do this now. -
AI including ChatGPT is being marketed as super awesome at everything, which is why that and similar AI is being forced into absolutely everything and being sold as a replacement for people.
Something marketed as AGI should be treated as AGI when proving it isn't AGI.
schrieb am 9. Juni 2025, 23:41 zuletzt editiert vonI don't think ai is being marketed as awesome at everything. It's got obvious flaws. Right now its not good for stuff like chess, probably not even tic tac toe. It's a language model, its hard for it to calculate the playing field. But ai is in development, it might not need much to start playing chess.
-
A strange game. How about a nice game of Global Thermonuclear War?
schrieb am 9. Juni 2025, 23:43 zuletzt editiert vonLmao!
that made me spit!!
-
A strange game. How about a nice game of Global Thermonuclear War?
schrieb am 9. Juni 2025, 23:43 zuletzt editiert vonFrak off, toaster
-
This isn't the strength of gpt-o4 the model has been optimised for tool use as an agent. That's why its so good at image gen relative to other models it uses tools to construct an image piece by piece similar to a human. Also probably poor system prompting. A LLM is not a universal thinking machine its a a universal process machine. An LLM understands the process and uses tools to accomplish the process hence its strengths in writing code (especially as an agent).
Its similar to how a monkey is infinitely better at remembering a sequence of numbers than a human ever could but is totally incapable of even comprehending writing down numbers.
schrieb am 9. Juni 2025, 23:43 zuletzt editiert vonDo you have a source for that re:monkeys memorizing numerical sequences? What do you mean by that?
-
I'm often impressed at how good chatGPT is at generating text, but I'll admit it's hilariously terrible at chess. It loves to manifest pieces out of thin air, or make absurd illegal moves, like jumping its king halfway across the board and claiming checkmate
schrieb am 9. Juni 2025, 23:45 zuletzt editiert vonYeah! I’ve loved watching Gothem Chess’ videos on these. Always have been good for a laugh.
-
Not to help the AI companies, but why don't they program them to look up math programs and outsource chess to other programs when they're asked for that stuff? It's obvious they're shit at it, why do they answer anyway? It's because they're programmed by know-it-all programmers, isn't it.
schrieb am 9. Juni 2025, 23:49 zuletzt editiert vonBecause they’re fucking terrible at designing tools to solve problems, they are obviously less and less good at pretending this is an omnitool that can do everything with perfect coherency (and if it isn’t working right it’s because you’re not believing or paying hard enough)
-
Do you have a source for that re:monkeys memorizing numerical sequences? What do you mean by that?
schrieb am 9. Juni 2025, 23:51 zuletzt editiert von -
Do you have a source for that re:monkeys memorizing numerical sequences? What do you mean by that?
schrieb am 9. Juni 2025, 23:51 zuletzt editiert vonThat threw me as well.
-
Neither of those things are marketed as being artificially intelligent.
schrieb am 9. Juni 2025, 23:59 zuletzt editiert vonMarketers aren't intelligent either, so I see no reason to listen to them.
-
-
-
OpenAI launches personal assistant capable of controlling files and web browsers
Technology204 vor 22 Tagenvor 24 Tagen1
-
OpenAI just launched its new ChatGPT Agent that can make as many as 1 complicated cupcake order per hour, but even Sam Altman says you probably shouldn't trust it for 'high-stakes uses'
Technology204 vor 22 Tagenvor 25 Tagen1
-
Microsoft saved $500 million using AI — after slashing over 15,000 jobs in 2025
Technology204 vor 24 Tagenvor 25 Tagen1
-
-
Frequent TikTok users in Taiwan more likely to agree with pro-China narratives, study finds
Technology 7. Juni 2025, 17:011
-