ChatGPT 'got absolutely wrecked' by Atari 2600 in beginner's chess match — OpenAI's newest model bamboozled by 1970s logic
-
Tbf, the article should probably mention the fact that machine learning programs designed to play chess blow everything else out of the water.
Yeah its like judging how great a fish is at climbing a tree. But it does show that it's not real intelligence or reasoning
-
This post did not contain any content.
ChatGPT has been, hands down, the worst AI coding assistant I've ever used.
It regularly suggests code that doesn't compile or isn't even for the language.
It generally suggests AC of code that is just a copy of the lines I just wrote.
Sometimes it likes to suggest setting the same property like 5 times.
It is absolute garbage and I do not recommend it to anyone.
-
no.
the answer is always, no.
The answer might be no today, but always seems like a stretch.
-
I don't understand why this is so important, marketing is all about exaggerating, why expect something different here.
It's not important. You said AI isn't being marketed to be able to do everything. I said yes it is. That's it.
-
ChatGPT has been, hands down, the worst AI coding assistant I've ever used.
It regularly suggests code that doesn't compile or isn't even for the language.
It generally suggests AC of code that is just a copy of the lines I just wrote.
Sometimes it likes to suggest setting the same property like 5 times.
It is absolute garbage and I do not recommend it to anyone.
I find it really hit and miss. Easy, standard operations are fine but if you have an issue with code you wrote and ask it to fix it, you can forget it
-
Did the author thinks ChatGPT is in fact an AGI? It's a chatbot. Why would it be good at chess? It's like saying an Atari 2600 running a dedicated chess program can beat Google Maps at chess.
Google Maps doesn't pretend to be good at chess. ChatGPT does.
-
Regurgitating an impression of, not regurgitating verbatim, that's the problem here.
Chess is 100% deterministic, so it falls flat.
I'm guessing it's not even hard to get it to "confidently" violate the rules.
-
I find it really hit and miss. Easy, standard operations are fine but if you have an issue with code you wrote and ask it to fix it, you can forget it
I've found Claude 3.7 and 4.0 and sometimes Gemini variants still leagues better than ChatGPT/Copilot.
Still not perfect, but night and day difference.
I feel like ChatGPT didn't focus on coding and instead focused on mainstream, but I am not an expert.
-
Yeah its like judging how great a fish is at climbing a tree. But it does show that it's not real intelligence or reasoning
Don't call my fish stupid.
-
Google Maps doesn't pretend to be good at chess. ChatGPT does.
A toddler can pretend to be good at chess but anybody with reasonable expectations knows that they are not.
-
ChatGPT has been, hands down, the worst AI coding assistant I've ever used.
It regularly suggests code that doesn't compile or isn't even for the language.
It generally suggests AC of code that is just a copy of the lines I just wrote.
Sometimes it likes to suggest setting the same property like 5 times.
It is absolute garbage and I do not recommend it to anyone.
I don't use it for coding. I use it sparingly really, but want to learn to use it more efficiently. Are there any areas in which you think it excels? Are there others that you'd recommend instead?
-
ChatGPT has been, hands down, the worst AI coding assistant I've ever used.
It regularly suggests code that doesn't compile or isn't even for the language.
It generally suggests AC of code that is just a copy of the lines I just wrote.
Sometimes it likes to suggest setting the same property like 5 times.
It is absolute garbage and I do not recommend it to anyone.
my favorite thing is to constantly be implementing libraries that don't exist
-
A toddler can pretend to be good at chess but anybody with reasonable expectations knows that they are not.
Plot twist: the toddler has a multi-year marketing push worth tens if not hundreds of millions, which convinced a lot of people who don't know the first thing about chess that it really is very impressive, and all those chess-types are just jealous.
-
Can ChatGPT actually play chess now? Last I checked, it couldn't remember more than 5 moves of history so it wouldn't be able to see the true board state and would make illegal moves, take it's own pieces, materialize pieces out of thin air, etc.
and still lose to stockfish even after conjuring 3 queens out of thin air lol
-
Did the author thinks ChatGPT is in fact an AGI? It's a chatbot. Why would it be good at chess? It's like saying an Atari 2600 running a dedicated chess program can beat Google Maps at chess.
well so much hype has been generated around chatgpt being close to AGI that now it makes sense to ask questions like "can chatgpt prove the Riemann hypothesis"
-
I don't use it for coding. I use it sparingly really, but want to learn to use it more efficiently. Are there any areas in which you think it excels? Are there others that you'd recommend instead?
Use Gemini (2.5) or Claude (3.7 and up). OpenAI is a shitshow
-
Most people do. It's just called AI in the media everywhere and marketing works. I think online folks forget that something as simple as getting a Lemmy account by yourself puts you into the top quintile of tech literacy.
Yet even on Lemmy people can't seem to make sense of these terms and are saying things like "LLM's are not AI"
-
Tbf, the article should probably mention the fact that machine learning programs designed to play chess blow everything else out of the water.
I forgot which airline it is but one of the onboard games in the back of a headrest TV was a game called “Beginners Chess” which was notoriously difficult to beat so it was tested against other chess engines and it ranked in like the top five most powerful chess engines ever
-
my favorite thing is to constantly be implementing libraries that don't exist
You're right. That library was removed in ToolName [PriorVersion]. Please try this instead.
*makes up entirely new fictitious library name*
-
I find it really hit and miss. Easy, standard operations are fine but if you have an issue with code you wrote and ask it to fix it, you can forget it
It's the ideal help for people who shouldn't be employed as programmers to start with.
I had to explain hexadecimal to somebody the other day. It's honestly depressing.