linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

ChatGPT 'got absolutely wrecked' by Atari 2600 in beginner's chess match — OpenAI's newest model bamboozled by 1970s logic

Technology

204 Beiträge 136 Kommentatoren 5.9k Aufrufe

F fmt99@lemmy.world

Did the author thinks ChatGPT is in fact an AGI? It's a chatbot. Why would it be good at chess? It's like saying an Atari 2600 running a dedicated chess program can beat Google Maps at chess.
B This user is from outside of this forum
B This user is from outside of this forum
broken@lemmy.ml

schrieb am zuletzt editiert von

#71

I agree with your general statement, but in theory since all ChatGPT does is regurgitate information back and a lot of chess is memorization of historical games and types, it might actually perform well. No, it can't think, but it can remember everything so at some point that might tip the results in it's favor.
E F 2 Antworten Letzte Antwort

10
P pelespirit@sh.itjust.works

Not to help the AI companies, but why don't they program them to look up math programs and outsource chess to other programs when they're asked for that stuff? It's obvious they're shit at it, why do they answer anyway? It's because they're programmed by know-it-all programmers, isn't it.
D This user is from outside of this forum
D This user is from outside of this forum
driving_crooner@lemmy.eco.br

schrieb am zuletzt editiert von

#72

If you pay for chatgpt you can connect it with wolfrenalpha and it's relays the maths to it
1 Antwort Letzte Antwort

3
P pelespirit@sh.itjust.works

Not to help the AI companies, but why don't they program them to look up math programs and outsource chess to other programs when they're asked for that stuff? It's obvious they're shit at it, why do they answer anyway? It's because they're programmed by know-it-all programmers, isn't it.
I This user is from outside of this forum
I This user is from outside of this forum
implyingimplications@lemmy.ca

schrieb am zuletzt editiert von

#73

why don't they program them

AI models aren't programmed traditionally. They're generated by machine learning. Essentially the model is given test prompts and then given a rating on its answer. The model's calculations will be adjusted so that its answer to the test prompt will be closer to the expected answer. You repeat this a few billion times with a few billion prompts and you will have generated a model that scores very high on all test prompts.

Then someone asks it how many R's are in strawberry and it gets the wrong answer. The only way to fix this is to add that as a test prompt and redo the machine learning process which takes an enormous amount of time and computational power each time it's done, only for people to once again quickly find some kind of prompt it doesn't answer well.

There are already AI models that play chess incredibly well. Using machine learning to solve a complexe problem isn't the issue. It's trying to get one model to be good at absolutely everything.
1 Antwort Letzte Antwort

23
B broken@lemmy.ml

I agree with your general statement, but in theory since all ChatGPT does is regurgitate information back and a lot of chess is memorization of historical games and types, it might actually perform well. No, it can't think, but it can remember everything so at some point that might tip the results in it's favor.
E This user is from outside of this forum
E This user is from outside of this forum
eagle0110@lemmy.world

schrieb am zuletzt editiert von eagle0110@lemmy.world

#74

Regurgitating an impression of, not regurgitating verbatim, that's the problem here.

Chess is 100% deterministic, so it falls flat.
R 1 Antwort Letzte Antwort

4
B bassturd@lemmy.world

You are both completely over estimating the intelligence level of "anyone" and not living in the same AI marketed universe as the rest of us. People are stupid. Really stupid.
P This user is from outside of this forum
P This user is from outside of this forum
pixelatedsaturn@lemmy.world

schrieb am zuletzt editiert von

#75

I don't understand why this is so important, marketing is all about exaggerating, why expect something different here.
B 1 Antwort Letzte Antwort

0
N notmyoldredditname@lemmy.world

Okay, but could ChatGPT be used to vibe code a chess program that beats the Atari 2600?
G This user is from outside of this forum
G This user is from outside of this forum
greenknight23@lemmy.world

schrieb am zuletzt editiert von

#76

no.

the answer is always, no.
N 1 Antwort Letzte Antwort

3
L lifecoach5000@lemmy.world

This post did not contain any content.
F This user is from outside of this forum
F This user is from outside of this forum
furbag@lemmy.world

schrieb am zuletzt editiert von

#77

Can ChatGPT actually play chess now? Last I checked, it couldn't remember more than 5 moves of history so it wouldn't be able to see the true board state and would make illegal moves, take it's own pieces, materialize pieces out of thin air, etc.
T B R S P 5 Antworten Letzte Antwort

24
F furbag@lemmy.world

Can ChatGPT actually play chess now? Last I checked, it couldn't remember more than 5 moves of history so it wouldn't be able to see the true board state and would make illegal moves, take it's own pieces, materialize pieces out of thin air, etc.
T This user is from outside of this forum
T This user is from outside of this forum
toastedravioli@midwest.social

schrieb am zuletzt editiert von

#78

ChatGPT must adhere honorably to the rules that its making up on the spot. Thats Dallas
1 Antwort Letzte Antwort

9
P pelespirit@sh.itjust.works

Not to help the AI companies, but why don't they program them to look up math programs and outsource chess to other programs when they're asked for that stuff? It's obvious they're shit at it, why do they answer anyway? It's because they're programmed by know-it-all programmers, isn't it.
V This user is from outside of this forum
V This user is from outside of this forum
veroxii@aussie.zone

schrieb am zuletzt editiert von

#79

They are starting to do this. Most new models support function calling and can generate code to come up with math answers etc
1 Antwort Letzte Antwort

4
T towardsthefuture@lemmy.zip

I think that’s generally the point is most people thing chat GPT is this sentient thing that knows everything and… no.
P This user is from outside of this forum
P This user is from outside of this forum
pixelatedsaturn@lemmy.world

schrieb am zuletzt editiert von

#80

Do they though? No one I talked to, not my coworkers that use it for work, not my friends, not my 72 year old mother think they are sentient.
T 1 Antwort Letzte Antwort

1
P pelespirit@sh.itjust.works

Not to help the AI companies, but why don't they program them to look up math programs and outsource chess to other programs when they're asked for that stuff? It's obvious they're shit at it, why do they answer anyway? It's because they're programmed by know-it-all programmers, isn't it.
M This user is from outside of this forum
M This user is from outside of this forum
majorasmaskforever@lemmy.world

schrieb am zuletzt editiert von

#81

From a technology standpoint, nothing is stopping them. From a business standpoint: hubris.

To put time and effort into creating traditional logic based algorithms to compensate for this generic math model would be to admit what mathematicians and scientists have known for centuries. That models are good at finding patterns but they do not explain why a relationship exists (if it exists at all). The technology is fundamentally flawed for the use cases that OpenAI is trying to claim it can be used in, and programming around it would be to acknowledge that.
1 Antwort Letzte Antwort

0
O objection@lemmy.ml

Tbf, the article should probably mention the fact that machine learning programs designed to play chess blow everything else out of the water.
B This user is from outside of this forum
B This user is from outside of this forum
bier@feddit.nl

schrieb am zuletzt editiert von

#82

Yeah its like judging how great a fish is at climbing a tree. But it does show that it's not real intelligence or reasoning
1 1 Antwort Letzte Antwort

28
L lifecoach5000@lemmy.world

This post did not contain any content.
A This user is from outside of this forum
A This user is from outside of this forum
alecsadler@sh.itjust.works

schrieb am zuletzt editiert von

#83

ChatGPT has been, hands down, the worst AI coding assistant I've ever used.

It regularly suggests code that doesn't compile or isn't even for the language.

It generally suggests AC of code that is just a copy of the lines I just wrote.

Sometimes it likes to suggest setting the same property like 5 times.

It is absolute garbage and I do not recommend it to anyone.
J M N E A 6 Antworten Letzte Antwort

61
G greenknight23@lemmy.world

no.

the answer is always, no.
N This user is from outside of this forum
N This user is from outside of this forum
notmyoldredditname@lemmy.world

schrieb am zuletzt editiert von

#84

The answer might be no today, but always seems like a stretch.
1 Antwort Letzte Antwort

0
P pixelatedsaturn@lemmy.world

I don't understand why this is so important, marketing is all about exaggerating, why expect something different here.
B This user is from outside of this forum
B This user is from outside of this forum
bassturd@lemmy.world

schrieb am zuletzt editiert von

#85

It's not important. You said AI isn't being marketed to be able to do everything. I said yes it is. That's it.
P 1 Antwort Letzte Antwort

1
A alecsadler@sh.itjust.works

ChatGPT has been, hands down, the worst AI coding assistant I've ever used.

It regularly suggests code that doesn't compile or isn't even for the language.

It generally suggests AC of code that is just a copy of the lines I just wrote.

Sometimes it likes to suggest setting the same property like 5 times.

It is absolute garbage and I do not recommend it to anyone.
J This user is from outside of this forum
J This user is from outside of this forum
j4yt33@feddit.org

schrieb am zuletzt editiert von

#86

I find it really hit and miss. Easy, standard operations are fine but if you have an issue with code you wrote and ask it to fix it, you can forget it
A B P 3 Antworten Letzte Antwort

15
F fmt99@lemmy.world

Did the author thinks ChatGPT is in fact an AGI? It's a chatbot. Why would it be good at chess? It's like saying an Atari 2600 running a dedicated chess program can beat Google Maps at chess.
M This user is from outside of this forum
M This user is from outside of this forum
malwieder@feddit.org

schrieb am zuletzt editiert von

#87

Google Maps doesn't pretend to be good at chess. ChatGPT does.
W 1 Antwort Letzte Antwort

26
E eagle0110@lemmy.world

Regurgitating an impression of, not regurgitating verbatim, that's the problem here.

Chess is 100% deterministic, so it falls flat.
R This user is from outside of this forum
R This user is from outside of this forum
raltoid@lemmy.world

schrieb am zuletzt editiert von raltoid@lemmy.world

#88

I'm guessing it's not even hard to get it to "confidently" violate the rules.
1 Antwort Letzte Antwort

4
J j4yt33@feddit.org

I find it really hit and miss. Easy, standard operations are fine but if you have an issue with code you wrote and ask it to fix it, you can forget it
A This user is from outside of this forum
A This user is from outside of this forum
alecsadler@sh.itjust.works

schrieb am zuletzt editiert von

#89

I've found Claude 3.7 and 4.0 and sometimes Gemini variants still leagues better than ChatGPT/Copilot.

Still not perfect, but night and day difference.

I feel like ChatGPT didn't focus on coding and instead focused on mainstream, but I am not an expert.
D 1 Antwort Letzte Antwort

8
B bier@feddit.nl

Yeah its like judging how great a fish is at climbing a tree. But it does show that it's not real intelligence or reasoning
1 This user is from outside of this forum
1 This user is from outside of this forum
13igtyme@lemmy.world

schrieb am zuletzt editiert von

#90

Don't call my fish stupid.
V 1 Antwort Letzte Antwort

11

Anmelden zum Antworten

S

Europe Sets Sail: Unmanned Surface Vehicle (USV) Market in Focus
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

2

0 Stimmen

1 Beiträge

3 Aufrufe

Niemand hat geantwortet
C

AI as a Service | AI Solutions Provider | Generative AI Models | Cyfuture AI
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

0 Stimmen

1 Beiträge

10 Aufrufe

Niemand hat geantwortet
D

Elon Musk wants SpaceX rockets over Hawaii. He recently asked the FAA to expand the area in the Pacific Ocean where Starships debris can land.
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
7

1

126 Stimmen

7 Beiträge

75 Aufrufe

M

They don't have fiddle heads
P

Simple Wikiclaudia: Chrome extension that finds a simple.wikipedia.org version of any wiki article. If one exists, click to open it; otherwise, it uses Claude or ChatGPT to simplify it.
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
13

1

20 Stimmen

13 Beiträge

134 Aufrufe

S

Nobody's complaining about the simple.wikipedia part, but you already know that.
P

Russian Internet users are unable to access the open Internet
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
30

1

360 Stimmen

30 Beiträge

299 Aufrufe

Z

Also don't forget all the suicides happening with hard to obtain poisons and shooting oneself in the back of the head three times.
P

For All That Is Good About Humankind, Ban Smartphones
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
89

1

132 Stimmen

89 Beiträge

1k Aufrufe

D

Appreciated, but do you think the authorities want to win the war on drugs?
P

Gig Companies Violate Workers’ Rights: Amazon Flex, DoorDash, Favor, Instacart, Lyft, Shipt, and Uber claim to offer workers flexibility but end up paying them less than state or local minimum wages.
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
8

1

147 Stimmen

8 Beiträge

77 Aufrufe

L

Whenever these things come up you always hear "then the company won't survive!" CEO and managers make bank somehow but it doesn't matter that the workers can't live on that wage. It's always so weird how when workers actually take a pay cut, that the businesses get used to it. When the CEOs get bonuses they have to get used to that too.
H

CrowdStrike Announces Layoffs Affecting 500 Employees
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
8

1

242 Stimmen

8 Beiträge

71 Aufrufe

S

This is where the magic of near meaningless corpo-babble comes in. The layoffs are part of a plan to aspirationally acheive the goal of $10b revenue by EoY 2025. What they are actually doing is a significant restructuring of the company, refocusing by outside hiring some amount of new people to lead or be a part of departments or positions that haven't existed before, or are being refocused to other priorities... ... But this process also involves laying off 500 of the 'least productive' or 'least mission critical' employees. So, technically, they can, and are, arguing that their new organizational paradigm will be so succesful that it actually will result in increased revenue, not just lower expenses. Generally corpos call this something like 'right-sizing' or 'refocusing' or something like that. ... But of course... anyone with any actual experience with working at a place that does this... will tell you roughly this is what happens: Turns out all those 'grunts' you let go of, well they actually do a lot more work in a bunch of weird, esoteric, bandaid solutions to keep everything going, than upper management was aware of... because middle management doesn't acknowledge or often even understand that that work was being done, because they are generally self-aggrandizing narcissist petty tyrants who spend more time in meetings fluffing themselves up than actually doing any useful management. Then, also, you are now bringing on new, outside people who look great on paper, to lead new or modified apartments... but they of course also do not have any institutional knowledge, as they are new. So now, you have a whole bunch of undocumented work that was being done, processes which were being followed... which is no longer being done, which is not documented.... and the new guys, even if they have the best intentions, now have to spend a quarter or two or three figuring out just exactly how much pre-existing middle management has been bullshitting about, figuring out just how much things do not actually function as they ssid it did... So now your efficiency improving restructuring is actually a chaotic mess. ... Now, this 'right sizing' is not always apocalyptically extremely bad, but it is also essentially never totally free from hiccups... and it increases stress, workload, and tensions between basically everyone at the company, to some extent. Here's Forbes explanation of this phenomenon, if you prefer an explanation of right sizing in corpospeak: https://www.forbes.com/advisor/business/rightsizing/