linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

ChatGPT 'got absolutely wrecked' by Atari 2600 in beginner's chess match — OpenAI's newest model bamboozled by 1970s logic

Technology

204 Beiträge 136 Kommentatoren 5.9k Aufrufe

L lifecoach5000@lemmy.world

This post did not contain any content.
F This user is from outside of this forum
F This user is from outside of this forum
finitebanjo@lemmy.world

schrieb am zuletzt editiert von

#144

All these comments asking "why don't they just have chatgpt go and look up the correct answer".

That's not how it works, you buffoons, it trains off of datasets long before it releases. It doesn't think. It doesn't learn after release, it won't remember things you try to teach it.

Really lowering my faith in humanity when even the AI skeptics don't understand that it generates statistical representations of an answer based on answers given in the past.
1 Antwort Letzte Antwort

13
N nutsack@lemmy.dbzer0.com

my favorite thing is to constantly be implementing libraries that don't exist
J This user is from outside of this forum
J This user is from outside of this forum
jj4211@lemmy.world

schrieb am zuletzt editiert von

#145

Oh man, I feel this. A couple of times I've had to field questions about some REST API I support and they ask why they get errors when they supply a specific attribute. Now that attribute never existed, not in our code, not in our documentation, we never thought of it. So I say "Well, that attribute is invalid, I'm not sure where you saw to do that". They get insistent that the code is generated by a very good LLM, so we must be missing something...
1 Antwort Letzte Antwort

1
H halosheep@lemm.ee

I swear every single article critical of current LLMs is like, "The square got BLASTED by the triangle shape when it completely FAILED to go through the triangle shaped hole."
I This user is from outside of this forum
I This user is from outside of this forum
ipkpjersi@lemmy.ml

schrieb am zuletzt editiert von

#146

That's just clickbait in general these days lol
1 Antwort Letzte Antwort

1
P pelespirit@sh.itjust.works

Not to help the AI companies, but why don't they program them to look up math programs and outsource chess to other programs when they're asked for that stuff? It's obvious they're shit at it, why do they answer anyway? It's because they're programmed by know-it-all programmers, isn't it.
C This user is from outside of this forum
C This user is from outside of this forum
cilethesane@lemmy.ca

schrieb am zuletzt editiert von

#147

why don't they program them to look up math programs and outsource chess to other programs when they're asked for that stuff?

Because the AI doesn't know what it's being asked, it's just a algorithm guessing what the next word in a reply is. It has no understanding of what the words mean.

"Why doesn't the man in the Chinese room just use a calculator for math questions?"
1 Antwort Letzte Antwort

2
F fmt99@lemmy.world

Did the author thinks ChatGPT is in fact an AGI? It's a chatbot. Why would it be good at chess? It's like saying an Atari 2600 running a dedicated chess program can beat Google Maps at chess.
E This user is from outside of this forum
E This user is from outside of this forum
empricorn@feddit.nl

schrieb am zuletzt editiert von

#148

You're not wrong, but keep in mind ChatGPT advocates, including the company itself are referring to it as AI, including in marketing. They're saying it's a complete, self-learning, constantly-evolving Artificial Intelligence that has been improving itself since release... And it loses to a 4KB video game program from 1979 that can only "think" 2 moves ahead.
F 1 Antwort Letzte Antwort

3
J jj4211@lemmy.world

To be fair, a decent chunk of coding is stupid boilerplate/minutia that varies environment to environment, language to language, library to library.

So LLM can do some code completion, filling out a bunch of boilerplate that is blatantly obvious, generating the redundant text mandated by certain patterns, and keeping straight details between languages like "does this language want join as a method on a list with a string argument, or vice versa?"

Problem is this can be sometimes more annoying than it's worth, as miscompletions are annoying.
P This user is from outside of this forum
P This user is from outside of this forum
pushbutton@lemmy.world

schrieb am zuletzt editiert von

#149

Fair point.

I liked the "upgraded autocompletion", you know, an completion based on the context, just before the time that they pushed it too much with 20 lines of non sense...

Now I am thinking of a way of doing the thing, then I receive a 20 lines suggestion.

So I am checking if that make sense, losing my momentum, only to realize the suggestion us calling shit that don't exist...

Screw that.
M 1 Antwort Letzte Antwort

2
L lifecoach5000@lemmy.world

This post did not contain any content.
N This user is from outside of this forum
N This user is from outside of this forum
neilbru@lemmy.world

schrieb am zuletzt editiert von neilbru@lemmy.world

#150

An LLM is a poor computational/predictive paradigm for playing chess.
T S S B 4 Antworten Letzte Antwort

76
F fmt99@lemmy.world

The problem is though that this perpetuates the idea that ChatGPT is actually an AI.
A This user is from outside of this forum
A This user is from outside of this forum
adhdplantdev@lemm.ee

schrieb am zuletzt editiert von

#151

People already think chatGPT is a general AI. We need more articles like this showing is ineffectiveness at being intelligent. Besides it helps find a limitations of this technology so that we can hopefully use it to argue against every single place
1 Antwort Letzte Antwort

0
P pushbutton@lemmy.world

You get 2 triangles in a single square mate...

CHECKMATE!
A This user is from outside of this forum
A This user is from outside of this forum
acid_burn@lemmy.dbzer0.com

schrieb am zuletzt editiert von

#152

Touchdown! 3 points!
1 Antwort Letzte Antwort

1
N nova_ad_vitum@lemmy.ca

Gotham chess has a video of making chatgpt play chess against stockfish. Spoiler: chatgpt does not do well. It plays okay for a few moves but then the moment it gets in trouble it straight up cheats. Telling it to follow the rules of chess doesn't help.

This sort of gets to the heart of LLM-based "AI". That one example to me really shows that there's no actual reasoning happening inside. It's producing answers that statistically look like answers that might be given based on that input.

For some things it even works. But calling this intelligence is dubious at best.
N This user is from outside of this forum
N This user is from outside of this forum
noodle07@lemmy.world

schrieb am zuletzt editiert von

#153

Hallucinating 100% of the time
1 Antwort Letzte Antwort

2
N neilbru@lemmy.world

An LLM is a poor computational/predictive paradigm for playing chess.
T This user is from outside of this forum
T This user is from outside of this forum
takapapatapaka@lemmy.world

schrieb am zuletzt editiert von

#154

Actually, a very specific model (chatgpt3.5-turbo-instruct) was pretty good at chess (around 1700 elo if i remember correctly).
N 1 Antwort Letzte Antwort

11
P pushbutton@lemmy.world

Fair point.

I liked the "upgraded autocompletion", you know, an completion based on the context, just before the time that they pushed it too much with 20 lines of non sense...

Now I am thinking of a way of doing the thing, then I receive a 20 lines suggestion.

So I am checking if that make sense, losing my momentum, only to realize the suggestion us calling shit that don't exist...

Screw that.
M This user is from outside of this forum
M This user is from outside of this forum
merdaverse@lemm.ee

schrieb am zuletzt editiert von

#155

The amount of garbage it spits out in autocomplete is distracting. If it's constantly making me 5-10% less productive the many times it's wrong, it should save me a lot of time when it is right, and generally, I haven't found it able to do that.

Yesterday I tried to prompt it to change around 20 call sites for a function where I had changed the signature. Easy, boring and repetitive, something that a junior could easily do. And all the models were absolutely clueless about it (using copilot)
1 Antwort Letzte Antwort

1
F fmt99@lemmy.world

Did the author thinks ChatGPT is in fact an AGI? It's a chatbot. Why would it be good at chess? It's like saying an Atari 2600 running a dedicated chess program can beat Google Maps at chess.
M This user is from outside of this forum
M This user is from outside of this forum
merdaverse@lemm.ee

schrieb am zuletzt editiert von merdaverse@lemm.ee

#156

OpenAI has been talking about AGI for years, implying that they are getting closer to it with their products.

(openai.com)

(openai.com)

Not to even mention all the hype created by the techbros around it.
F 1 Antwort Letzte Antwort

2
A arc99@lemmy.world

All AIs are the same. They're just scraping content from GitHub, stackoverflow etc with a bunch of guardrails slapped on to spew out sentences that conform to their training data but there is no intelligence. They're super handy for basic code snippets but anyone using them anything remotely complex or nuanced will regret it.
A This user is from outside of this forum
A This user is from outside of this forum
alecsadler@sh.itjust.works

schrieb am zuletzt editiert von

#157

I've used agents for implementing entire APIs and front-ends from the ground up with my own customizations and nuances.

I will say that, for my pedantic needs, it typically only gets about 80-90% of the way there so I still have to put fingers to code, but it definitely saves a boat load of time in those instances.
1 Antwort Letzte Antwort

0
M merdaverse@lemm.ee

OpenAI has been talking about AGI for years, implying that they are getting closer to it with their products.

(openai.com)

(openai.com)

Not to even mention all the hype created by the techbros around it.
F This user is from outside of this forum
F This user is from outside of this forum
fmt99@lemmy.world

schrieb am zuletzt editiert von

#158

Hey I didn't say anywhere that corporations don't lie to promote their product did I?
1 Antwort Letzte Antwort

0
A arc99@lemmy.world

All AIs are the same. They're just scraping content from GitHub, stackoverflow etc with a bunch of guardrails slapped on to spew out sentences that conform to their training data but there is no intelligence. They're super handy for basic code snippets but anyone using them anything remotely complex or nuanced will regret it.
N This user is from outside of this forum
N This user is from outside of this forum
natenate60@lemmy.world

schrieb am zuletzt editiert von

#159
One of my mates generated an entire website using Gemini. It was a React web app that tracks inventory for trading card dealers. It actually did come out functional and well-polished. That being said, the AI really struggled with several aspects of the project that humans would not:
- It left database secrets in the code
- The design of the website meant that it was impossible to operate securely
- The quality of the code itself was hot garbage—unreadable and undocumented nonsense that somehow still worked
- It did not break the code into multiple files. It piled everything into a single file
1 Antwort Letzte Antwort

0
E empricorn@feddit.nl

You're not wrong, but keep in mind ChatGPT advocates, including the company itself are referring to it as AI, including in marketing. They're saying it's a complete, self-learning, constantly-evolving Artificial Intelligence that has been improving itself since release... And it loses to a 4KB video game program from 1979 that can only "think" 2 moves ahead.
F This user is from outside of this forum
F This user is from outside of this forum
fmt99@lemmy.world

schrieb am zuletzt editiert von

#160

That's totally fair, the company is obviously lying, excuse me "marketing", to promote their product, that's absolutely true.
1 Antwort Letzte Antwort

1
P pamasich@kbin.earth

There are custom GPTs which claim to play at a stockfish level or be literally stockfish under the hood (I assume the former is still the latter just not explicitly). Haven't tested them, but if they work, I'd say yes. An LLM itself will never be able to play chess or do anything similar, unless they outsource that task to another tool that can. And there seem to be GPTs that do exactly that.

As for why we need ChatGPT then when the result comes from Stockfish anyway, it's for the natural language prompts and responses.
N This user is from outside of this forum
N This user is from outside of this forum
natenate60@lemmy.world

schrieb am zuletzt editiert von

#161

It's not an LLM, but Stockfish does use AI under the hood and has been since 2020. Stockfish uses a classical alpha-beta search strategy (if I recall correctly) combined with a neural network for smarter pruning.

There are some engines of comparable strength that are primarily neural-network based. lc0 comes to mind. lc0 placed 2nd in the Top Chess Engine Championships in 9 out of the past 10 seasons. By comparison, Stockfish is currently on a 10-season win streak in the TCEC.
1 Antwort Letzte Antwort

0
N nova_ad_vitum@lemmy.ca

Gotham chess has a video of making chatgpt play chess against stockfish. Spoiler: chatgpt does not do well. It plays okay for a few moves but then the moment it gets in trouble it straight up cheats. Telling it to follow the rules of chess doesn't help.

This sort of gets to the heart of LLM-based "AI". That one example to me really shows that there's no actual reasoning happening inside. It's producing answers that statistically look like answers that might be given based on that input.

For some things it even works. But calling this intelligence is dubious at best.
U This user is from outside of this forum
U This user is from outside of this forum
ultraviolet@lemmy.world

schrieb am zuletzt editiert von ultraviolet@lemmy.world

#162

Because it doesn't have any understanding of the rules of chess or even an internal model of the game state, it just has the text of chess games in its training data and can reproduce the notation, but nothing to prevent it from making illegal moves, trying to move or capture pieces that don't exist, incorrectly declaring check/checkmate, or any number of nonsensical things.
1 Antwort Letzte Antwort

5
N nova_ad_vitum@lemmy.ca

Gotham chess has a video of making chatgpt play chess against stockfish. Spoiler: chatgpt does not do well. It plays okay for a few moves but then the moment it gets in trouble it straight up cheats. Telling it to follow the rules of chess doesn't help.

This sort of gets to the heart of LLM-based "AI". That one example to me really shows that there's no actual reasoning happening inside. It's producing answers that statistically look like answers that might be given based on that input.

For some things it even works. But calling this intelligence is dubious at best.
I This user is from outside of this forum
I This user is from outside of this forum
interdimensionalmeme@lemmy.ml

schrieb am zuletzt editiert von

#163

I think the biggest problem is it's very low ability to "test time adaptability". Even when combined with a reasonning model outputting into its context, the weights do not learn out of the immediate context.

I think the solution might be to train a LoRa overlay on the fly against the weights and run inference with that AND the unmodified weights and then have an overseer model self evaluate and recompose the raw outputs.

Like humans are way better at answering stuff when it's a collaboration of more than one person. I suspect the same is true of LLMs.
N 1 Antwort Letzte Antwort

0

Anmelden zum Antworten

M

Trump says he plans to put a 100% tariff on computer chips, likely pushing up cost of electronics
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
226

1

688 Stimmen

226 Beiträge

22 Aufrufe

V

It's far more harmful to allow people, especially people like you, decide who deserves what. So no, all my points stand.
P

The age of storage: Batteries primed for India’s power markets
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

30 Stimmen

1 Beiträge

1 Aufrufe

Niemand hat geantwortet
A

As Gaza suffers, US companies are reaping horrific payoffs | Katrina vanden Heuvel
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
4

1

138 Stimmen

4 Beiträge

8 Aufrufe

A

Thiel taking diligent notes on how to start WWIII. Topics for next year's discussion: •How to rebrand your authoritarian axis. •Deregulating nuclear safety to power AI: How the West finally kicked its fossil fuel habit. •Have the 99% really earned autonomy? •Global organ harvest and the path to immortality for the chosen elite. Nobody wants to call him out bc they've already accepted the future. If anyone in the U.S. actually cared about stopping genocide wouldn't they be demanding the U.S. stop giving billions of dollars in contracts to Palantir, and that any government official investing in genocide be forced to step down?
A

Airlines urge senators to reject bill limiting facial recognition
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
12

105 Stimmen

12 Beiträge

20 Aufrufe

H

Part of the reason it's so fast is they have the passenger manifest already. So they start the search checking against the hundreds of people that just arrived. Instead off the much larger overall database.
R

Customer Data Platform Market
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

2

0 Stimmen

1 Beiträge

8 Aufrufe

Niemand hat geantwortet
C

SHUT THE FUCK UP!
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
20

2

94 Stimmen

20 Beiträge

229 Aufrufe

T

Why censor fucking but not fuck?
R

McDonald’s AI Hiring Bot Exposed Millions of Applicants’ Data to Hackers Who Tried the Password ‘123456’
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
81

1

690 Stimmen

81 Beiträge

2k Aufrufe

I

I don't know why you are getting so many upvotes for being a liar. Tried it on Lemmy.world and it doesn't work. I even tried it with a capital H.
M

Microsoft Pivots, Offers Free Windows 10 Updates after End-Of-Life Deadline with a Strategic Catch - WinBuzzer
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

0 Stimmen

1 Beiträge

19 Aufrufe

Niemand hat geantwortet