linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

ChatGPT 'got absolutely wrecked' by Atari 2600 in beginner's chess match — OpenAI's newest model bamboozled by 1970s logic

204 Beiträge 136 Kommentatoren 5.9k Aufrufe

S spankmonkey@lemmy.world

AI including ChatGPT is being marketed as super awesome at everything, which is why that and similar AI is being forced into absolutely everything and being sold as a replacement for people.

Something marketed as AGI should be treated as AGI when proving it isn't AGI.
P This user is from outside of this forum
P This user is from outside of this forum
pixelatedsaturn@lemmy.world

schrieb am zuletzt editiert von

#28

I don't think ai is being marketed as awesome at everything. It's got obvious flaws. Right now its not good for stuff like chess, probably not even tic tac toe. It's a language model, its hard for it to calculate the playing field. But ai is in development, it might not need much to start playing chess.
V 4 B 3 Antworten Letzte Antwort

9
A anubis119@lemmy.world

A strange game. How about a nice game of Global Thermonuclear War?
L This user is from outside of this forum
L This user is from outside of this forum
lifecoach5000@lemmy.world

schrieb am zuletzt editiert von

#29

Lmao! that made me spit!!
1 Antwort Letzte Antwort

2
A anubis119@lemmy.world

A strange game. How about a nice game of Global Thermonuclear War?
M This user is from outside of this forum
M This user is from outside of this forum
madmadbunny@lemmy.ca

schrieb am zuletzt editiert von

#30

Frak off, toaster
1 Antwort Letzte Antwort

1
M muntedcrocodile@lemm.ee

This isn't the strength of gpt-o4 the model has been optimised for tool use as an agent. That's why its so good at image gen relative to other models it uses tools to construct an image piece by piece similar to a human. Also probably poor system prompting. A LLM is not a universal thinking machine its a a universal process machine. An LLM understands the process and uses tools to accomplish the process hence its strengths in writing code (especially as an agent).

Its similar to how a monkey is infinitely better at remembering a sequence of numbers than a human ever could but is totally incapable of even comprehending writing down numbers.
C This user is from outside of this forum
C This user is from outside of this forum
cheese_greater@lemmy.world

schrieb am zuletzt editiert von

#31

Do you have a source for that re:monkeys memorizing numerical sequences? What do you mean by that?
R S 2 Antworten Letzte Antwort

2
N nurse_robot@lemmy.world

I'm often impressed at how good chatGPT is at generating text, but I'll admit it's hilariously terrible at chess. It loves to manifest pieces out of thin air, or make absurd illegal moves, like jumping its king halfway across the board and claiming checkmate
L This user is from outside of this forum
L This user is from outside of this forum
lifecoach5000@lemmy.world

schrieb am zuletzt editiert von

#32

Yeah! I’ve loved watching Gothem Chess’ videos on these. Always have been good for a laugh.
1 Antwort Letzte Antwort

2
P pelespirit@sh.itjust.works

Not to help the AI companies, but why don't they program them to look up math programs and outsource chess to other programs when they're asked for that stuff? It's obvious they're shit at it, why do they answer anyway? It's because they're programmed by know-it-all programmers, isn't it.
R This user is from outside of this forum
R This user is from outside of this forum
rebelsimile@sh.itjust.works

schrieb am zuletzt editiert von

#33

Because they’re fucking terrible at designing tools to solve problems, they are obviously less and less good at pretending this is an omnitool that can do everything with perfect coherency (and if it isn’t working right it’s because you’re not believing or paying hard enough)
M 1 Antwort Letzte Antwort

28
C cheese_greater@lemmy.world

Do you have a source for that re:monkeys memorizing numerical sequences? What do you mean by that?
R This user is from outside of this forum
R This user is from outside of this forum
remembertheending@lemmy.world

schrieb am zuletzt editiert von

#34
1 Antwort Letzte Antwort

5
C cheese_greater@lemmy.world

Do you have a source for that re:monkeys memorizing numerical sequences? What do you mean by that?
S This user is from outside of this forum
S This user is from outside of this forum
shalafi@lemmy.world

schrieb am zuletzt editiert von

#35

That threw me as well.
1 Antwort Letzte Antwort

1
C carbonatedpastasauce@lemmy.world

Neither of those things are marketed as being artificially intelligent.
L This user is from outside of this forum
L This user is from outside of this forum
lembot_0003@lemmy.zip

schrieb am zuletzt editiert von

#36

Marketers aren't intelligent either, so I see no reason to listen to them.
H 1 Antwort Letzte Antwort

9
L lifecoach5000@lemmy.world

This post did not contain any content.
A This user is from outside of this forum
A This user is from outside of this forum
asswardbackaddict@lemmy.world

schrieb am zuletzt editiert von asswardbackaddict@lemmy.world

#37

While you guys suck at using tools, I'm making up for my lack of coding experience with ai, and successfully simulating the behavior of my aether (fuck you guys. Your search for a static ether is irrelevant to how mine behaves, and you shouldn't have dismissed everybody from Diogynes to Einstein), showing soliton-like structure emergence and particle-like interactions (with 1D relativistic constraints [I'm gonna need a fucking super computer to scale to 3D]). Anyways, whether you're wrong about your latest fun fact, cutting your thumb off trying to split a 2X4, or believing any idiot you talk to, this is user error, bro. Creating functional code for my simulator has saved me months, if not years of my life. Just setting up a gui was ridiculous for a novice like me, let alone translating walls of relativistic equation results (mainly stress-energy tensor) into code a computer can use. Side note: y'all don't give a fuck about facts. Come on. We're primates. Social status is the name of the game.
X 1 Antwort Letzte Antwort

1
L lembot_0003@lemmy.zip

Marketers aren't intelligent either, so I see no reason to listen to them.
H This user is from outside of this forum
H This user is from outside of this forum
homesweethomemrl@lemmy.world

schrieb am zuletzt editiert von

#38

You’re not going to slimeball investors out of three hundred billion dollars with that attitude, mister.
1 Antwort Letzte Antwort

6
F fmt99@lemmy.world

Prepare to be delighted. Full disclosure, my Atari isn't hooked up and also I don't have the Video Chess cart even if it was, so this was fetched from Google Images.
H This user is from outside of this forum
H This user is from outside of this forum
homesweethomemrl@lemmy.world

schrieb am zuletzt editiert von

#39

Can confirm.

And if you play it on expert mode, you can leave for college and get your degree before it’s your turn again.
1 Antwort Letzte Antwort

1
P pixelatedsaturn@lemmy.world

I don't think ai is being marketed as awesome at everything. It's got obvious flaws. Right now its not good for stuff like chess, probably not even tic tac toe. It's a language model, its hard for it to calculate the playing field. But ai is in development, it might not need much to start playing chess.
V This user is from outside of this forum
V This user is from outside of this forum
vinnymac@lemmy.world

schrieb am zuletzt editiert von

#40

What the tech is being marketed as and what it’s capable of are not the same, and likely never will be. In fact all things are very rarely marketed how they truly behave, intentionally.

Everyone is still trying to figure out what these Large Reasoning Models and Large Language Models are even capable of; Apple, one of the largest companies in the world just released a white paper this past week describing the “illusion of reasoning”. If it takes a scientific paper to understand what these models are and are not capable of, I assure you they’ll be selling snake oil for years after we fully understand every nuance of their capabilities.

TL;DR Rich folks want them to be everything, so they’ll be sold as capable of everything until we repeatedly refute they are able to do so.
P 1 Antwort Letzte Antwort

31
F fmt99@lemmy.world

Did the author thinks ChatGPT is in fact an AGI? It's a chatbot. Why would it be good at chess? It's like saying an Atari 2600 running a dedicated chess program can beat Google Maps at chess.
S This user is from outside of this forum
S This user is from outside of this forum
saltesc@lemmy.world

schrieb am zuletzt editiert von

#41

I like referring to LLMs as VI (Virtual Intelligence from Mass Effect) since they merely give the impression of intelligence but are little more than search engines. In the end all one is doing is displaying expected results based on a popularity algorithm. However they do this inconsistently due to bad data in and limited caching.
1 Antwort Letzte Antwort

4
Y youngalfred@lemm.ee

Here you go (online emulator): https://www.retrogames.cz/play_716-Atari2600.php
O This user is from outside of this forum
O This user is from outside of this forum
over_clox@lemmy.world

schrieb am zuletzt editiert von

#42

WTF? I just played that just long enough for my queen to take over their queen, and it turned my queen into a rook?

Is that even a legit rule in any variation of chess rules?
1 Antwort Letzte Antwort

0
A anubis119@lemmy.world

A strange game. How about a nice game of Global Thermonuclear War?
X This user is from outside of this forum
X This user is from outside of this forum
xanthobilly@lemmy.world

schrieb am zuletzt editiert von

#43
1 Antwort Letzte Antwort

1
P pelespirit@sh.itjust.works

Not to help the AI companies, but why don't they program them to look up math programs and outsource chess to other programs when they're asked for that stuff? It's obvious they're shit at it, why do they answer anyway? It's because they're programmed by know-it-all programmers, isn't it.
N This user is from outside of this forum
N This user is from outside of this forum
nobodyelse@sh.itjust.works

schrieb am zuletzt editiert von

#44

Because the LLMs are now being used to vibe code themselves.
1 Antwort Letzte Antwort

4
E electricyarn@lemmy.world

Yeah, just because I can't count the number of r's in the word strawberry doesn't mean I shouldn't be put in charge of the US nuclear arsenal!
O This user is from outside of this forum
O This user is from outside of this forum
otp@sh.itjust.works

schrieb am zuletzt editiert von

#45

That is more a failure of the person who made that decision than a failing of ChatBots, lol
P W 2 Antworten Letzte Antwort

3
P pixelatedsaturn@lemmy.world

I don't think ai is being marketed as awesome at everything. It's got obvious flaws. Right now its not good for stuff like chess, probably not even tic tac toe. It's a language model, its hard for it to calculate the playing field. But ai is in development, it might not need much to start playing chess.
4 This user is from outside of this forum
4 This user is from outside of this forum
4am@lemm.ee

schrieb am zuletzt editiert von

#46

Really then why are they cramming AI into every app and every device and replacing jobs with it and claiming they’re saving so much time and money and they’re the best now the hardest working most efficient company and this is the future and they have a director of AI vision that’s right a director of AI vision a true visionary to lead us into the promised land where we will make money automatically please bro just let this be the automatic money cheat oh god I’m about to
P 1 Antwort Letzte Antwort

8
P pixelatedsaturn@lemmy.world

I don't think ai is being marketed as awesome at everything. It's got obvious flaws. Right now its not good for stuff like chess, probably not even tic tac toe. It's a language model, its hard for it to calculate the playing field. But ai is in development, it might not need much to start playing chess.
B This user is from outside of this forum
B This user is from outside of this forum
bassturd@lemmy.world

schrieb am zuletzt editiert von

#47

Marketing does not mean functionality. AI is absolutely being sold to the public and enterprises as something that can solve everything. Obviously it can't, but it's being sold that way. I would bet the average person would be surprised by this headline solely on what they've heard about the capabilities of AI.
P 1 Antwort Letzte Antwort

17

Anmelden zum Antworten

A

Scientists study how people would react to a neurotic robot personality in real life
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology robotics
9

61 Stimmen

9 Beiträge

43 Aufrufe

G

At least they're good at imagining all the ways in which you can hurt yourself way beforehand...and making sure you don't do them...or anything else
J

🎯 A free collection of 40+ web tools – from dev utilities to productivity boosters
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
2

4 Stimmen

2 Beiträge

13 Aufrufe

K

You made this site, you say? What an odd coincidence! Were you inspired by the site you say you "stumbled upon" here? https://lemmy.world/post/33395761 Because it sure seems like the exact same site.
P

(LLM) A language model built for the public good
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
18

1

131 Stimmen

18 Beiträge

228 Aufrufe

D

Is the red cross involved? Because if not, using a red cross in the article is misleading and potentially a crime.
P

LLMs factor in unrelated information when recommending medical treatments
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
15

1

138 Stimmen

15 Beiträge

121 Aufrufe

T

ChatGPT is not a doctor. But models trained on imaging can actually be a very useful tool for them to utilize. Even years ago, just before the AI “boom”, they were asking doctors for details on how they examine patient images and then training models on that. They found that the AI was “better” than doctors specifically because it followed the doctor’s advice 100% of the time; thereby eliminating any kind of bias from the doctor that might interfere with following their own training. Of course, the splashy headline “AI better than doctors” was ridiculous. But it does show the benefit of having a neutral tool for doctors to utilize, especially when looking at images for people who are outside of the typical demographics that much medical training is based on. (As in mostly just white men. For example, everything they train doctors on regarding knee imagining comes from images of the knees of coal miners in the UK some decades ago)
P

Inside a Dark Adtech Empire Fed by Fake CAPTCHAs
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

10 Stimmen

1 Beiträge

16 Aufrufe

Niemand hat geantwortet
S

Anker recalls over a million power banks due to fire and burn hazards
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
9

1

178 Stimmen

9 Beiträge

94 Aufrufe

R

They've probably just crunched the numbers and determined the cost of a recall in Canada was greater than the cost of law suits when your house does burn down
E

Why Silicon Valley Needs Immigration
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
4

1

36 Stimmen

4 Beiträge

58 Aufrufe

A

"Because theyŕe greedy fucks". There, saved you a click.
A

Tesla confirms it has given up on its Cybertruck range extender to achieve promised range
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
31

1

531 Stimmen

31 Beiträge

276 Aufrufe

U

If you want a narrative, look at all the full-price $250k Roadster pre-orders they've been holding onto for like 8 years now with zero signs of production and complete silence for the last...5 years?

1
2
3
4
5
10
11