linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

ChatGPT 'got absolutely wrecked' by Atari 2600 in beginner's chess match — OpenAI's newest model bamboozled by 1970s logic

Technology

204 Beiträge 136 Kommentatoren 5.9k Aufrufe

N neilbru@lemmy.world
10. Juni 2025, 19:58

Absolutely interested. Thank you for your time to share that.

My career path in neural networks began as a researcher for cancerous tissue object detection in medical diagnostic imaging. Now it is switched to generative models for CAD (architecture, product design, game assets, etc.). I don't really mess about with fine-tuning LLMs.

However, I do self-host my own LLMs as code assistants. Thus, I'm only tangentially involved with the current LLM craze.

But it does interest me, nonetheless!
T This user is from outside of this forum
T This user is from outside of this forum
takapapatapaka@lemmy.world

schrieb am 11. Juni 2025, 14:38 zuletzt editiert von

#194

Here is the main blog post that i remembered : it has a follow up, a more scientific version, and uses two other articles as a basis, so you might want to dig around what they mention in the introduction.

It is indeed a quite technical discovery, and it still lacks complete and wider analysis, but it is very interesting for the fact that it kinda invalidates the common gut feeling that llms are pure lucky random.
1 Antwort Letzte Antwort

0
J jsomae@lemmy.ml
10. Juni 2025, 21:33

Using an LLM as a chess engine is like using a power tool as a table leg. Pretty funny honestly, but it's obviously not going to be good at it, at least not without scaffolding.
K This user is from outside of this forum
K This user is from outside of this forum
kent_eh@lemmy.ca

schrieb am 11. Juni 2025, 18:00 zuletzt editiert von

#195

is like using a power tool as a table leg.

Then again, our corporate lords and masters are trying to replace all manner of skilled workers with those same LLM "AI" tools.

And clearly that will backfire on them and they'll eventually scramble to find people with the needed skills, but in the meantime tons of people will have lost their source of income.
J 1 Antwort Letzte Antwort 11. Juni 2025, 21:11

2
L lifecoach5000@lemmy.world
9. Juni 2025, 22:38

This post did not contain any content.
F This user is from outside of this forum
F This user is from outside of this forum
fourwaveforms@lemm.ee

schrieb am 11. Juni 2025, 19:44 zuletzt editiert von

#196

If you don't play chess, the Atari is probably going to beat you as well.

LLMs are only good at things to the extent that they have been well-trained in the relevant areas. Not just learning to predict text string sequences, but reinforcement learning after that, where a human or some other agent says "this answer is better than that one" enough times in enough of the right contexts. It mimics the way humans learn, which is through repeated and diverse exposure.

If they set up a system to train it against some chess program, or (much simpler) simply gave it a tool call, it would do much better. Tool calling already exists and would be by far the easiest way.

It could also be instructed to write a chess solver program and then run it, at which point it would be on par with the Atari, but it wouldn't compete well with a serious chess solver.
1 Antwort Letzte Antwort

4
K kent_eh@lemmy.ca
11. Juni 2025, 18:00

is like using a power tool as a table leg.

Then again, our corporate lords and masters are trying to replace all manner of skilled workers with those same LLM "AI" tools.

And clearly that will backfire on them and they'll eventually scramble to find people with the needed skills, but in the meantime tons of people will have lost their source of income.
J This user is from outside of this forum
J This user is from outside of this forum
jsomae@lemmy.ml

schrieb am 11. Juni 2025, 21:11 zuletzt editiert von jsomae@lemmy.ml 6. Nov. 2025, 23:12

#197

If you believe LLMs are not good at anything then there should be relatively little to worry about in the long-term, but I am more concerned.

It's not obvious to me that it will backfire for them, because I believe LLMs are good at some things (that is, when they are used correctly, for the correct tasks). Currently they're being applied to far more use cases than they are likely to be good at -- either because they're overhyped or our corporate lords and masters are just experimenting to find out what they're good at and what not. Some of these cases will be like chess, but others will be like code*.

(* not saying LLMs are good at code in general, but for some coding applications I believe they are vastly more efficient than humans, even if a human expert can currently write higher-quality less-buggy code.)
K 1 Antwort Letzte Antwort 12. Juni 2025, 15:34

0
A arc99@lemmy.world
10. Juni 2025, 11:22

Hardly surprising. Llms aren't -thinking- they're just shitting out the next token for any given input of tokens.
S This user is from outside of this forum
S This user is from outside of this forum
stevedice@sh.itjust.works

schrieb am 11. Juni 2025, 21:43 zuletzt editiert von

#198

That's exactly what thinking is, though.
A 1 Antwort Letzte Antwort 13. Juni 2025, 04:22

0
L lifecoach5000@lemmy.world
9. Juni 2025, 22:38

This post did not contain any content.
S This user is from outside of this forum
S This user is from outside of this forum
stevedice@sh.itjust.works

schrieb am 11. Juni 2025, 21:52 zuletzt editiert von

#199

2025 Mazda MX-5 Miata 'got absolutely wrecked' by Inflatable Boat in beginner's boat racing match — Mazda's newest model bamboozled by 1930s technology.
1 Antwort Letzte Antwort

7
L lifecoach5000@lemmy.world
9. Juni 2025, 22:38

This post did not contain any content.
U This user is from outside of this forum
U This user is from outside of this forum
untakenusername@sh.itjust.works

schrieb am 12. Juni 2025, 03:46 zuletzt editiert von

#200

this is because an LLM is not made for playing chess
1 Antwort Letzte Antwort

1
J jsomae@lemmy.ml
11. Juni 2025, 21:11

If you believe LLMs are not good at anything then there should be relatively little to worry about in the long-term, but I am more concerned.

It's not obvious to me that it will backfire for them, because I believe LLMs are good at some things (that is, when they are used correctly, for the correct tasks). Currently they're being applied to far more use cases than they are likely to be good at -- either because they're overhyped or our corporate lords and masters are just experimenting to find out what they're good at and what not. Some of these cases will be like chess, but others will be like code*.

(* not saying LLMs are good at code in general, but for some coding applications I believe they are vastly more efficient than humans, even if a human expert can currently write higher-quality less-buggy code.)
K This user is from outside of this forum
K This user is from outside of this forum
kent_eh@lemmy.ca

schrieb am 12. Juni 2025, 15:34 zuletzt editiert von

#201

I believe LLMs are good at some things

The problem is that they're being used for all the things, including a large number of tasks that thwy are not well suited to.
J 1 Antwort Letzte Antwort 12. Juni 2025, 15:49

0
K kent_eh@lemmy.ca
12. Juni 2025, 15:34

I believe LLMs are good at some things

The problem is that they're being used for all the things, including a large number of tasks that thwy are not well suited to.
J This user is from outside of this forum
J This user is from outside of this forum
jsomae@lemmy.ml

schrieb am 12. Juni 2025, 15:49 zuletzt editiert von

#202

yeah, we agree on this point. In the short term it's a disaster. In the long-term, assuming AI's capabilities don't continue to improve at the rate they have been, our corporate overlords will only replace people for whom it's actually worth it to them to replace with AI.
1 Antwort Letzte Antwort

1
S stevedice@sh.itjust.works
11. Juni 2025, 21:43

That's exactly what thinking is, though.
A This user is from outside of this forum
A This user is from outside of this forum
arc99@lemmy.world

schrieb am 13. Juni 2025, 04:22 zuletzt editiert von arc99@lemmy.world

#203
An LLM is an ordered series of parameterized / weighted nodes which are fed a bunch of tokens, and millions of calculations later result generates the next token to append and repeat the process. It's like turning a handle on some complex Babbage-esque machine. LLMs use a tiny bit of randomness ("temperature") when choosing the next token so the responses are not identical each time.

But it is not thinking. Not even remotely so. It's a simulacrum. If you want to see this, run ollama with the temperature set to 0 e.g.
ollama run gemma3:4b >>> /set parameter temperature 0 >>> what is a leaf
You will get the same answer every single time.
S 1 Antwort Letzte Antwort 17. Juni 2025, 02:31

0
A arc99@lemmy.world
13. Juni 2025, 04:22
An LLM is an ordered series of parameterized / weighted nodes which are fed a bunch of tokens, and millions of calculations later result generates the next token to append and repeat the process. It's like turning a handle on some complex Babbage-esque machine. LLMs use a tiny bit of randomness ("temperature") when choosing the next token so the responses are not identical each time.

But it is not thinking. Not even remotely so. It's a simulacrum. If you want to see this, run ollama with the temperature set to 0 e.g.
```
ollama run gemma3:4b
>>> /set parameter temperature 0
>>> what is a leaf
```
You will get the same answer every single time.
S This user is from outside of this forum
S This user is from outside of this forum
stevedice@sh.itjust.works

schrieb am 17. Juni 2025, 02:31 zuletzt editiert von stevedice@sh.itjust.works

#204

I know what an LLM is doing. You don't know what your brain is doing.
1 Antwort Letzte Antwort

0

Anmelden zum Antworten

203/204

13. Juni 2025, 04:22

D

North Korea sent me abroad to be a secret IT worker. My wages funded the regime
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
204 vor 8 Tagen
vor 9 Tagen
1

210 Stimmen

32 Beiträge

106 Aufrufe

S vor 8 Tagen

No need for good computers to train agents. They don't need to play crysis to train as hackers. Something on the level of a Pi (or more accurately of a 2010 laptop) is good enough.
U

Eyes in the Sky: A Comprehensive Survey of Ukrainian Unmanned Aerial Vehicles (UAVs)
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
204 vor 25 Tagen
vor 28 Tagen
1

36 Stimmen

11 Beiträge

155 Aufrufe

N vor 25 Tagen

That was a feature where you could attach an analog receiver to their Goggles v2. They have since removed it in newer Goggle versions. Their system is basically a walled garden now.
M

Threads is nearing X's daily app users, new data shows
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
204 vor 21 Tagen
8. Juli 2025, 13:27
1

109 Stimmen

29 Beiträge

355 Aufrufe

P vor 21 Tagen

Always happy to see your content!
M

Pirate Software "Stop Killing Games" Drama
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
204 27. Juni 2025, 04:30
27. Juni 2025, 01:37

37 Stimmen

9 Beiträge

81 Aufrufe

V 27. Juni 2025, 04:30

Crazy how big of a following he has after the drama with Only Fangs at the beginning of he year.
T

Apple announces iOS 26 with Liquid Glass redesign
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
204 13. Juni 2025, 03:09
9. Juni 2025, 18:44
1

117 Stimmen

83 Beiträge

846 Aufrufe

S 13. Juni 2025, 03:09

you guys are weird
B

$20 for us citizens
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
204 7. Juni 2025, 08:38
7. Juni 2025, 08:38

0 Stimmen

1 Beiträge

18 Aufrufe

Niemand hat geantwortet
F

Revolutionary cooling technology emerges from Slovenia
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
204 19. Mai 2025, 23:44
19. Mai 2025, 19:53

43 Stimmen

8 Beiträge

82 Aufrufe

S 19. Mai 2025, 23:44

You know what's even cheaper to run than this "new technology"? Breathy promotion pieces that give no evidence whatsoever to support it's claims. Way to go, PR folks.
H

Microsoft Bans Employees From Using DeepSeek App
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
204 9. Mai 2025, 19:43
9. Mai 2025, 11:55
1

121 Stimmen

11 Beiträge

101 Aufrufe

L 9. Mai 2025, 19:43

(Premise - suppose I accept that there is such a definable thing as capitalism) I'm not sure why you feel the need to state this in a discussion that already assumes it as a necessary precondition of, but, uh, you do you. People blaming capitalism for everything then build a country that imports grain, while before them and after them it’s among the largest exporters on the planet (if we combine Russia and Ukraine for the “after” metric, no pun intended). ...what? What does this have to do with literally anything, much less my comment about innovation/competition? Even setting aside the wild-assed assumptions you're making about me criticizing capitalism means I 'blame [it] for everything', this tirade you've launched into, presumably about Ukraine and the USSR, has no bearing on anything even tangentially related to this conversation. People praising capitalism create conditions in which there’s no reason to praise it. Like, it’s competitive - they kill competitiveness with patents, IP, very complex legal systems. It’s self-regulating and self-optimizing - they make regulations and do bailouts preventing sick companies from dying, make laws after their interests, then reactively make regulations to make conditions with them existing bearable, which have a side effect of killing smaller companies. Please allow me to reiterate: ...what? Capitalists didn't build literally any of those things, governments did, and capitalists have been trying to escape, subvert, or dismantle those systems at every turn, so this... vain, confusing attempt to pin a medal on capitalism's chest for restraining itself is not only wrong, it fails to understand basic facts about history. It's the opposite of self-regulating because it actively seeks to dismantle regulations (environmental, labor, wage, etc), and the only thing it optimizes for is the wealth of oligarchs, and maybe if they're lucky, there will be a few crumbs left over for their simps. That’s the problem, both “socialist” and “capitalist” ideal systems ignore ape power dynamics. I'm going to go ahead an assume that 'the problem' has more to do with assuming that complex interacting systems can be simplified to 'ape (or any other animal's) power dynamics' than with failing to let the richest people just do whatever they want. Such systems should be designed on top of the fact that jungle law is always allowed So we should just be cool with everybody being poor so Jeff Bezos or whoever can upgrade his megayacht to a gigayacht or whatever? Let me say this in the politest way I know how: LOL no. Also, do you remember when I said this? ‘Won’t someone please think of the billionaires’ is wearing kinda thin You know, right before you went on this very long-winded, surreal, barely-coherent ramble? Did you imagine I would be convinced by literally any of it when all it amounts to is one giant, extraneous, tedious equivalent of 'Won't someone please think of the billionaires?' Simp harder and I bet maybe you can get a crumb or two yourself.