linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

ChatGPT 'got absolutely wrecked' by Atari 2600 in beginner's chess match — OpenAI's newest model bamboozled by 1970s logic

Technology

204 Beiträge 136 Kommentatoren 5.9k Aufrufe

M melodiousfunk@startrek.website

Plot twist: the toddler has a multi-year marketing push worth tens if not hundreds of millions, which convinced a lot of people who don't know the first thing about chess that it really is very impressive, and all those chess-types are just jealous.
X This user is from outside of this forum
X This user is from outside of this forum
xavier666@lemm.ee

schrieb am zuletzt editiert von

#108

Have you tried feeding the toddler gallons of baby-food? Maybe then it can play chess
B 1 Antwort Letzte Antwort

3
E etterra@discuss.online

That's because it doesn't know what it's saying. It's just blathering out each word as what it estimates to be the likely next word given past examples in its training data. It's a statistics calculator. It's marginally better than just smashing the auto fill on your cell repeatedly. It's literally dumber than a parrot.
A This user is from outside of this forum
A This user is from outside of this forum
anunusualrelic@lemmy.world

schrieb am zuletzt editiert von

#109

Parrots are actually intelligent though.
E 1 Antwort Letzte Antwort

3
B broken@lemmy.ml

I agree with your general statement, but in theory since all ChatGPT does is regurgitate information back and a lot of chess is memorization of historical games and types, it might actually perform well. No, it can't think, but it can remember everything so at some point that might tip the results in it's favor.
F This user is from outside of this forum
F This user is from outside of this forum
fmt99@lemmy.world

schrieb am zuletzt editiert von

#110

I mean it may be possible but the complexity would be so many orders of magnitude greater. It'd be like learning chess by just memorizing all the moves great players made but without any context or understanding of the underlying strategy.
1 Antwort Letzte Antwort

0
A adhdplantdev@lemm.ee

Articles like this are good because it exposes the flaws with the ai and that it can't be trusted with complex multi step tasks.

Helps people see that think AI is close to a human that its not and its missing critical functionality
F This user is from outside of this forum
F This user is from outside of this forum
fmt99@lemmy.world

schrieb am zuletzt editiert von

#111

The problem is though that this perpetuates the idea that ChatGPT is actually an AI.
A 1 Antwort Letzte Antwort

3
X x00z@lemmy.world

In all fairness. Machine learning in chess engines is actually pretty strong.

AlphaZero was developed by the artificial intelligence and research company DeepMind, which was acquired by Google. It is a computer program that reached a virtually unthinkable level of play using only reinforcement learning and self-play in order to train its neural networks. In other words, it was only given the rules of the game and then played against itself many millions of times (44 million games in the first nine hours, according to DeepMind).

AlphaZero - Chess Engines

Learn all about the AlphaZero chess program. Everything you need to know about AlphaZero, including what it is, why it is important, and more!

Chess.com (www.chess.com)
F This user is from outside of this forum
F This user is from outside of this forum
fmt99@lemmy.world

schrieb am zuletzt editiert von

#112

Oh absolutely you can apply machine learning to game strategy. But you can't expect a generalized chatbot to do well at strategic decision making for a specific game.
1 Antwort Letzte Antwort

0
A asswardbackaddict@lemmy.world

You're so fucking silly. You gonna study cell theory to see how long you should keep vegetables in your fridge? Go home. Save science for people who understand things.
J This user is from outside of this forum
J This user is from outside of this forum
junkthief@lemmy.blahaj.zone

schrieb am zuletzt editiert von

#113

Save science for people who understand things.

Does this not strike you as the least bit ironic?
1 Antwort Letzte Antwort

0
P pixelatedsaturn@lemmy.world

I like tab coding, writing small blocks of code that it thinks I need. Its On point almost all the time. This speeds me up.
W This user is from outside of this forum
W This user is from outside of this forum
whoisearth@lemmy.ca

schrieb am zuletzt editiert von whoisearth@lemmy.ca

#114

Bingo. If anything what you're finding is the people bitching are the same people that if given a bike wouldn't know how to ride it, which is fair. Some people understand quicker how to use the tools they are given.

Edit - a poor carpenter blames his tools.
1 Antwort Letzte Antwort

6
P pelespirit@sh.itjust.works

Not to help the AI companies, but why don't they program them to look up math programs and outsource chess to other programs when they're asked for that stuff? It's obvious they're shit at it, why do they answer anyway? It's because they're programmed by know-it-all programmers, isn't it.
F This user is from outside of this forum
F This user is from outside of this forum
fmstrat@lemmy.nowsci.com

schrieb am zuletzt editiert von

#115

This is where MCP comes in. It's a protocol for LLMs to call standard tools. Basically the LLM would figure out the tool to use from the context, then figure out the order of parameters from those the MCP server says is available, send the JSON, and parse the response.
1 Antwort Letzte Antwort

1
O otp@sh.itjust.works

That is more a failure of the person who made that decision than a failing of ChatBots, lol
W This user is from outside of this forum
W This user is from outside of this forum
wewbull@feddit.uk

schrieb am zuletzt editiert von

#116

Agreed, which is why it's important to have articles out in the wild that show the shortcomings of AI. If all people read is all the positive crap coming out of companies like OpenAI then they will make stupid decisions.
1 Antwort Letzte Antwort

2
L lifecoach5000@lemmy.world

This post did not contain any content.
A This user is from outside of this forum
A This user is from outside of this forum
arc99@lemmy.world

schrieb am zuletzt editiert von

#117

Hardly surprising. Llms aren't -thinking- they're just shitting out the next token for any given input of tokens.
S 1 Antwort Letzte Antwort

18
A alecsadler@sh.itjust.works

ChatGPT has been, hands down, the worst AI coding assistant I've ever used.

It regularly suggests code that doesn't compile or isn't even for the language.

It generally suggests AC of code that is just a copy of the lines I just wrote.

Sometimes it likes to suggest setting the same property like 5 times.

It is absolute garbage and I do not recommend it to anyone.
A This user is from outside of this forum
A This user is from outside of this forum
arc99@lemmy.world

schrieb am zuletzt editiert von

#118

All AIs are the same. They're just scraping content from GitHub, stackoverflow etc with a bunch of guardrails slapped on to spew out sentences that conform to their training data but there is no intelligence. They're super handy for basic code snippets but anyone using them anything remotely complex or nuanced will regret it.
A N 2 Antworten Letzte Antwort

5
S seven_phone@lemmy.world

You say you produce good oranges but my machine for testing apples gave your oranges a very low score.
W This user is from outside of this forum
W This user is from outside of this forum
wizardbeard@lemmy.dbzer0.com

schrieb am zuletzt editiert von

#119

No, more like "Your marketing team, sales team, the news media at large, and random hype men all insist your orange machine works amazing on any fruit if you know how to use it right. It didn't work my strawberries when I gave it all the help I could, and was outperformed by my 40 year old strawberry machine. Please stop selling the idea it works on all fruit."

This study is specifically a counter to the constant hype that these LLMs will revolutionize absolutely everything, and the constant word choices used in discussion of LLMs that imply they have reasoning capabilities.
1 Antwort Letzte Antwort

3
N nutsack@lemmy.dbzer0.com

my favorite thing is to constantly be implementing libraries that don't exist
A This user is from outside of this forum
A This user is from outside of this forum
arc99@lemmy.world

schrieb am zuletzt editiert von

#120

It's even worse when AI soaks up some project whose APIs are constantly changing. Try using AI to code against jetty for example and you'll be weeping.
1 Antwort Letzte Antwort

1
L lifecoach5000@lemmy.world

This post did not contain any content.
H This user is from outside of this forum
H This user is from outside of this forum
halosheep@lemm.ee

schrieb am zuletzt editiert von

#121

I swear every single article critical of current LLMs is like, "The square got BLASTED by the triangle shape when it completely FAILED to go through the triangle shaped hole."
D I L 3 Antworten Letzte Antwort

48
X xavier666@lemm.ee

Have you tried feeding the toddler gallons of baby-food? Maybe then it can play chess
B This user is from outside of this forum
B This user is from outside of this forum
baggachipz@sh.itjust.works

schrieb am zuletzt editiert von

#122

They’ve been feeding the toddler everybody else’s baby food and claiming they have the right to.
X 1 Antwort Letzte Antwort

2
I isaamoonkhgdt_6143@lemmy.zip

They used ChatGPT 4o, instead of using o1 or o3.

Obviously it was going to fail.
W This user is from outside of this forum
W This user is from outside of this forum
wizardbeard@lemmy.dbzer0.com

schrieb am zuletzt editiert von wizardbeard@lemmy.dbzer0.com

#123

Other studies (not all chess based or against this old chess AI) show similar lackluster results when using reasoning models.

Edit: When comparing reasoning models to existing algorithmic solutions.
1 Antwort Letzte Antwort

0
H halosheep@lemm.ee

I swear every single article critical of current LLMs is like, "The square got BLASTED by the triangle shape when it completely FAILED to go through the triangle shaped hole."
D This user is from outside of this forum
D This user is from outside of this forum
drspod@lemmy.ml

schrieb am zuletzt editiert von

#124

It's newsworthy when the sellers of squares are saying that nobody will ever need a triangle again, and the shape-sector of the stock market is hysterically pumping money into companies that make or use squares.
P I M 3 Antworten Letzte Antwort

38
B baggachipz@sh.itjust.works

They’ve been feeding the toddler everybody else’s baby food and claiming they have the right to.
X This user is from outside of this forum
X This user is from outside of this forum
xavier666@lemm.ee

schrieb am zuletzt editiert von

#125

"If we have to ask every time before stealing a little baby food, our morbidly obese toddler cannot survive"
1 Antwort Letzte Antwort

3
A alecsadler@sh.itjust.works

ChatGPT has been, hands down, the worst AI coding assistant I've ever used.

It regularly suggests code that doesn't compile or isn't even for the language.

It generally suggests AC of code that is just a copy of the lines I just wrote.

Sometimes it likes to suggest setting the same property like 5 times.

It is absolute garbage and I do not recommend it to anyone.
I This user is from outside of this forum
I This user is from outside of this forum
ilikeboobies@lemmy.ca

schrieb am zuletzt editiert von

#126

I’ve had success with splitting a function into 2 and planning out an overview, though that’s more like talking to myself

I wouldn’t use it to generate stuff though
1 Antwort Letzte Antwort

0
D drspod@lemmy.ml

It's newsworthy when the sellers of squares are saying that nobody will ever need a triangle again, and the shape-sector of the stock market is hysterically pumping money into companies that make or use squares.
P This user is from outside of this forum
P This user is from outside of this forum
pushbutton@lemmy.world

schrieb am zuletzt editiert von

#127

You get 2 triangles in a single square mate...

CHECKMATE!
A 1 Antwort Letzte Antwort

6

Anmelden zum Antworten

I

Nvidia plans to boost presence in Israel with multibillion-dollar tech campus in north
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
62

1

290 Stimmen

62 Beiträge

455 Aufrufe

B

I have crypto and I play games and I will not buy Nvidia again
R

Getting Started with Ebitengine (Go game engine)
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
2

15 Stimmen

2 Beiträge

9 Aufrufe

R

This video complements the text tutorial at https://trevors-tutorials.com/0003-getting-started-with-ebitengine/ Trevors-Tutorials.com is where you can find free programming tutorials. The focus is on Go and Ebitengine game development. Watch the channel introduction for more info.
5

Delta moves toward eliminating set prices in favor of AI that determines how much you personally will pay for a ticket
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
220

1

801 Stimmen

220 Beiträge

2k Aufrufe

U

algos / AI has already been used to justify racial discrimination in some counties who use predictive policing software to adjust the sentences of convicts (the software takes in a range of facts about the suspect and the incident and compares it to how prior incidents and suspects were similar features were adjudicated) and wouldn't you know it, it simply highlighted and exaggerated the prejudices of police and the courts to absurdity, giving whites absurdly lighter sentences than nonwhites, for example. This is essentially mind control or coercion technology based on the KGB technology of компромат (Kompromat, or compromising information, or as CIA calls it biographical leverage, ) essentially, information about a person that can be used either to jeopardize their life, blackmail material or means to lure and bribe them. Take this from tradecraft and apply it to marketing or civil control, and you get things like the Social Credit System in China to keep people from misbehaving, engaging in discontent and coming out of the closet (LGBTQ+ but there are plenty of other applicable closets). From a futurist perspective, we homo-sapiens appear just incapable of noping out of a technology or process, no matter how morally black or heinous that technology is, we'll use it, especially those with wealth and power to evade legal prosecution (or civil persecution). It breaks down into three categories: Technologies we use anyway, and suffer, e.g. usury, bonded servitude, mass-media propaganda distribution Technologies we collectively decide are just not worth the consequences, e.g. the hydrogen bomb, biochemical warfare Technologies for which we create countermeasures, usually turning into a tech race between states or between the public and the state, e.g. secure communication, secure data encryption, forbidden data distribution / censorship We're clearly on the cusp of mind control and weaponizing data harvesting into a coercion mechanism. Currently we're already seeing it used to establish and defend specific power structures that are antithetical to the public good. It's currently in the first category, and hopefully it'll fall into the third, because we have to make a mess (e.g. Castle Bravo / Bikini Atol) and clean it up before deciding not to do that again. Also, with the rise of the internet, we've run out of myths that justify capitalism, which is bonded servitude with extra steps. So we may soon (within centuries) see that go into one of the latter two categories, since the US is currently experiencing the endgame consequences of forcing labor, and the rest of the industrialized world is having to bulwark from the blast.
M

You can still enable uBlock Origin in Chrome, here is how
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
130

1

312 Stimmen

130 Beiträge

2k Aufrufe

W

I use IronFox all the time. For me almost nothing is broken. Once a year I find one low value site that I have to load in Cromite to see what it is, and then I never use that trash site again. In other words, IronFox fulfills 100% of all my browsing needs excellently. I used Mull before IronFox, and my experience there was excellent as well. There is no good reason to use Chrome today or even some years back when Mull was the thing.
R

FTC’s click-to-cancel rule has been struck down by federal judges at the eleventh hour
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
30

1

427 Stimmen

30 Beiträge

204 Aufrufe

S

Every single opportunity, however petty, to ensure we become more miserable evwry day.
D

Study finds smartphone bans in Dutch schools improved focus
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
55

359 Stimmen

55 Beiträge

372 Aufrufe

D

Based on what data?
D

Racist AI-generated videos are garnering millions of views on TikTok
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
36

1

238 Stimmen

36 Beiträge

201 Aufrufe

M

It should be taught at schools that there is no such thing as human race, it's a fucking disgracing non-scientific term. Skin color is just that - a skin color.
T

EV tax credits might end even sooner than House bill proposed
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
7

49 Stimmen

7 Beiträge

46 Aufrufe

B

It's not just tax credits for new cars, they are also getting rid of the Used EV Tax Credit which has helped to keep the prices of used EVs (relatively) lower.