linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Apple just proved AI "reasoning" models like Claude, DeepSeek-R1, and o3-mini don't actually reason at all. They just memorize patterns really well.

Technology

356 Beiträge 149 Kommentatoren 3.1k Aufrufe

M muskymelon@lemmy.world

I use LLMs as advanced search engines. No ads or sponsored results.
D This user is from outside of this forum
D This user is from outside of this forum
dojan@pawb.social

schrieb am zuletzt editiert von

#16

There are search engines that do this better. There’s a world out there beyond Google.
A 1 Antwort Letzte Antwort

11
N nanook@lemm.ee

lol is this news? I mean we call it AI, but it’s just LLM and variants it doesn’t think.
M This user is from outside of this forum
M This user is from outside of this forum
mnbychoice@midwest.social

schrieb am zuletzt editiert von

#17

The "Apple" part. CEOs only care what companies say.
K 1 Antwort Letzte Antwort

77
A allah@lemm.ee

LOOK MAA I AM ON FRONT PAGE
B This user is from outside of this forum
B This user is from outside of this forum
blaster_m@lemmy.world

schrieb am zuletzt editiert von blaster_m@lemmy.world

#18

Would like a link to the original research paper, instead of a link of a screenshot of a screenshot
A 1 Antwort Letzte Antwort

6
N nanook@lemm.ee

lol is this news? I mean we call it AI, but it’s just LLM and variants it doesn’t think.
M This user is from outside of this forum
M This user is from outside of this forum
melvin_ferd@lemmy.world

schrieb am zuletzt editiert von

#19

This is why I say these articles are so similar to how right wing media covers issues about immigrants.

There's some weird media push to convince the left to hate AI. Think of all the headlines for these issues. There are so many similarities. They're taking jobs. They are a threat to our way of life. The headlines talk about how they will sexual assault your wife, your children, you. Threats to the environment. There's articles like this where they take something known as twist it to make it sound nefarious to keep the story alive and avoid decay of interest.

Then when they pass laws, we're all primed to accept them removing whatever it is that advantageous them and disadvantageous us.
H T I 3 Antworten Letzte Antwort

13
H hybridep@lemmy.wtf

And this is relevant to this post in what regard?

90% of Lemmy comments lately are not subject related and only about how OP is not leftist, not pro-israel, pro-palestine, pro-sjw enough. Is this what Lemmy aims to be?
M This user is from outside of this forum
M This user is from outside of this forum
melvin_ferd@lemmy.world

schrieb am zuletzt editiert von

#20

It's not relevant to the post... But what the fuck
1 Antwort Letzte Antwort

2
A allah@lemm.ee

LOOK MAA I AM ON FRONT PAGE
S This user is from outside of this forum
S This user is from outside of this forum
sev@nullterra.org

schrieb am zuletzt editiert von

#21

Just fancy Markov chains with the ability to link bigger and bigger token sets. It can only ever kick off processing as a response and can never initiate any line of reasoning. This, along with the fact that its working set of data can never be updated moment-to-moment, means that it would be a physical impossibility for any LLM to achieve any real "reasoning" processes.
K A 2 Antworten Letzte Antwort

50
B blaster_m@lemmy.world

Would like a link to the original research paper, instead of a link of a screenshot of a screenshot
A This user is from outside of this forum
A This user is from outside of this forum
allah@lemm.ee

schrieb am zuletzt editiert von

#22

The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity

Recent generations of frontier language models have introduced Large Reasoning Models (LRMs) that generate detailed thinking processes…

Apple Machine Learning Research (machinelearning.apple.com)
1 Antwort Letzte Antwort

11
A allah@lemm.ee

LOOK MAA I AM ON FRONT PAGE
1 This user is from outside of this forum
1 This user is from outside of this forum
1rre@discuss.tchncs.de

schrieb am zuletzt editiert von

#23

The difference between reasoning models and normal models is reasoning models are two steps, to oversimplify it a little they prompt "how would you go about responding to this" then prompt "write the response"

It's still predicting the most likely thing to come next, but the difference is that it gives the chance for the model to write the most likely instructions to follow for the task, then the most likely result of following the instructions - both of which are much more conformant to patterns than a single jump from prompt to response.
K T 2 Antworten Letzte Antwort

3
A allah@lemm.ee

LOOK MAA I AM ON FRONT PAGE
S This user is from outside of this forum
S This user is from outside of this forum
sp3ctr4l@lemmy.dbzer0.com

schrieb am zuletzt editiert von sp3ctr4l@lemmy.dbzer0.com

#24

This has been known for years, this is the default assumption of how these models work.

You would have to prove that some kind of actual reasoning capacity has arisen as... some kind of emergent complexity phenomenon.... not the other way around.

Corpos have just marketed/gaslit us/themselves so hard that they apparently forgot this.
R 1 Antwort Letzte Antwort

16
M muskymelon@lemmy.world

I use LLMs as advanced search engines. No ads or sponsored results.
K This user is from outside of this forum
K This user is from outside of this forum
kyrgizion@lemmy.world

schrieb am zuletzt editiert von

#25

There are ads but they're subtle enough that you don't recognize them as such.
1 Antwort Letzte Antwort

3
S sev@nullterra.org

Just fancy Markov chains with the ability to link bigger and bigger token sets. It can only ever kick off processing as a response and can never initiate any line of reasoning. This, along with the fact that its working set of data can never be updated moment-to-moment, means that it would be a physical impossibility for any LLM to achieve any real "reasoning" processes.
K This user is from outside of this forum
K This user is from outside of this forum
kescusay@lemmy.world

schrieb am zuletzt editiert von

#26

I can envision a system where an LLM becomes one part of a reasoning AI, acting as a kind of fuzzy "dataset" that a proper neural network incorporates and reasons with, and the LLM could be kept real-time updated (sort of) with MCP servers that incorporate anything new it learns.

But I don't think we're anywhere near there yet.
R H 2 Antworten Letzte Antwort

16
A allah@lemm.ee

LOOK MAA I AM ON FRONT PAGE
M This user is from outside of this forum
M This user is from outside of this forum
mfed1122@discuss.tchncs.de

schrieb am zuletzt editiert von mfed1122@discuss.tchncs.de

#27

This sort of thing has been published a lot for awhile now, but why is it assumed that this isn't what human reasoning consists of? Isn't all our reasoning ultimately a form of pattern memorization? I sure feel like it is. So to me all these studies that prove they're "just" memorizing patterns don't prove anything other than that, unless coupled with research on the human brain to prove we do something different.
E L C T A 5 Antworten Letzte Antwort

17
M melvin_ferd@lemmy.world

This is why I say these articles are so similar to how right wing media covers issues about immigrants.

There's some weird media push to convince the left to hate AI. Think of all the headlines for these issues. There are so many similarities. They're taking jobs. They are a threat to our way of life. The headlines talk about how they will sexual assault your wife, your children, you. Threats to the environment. There's articles like this where they take something known as twist it to make it sound nefarious to keep the story alive and avoid decay of interest.

Then when they pass laws, we're all primed to accept them removing whatever it is that advantageous them and disadvantageous us.
H This user is from outside of this forum
H This user is from outside of this forum
hansolo@lemmy.today

schrieb am zuletzt editiert von

#28

Because it's a fear-mongering angle that still sells. AI has been a vehicle for scifi for so long that trying to convince Boomers that of won't kill us all is the hard part.

I'm a moderate user for code and skeptic of LLM abilities, but 5 years from now when we are leveraging ML models for groundbreaking science and haven't been nuked by SkyNet, all of this will look quaint and silly.
T 1 Antwort Letzte Antwort

8
N nanook@lemm.ee

lol is this news? I mean we call it AI, but it’s just LLM and variants it doesn’t think.
J This user is from outside of this forum
J This user is from outside of this forum
johnedwa@sopuli.xyz

schrieb am zuletzt editiert von johnedwa@sopuli.xyz

#29

"It's part of the history of the field of artificial intelligence that every time somebody figured out how to make a computer do something—play good checkers, solve simple but relatively informal problems—there was a chorus of critics to say, 'that's not thinking'." -Pamela McCorduck´.
It's called the AI Effect.

As Larry Tesler puts it, "AI is whatever hasn't been done yet.".
K T V 3 Antworten Letzte Antwort

26
A allah@lemm.ee

LOOK MAA I AM ON FRONT PAGE
B This user is from outside of this forum
B This user is from outside of this forum
brsrklf@jlai.lu

schrieb am zuletzt editiert von

#30

You know, despite not really believing LLM "intelligence" works anywhere like real intelligence, I kind of thought maybe being good at recognizing patterns was a way to emulate it to a point...

But that study seems to prove they're still not even good at that. At first I was wondering how hard the puzzles must have been, and then there's a bit about LLM finishing 100 move towers of Hanoï (on which they were trained) and failing 4 move river crossings. Logically, those problems are very similar... Also, failing to apply a step-by-step solution they were given.
A T 2 Antworten Letzte Antwort

38
A allah@lemm.ee

LOOK MAA I AM ON FRONT PAGE
N This user is from outside of this forum
N This user is from outside of this forum
naich@lemmings.world

schrieb am zuletzt editiert von

#31

So they have worked out that LLMs do what they were programmed to do in the way that they were programmed? Shocking.
1 Antwort Letzte Antwort

2
1 1rre@discuss.tchncs.de

The difference between reasoning models and normal models is reasoning models are two steps, to oversimplify it a little they prompt "how would you go about responding to this" then prompt "write the response"

It's still predicting the most likely thing to come next, but the difference is that it gives the chance for the model to write the most likely instructions to follow for the task, then the most likely result of following the instructions - both of which are much more conformant to patterns than a single jump from prompt to response.
K This user is from outside of this forum
K This user is from outside of this forum
kescusay@lemmy.world

schrieb am zuletzt editiert von

#32

But it still manages to fuck it up.

I've been experimenting with using Claude's Sonnet model in Copilot in agent mode for my job, and one of the things that's become abundantly clear is that it has certain types of behavior that are heavily represented in the model, so it assumes you want that behavior even if you explicitly tell it you don't.

Say you're working in a yarn workspaces project, and you instruct Copilot to build and test a new dashboard using an instruction file. You'll need to include explicit and repeated reminders all throughout the file to use yarn, not NPM, because even though yarn is very popular today, there are so many older examples of using NPM in its model that it's just going to assume that's what you actually want - thereby fucking up your codebase.

I've also had lots of cases where I tell it I don't want it to edit any code, just to analyze and explain something that's there and how to update it... and then I have to stop it from editing code anyway, because halfway through it forgot that I didn't want edits, just explanations.
S R 2 Antworten Letzte Antwort

3
M mfed1122@discuss.tchncs.de

This sort of thing has been published a lot for awhile now, but why is it assumed that this isn't what human reasoning consists of? Isn't all our reasoning ultimately a form of pattern memorization? I sure feel like it is. So to me all these studies that prove they're "just" memorizing patterns don't prove anything other than that, unless coupled with research on the human brain to prove we do something different.
E This user is from outside of this forum
E This user is from outside of this forum
endmaker@ani.social

schrieb am zuletzt editiert von

#33

You've hit the nail on the head.

Personally, I wish that there's more progress in our understanding of human intelligence.
T 1 Antwort Letzte Antwort

9
M mfed1122@discuss.tchncs.de

This sort of thing has been published a lot for awhile now, but why is it assumed that this isn't what human reasoning consists of? Isn't all our reasoning ultimately a form of pattern memorization? I sure feel like it is. So to me all these studies that prove they're "just" memorizing patterns don't prove anything other than that, unless coupled with research on the human brain to prove we do something different.
L This user is from outside of this forum
L This user is from outside of this forum
lesserabe@lemmy.world

schrieb am zuletzt editiert von

#34

Agreed. We don't seem to have a very cohesive idea of what human consciousness is or how it works.
T 1 Antwort Letzte Antwort

10
4 4am@lemm.ee

No, and to make that work using the current structures we use for creating AI models we’d probably need all the collective computing power on earth at once.
S This user is from outside of this forum
S This user is from outside of this forum
sarge@startrek.website

schrieb am zuletzt editiert von

#35

...... So you're saying there's a chance?
A 1 Antwort Letzte Antwort

6

Anmelden zum Antworten

P

YouTube to be included in Australia’s social media ban for children under 16
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
43

189 Stimmen

43 Beiträge

263 Aufrufe

M

The trend of using child abuse as argument to implement media control mechanisms, is dangerous for society. There, i said it.
B

Lemmy has a problem
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
35

2

49 Stimmen

35 Beiträge

196 Aufrufe

B

Say it loud
M

You can still enable uBlock Origin in Chrome, here is how
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
130

1

312 Stimmen

130 Beiträge

2k Aufrufe

W

I use IronFox all the time. For me almost nothing is broken. Once a year I find one low value site that I have to load in Cromite to see what it is, and then I never use that trash site again. In other words, IronFox fulfills 100% of all my browsing needs excellently. I used Mull before IronFox, and my experience there was excellent as well. There is no good reason to use Chrome today or even some years back when Mull was the thing.
E

Microsoft finally bids farewell to PowerShell 2.0
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
6

1

69 Stimmen

6 Beiträge

76 Aufrufe

B

Batch scripts run on my locked-down work laptop. Powershell requires administrator privileges that I don't have. I don't make the rules, I just evade them
P

Russian Internet users are unable to access the open Internet
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
30

1

360 Stimmen

30 Beiträge

282 Aufrufe

Z

Also don't forget all the suicides happening with hard to obtain poisons and shooting oneself in the back of the head three times.
P

AI Scraping Bots Are Breaking Open Libraries, Archives, and Museums
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
3

1

41 Stimmen

3 Beiträge

37 Aufrufe

P

Yes. I can't use lynx for most of the sites I am used to go with it. They are all protecting themselves with captcha and other form of javascript computation. The net is dying. Fucking thank you AI-bullshitery...
C

Unionize or die - Drew DeVault
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
3

75 Stimmen

3 Beiträge

40 Aufrufe

W

and hopefully also elsewhere. as Drew said in the first part, tech workers will be affected by billionaire's decisions even outside of work, on multiple fronts. we must eat the rich, or they will eat us all alive.
P

‘Digital blitz’: Misinformation on social media casts shadow on US-China trade truce
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

5 Stimmen

1 Beiträge

15 Aufrufe

Niemand hat geantwortet