linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Apple just proved AI "reasoning" models like Claude, DeepSeek-R1, and o3-mini don't actually reason at all. They just memorize patterns really well.

Technology

356 Beiträge 149 Kommentatoren 1.7k Aufrufe

K kescusay@lemmy.world

I can envision a system where an LLM becomes one part of a reasoning AI, acting as a kind of fuzzy "dataset" that a proper neural network incorporates and reasons with, and the LLM could be kept real-time updated (sort of) with MCP servers that incorporate anything new it learns.

But I don't think we're anywhere near there yet.
R This user is from outside of this forum
R This user is from outside of this forum
riskable@programming.dev

schrieb am zuletzt editiert von

#72

The only reason we're not there yet is memory limitations.

Eventually some company will come out with AI hardware that lets you link up a petabyte of ultra fast memory to chips that contain a million parallel matrix math processors. Then we'll have an entirely new problem: AI that trains itself incorrectly too quickly.

Just you watch: The next big breakthrough in AI tech will come around 2032-2035 (when the hardware is available) and everyone will be bitching that "chain reasoning" (or whatever the term turns out to be) isn't as smart as everyone thinks it is.
1 Antwort Letzte Antwort

7
A allah@lemm.ee

did i do it here? also that's where i live, if i can't talk about womens struggle then i appologize
T This user is from outside of this forum
T This user is from outside of this forum
technocrit@lemmy.dbzer0.com

schrieb am zuletzt editiert von

#73

I don't think that person cares about women or anything else. They just said that they don't even want to hear about it.
1 Antwort Letzte Antwort

3
A allah@lemm.ee

LOOK MAA I AM ON FRONT PAGE
J This user is from outside of this forum
J This user is from outside of this forum
jhex@lemmy.world

schrieb am zuletzt editiert von

#74

this is so Apple, claiming to invent or discover something "first" 3 years later than the rest of the market
P 1 Antwort Letzte Antwort

53
S sarge@startrek.website

...... So you're saying there's a chance?
A This user is from outside of this forum
A This user is from outside of this forum
allah@lemm.ee

schrieb am zuletzt editiert von

#75

10^36 flops to be exact
R 1 Antwort Letzte Antwort

1
T technocrit@lemmy.dbzer0.com

Who is "you"?

Just because some dummies supposedly think that NPCs are "AI", that doesn't make it so. I don't consider checkers to be a litmus test for "intelligence".
G This user is from outside of this forum
G This user is from outside of this forum
grimy@lemmy.world

schrieb am zuletzt editiert von

#76

"You" applies to anyone that doesnt understand what AI means. It's a portmanteau word for a lot of things.

Npcs ARE AI. AI doesnt mean "human level intelligence" and never did. Read the wiki if you need help understanding.
1 Antwort Letzte Antwort

7
A auraithx@lemmy.dbzer0.com

Unlike Markov models, modern LLMs use transformers that attend to full contexts, enabling them to simulate structured, multi-step reasoning (albeit imperfectly). While they don’t initiate reasoning like humans, they can generate and refine internal chains of thought when prompted, and emerging frameworks (like ReAct or Toolformer) allow them to update working memory via external tools. Reasoning is limited, but not physically impossible, it’s evolving beyond simple pattern-matching toward more dynamic and compositional processing.
R This user is from outside of this forum
R This user is from outside of this forum
riskable@programming.dev

schrieb am zuletzt editiert von

#77

I'm not convinced that humans don't reason in a similar fashion. When I'm asked to produce pointless bullshit at work my brain puts in a similar level of reasoning to an LLM.

Think about "normal" programming: An experienced developer (that's self-trained on dozens of enterprise code bases) doesn't have to think much at all about 90% of what they're coding. It's all bog standard bullshit so they end up copying and pasting from previous work, Stack Overflow, etc because it's nothing special.

The remaining 10% is "the hard stuff". They have to read documentation, search the Internet, and then—after all that effort to avoid having to think—they sigh and start actually start thinking in order to program the thing they need.

LLMs go through similar motions behind the scenes! Probably because they were created by software developers but they still fail at that last 90%: The stuff that requires actual thinking.

Eventually someone is going to figure out how to auto-generate LoRAs based on test cases combined with trial and error that then get used by the AI model to improve itself and that is when people are going to be like, "Oh shit! Maybe AGI really is imminent!" But again, they'll be wrong.

AGI won't happen until AI models get good at retraining themselves with something better than basic reinforcement learning. In order for that to happen you need the working memory of the model to be nearly as big as the hardware that was used to train it. That, and loads and loads of spare matrix math processors ready to go for handing that retraining.
1 Antwort Letzte Antwort

5
A allah@lemm.ee

LOOK MAA I AM ON FRONT PAGE
S This user is from outside of this forum
S This user is from outside of this forum
splashjackson@lemmy.ca

schrieb am zuletzt editiert von

#78

Just like me
A 1 Antwort Letzte Antwort

21
C count_dongulus@lemmy.world

Humans apply judgment, because they have emotion. LLMs do not possess emotion. Mimicking emotion without ever actually having the capability of experiencing it is sociopathy. An LLM would at best apply patterns like a sociopath.
R This user is from outside of this forum
R This user is from outside of this forum
riskable@programming.dev

schrieb am zuletzt editiert von riskable@programming.dev

#79

That just means they'd be great CEOs!

According to Wall Street.
1 Antwort Letzte Antwort

0
T technocrit@lemmy.dbzer0.com

why is it assumed that this isn’t what human reasoning consists of?

Because science doesn't work work like that. Nobody should assume wild hypotheses without any evidence whatsoever.

Isn’t all our reasoning ultimately a form of pattern memorization? I sure feel like it is.

You should get a job in "AI". smh.
M This user is from outside of this forum
M This user is from outside of this forum
mfed1122@discuss.tchncs.de

schrieb am zuletzt editiert von

#80

Sorry, I can see why my original post was confusing, but I think you've misunderstood me. I'm not claiming that I know the way humans reason. In fact you and I are on total agreement that it is unscientific to assume hypotheses without evidence. This is exactly what I am saying is the mistake in the statement "AI doesn't actually reason, it just follows patterns". That is unscientific if we don't know whether or "actually reasoning" consists of following patterns, or something else. As far as I know, the jury is out on the fundamental nature of how human reasoning works. It's my personal, subjective feeling that human reasoning works by following patterns. But I'm not saying "AI does actually reason like humans because it follows patterns like we do". Again, I see how what I said could have come off that way. What I mean more precisely is:

It's not clear whether AI's pattern-following techniques are the same as human reasoning, because we aren't clear on how human reasoning works. My intuition tells me that humans doing pattern following seems equally as valid of an initial guess as humans not doing pattern following, so shouldn't we have studies to back up the direction we lean in one way or the other?

I think you and I are in agreement, we're upholding the same principle but in different directions.
1 Antwort Letzte Antwort

3
J johnedwa@sopuli.xyz

It is. And has always been. "Artificial Intelligence" doesn't mean a feeling thinking robot person (that would fall under AGI or artificial conciousness), it's a vast field of research in computer science with many, many things under it.
E This user is from outside of this forum
E This user is from outside of this forum
endmaker@ani.social

schrieb am zuletzt editiert von

#81

ITT: people who obviously did not study computer science or AI at at least an undergraduate level.

Y'all are too patient. I can't be bothered to spend the time to give people free lessons.
A C 2 Antworten Letzte Antwort

7
S sp3ctr4l@lemmy.dbzer0.com

This has been known for years, this is the default assumption of how these models work.

You would have to prove that some kind of actual reasoning capacity has arisen as... some kind of emergent complexity phenomenon.... not the other way around.

Corpos have just marketed/gaslit us/themselves so hard that they apparently forgot this.
R This user is from outside of this forum
R This user is from outside of this forum
riskable@programming.dev

schrieb am zuletzt editiert von

#82

Define, "reasoning". For decades software developers have been writing code with conditionals. That's "reasoning."

LLMs are "reasoning"... They're just not doing human-like reasoning.
S 1 Antwort Letzte Antwort

5
A atlien51@lemm.ee

Employers who are foaming at the mouth at the thought of replacing their workers with cheap AI:

🫢
M This user is from outside of this forum
M This user is from outside of this forum
monkeyslikebananas2@lemmy.world

schrieb am zuletzt editiert von

#83

Can’t really replace. At best, this tech will make employees more productive at the cost of the rainforests.
A 1 Antwort Letzte Antwort

0
T technocrit@lemmy.dbzer0.com

In fact, simple computer programs do a great job of solving these puzzles...

Yes, this shit is very basic. Not at all "intelligent."
M This user is from outside of this forum
M This user is from outside of this forum
mfed1122@discuss.tchncs.de

schrieb am zuletzt editiert von

#84

But reasoning about it is intelligent, and the point of this study is to determine the extent to which these models are reasoning or not. Which again, has nothing to do with emotions. And furthermore, my initial question about whether or not pattern following should automatically be disqualified as intelligence, as the person summarizing this study (and notably not the study itself) claims, is the real question here.
1 Antwort Letzte Antwort

0
K kescusay@lemmy.world

But it still manages to fuck it up.

I've been experimenting with using Claude's Sonnet model in Copilot in agent mode for my job, and one of the things that's become abundantly clear is that it has certain types of behavior that are heavily represented in the model, so it assumes you want that behavior even if you explicitly tell it you don't.

Say you're working in a yarn workspaces project, and you instruct Copilot to build and test a new dashboard using an instruction file. You'll need to include explicit and repeated reminders all throughout the file to use yarn, not NPM, because even though yarn is very popular today, there are so many older examples of using NPM in its model that it's just going to assume that's what you actually want - thereby fucking up your codebase.

I've also had lots of cases where I tell it I don't want it to edit any code, just to analyze and explain something that's there and how to update it... and then I have to stop it from editing code anyway, because halfway through it forgot that I didn't want edits, just explanations.
R This user is from outside of this forum
R This user is from outside of this forum
riskable@programming.dev

schrieb am zuletzt editiert von riskable@programming.dev

#85

To be fair, the world of JavaScript is such a clusterfuck... Can you really blame the LLM for needing constant reminders about the specifics of your project?

When a programming language has five hundred bazillion absolutely terrible ways of accomplishing a given thing—and endless absolutely awful code examples on the Internet to "learn from"—you're just asking for trouble. Not just from trying to get an LLM to produce what you want but also trying to get humans to do it.

This is why LLMs are so fucking good at writing rust and Python: There's only so many ways to do a thing and the larger community pretty much always uses the same solutions.

JavaScript? How can it even keep up? You're using yarn today but in a year you'll probably like, "fuuuuck this code is garbage... I need to convert this all to [new thing]."
K 1 Antwort Letzte Antwort

1
G grimy@lemmy.world

No, it shows how certain people misunderstand the meaning of the word.

You have called npcs in video games "AI" for a decade, yet you were never implying they were somehow intelligent. The whole argument is strangely inconsistent.
I This user is from outside of this forum
I This user is from outside of this forum
initiateofthevoid@lemmy.dbzer0.com

schrieb am zuletzt editiert von initiateofthevoid@lemmy.dbzer0.com

#86

"Artificial" has several meanings.

One is:

not being, showing, or resembling sincere or spontaneous behavior : fake

AI in video games literally means "fake intelligence"
1 Antwort Letzte Antwort

0
M melvin_ferd@lemmy.world

This is why I say these articles are so similar to how right wing media covers issues about immigrants.

There's some weird media push to convince the left to hate AI. Think of all the headlines for these issues. There are so many similarities. They're taking jobs. They are a threat to our way of life. The headlines talk about how they will sexual assault your wife, your children, you. Threats to the environment. There's articles like this where they take something known as twist it to make it sound nefarious to keep the story alive and avoid decay of interest.

Then when they pass laws, we're all primed to accept them removing whatever it is that advantageous them and disadvantageous us.
I This user is from outside of this forum
I This user is from outside of this forum
initiateofthevoid@lemmy.dbzer0.com

schrieb am zuletzt editiert von

#87
Unlike fear-mongering from the right about immigrants, current iterations of AI development:
- literally consume the environment (they are using electricity and water)
- are taking jobs and siphoning money from the economy towards centralized corporate revenue streams that don't pay a fair share of taxes
- I don't know of headlines claiming they will sexually assault you, but many headlines note that they can be used as part of sophisticated catfishing scams, which they are
All of these things aren't scare tactics. They're often overblown and exaggerated for clicks, but the fundamental nature of the technology and corporate implementation of it indisputable.

Open-source AI can change the world for the better. Corporate-controlled AI in some limited cases will improve the world, but without reasonable regulations they will severely harm it first.
1 Antwort Letzte Antwort

0
A auraithx@lemmy.dbzer0.com

Define reason.

Like humans? Of course not. They lack intent, awareness, and grounded meaning. They don’t “understand” problems, they generate token sequences.
R This user is from outside of this forum
R This user is from outside of this forum
reksas@sopuli.xyz

schrieb am zuletzt editiert von

#88

as it is defined in the article
1 Antwort Letzte Antwort

0
A auraithx@lemmy.dbzer0.com

Brother you better hope it does because even if emissions dropped to 0 tonight the planet wouldnt stop warming and it wouldn't stop what's coming for us.
M This user is from outside of this forum
M This user is from outside of this forum
mcasq_qsacj_234@lemmy.zip

schrieb am zuletzt editiert von

#89

If the situation gets dire, it's likely that the weather will be manipulated. Countries would then have to be convinced not to use this for military purposes.
A 1 Antwort Letzte Antwort

0
R riskable@programming.dev

Define, "reasoning". For decades software developers have been writing code with conditionals. That's "reasoning."

LLMs are "reasoning"... They're just not doing human-like reasoning.
S This user is from outside of this forum
S This user is from outside of this forum
sp3ctr4l@lemmy.dbzer0.com

schrieb am zuletzt editiert von sp3ctr4l@lemmy.dbzer0.com

#90

Howabout uh...

The ability to take a previously given set of knowledge, experiences and concepts, and combine or synthesize them in a consistent, non contradictory manner, to generate hitherto unrealized knowledge, or concepts, and then also be able to verify that those new knowledge and concepts are actually new, and actually valid, or at least be able to propose how one could test whether or not they are valid.

Arguably this is or involves meta-cognition, but that is what I would say... is the difference between what we typically think of as 'machine reasoning', and 'human reasoning'.

Now I will grant you that a large amount of humans essentially cannot do this, they suck at introspecting and maintaining logical consistency, that they are just told 'this is how things work', and they never question that untill decades later and their lives force them to address, or dismiss their own internally inconsisten beliefs.

But I would also say that this means they are bad at 'human reasoning'.

Basically, my definition of 'human reasoning' is perhaps more accurately described as 'critical thinking'.
1 Antwort Letzte Antwort

5
A allah@lemm.ee

10^36 flops to be exact
R This user is from outside of this forum
R This user is from outside of this forum
refurbishedrefurbisher@lemmy.sdf.org

schrieb am zuletzt editiert von

#91

That sounds really floppy.
1 Antwort Letzte Antwort

2

Anmelden zum Antworten

D

Elon Musk’s X platform investigated in France for alleged data tampering and fraud
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
4

1

255 Stimmen

4 Beiträge

41 Aufrufe

T

isnt merz kinda right wing, but not AFD-CRAZY.
E

How social media became a storefront for deadly fake pills
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

18 Stimmen

1 Beiträge

12 Aufrufe

Niemand hat geantwortet
T

Converting An E-Paper Photo Frame Into Weather Map
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
2

1

113 Stimmen

2 Beiträge

21 Aufrufe

I

Looks like East Anglia has basically disappeared. At least nothing of value was lost
R

Streaming overtakes cable and broadcast as the most-watched form of TV
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
17

1

68 Stimmen

17 Beiträge

82 Aufrufe

H

Set up arrs, you basically set it and forget it.
T

Atom-Thin Tech Replaces Silicon in the World’s First 2D Computer
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
18

1

125 Stimmen

18 Beiträge

101 Aufrufe

L

The 'laptop' is s conceptual illustration. The image shown on the laptop screen is an actual SEM image.
A

My AI Skeptic Friends Are All Nuts
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
31

1

13 Stimmen

31 Beiträge

162 Aufrufe

J

I did read it, and my comment is exactly referencing the attitude of the author which is "It's good enough, so you should use it". I disagree, and say it's another dumbass shortcut to cash grab on a less than stellar ecosystem and product. It's training wheels for failure.
A

Prototype of RTX 5090 Appears With Four 16-Pin Power Connectors, Capable of Delivering 2,400W
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
67

1

131 Stimmen

67 Beiträge

301 Aufrufe

I

Arcing causes more fires, because over current caused all the fires until we tightened standards and dual-mode circuit breakers. Now fires are caused by loose connections arcing, and damaged wires arcing to flammable material. Breakers are specifically designed for a sustained current, but arcing is dangerous because it tends to cascade, light arcing damages contacts, leading to more arcing in a cycle. The real danger of arcing is that it can happen outside of view, and start fires that aren't caught till everything burns down.
E

@chrlschn - Beware the Complexity Merchants
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
6

1

57 Stimmen

6 Beiträge

61 Aufrufe

S

I'm a big fan of the manta "Make your designs as simple as possible and no simpler". Pointless complexity drives me nuts, but others take it too far and remove functionality by making things too minimal. It doesn't help that a lot of businesses optimize for people who make changes, so the positive feedback loop is change for the sake of change rather than improving the product.