linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Apple just proved AI "reasoning" models like Claude, DeepSeek-R1, and o3-mini don't actually reason at all. They just memorize patterns really well.

Technology

347 Beiträge 149 Kommentatoren 17 Aufrufe

G grimy@lemmy.world

No, it shows how certain people misunderstand the meaning of the word.

You have called npcs in video games "AI" for a decade, yet you were never implying they were somehow intelligent. The whole argument is strangely inconsistent.
H This user is from outside of this forum
H This user is from outside of this forum
homesweethomemrl@lemmy.world

schrieb zuletzt editiert von

#97

Strangely inconsistent + smoke & mirrors = profit!
1 Antwort Letzte Antwort

4
M mfed1122@discuss.tchncs.de

But for something like solving a Towers of Hanoi puzzle, which is what this study is about, we're not looking for emotional judgements - we're trying to evaluate the logical reasoning capabilities. A sociopath would be equally capable of solving logic puzzles compared to a non-sociopath. In fact, simple computer programs do a great job of solving these puzzles, and they certainly have nothing like emotions. So I'm not sure that emotions have much relevance to the topic of AI or human reasoning and problem solving, at least not this particular aspect of it.

As for analogizing LLMs to sociopaths, I think that's a bit odd too. The reason why we (stereotypically) find sociopathy concerning is that a person has their own desires which, in combination with a disinterest in others' feelings, incentivizes them to be deceitful or harmful in some scenarios. But LLMs are largely designed specifically as servile, having no will or desires of their own. If people find it concerning that LLMs imitate emotions, then I think we're giving them far too much credit as sentient autonomous beings - and this is coming from someone who thinks they think in the same way we do! The think like we do, IMO, but they lack a lot of the other subsystems that are necessary for an entity to function in a way that can be considered as autonomous/having free will/desires of its own choosing, etc.
M This user is from outside of this forum
M This user is from outside of this forum
mcasq_qsacj_234@lemmy.zip

schrieb zuletzt editiert von

#98

In fact, simple computer programs do a great job of solving these puzzles.....

If an AI is trained to do this, it will be very good, like for example when a GPT-2 was trained to multiply numbers up to 20 digits.

https://nitter.net/yuntiandeng/status/1836114419480166585#m

Here they do the same test to GPT-4o, o1-mini and o3-mini

https://nitter.net/yuntiandeng/status/1836114401213989366#m

https://nitter.net/yuntiandeng/status/1889704768135905332#m
1 Antwort Letzte Antwort

1
T technocrit@lemmy.dbzer0.com

Why would they "prove" something that's completely obvious?

The burden of proof is on the grifters who have overwhelmingly been making false claims and distorting language for decades.
M This user is from outside of this forum
M This user is from outside of this forum
mbourgon@lemmy.world

schrieb zuletzt editiert von

#99

Not when large swaths of people are being told to use it everyday. Upper management has bought in on it.
L 1 Antwort Letzte Antwort

8
A allah@lemm.ee

LOOK MAA I AM ON FRONT PAGE

archive.is

(archive.is)
B This user is from outside of this forum
B This user is from outside of this forum
bjoern_tantau@swg-empire.de

schrieb zuletzt editiert von bjoern_tantau@swg-empire.de

#100
1 Antwort Letzte Antwort

37
A allah@lemm.ee

LOOK MAA I AM ON FRONT PAGE

archive.is

(archive.is)
L This user is from outside of this forum
L This user is from outside of this forum
lonstedbrowrybased@lemm.ee

schrieb zuletzt editiert von

#101

Yah of course they do they’re computers
F I 2 Antworten Letzte Antwort

21
T technocrit@lemmy.dbzer0.com

Why would they "prove" something that's completely obvious?

The burden of proof is on the grifters who have overwhelmingly been making false claims and distorting language for decades.
T This user is from outside of this forum
T This user is from outside of this forum
therealkuni@midwest.social

schrieb zuletzt editiert von

#102

Why would they "prove" something that's completely obvious?

I don’t want to be critical, but I think if you step back a bit and look and what you’re saying, you’re asking why we would bother to experiment and prove what we think we know.

That’s a perfectly normal and reasonable scientific pursuit. Yes, in a rational society the burden of proof would be on the grifters, but that’s never how it actually works. It’s always the doctors disproving the cure-all, not the snake oil salesmen failing to prove their own prove their own product.

There is value in this research, even if it fits what you already believe on the subject. I would think you would be thrilled to have your hypothesis confirmed.
P 1 Antwort Letzte Antwort

28
L lonstedbrowrybased@lemm.ee

Yah of course they do they’re computers
F This user is from outside of this forum
F This user is from outside of this forum
finitebanjo@lemmy.world

schrieb zuletzt editiert von

#103

That's not really a valid argument for why, but yes the models which use training data to assemble statistical models are all bullshitting. TBH idk how people can convince themselves otherwise.
T E I 3 Antworten Letzte Antwort

22
A auraithx@lemmy.dbzer0.com

Like what?

I don’t think there’s any search engine better than Perplexity. And for scientific research Consensus is miles ahead.
D This user is from outside of this forum
D This user is from outside of this forum
dojan@pawb.social

schrieb zuletzt editiert von

#104

Through the years I've bounced between different engines. I gave Bing a decent go some years back, mostly because I was interested in gauging the performance and wanted to just pit something against Google. After that I've swapped between Qwant and Startpage a bunch. I'm a big fan of Startpage's "Anonymous view" function.

Since then I've landed on Kagi, which I've used for almost a year now. It's the first search engine I've used that you can make work for you. I use the lens feature to focus on specific tasks, and de-prioritise pages that annoy me, sometimes outright omitting results from sites I find useless or unserious. For example when I'm doing web stuff and need to reference the MDN, I don't really care for w3schools polluting my results.

I'm a big fan of using my own agency and making my own decisions, and the recent trend in making LLMs think for us is something I find rather worrying, it allows for a much subtler manipulation than what Google does with its rankings and sponsor inserts.

Perplexity openly talking about wanting to buy Chrome and harvesting basically all the private data is also terrifying, thus I wouldn't touch that service with a stick. That said, I appreciate their candour, somehow being open about being evil is a lot more palatable to me than all these companies pretending to be good.
1 Antwort Letzte Antwort

0
L lostxor@fedia.io

If emissions dropped to 0 tonight, we would be substantially better off than if we maintain our current trajectory. Doomerism helps nobody.
A This user is from outside of this forum
A This user is from outside of this forum
auraithx@lemmy.dbzer0.com

schrieb zuletzt editiert von auraithx@lemmy.dbzer0.com

#105

It’s not doomerism it’s just realistic. Deluding yourself won’t change that.
1 Antwort Letzte Antwort

0
M mcasq_qsacj_234@lemmy.zip

If the situation gets dire, it's likely that the weather will be manipulated. Countries would then have to be convinced not to use this for military purposes.
A This user is from outside of this forum
A This user is from outside of this forum
auraithx@lemmy.dbzer0.com

schrieb zuletzt editiert von

#106

This isn’t a thing.
1 Antwort Letzte Antwort

0
F finitebanjo@lemmy.world

That's not really a valid argument for why, but yes the models which use training data to assemble statistical models are all bullshitting. TBH idk how people can convince themselves otherwise.
T This user is from outside of this forum
T This user is from outside of this forum
turmacar@lemmy.world

schrieb zuletzt editiert von turmacar@lemmy.world

#107

I think because it's language.

There's a famous quote from Charles Babbage when he presented his difference engine (gear based calculator) and someone asking "if you put in the wrong figures, will the correct ones be output" and Babbage not understanding how someone can so thoroughly misunderstand that the machine is, just a machine.

People are people, the main thing that's changed since the Cuneiform copper customer complaint is our materials science and networking ability. Most things that people interact with every day, most people just assume work like it appears to on the surface.

And nothing other than a person can do math problems or talk back to you. So people assume that means intelligence.
F L 2 Antworten Letzte Antwort

9
T turmacar@lemmy.world

I think because it's language.

There's a famous quote from Charles Babbage when he presented his difference engine (gear based calculator) and someone asking "if you put in the wrong figures, will the correct ones be output" and Babbage not understanding how someone can so thoroughly misunderstand that the machine is, just a machine.

People are people, the main thing that's changed since the Cuneiform copper customer complaint is our materials science and networking ability. Most things that people interact with every day, most people just assume work like it appears to on the surface.

And nothing other than a person can do math problems or talk back to you. So people assume that means intelligence.
F This user is from outside of this forum
F This user is from outside of this forum
finitebanjo@lemmy.world

schrieb zuletzt editiert von

#108

I often feel like I'm surrounded by idiots, but even I can't begin to imagine what it must have felt like to be Charles Babbage explaining computers to people in 1840.
1 Antwort Letzte Antwort

7
F finitebanjo@lemmy.world

That's not really a valid argument for why, but yes the models which use training data to assemble statistical models are all bullshitting. TBH idk how people can convince themselves otherwise.
E This user is from outside of this forum
E This user is from outside of this forum
encryptkeeper@lemmy.world

schrieb zuletzt editiert von

#109

TBH idk how people can convince themselves otherwise.

They don’t convince themselves. They’re convinced by the multi billion dollar corporations pouring unholy amounts of money into not only the development of AI, but its marketing. Marketing designed to not only convince them that AI is something it’s not, but also that that anyone who says otherwise (like you) are just luddites who are going to be “left behind”.
B L 2 Antworten Letzte Antwort

15
J johnedwa@sopuli.xyz

"It's part of the history of the field of artificial intelligence that every time somebody figured out how to make a computer do something—play good checkers, solve simple but relatively informal problems—there was a chorus of critics to say, 'that's not thinking'." -Pamela McCorduck´.
It's called the AI Effect.

As Larry Tesler puts it, "AI is whatever hasn't been done yet.".
V This user is from outside of this forum
V This user is from outside of this forum
vala@lemmy.world

schrieb zuletzt editiert von vala@lemmy.world

#110

Yesterday I asked an LLM "how much energy is stored in a grand piano?" It responded with saying there is no energy stored in a grad piano because it doesn't have a battery.

Any reasoning human would have understood that question to be referring to the tension in the strings.

Another example is asking "does lime cause kidney stones?". It didn't assume I mean lime the mineral and went with lime the citrus fruit instead.

Once again a reasoning human would assume the question is about the mineral.

Ask these questions again in a slightly different way and you might get a correct answer, but it won't be because the LLM was thinking.
A P X 3 Antworten Letzte Antwort

7
A allah@lemm.ee

LOOK MAA I AM ON FRONT PAGE

archive.is

(archive.is)
G This user is from outside of this forum
G This user is from outside of this forum
grizzlyboy@lemmy.zip

schrieb zuletzt editiert von

#111

What a dumb title. I proved it by asking a series of questions. It’s not AI, stop calling it AI, it’s a dumb af language model. Can you get a ton of help from it, as a tool? Yes! Can it reason? NO! It never could and for the foreseeable future, it will not.

It’s phenomenal at patterns, much much better than us meat peeps. That’s why they’re accurate as hell when it comes to analyzing medical scans.
1 Antwort Letzte Antwort

2
V vala@lemmy.world

Yesterday I asked an LLM "how much energy is stored in a grand piano?" It responded with saying there is no energy stored in a grad piano because it doesn't have a battery.

Any reasoning human would have understood that question to be referring to the tension in the strings.

Another example is asking "does lime cause kidney stones?". It didn't assume I mean lime the mineral and went with lime the citrus fruit instead.

Once again a reasoning human would assume the question is about the mineral.

Ask these questions again in a slightly different way and you might get a correct answer, but it won't be because the LLM was thinking.
A This user is from outside of this forum
A This user is from outside of this forum
antonim@lemmy.dbzer0.com

schrieb zuletzt editiert von

#112

But 90% of "reasoning humans" would answer just the same. Your questions are based on some non-trivial knowledge of physics, chemistry and medicine that most people do not possess.
1 Antwort Letzte Antwort

6
A allah@lemm.ee

LOOK MAA I AM ON FRONT PAGE

archive.is

(archive.is)
S This user is from outside of this forum
S This user is from outside of this forum
surph_ninja@lemmy.world

schrieb zuletzt editiert von

#113

You assume humans do the opposite? We literally institutionalize humans who not follow set patterns.
L P S 3 Antworten Letzte Antwort

30
A auraithx@lemmy.dbzer0.com

Unlike Markov models, modern LLMs use transformers that attend to full contexts, enabling them to simulate structured, multi-step reasoning (albeit imperfectly). While they don’t initiate reasoning like humans, they can generate and refine internal chains of thought when prompted, and emerging frameworks (like ReAct or Toolformer) allow them to update working memory via external tools. Reasoning is limited, but not physically impossible, it’s evolving beyond simple pattern-matching toward more dynamic and compositional processing.
V This user is from outside of this forum
V This user is from outside of this forum
vrighter@discuss.tchncs.de

schrieb zuletzt editiert von

#114

previous input goes in. Completely static, prebuilt model processes it and comes up with a probability distribution.

There is no "unlike markov chains". They are markov chains. Ones with a long context (a markov chain also kakes use of all the context provided to it, so I don't know what you're on about there). LLMs are just a (very) lossy compression scheme for the state transition table. Computed once, applied blindly to any context fed in.
A 1 Antwort Letzte Antwort

3
V vrighter@discuss.tchncs.de

previous input goes in. Completely static, prebuilt model processes it and comes up with a probability distribution.

There is no "unlike markov chains". They are markov chains. Ones with a long context (a markov chain also kakes use of all the context provided to it, so I don't know what you're on about there). LLMs are just a (very) lossy compression scheme for the state transition table. Computed once, applied blindly to any context fed in.
A This user is from outside of this forum
A This user is from outside of this forum
auraithx@lemmy.dbzer0.com

schrieb zuletzt editiert von

#115

LLMs are not Markov chains, even extended ones. A Markov model, by definition, relies on a fixed-order history and treats transitions as independent of deeper structure. LLMs use transformer attention mechanisms that dynamically weigh relationships between all tokens in the input—not just recent ones. This enables global context modeling, hierarchical structure, and even emergent behaviors like in-context learning. Markov models can't reweight context dynamically or condition on abstract token relationships.

The idea that LLMs are "computed once" and then applied blindly ignores the fact that LLMs adapt their behavior based on input. They don’t change weights during inference, true—but they do adapt responses through soft prompting, chain-of-thought reasoning, or even emulated state machines via tokens alone. That’s a powerful form of contextual plasticity, not blind table lookup.

Calling them “lossy compressors of state transition tables” misses the fact that the “table” they’re compressing is not fixed—it’s context-sensitive and computed in real time using self-attention over high-dimensional embeddings. That’s not how Markov chains work, even with large windows.
V 1 Antwort Letzte Antwort

2
A auraithx@lemmy.dbzer0.com

LLMs are not Markov chains, even extended ones. A Markov model, by definition, relies on a fixed-order history and treats transitions as independent of deeper structure. LLMs use transformer attention mechanisms that dynamically weigh relationships between all tokens in the input—not just recent ones. This enables global context modeling, hierarchical structure, and even emergent behaviors like in-context learning. Markov models can't reweight context dynamically or condition on abstract token relationships.

The idea that LLMs are "computed once" and then applied blindly ignores the fact that LLMs adapt their behavior based on input. They don’t change weights during inference, true—but they do adapt responses through soft prompting, chain-of-thought reasoning, or even emulated state machines via tokens alone. That’s a powerful form of contextual plasticity, not blind table lookup.

Calling them “lossy compressors of state transition tables” misses the fact that the “table” they’re compressing is not fixed—it’s context-sensitive and computed in real time using self-attention over high-dimensional embeddings. That’s not how Markov chains work, even with large windows.
V This user is from outside of this forum
V This user is from outside of this forum
vrighter@discuss.tchncs.de

schrieb zuletzt editiert von vrighter@discuss.tchncs.de

#116

their input is the context window. Markov chains also use their whole context window. Llms are a novel implementation that can work with much longer contexts, but as soon as something slides out of its window, it's forgotten. just like any other markov chain. They don't adapt. You add their token to the context, slide the oldest one out and then you have a different context, on which you run the same thing again. A normal markov chain will also give you a different outuut if you give it a different context. Their biggest weakness is that they don't and can't adapt. You are confusing the encoding of the context with the model itself. Just to see how static the model is, try setting temperature to 0, and giving it the same context. i.e. only try to predict one token with the exact same context each time. As soon as you try to predict a 2nd token, you've just changed the input and ran the thing again. It's not adapting, you asked it something different, so it came up with a different answer
A 1 Antwort Letzte Antwort

1

Anmelden zum Antworten

P

Senators Demand Meta Answer For AI Chatbots Posing as Licensed Therapists
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
16

1

220 Stimmen

16 Beiträge

0 Aufrufe

V

Does it mean that some people take orders from AI and don't know it's AI ?
S

85K – A Melhor Opção para Quem Busca Diversão e Recompensas
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

1 Stimmen

1 Beiträge

1 Aufrufe

Niemand hat geantwortet
L

Tide42 – A Fast, Minimalist CLI IDE for Terminal-Centric Devs
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
6

2

96 Stimmen

6 Beiträge

7 Aufrufe

A

Emacs has panes. Is this supposed to imitate a fraction of the holy power?
F

Tiny LEDs May Power Future AI Inteconnects
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

8 Stimmen

1 Beiträge

1 Aufrufe

Niemand hat geantwortet
P

The AI-powered collapse of the American tech workfoce
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
2

1

4 Stimmen

2 Beiträge

2 Aufrufe

R

The biggest tech companies are still trimming from pandemic over hiring. Smaller companies are still snatching workers up. And you also have companies trimming payroll for the coming Trump recession. Neither have anything to do with AI.
Z

Apple Eyes Move to AI Search, Ending Era Defined by Google
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
2

10 Stimmen

2 Beiträge

2 Aufrufe

O

It’s infuriating that Safari/Apple only allows me to choose from five different search engines. I self-host my own SearXNG instance and have to use a third-party extension to redirect my queries.
W

Microsoft CEO says up to 30% of the company's code was written by AI | TechCrunch
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
6

1

0 Stimmen

6 Beiträge

2 Aufrufe

P

Outlook.... Ok Pretty solid Bahaha hahahahaha Sorry. Outlook is a lot of things. "Gooey crap" would be one way to describe it, but "solid"? Yeah, no. Gmail is (well, was) pretty solid. There are a lot of other webmail providers out there, including self hosted options and most are pretty solid, yeah. Outlook, though? It's a shit show, it's annoying. Do you love me? Please love me, please give feedback, please give feedback again, please look at this, hey am I the best? Am I.. STFU YOU PIECE OF CRAP! Can you PLEASE just let me do my email without being an attention whore every hour? Even down to the basics. Back button? "What is that? Never heard of it, can't go back to the message I just was on because I'm Microsoft software and so half baked." Having two tabs open? "Oh noes, now I get scawed, now I don't know how to manage sessions anymore, better just sign you out everywhere." What is it with Microsoft and not being able to do something basic as sessions normal? I'm not even asking for good, definitely not "awesome", just normal, and that is already too much to ask. Try running it in Firefox! I'm sure it's totally not on purpose, just "oopsie woopsie poopsie" accidentally bwoken. Maybe it's working again today, who knows, tomorrow it'll be broken again. I run everything on Firefox except the Microsoft sites, they have to be in chrome because fuck you, that's why. Seriously, I can't take any Microsoft software seriously at this point, and all of it is on its way out in our company, I'm making sure of that
C

FBI nabs worker at DVD company for ripping prerelease blockbusters
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

0 Stimmen

1 Beiträge

1 Aufrufe

Niemand hat geantwortet