linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Apple just proved AI "reasoning" models like Claude, DeepSeek-R1, and o3-mini don't actually reason at all. They just memorize patterns really well.

Technology

347 Beiträge 149 Kommentatoren 17 Aufrufe

T technocrit@lemmy.dbzer0.com

Why would they "prove" something that's completely obvious?

The burden of proof is on the grifters who have overwhelmingly been making false claims and distorting language for decades.
Y This user is from outside of this forum
Y This user is from outside of this forum
yeahiknow3@lemmings.world

schrieb zuletzt editiert von

#55

They’re just using the terminology that’s widespread in the field. In a sense, the paper’s purpose is to prove that this terminology is unsuitable.
T 1 Antwort Letzte Antwort

19
Y yeahiknow3@lemmings.world

They’re just using the terminology that’s widespread in the field. In a sense, the paper’s purpose is to prove that this terminology is unsuitable.
T This user is from outside of this forum
T This user is from outside of this forum
technocrit@lemmy.dbzer0.com

schrieb zuletzt editiert von technocrit@lemmy.dbzer0.com

#56

I understand that people in this "field" regularly use pseudo-scientific language (I actually deleted that part of my comment).

But the terminology has never been suitable so it shouldn't be used in the first place. It pre-supposes the hypothesis that they're supposedly "disproving". They're feeding into the grift because that's what the field is. That's how they all get paid the big bucks.
1 Antwort Letzte Antwort

5
K kadup@lemmy.world

Apple is significantly behind and arrived late to the whole AI hype, so of course it's in their absolute best interest to keep showing how LLMs aren't special or amazingly revolutionary.

They're not wrong, but the motivation is also pretty clear.
M This user is from outside of this forum
M This user is from outside of this forum
mcasq_qsacj_234@lemmy.zip

schrieb zuletzt editiert von

#57

They need to convince investors that this delay wasn't due to incompetence. The problem will only be somewhat effective as long as there isn't an innovation that makes AI more effective.

If that happens, Apple shareholders will, at best, ask the company to increase investment in that area or, at worst, to restructure the company, which could also mean a change in CEO.
1 Antwort Letzte Antwort

13
B brsrklf@jlai.lu

You know, despite not really believing LLM "intelligence" works anywhere like real intelligence, I kind of thought maybe being good at recognizing patterns was a way to emulate it to a point...

But that study seems to prove they're still not even good at that. At first I was wondering how hard the puzzles must have been, and then there's a bit about LLM finishing 100 move towers of Hanoï (on which they were trained) and failing 4 move river crossings. Logically, those problems are very similar... Also, failing to apply a step-by-step solution they were given.
T This user is from outside of this forum
T This user is from outside of this forum
technocrit@lemmy.dbzer0.com

schrieb zuletzt editiert von technocrit@lemmy.dbzer0.com

#58

Computers are awesome at "recognizing patterns" as long as the pattern is a statistical average of some possibly worthless data set. And it really helps if the computer is setup to ahead of time to recognize pre-determined patterns.
1 Antwort Letzte Antwort

9
J johnedwa@sopuli.xyz

"It's part of the history of the field of artificial intelligence that every time somebody figured out how to make a computer do something—play good checkers, solve simple but relatively informal problems—there was a chorus of critics to say, 'that's not thinking'." -Pamela McCorduck´.
It's called the AI Effect.

As Larry Tesler puts it, "AI is whatever hasn't been done yet.".
T This user is from outside of this forum
T This user is from outside of this forum
technocrit@lemmy.dbzer0.com

schrieb zuletzt editiert von technocrit@lemmy.dbzer0.com

#59

I'm going to write a program to play tic-tac-toe. If y'all don't think it's "AI", then you're just haters. Nothing will ever be good enough for y'all. You want scientific evidence of intelligence?!?! I can't even define intelligence so take that! \s

Seriously tho. This person is arguing that a checkers program is "AI". It kinda demonstrates the loooong history of this grift.
L J 2 Antworten Letzte Antwort

16
G grimy@lemmy.world

No, it shows how certain people misunderstand the meaning of the word.

You have called npcs in video games "AI" for a decade, yet you were never implying they were somehow intelligent. The whole argument is strangely inconsistent.
T This user is from outside of this forum
T This user is from outside of this forum
technocrit@lemmy.dbzer0.com

schrieb zuletzt editiert von technocrit@lemmy.dbzer0.com

#60

Who is "you"?

Just because some dummies supposedly think that NPCs are "AI", that doesn't make it so. I don't consider checkers to be a litmus test for "intelligence".
G 1 Antwort Letzte Antwort

4
M melvin_ferd@lemmy.world

This is why I say these articles are so similar to how right wing media covers issues about immigrants.

There's some weird media push to convince the left to hate AI. Think of all the headlines for these issues. There are so many similarities. They're taking jobs. They are a threat to our way of life. The headlines talk about how they will sexual assault your wife, your children, you. Threats to the environment. There's articles like this where they take something known as twist it to make it sound nefarious to keep the story alive and avoid decay of interest.

Then when they pass laws, we're all primed to accept them removing whatever it is that advantageous them and disadvantageous us.
T This user is from outside of this forum
T This user is from outside of this forum
technocrit@lemmy.dbzer0.com

schrieb zuletzt editiert von technocrit@lemmy.dbzer0.com

#61

This is why I say these articles are so similar to how right wing media covers issues about immigrants.

Maybe the actual problem is people who equate computer programs with people.

Then when they pass laws, we’re all primed to accept them removing whatever it is that advantageous them and disadvantageous us.

You mean laws like this? jfc.

inc.com

(www.inc.com)
M 1 Antwort Letzte Antwort

5
H hansolo@lemmy.today

Because it's a fear-mongering angle that still sells. AI has been a vehicle for scifi for so long that trying to convince Boomers that of won't kill us all is the hard part.

I'm a moderate user for code and skeptic of LLM abilities, but 5 years from now when we are leveraging ML models for groundbreaking science and haven't been nuked by SkyNet, all of this will look quaint and silly.
T This user is from outside of this forum
T This user is from outside of this forum
technocrit@lemmy.dbzer0.com

schrieb zuletzt editiert von technocrit@lemmy.dbzer0.com

#62
5 years from now? Or was it supposed to be 5 years ago?

Pretty sure we already have skynet.
- https://www.nytimes.com/2025/04/25/technology/israel-gaza-ai.html
- https://www.nytimes.com/2025/05/30/technology/trump-palantir-data-americans.html
1 Antwort Letzte Antwort

3
T technocrit@lemmy.dbzer0.com

I'm going to write a program to play tic-tac-toe. If y'all don't think it's "AI", then you're just haters. Nothing will ever be good enough for y'all. You want scientific evidence of intelligence?!?! I can't even define intelligence so take that! \s

Seriously tho. This person is arguing that a checkers program is "AI". It kinda demonstrates the loooong history of this grift.
L This user is from outside of this forum
L This user is from outside of this forum
landedgentry@lemmy.zip

schrieb zuletzt editiert von landedgentry@lemmy.zip

#63

Yeah that’s exactly what I took from the above comment as well.

I have a pretty simple bar: until we’re debating the ethics of turning it off or otherwise giving it rights, it isn’t intelligent. No it’s not scientific, but it’s a hell of a lot more consistent than what all the AI evangelists espouse. And frankly if we’re talking about the ethics of how to treat something we consider intelligent, we have to go beyond pure scientific benchmarks anyway. It becomes a philosophy/ethics discussion.

Like crypto it has become a pseudo religion. Challenges to dogma and orthodoxy are shouted down, the non-believers are not welcome to critique it.
1 Antwort Letzte Antwort

5
A auraithx@lemmy.dbzer0.com

The paper doesn’t say LLMs can’t reason, it shows that their reasoning abilities are limited and collapse under increasing complexity or novel structure.
T This user is from outside of this forum
T This user is from outside of this forum
technocrit@lemmy.dbzer0.com

schrieb zuletzt editiert von

#64

The paper doesn’t say LLMs can’t reason

Authors gotta get paid. This article is full of pseudo-scientific jargon.
1 Antwort Letzte Antwort

4
A auraithx@lemmy.dbzer0.com

Performance eventually collapses due to architectural constraints, this mirrors cognitive overload in humans: reasoning isn’t just about adding compute, it requires mechanisms like abstraction, recursion, and memory. The models’ collapse doesn’t prove “only pattern matching”, it highlights that today’s models simulate reasoning in narrow bands, but lack the structure to scale it reliably. That is a limitation of implementation, not a disproof of emergent reasoning.
T This user is from outside of this forum
T This user is from outside of this forum
technocrit@lemmy.dbzer0.com

schrieb zuletzt editiert von

#65

Performance collapses because luck runs out. Bigger destruction of the planet won't fix that.
A 1 Antwort Letzte Antwort

3
M mfed1122@discuss.tchncs.de

This sort of thing has been published a lot for awhile now, but why is it assumed that this isn't what human reasoning consists of? Isn't all our reasoning ultimately a form of pattern memorization? I sure feel like it is. So to me all these studies that prove they're "just" memorizing patterns don't prove anything other than that, unless coupled with research on the human brain to prove we do something different.
T This user is from outside of this forum
T This user is from outside of this forum
technocrit@lemmy.dbzer0.com

schrieb zuletzt editiert von technocrit@lemmy.dbzer0.com

#66

why is it assumed that this isn’t what human reasoning consists of?

Because science doesn't work work like that. Nobody should assume wild hypotheses without any evidence whatsoever.

Isn’t all our reasoning ultimately a form of pattern memorization? I sure feel like it is.

You should get a job in "AI". smh.
M 1 Antwort Letzte Antwort

6
M mfed1122@discuss.tchncs.de

But for something like solving a Towers of Hanoi puzzle, which is what this study is about, we're not looking for emotional judgements - we're trying to evaluate the logical reasoning capabilities. A sociopath would be equally capable of solving logic puzzles compared to a non-sociopath. In fact, simple computer programs do a great job of solving these puzzles, and they certainly have nothing like emotions. So I'm not sure that emotions have much relevance to the topic of AI or human reasoning and problem solving, at least not this particular aspect of it.

As for analogizing LLMs to sociopaths, I think that's a bit odd too. The reason why we (stereotypically) find sociopathy concerning is that a person has their own desires which, in combination with a disinterest in others' feelings, incentivizes them to be deceitful or harmful in some scenarios. But LLMs are largely designed specifically as servile, having no will or desires of their own. If people find it concerning that LLMs imitate emotions, then I think we're giving them far too much credit as sentient autonomous beings - and this is coming from someone who thinks they think in the same way we do! The think like we do, IMO, but they lack a lot of the other subsystems that are necessary for an entity to function in a way that can be considered as autonomous/having free will/desires of its own choosing, etc.
T This user is from outside of this forum
T This user is from outside of this forum
technocrit@lemmy.dbzer0.com

schrieb zuletzt editiert von technocrit@lemmy.dbzer0.com

#67

In fact, simple computer programs do a great job of solving these puzzles...

Yes, this shit is very basic. Not at all "intelligent."
M 1 Antwort Letzte Antwort

2
E endmaker@ani.social

You've hit the nail on the head.

Personally, I wish that there's more progress in our understanding of human intelligence.
T This user is from outside of this forum
T This user is from outside of this forum
technocrit@lemmy.dbzer0.com

schrieb zuletzt editiert von

#68

Their argument is that we don't understand human intelligence so we should call computers intelligent.

That's not hitting any nail on the head.
1 Antwort Letzte Antwort

5
L lesserabe@lemmy.world

Agreed. We don't seem to have a very cohesive idea of what human consciousness is or how it works.
T This user is from outside of this forum
T This user is from outside of this forum
technocrit@lemmy.dbzer0.com

schrieb zuletzt editiert von technocrit@lemmy.dbzer0.com

#69

... And so we should call machines "intelligent"? That's not how science works.
L 1 Antwort Letzte Antwort

2
T technocrit@lemmy.dbzer0.com

I'm going to write a program to play tic-tac-toe. If y'all don't think it's "AI", then you're just haters. Nothing will ever be good enough for y'all. You want scientific evidence of intelligence?!?! I can't even define intelligence so take that! \s

Seriously tho. This person is arguing that a checkers program is "AI". It kinda demonstrates the loooong history of this grift.
J This user is from outside of this forum
J This user is from outside of this forum
johnedwa@sopuli.xyz

schrieb zuletzt editiert von johnedwa@sopuli.xyz

#70

It is. And has always been. "Artificial Intelligence" doesn't mean a feeling thinking robot person (that would fall under AGI or artificial conciousness), it's a vast field of research in computer science with many, many things under it.
E 1 Antwort Letzte Antwort

12
T technocrit@lemmy.dbzer0.com

Performance collapses because luck runs out. Bigger destruction of the planet won't fix that.
A This user is from outside of this forum
A This user is from outside of this forum
auraithx@lemmy.dbzer0.com

schrieb zuletzt editiert von

#71

Brother you better hope it does because even if emissions dropped to 0 tonight the planet wouldnt stop warming and it wouldn't stop what's coming for us.
M L 2 Antworten Letzte Antwort

2
K kescusay@lemmy.world

I can envision a system where an LLM becomes one part of a reasoning AI, acting as a kind of fuzzy "dataset" that a proper neural network incorporates and reasons with, and the LLM could be kept real-time updated (sort of) with MCP servers that incorporate anything new it learns.

But I don't think we're anywhere near there yet.
R This user is from outside of this forum
R This user is from outside of this forum
riskable@programming.dev

schrieb zuletzt editiert von

#72

The only reason we're not there yet is memory limitations.

Eventually some company will come out with AI hardware that lets you link up a petabyte of ultra fast memory to chips that contain a million parallel matrix math processors. Then we'll have an entirely new problem: AI that trains itself incorrectly too quickly.

Just you watch: The next big breakthrough in AI tech will come around 2032-2035 (when the hardware is available) and everyone will be bitching that "chain reasoning" (or whatever the term turns out to be) isn't as smart as everyone thinks it is.
1 Antwort Letzte Antwort

7
A allah@lemm.ee

did i do it here? also that's where i live, if i can't talk about womens struggle then i appologize
T This user is from outside of this forum
T This user is from outside of this forum
technocrit@lemmy.dbzer0.com

schrieb zuletzt editiert von

#73

I don't think that person cares about women or anything else. They just said that they don't even want to hear about it.
1 Antwort Letzte Antwort

3
A allah@lemm.ee

LOOK MAA I AM ON FRONT PAGE

archive.is

(archive.is)
J This user is from outside of this forum
J This user is from outside of this forum
jhex@lemmy.world

schrieb zuletzt editiert von

#74

this is so Apple, claiming to invent or discover something "first" 3 years later than the rest of the market
P 1 Antwort Letzte Antwort

53

Anmelden zum Antworten

J

Palantir Exposed: The New Deep State [27:27 | JUN 10 2025 | Glenn Greenwald]
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
27

54 Stimmen

27 Beiträge

0 Aufrufe

A

We've crossed it a long time ago. Out of the ~1.6 million russian men who took part/are taking part (dead, wounded, currently fighting) in the full scale invasion of Ukraine, how many were children in 2014 when the invasion started? While adults russians are clearly responsible, when are you fighting the russians you cannot put your head in the sand and wish away the fact that there is a ~65% chance a russian child will grow up as an open supporter of genocidal imperialism and another 20% chance (for a total 85%) that they will be supporters of imperialism but perhaps not open supporters of genocidal aims. P.S. Just an FYI, standard critiques about polling numbers are not relevant here, as these numbers exclude preference falsification (i.e. someone being afraid to state their true view), the "nominal" results are even higher.
L

ChatGPT 'got absolutely wrecked' by Atari 2600 in beginner's chess match — OpenAI's newest model bamboozled by 1970s logic
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
193

1

863 Stimmen

193 Beiträge

0 Aufrufe

P

It plays okay for a few moves but then the moment it gets in trouble it straight up cheats. Lol. More comparisons to how AI is currently like a young child.
B

Napster/BitTorrent for machine learning?
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
3

1

27 Stimmen

3 Beiträge

0 Aufrufe

G

What would a use case look like? I assume that the latency will make it impractical to train something that's LLM-sized. But even for something small, wouldn't a data center be more efficient?
A

You Don't Need a Big Budget for Big Security: Secure Your App with a Free, Powerful WAF
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
2

2

1 Stimmen

2 Beiträge

0 Aufrufe

A

If you're a developer, a startup founder, or part of a small team, you've poured countless hours into building your web application. You've perfected the UI, optimized the database, and shipped features your users love. But in the rush to build and deploy, a critical question often gets deferred: is your application secure? For many, the answer is a nervous "I hope so." The reality is that without a proper defense, your application is exposed to a barrage of automated attacks hitting the web every second. Threats like SQL Injection, Cross-Site Scripting (XSS), and Remote Code Execution are not just reserved for large enterprises; they are constant dangers for any application with a public IP address. The Security Barrier: When Cost and Complexity Get in the Way The standard recommendation is to place a Web Application Firewall (WAF) in front of your application. A WAF acts as a protective shield, inspecting incoming traffic and filtering out malicious requests before they can do any damage. It’s a foundational piece of modern web security. So, why doesn't everyone have one? Historically, robust WAFs have been complex and expensive. They required significant budgets, specialized knowledge to configure, and ongoing maintenance, putting them out of reach for students, solo developers, non-profits, and early-stage startups. This has created a dangerous security divide, leaving the most innovative and resource-constrained projects the most vulnerable. But that is changing. Democratizing Security: The Power of a Community WAF Security should be a right, not a privilege. Recognizing this, the landscape is shifting towards more accessible, community-driven tools. The goal is to provide powerful, enterprise-grade protection to everyone, for free. This is the principle behind the HaltDos Community WAF. It's a no-cost, perpetually free Web Application Firewall designed specifically for the community that has been underserved for too long. It’s not a stripped-down trial version; it’s a powerful security tool designed to give you immediate and effective protection against the OWASP Top 10 and other critical web threats. What Can You Actually Do with It? With a community WAF, you can deploy a security layer in minutes that: Blocks Malicious Payloads: Get instant, out-of-the-box protection against common attack patterns like SQLi, XSS, RCE, and more. Stops Bad Bots: Prevent malicious bots from scraping your content, attempting credential stuffing, or spamming your forms. Gives You Visibility: A real-time dashboard shows you exactly who is trying to attack your application and what methods they are using, providing invaluable security intelligence. Allows Customization: You can add your own custom security rules to tailor the protection specifically to your application's logic and technology stack. The best part? It can be deployed virtually anywhere—on-premises, in a private cloud, or with any major cloud provider like AWS, Azure, or Google Cloud. Get Started in Minutes You don't need to be a security guru to use it. The setup is straightforward, and the value is immediate. Protecting the project, you've worked so hard on is no longer a question of budget. Download: Get the free Community WAF from the HaltDos site. Deploy: Follow the simple instructions to set it up with your web server (it’s compatible with Nginx, Apache, and others). Secure: Watch the dashboard as it begins to inspect your traffic and block threats in real-time. Security is a journey, but it must start somewhere. For developers, startups, and anyone running a web application on a tight budget, a community WAF is the perfect first step. It's powerful, it's easy, and it's completely free.
H

Sunsetting the Ghostery Private Browser
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
8

1

33 Stimmen

8 Beiträge

2 Aufrufe

P

Sunsetting Dawn? Of course
J

How the Signal Knockoff App TeleMessage Got Hacked in 20 Minutes
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
31

1

188 Stimmen

31 Beiträge

4 Aufrufe

P

Not to mention TeleMessage violated the terms of the GPL. Signal is under gpl and I can't find TeleMessage's code anywhere. Edit: it appears it is online somewhere just not in a github repo or anything https://micahflee.com/heres-the-source-code-for-the-unofficial-signal-app-used-by-trump-officials/
?

[paper] Evidence of a social evaluation penalty for using AI
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
10

28 Stimmen

10 Beiträge

9 Aufrufe

V

I'm specifically talking about toil when it comes to my job as a software developer. I already know I need an if statement and a for loop all wrapped in a try catch. Rather then spending a couple minutes coding that I have cursor do it for me instantly then fill out the actual code. Or, ive written something in python and it needs to be converted to JavaScript. I can ask Claude to convert it one to one for me and test it, which comes back with either no errors or a very simple error I need to fix. It takes a minute. Instead I could have taken 15min to rewrite it myself and maybe make more mistakes that take longer.
H

Apple Fights Back Against Ruling Requiring External Payment Options
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
7

1

12 Stimmen

7 Beiträge

5 Aufrufe

C

Sure, he wasn't an engineer, so no, Jobs never personally "invented" anything. But Jobs at least knew what was good and what was shit when he saw it. Under Tim Cook, Apple just keeps putting out shitty unimaginative products, Cook is allowing Apple to stagnate, a dangerous thing to do when they have under 10% market share.