linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Apple just proved AI "reasoning" models like Claude, DeepSeek-R1, and o3-mini don't actually reason at all. They just memorize patterns really well.

Technology

198 Beiträge 103 Kommentatoren 0 Aufrufe

K kadup@lemmy.world

Apple is significantly behind and arrived late to the whole AI hype, so of course it's in their absolute best interest to keep showing how LLMs aren't special or amazingly revolutionary.

They're not wrong, but the motivation is also pretty clear.
H This user is from outside of this forum
H This user is from outside of this forum
homesweethomemrl@lemmy.world

schrieb zuletzt editiert von

#94

“Late to the hype” is actually a good thing. Gen AI is a scam wrapped in idiocy wrapped in a joke. That Apple is slow to ape the idiocy of microsoft is just fine.
1 Antwort Letzte Antwort

14
A auraithx@lemmy.dbzer0.com

Brother you better hope it does because even if emissions dropped to 0 tonight the planet wouldnt stop warming and it wouldn't stop what's coming for us.
L This user is from outside of this forum
L This user is from outside of this forum
lostxor@fedia.io

schrieb zuletzt editiert von

#95

If emissions dropped to 0 tonight, we would be substantially better off than if we maintain our current trajectory. Doomerism helps nobody.
A 1 Antwort Letzte Antwort

2
T technocrit@lemmy.dbzer0.com

This is why I say these articles are so similar to how right wing media covers issues about immigrants.

Maybe the actual problem is people who equate computer programs with people.

Then when they pass laws, we’re all primed to accept them removing whatever it is that advantageous them and disadvantageous us.

You mean laws like this? jfc.

inc.com

(www.inc.com)
M This user is from outside of this forum
M This user is from outside of this forum
melvin_ferd@lemmy.world

schrieb zuletzt editiert von melvin_ferd@lemmy.world

#96

Literally what I'm talking about. They have been pushing anti AI propaganda to alienate the left from embracing it while the right embraces it. You have such a blind spot you this, you can't even see you're making my argument for me.
A 1 Antwort Letzte Antwort

0
G grimy@lemmy.world

No, it shows how certain people misunderstand the meaning of the word.

You have called npcs in video games "AI" for a decade, yet you were never implying they were somehow intelligent. The whole argument is strangely inconsistent.
H This user is from outside of this forum
H This user is from outside of this forum
homesweethomemrl@lemmy.world

schrieb zuletzt editiert von

#97

Strangely inconsistent + smoke & mirrors = profit!
1 Antwort Letzte Antwort

2
M mfed1122@discuss.tchncs.de

But for something like solving a Towers of Hanoi puzzle, which is what this study is about, we're not looking for emotional judgements - we're trying to evaluate the logical reasoning capabilities. A sociopath would be equally capable of solving logic puzzles compared to a non-sociopath. In fact, simple computer programs do a great job of solving these puzzles, and they certainly have nothing like emotions. So I'm not sure that emotions have much relevance to the topic of AI or human reasoning and problem solving, at least not this particular aspect of it.

As for analogizing LLMs to sociopaths, I think that's a bit odd too. The reason why we (stereotypically) find sociopathy concerning is that a person has their own desires which, in combination with a disinterest in others' feelings, incentivizes them to be deceitful or harmful in some scenarios. But LLMs are largely designed specifically as servile, having no will or desires of their own. If people find it concerning that LLMs imitate emotions, then I think we're giving them far too much credit as sentient autonomous beings - and this is coming from someone who thinks they think in the same way we do! The think like we do, IMO, but they lack a lot of the other subsystems that are necessary for an entity to function in a way that can be considered as autonomous/having free will/desires of its own choosing, etc.
M This user is from outside of this forum
M This user is from outside of this forum
mcasq_qsacj_234@lemmy.zip

schrieb zuletzt editiert von

#98

In fact, simple computer programs do a great job of solving these puzzles.....

If an AI is trained to do this, it will be very good, like for example when a GPT-2 was trained to multiply numbers up to 20 digits.

https://nitter.net/yuntiandeng/status/1836114419480166585#m

Here they do the same test to GPT-4o, o1-mini and o3-mini

https://nitter.net/yuntiandeng/status/1836114401213989366#m

https://nitter.net/yuntiandeng/status/1889704768135905332#m
1 Antwort Letzte Antwort

1
T technocrit@lemmy.dbzer0.com

Why would they "prove" something that's completely obvious?

The burden of proof is on the grifters who have overwhelmingly been making false claims and distorting language for decades.
M This user is from outside of this forum
M This user is from outside of this forum
mbourgon@lemmy.world

schrieb zuletzt editiert von

#99

Not when large swaths of people are being told to use it everyday. Upper management has bought in on it.
L 1 Antwort Letzte Antwort

8
A allah@lemm.ee

This post did not contain any content.

archive.is

(archive.is)
B This user is from outside of this forum
B This user is from outside of this forum
bjoern_tantau@swg-empire.de

schrieb zuletzt editiert von

#100
1 Antwort Letzte Antwort

37
A allah@lemm.ee

This post did not contain any content.

archive.is

(archive.is)
L This user is from outside of this forum
L This user is from outside of this forum
lonstedbrowrybased@lemm.ee

schrieb zuletzt editiert von

#101

Yah of course they do they’re computers
F I 2 Antworten Letzte Antwort

21
T technocrit@lemmy.dbzer0.com

Why would they "prove" something that's completely obvious?

The burden of proof is on the grifters who have overwhelmingly been making false claims and distorting language for decades.
T This user is from outside of this forum
T This user is from outside of this forum
therealkuni@midwest.social

schrieb zuletzt editiert von

#102

Why would they "prove" something that's completely obvious?

I don’t want to be critical, but I think if you step back a bit and look and what you’re saying, you’re asking why we would bother to experiment and prove what we think we know.

That’s a perfectly normal and reasonable scientific pursuit. Yes, in a rational society the burden of proof would be on the grifters, but that’s never how it actually works. It’s always the doctors disproving the cure-all, not the snake oil salesmen failing to prove their own prove their own product.

There is value in this research, even if it fits what you already believe on the subject. I would think you would be thrilled to have your hypothesis confirmed.
P 1 Antwort Letzte Antwort

24
L lonstedbrowrybased@lemm.ee

Yah of course they do they’re computers
F This user is from outside of this forum
F This user is from outside of this forum
finitebanjo@lemmy.world

schrieb zuletzt editiert von

#103

That's not really a valid argument for why, but yes the models which use training data to assemble statistical models are all bullshitting. TBH idk how people can convince themselves otherwise.
T E I 3 Antworten Letzte Antwort

20
A auraithx@lemmy.dbzer0.com

Like what?

I don’t think there’s any search engine better than Perplexity. And for scientific research Consensus is miles ahead.
D This user is from outside of this forum
D This user is from outside of this forum
dojan@pawb.social

schrieb zuletzt editiert von

#104

Through the years I've bounced between different engines. I gave Bing a decent go some years back, mostly because I was interested in gauging the performance and wanted to just pit something against Google. After that I've swapped between Qwant and Startpage a bunch. I'm a big fan of Startpage's "Anonymous view" function.

Since then I've landed on Kagi, which I've used for almost a year now. It's the first search engine I've used that you can make work for you. I use the lens feature to focus on specific tasks, and de-prioritise pages that annoy me, sometimes outright omitting results from sites I find useless or unserious. For example when I'm doing web stuff and need to reference the MDN, I don't really care for w3schools polluting my results.

I'm a big fan of using my own agency and making my own decisions, and the recent trend in making LLMs think for us is something I find rather worrying, it allows for a much subtler manipulation than what Google does with its rankings and sponsor inserts.

Perplexity openly talking about wanting to buy Chrome and harvesting basically all the private data is also terrifying, thus I wouldn't touch that service with a stick. That said, I appreciate their candour, somehow being open about being evil is a lot more palatable to me than all these companies pretending to be good.
1 Antwort Letzte Antwort

0
L lostxor@fedia.io

If emissions dropped to 0 tonight, we would be substantially better off than if we maintain our current trajectory. Doomerism helps nobody.
A This user is from outside of this forum
A This user is from outside of this forum
auraithx@lemmy.dbzer0.com

schrieb zuletzt editiert von auraithx@lemmy.dbzer0.com

#105

It’s not doomerism it’s just realistic. Deluding yourself won’t change that.
1 Antwort Letzte Antwort

0
M mcasq_qsacj_234@lemmy.zip

If the situation gets dire, it's likely that the weather will be manipulated. Countries would then have to be convinced not to use this for military purposes.
A This user is from outside of this forum
A This user is from outside of this forum
auraithx@lemmy.dbzer0.com

schrieb zuletzt editiert von

#106

This isn’t a thing.
1 Antwort Letzte Antwort

0
F finitebanjo@lemmy.world

That's not really a valid argument for why, but yes the models which use training data to assemble statistical models are all bullshitting. TBH idk how people can convince themselves otherwise.
T This user is from outside of this forum
T This user is from outside of this forum
turmacar@lemmy.world

schrieb zuletzt editiert von turmacar@lemmy.world

#107

I think because it's language.

There's a famous quote from Charles Babbage when he presented his difference engine (gear based calculator) and someone asking "if you put in the wrong figures, will the correct ones be output" and Babbage not understanding how someone can so thoroughly misunderstand that the machine is, just a machine.

People are people, the main thing that's changed since the Cuneiform copper customer complaint is our materials science and networking ability. Most things that people interact with every day, most people just assume work like it appears to on the surface.

And nothing other than a person can do math problems or talk back to you. So people assume that means intelligence.
F L 2 Antworten Letzte Antwort

9
T turmacar@lemmy.world

I think because it's language.

There's a famous quote from Charles Babbage when he presented his difference engine (gear based calculator) and someone asking "if you put in the wrong figures, will the correct ones be output" and Babbage not understanding how someone can so thoroughly misunderstand that the machine is, just a machine.

People are people, the main thing that's changed since the Cuneiform copper customer complaint is our materials science and networking ability. Most things that people interact with every day, most people just assume work like it appears to on the surface.

And nothing other than a person can do math problems or talk back to you. So people assume that means intelligence.
F This user is from outside of this forum
F This user is from outside of this forum
finitebanjo@lemmy.world

schrieb zuletzt editiert von

#108

I often feel like I'm surrounded by idiots, but even I can't begin to imagine what it must have felt like to be Charles Babbage explaining computers to people in 1840.
1 Antwort Letzte Antwort

7
F finitebanjo@lemmy.world

That's not really a valid argument for why, but yes the models which use training data to assemble statistical models are all bullshitting. TBH idk how people can convince themselves otherwise.
E This user is from outside of this forum
E This user is from outside of this forum
encryptkeeper@lemmy.world

schrieb zuletzt editiert von

#109

TBH idk how people can convince themselves otherwise.

They don’t convince themselves. They’re convinced by the multi billion dollar corporations pouring unholy amounts of money into not only the development of AI, but its marketing. Marketing designed to not only convince them that AI is something it’s not, but also that that anyone who says otherwise (like you) are just luddites who are going to be “left behind”.
B L 2 Antworten Letzte Antwort

14
J johnedwa@sopuli.xyz

"It's part of the history of the field of artificial intelligence that every time somebody figured out how to make a computer do something—play good checkers, solve simple but relatively informal problems—there was a chorus of critics to say, 'that's not thinking'." -Pamela McCorduck´.
It's called the AI Effect.

As Larry Tesler puts it, "AI is whatever hasn't been done yet.".
V This user is from outside of this forum
V This user is from outside of this forum
vala@lemmy.world

schrieb zuletzt editiert von vala@lemmy.world

#110

Yesterday I asked an LLM "how much energy is stored in a grand piano?" It responded with saying there is no energy stored in a grad piano because it doesn't have a battery.

Any reasoning human would have understood that question to be referring to the tension in the strings.

Another example is asking "does lime cause kidney stones?". It didn't assume I mean lime the mineral and went with lime the citrus fruit instead.

Once again a reasoning human would assume the question is about the mineral.

Ask these questions again in a slightly different way and you might get a correct answer, but it won't be because the LLM was thinking.
A P X 3 Antworten Letzte Antwort

6
A allah@lemm.ee

This post did not contain any content.

archive.is

(archive.is)
G This user is from outside of this forum
G This user is from outside of this forum
grizzlyboy@lemmy.zip

schrieb zuletzt editiert von

#111

What a dumb title. I proved it by asking a series of questions. It’s not AI, stop calling it AI, it’s a dumb af language model. Can you get a ton of help from it, as a tool? Yes! Can it reason? NO! It never could and for the foreseeable future, it will not.

It’s phenomenal at patterns, much much better than us meat peeps. That’s why they’re accurate as hell when it comes to analyzing medical scans.
1 Antwort Letzte Antwort

2
V vala@lemmy.world

Yesterday I asked an LLM "how much energy is stored in a grand piano?" It responded with saying there is no energy stored in a grad piano because it doesn't have a battery.

Any reasoning human would have understood that question to be referring to the tension in the strings.

Another example is asking "does lime cause kidney stones?". It didn't assume I mean lime the mineral and went with lime the citrus fruit instead.

Once again a reasoning human would assume the question is about the mineral.

Ask these questions again in a slightly different way and you might get a correct answer, but it won't be because the LLM was thinking.
A This user is from outside of this forum
A This user is from outside of this forum
antonim@lemmy.dbzer0.com

schrieb zuletzt editiert von

#112

But 90% of "reasoning humans" would answer just the same. Your questions are based on some non-trivial knowledge of physics, chemistry and medicine that most people do not possess.
1 Antwort Letzte Antwort

5
A allah@lemm.ee

This post did not contain any content.

archive.is

(archive.is)
S This user is from outside of this forum
S This user is from outside of this forum
surph_ninja@lemmy.world

schrieb zuletzt editiert von

#113

You assume humans do the opposite? We literally institutionalize humans who not follow set patterns.
L P S 3 Antworten Letzte Antwort

28

Anmelden zum Antworten

D

“Fuck you! Fuck you! Fuck you!” US Treasury Secretary Scott Bessent shouted loudly at Elon Musk in the halls of the West Wing last month
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
9

1

78 Stimmen

9 Beiträge

0 Aufrufe

U

Obligatory Knowledge Fight Reference: [https://knowledgefight.libsyn.com/1044-june-2-2025](In this installment, Dan and Jordan discuss a strange day on Alex's show where he spends a fair amount of time trying to dissuade his listeners from getting too suspicious about Palantir.)
A

Amazon is reportedly training humanoid robots to deliver packages
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
142

1

298 Stimmen

142 Beiträge

0 Aufrufe

5

Nice, thanks!
R

Reddit sues Anthropic for allegedly not paying for training data | TechCrunch
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
16

1

135 Stimmen

16 Beiträge

3 Aufrufe

E

I thought we were going to get our share of the damages
O

Where hyperscale hardware goes to retire: Ars visits a very big ITAD site
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
3

1

21 Stimmen

3 Beiträge

2 Aufrufe

B

We have to do this ourselves in the government for every decommissioned server/appliance/end user device. We have to fill out paperwork for every single storage drive we destroy, and we can only destroy them using approved destruction tools (e.g. specific degaussers, drive shredders/crushers, etc). Appliances can be kind of a pain, though. It can be tricky sometimes finding all the writable memory in things like switches and routers. But, nothing is worse than storage arrays... destroying hundreds of drives is incredibly tedious.
K

The U.S. Just Ran a Solar Storm Emergency Drill. The Real Deal Would Be a Catastrophe
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
19

1

149 Stimmen

19 Beiträge

3 Aufrufe

C

Got it, at that point (extremely high voltage) you'd need suppression at the panel. Which I would hope people have inline, but not expect like an LVD.
P

Google might replace the ‘I’m Feeling Lucky’ button with AI Mode
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
19

1

60 Stimmen

19 Beiträge

5 Aufrufe

I

I'm not a Bing fan either because it used to be regurgitated Google results. For now I'm just self-hosting an instance of SearXNG. Copilot is pretty good for Azure stuff though, really I just like it because it always has links back to Microsoft's documentation (even though it's constantly changing).
P

Gig Companies Violate Workers’ Rights: Amazon Flex, DoorDash, Favor, Instacart, Lyft, Shipt, and Uber claim to offer workers flexibility but end up paying them less than state or local minimum wages.
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
8

1

148 Stimmen

8 Beiträge

0 Aufrufe

L

Whenever these things come up you always hear "then the company won't survive!" CEO and managers make bank somehow but it doesn't matter that the workers can't live on that wage. It's always so weird how when workers actually take a pay cut, that the businesses get used to it. When the CEOs get bonuses they have to get used to that too.
C

Brian Eno: “The biggest problem about AI is not intrinsic to AI. It’s to do with the fact that it’s owned by the same few people”
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

0 Stimmen

1 Beiträge

0 Aufrufe

Niemand hat geantwortet