linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Apple just proved AI "reasoning" models like Claude, DeepSeek-R1, and o3-mini don't actually reason at all. They just memorize patterns really well.

Technology

198 Beiträge 103 Kommentatoren 0 Aufrufe

M melvin_ferd@lemmy.world

This is why I say these articles are so similar to how right wing media covers issues about immigrants.

There's some weird media push to convince the left to hate AI. Think of all the headlines for these issues. There are so many similarities. They're taking jobs. They are a threat to our way of life. The headlines talk about how they will sexual assault your wife, your children, you. Threats to the environment. There's articles like this where they take something known as twist it to make it sound nefarious to keep the story alive and avoid decay of interest.

Then when they pass laws, we're all primed to accept them removing whatever it is that advantageous them and disadvantageous us.
I This user is from outside of this forum
I This user is from outside of this forum
initiateofthevoid@lemmy.dbzer0.com

schrieb zuletzt editiert von

#87
Unlike fear-mongering from the right about immigrants, current iterations of AI development:
- literally consume the environment (they are using electricity and water)
- are taking jobs and siphoning money from the economy towards centralized corporate revenue streams that don't pay a fair share of taxes
- I don't know of headlines claiming they will sexually assault you, but many headlines note that they can be used as part of sophisticated catfishing scams, which they are
All of these things aren't scare tactics. They're often overblown and exaggerated for clicks, but the fundamental nature of the technology and corporate implementation of it indisputable.

Open-source AI can change the world for the better. Corporate-controlled AI in some limited cases will improve the world, but without reasonable regulations they will severely harm it first.
1 Antwort Letzte Antwort

0
A auraithx@lemmy.dbzer0.com

Define reason.

Like humans? Of course not. They lack intent, awareness, and grounded meaning. They don’t “understand” problems, they generate token sequences.
R This user is from outside of this forum
R This user is from outside of this forum
reksas@sopuli.xyz

schrieb zuletzt editiert von

#88

as it is defined in the article
1 Antwort Letzte Antwort

0
A auraithx@lemmy.dbzer0.com

Brother you better hope it does because even if emissions dropped to 0 tonight the planet wouldnt stop warming and it wouldn't stop what's coming for us.
M This user is from outside of this forum
M This user is from outside of this forum
mcasq_qsacj_234@lemmy.zip

schrieb zuletzt editiert von

#89

If the situation gets dire, it's likely that the weather will be manipulated. Countries would then have to be convinced not to use this for military purposes.
A 1 Antwort Letzte Antwort

0
R riskable@programming.dev

Define, "reasoning". For decades software developers have been writing code with conditionals. That's "reasoning."

LLMs are "reasoning"... They're just not doing human-like reasoning.
S This user is from outside of this forum
S This user is from outside of this forum
sp3ctr4l@lemmy.dbzer0.com

schrieb zuletzt editiert von sp3ctr4l@lemmy.dbzer0.com

#90

Howabout uh...

The ability to take a previously given set of knowledge, experiences and concepts, and combine or synthesize them in a consistent, non contradictory manner, to generate hitherto unrealized knowledge, or concepts, and then also be able to verify that those new knowledge and concepts are actually new, and actually valid, or at least be able to propose how one could test whether or not they are valid.

Arguably this is or involves meta-cognition, but that is what I would say... is the difference between what we typically think of as 'machine reasoning', and 'human reasoning'.

Now I will grant you that a large amount of humans essentially cannot do this, they suck at introspecting and maintaining logical consistency, that they are just told 'this is how things work', and they never question that untill decades later and their lives force them to address, or dismiss their own internally inconsisten beliefs.

But I would also say that this means they are bad at 'human reasoning'.

Basically, my definition of 'human reasoning' is perhaps more accurately described as 'critical thinking'.
1 Antwort Letzte Antwort

5
A allah@lemm.ee

10^36 flops to be exact
R This user is from outside of this forum
R This user is from outside of this forum
refurbishedrefurbisher@lemmy.sdf.org

schrieb zuletzt editiert von

#91

That sounds really floppy.
1 Antwort Letzte Antwort

1
R riskable@programming.dev

To be fair, the world of JavaScript is such a clusterfuck... Can you really blame the LLM for needing constant reminders about the specifics of your project?

When a programming language has five hundred bazillion absolutely terrible ways of accomplishing a given thing—and endless absolutely awful code examples on the Internet to "learn from"—you're just asking for trouble. Not just from trying to get an LLM to produce what you want but also trying to get humans to do it.

This is why LLMs are so fucking good at writing rust and Python: There's only so many ways to do a thing and the larger community pretty much always uses the same solutions.

JavaScript? How can it even keep up? You're using yarn today but in a year you'll probably like, "fuuuuck this code is garbage... I need to convert this all to [new thing]."
K This user is from outside of this forum
K This user is from outside of this forum
kescusay@lemmy.world

schrieb zuletzt editiert von

#92

That's only part of the problem. Yes, JavaScript is a fragmented clusterfuck. Typescript is leagues better, but by no means perfect. Still, that doesn't explain why the LLM can't recall that I'm using Yarn while it's processing the instruction that specifically told it to use Yarn. Or why it tries to start editing code when I tell it not to. Those are still issues that aren't specific to the language.
1 Antwort Letzte Antwort

1
S splashjackson@lemmy.ca

Just like me
A This user is from outside of this forum
A This user is from outside of this forum
alexdeathway@programming.dev

schrieb zuletzt editiert von

#93

python code for reversing the linked list.
1 Antwort Letzte Antwort

4
K kadup@lemmy.world

Apple is significantly behind and arrived late to the whole AI hype, so of course it's in their absolute best interest to keep showing how LLMs aren't special or amazingly revolutionary.

They're not wrong, but the motivation is also pretty clear.
H This user is from outside of this forum
H This user is from outside of this forum
homesweethomemrl@lemmy.world

schrieb zuletzt editiert von

#94

“Late to the hype” is actually a good thing. Gen AI is a scam wrapped in idiocy wrapped in a joke. That Apple is slow to ape the idiocy of microsoft is just fine.
1 Antwort Letzte Antwort

14
A auraithx@lemmy.dbzer0.com

Brother you better hope it does because even if emissions dropped to 0 tonight the planet wouldnt stop warming and it wouldn't stop what's coming for us.
L This user is from outside of this forum
L This user is from outside of this forum
lostxor@fedia.io

schrieb zuletzt editiert von

#95

If emissions dropped to 0 tonight, we would be substantially better off than if we maintain our current trajectory. Doomerism helps nobody.
A 1 Antwort Letzte Antwort

2
T technocrit@lemmy.dbzer0.com

This is why I say these articles are so similar to how right wing media covers issues about immigrants.

Maybe the actual problem is people who equate computer programs with people.

Then when they pass laws, we’re all primed to accept them removing whatever it is that advantageous them and disadvantageous us.

You mean laws like this? jfc.

inc.com

(www.inc.com)
M This user is from outside of this forum
M This user is from outside of this forum
melvin_ferd@lemmy.world

schrieb zuletzt editiert von melvin_ferd@lemmy.world

#96

Literally what I'm talking about. They have been pushing anti AI propaganda to alienate the left from embracing it while the right embraces it. You have such a blind spot you this, you can't even see you're making my argument for me.
A 1 Antwort Letzte Antwort

0
G grimy@lemmy.world

No, it shows how certain people misunderstand the meaning of the word.

You have called npcs in video games "AI" for a decade, yet you were never implying they were somehow intelligent. The whole argument is strangely inconsistent.
H This user is from outside of this forum
H This user is from outside of this forum
homesweethomemrl@lemmy.world

schrieb zuletzt editiert von

#97

Strangely inconsistent + smoke & mirrors = profit!
1 Antwort Letzte Antwort

2
M mfed1122@discuss.tchncs.de

But for something like solving a Towers of Hanoi puzzle, which is what this study is about, we're not looking for emotional judgements - we're trying to evaluate the logical reasoning capabilities. A sociopath would be equally capable of solving logic puzzles compared to a non-sociopath. In fact, simple computer programs do a great job of solving these puzzles, and they certainly have nothing like emotions. So I'm not sure that emotions have much relevance to the topic of AI or human reasoning and problem solving, at least not this particular aspect of it.

As for analogizing LLMs to sociopaths, I think that's a bit odd too. The reason why we (stereotypically) find sociopathy concerning is that a person has their own desires which, in combination with a disinterest in others' feelings, incentivizes them to be deceitful or harmful in some scenarios. But LLMs are largely designed specifically as servile, having no will or desires of their own. If people find it concerning that LLMs imitate emotions, then I think we're giving them far too much credit as sentient autonomous beings - and this is coming from someone who thinks they think in the same way we do! The think like we do, IMO, but they lack a lot of the other subsystems that are necessary for an entity to function in a way that can be considered as autonomous/having free will/desires of its own choosing, etc.
M This user is from outside of this forum
M This user is from outside of this forum
mcasq_qsacj_234@lemmy.zip

schrieb zuletzt editiert von

#98

In fact, simple computer programs do a great job of solving these puzzles.....

If an AI is trained to do this, it will be very good, like for example when a GPT-2 was trained to multiply numbers up to 20 digits.

https://nitter.net/yuntiandeng/status/1836114419480166585#m

Here they do the same test to GPT-4o, o1-mini and o3-mini

https://nitter.net/yuntiandeng/status/1836114401213989366#m

https://nitter.net/yuntiandeng/status/1889704768135905332#m
1 Antwort Letzte Antwort

1
T technocrit@lemmy.dbzer0.com

Why would they "prove" something that's completely obvious?

The burden of proof is on the grifters who have overwhelmingly been making false claims and distorting language for decades.
M This user is from outside of this forum
M This user is from outside of this forum
mbourgon@lemmy.world

schrieb zuletzt editiert von

#99

Not when large swaths of people are being told to use it everyday. Upper management has bought in on it.
L 1 Antwort Letzte Antwort

8
A allah@lemm.ee

This post did not contain any content.

archive.is

(archive.is)
B This user is from outside of this forum
B This user is from outside of this forum
bjoern_tantau@swg-empire.de

schrieb zuletzt editiert von

#100
1 Antwort Letzte Antwort

37
A allah@lemm.ee

This post did not contain any content.

archive.is

(archive.is)
L This user is from outside of this forum
L This user is from outside of this forum
lonstedbrowrybased@lemm.ee

schrieb zuletzt editiert von

#101

Yah of course they do they’re computers
F I 2 Antworten Letzte Antwort

21
T technocrit@lemmy.dbzer0.com

Why would they "prove" something that's completely obvious?

The burden of proof is on the grifters who have overwhelmingly been making false claims and distorting language for decades.
T This user is from outside of this forum
T This user is from outside of this forum
therealkuni@midwest.social

schrieb zuletzt editiert von

#102

Why would they "prove" something that's completely obvious?

I don’t want to be critical, but I think if you step back a bit and look and what you’re saying, you’re asking why we would bother to experiment and prove what we think we know.

That’s a perfectly normal and reasonable scientific pursuit. Yes, in a rational society the burden of proof would be on the grifters, but that’s never how it actually works. It’s always the doctors disproving the cure-all, not the snake oil salesmen failing to prove their own prove their own product.

There is value in this research, even if it fits what you already believe on the subject. I would think you would be thrilled to have your hypothesis confirmed.
P 1 Antwort Letzte Antwort

24
L lonstedbrowrybased@lemm.ee

Yah of course they do they’re computers
F This user is from outside of this forum
F This user is from outside of this forum
finitebanjo@lemmy.world

schrieb zuletzt editiert von

#103

That's not really a valid argument for why, but yes the models which use training data to assemble statistical models are all bullshitting. TBH idk how people can convince themselves otherwise.
T E I 3 Antworten Letzte Antwort

20
A auraithx@lemmy.dbzer0.com

Like what?

I don’t think there’s any search engine better than Perplexity. And for scientific research Consensus is miles ahead.
D This user is from outside of this forum
D This user is from outside of this forum
dojan@pawb.social

schrieb zuletzt editiert von

#104

Through the years I've bounced between different engines. I gave Bing a decent go some years back, mostly because I was interested in gauging the performance and wanted to just pit something against Google. After that I've swapped between Qwant and Startpage a bunch. I'm a big fan of Startpage's "Anonymous view" function.

Since then I've landed on Kagi, which I've used for almost a year now. It's the first search engine I've used that you can make work for you. I use the lens feature to focus on specific tasks, and de-prioritise pages that annoy me, sometimes outright omitting results from sites I find useless or unserious. For example when I'm doing web stuff and need to reference the MDN, I don't really care for w3schools polluting my results.

I'm a big fan of using my own agency and making my own decisions, and the recent trend in making LLMs think for us is something I find rather worrying, it allows for a much subtler manipulation than what Google does with its rankings and sponsor inserts.

Perplexity openly talking about wanting to buy Chrome and harvesting basically all the private data is also terrifying, thus I wouldn't touch that service with a stick. That said, I appreciate their candour, somehow being open about being evil is a lot more palatable to me than all these companies pretending to be good.
1 Antwort Letzte Antwort

0
L lostxor@fedia.io

If emissions dropped to 0 tonight, we would be substantially better off than if we maintain our current trajectory. Doomerism helps nobody.
A This user is from outside of this forum
A This user is from outside of this forum
auraithx@lemmy.dbzer0.com

schrieb zuletzt editiert von auraithx@lemmy.dbzer0.com

#105

It’s not doomerism it’s just realistic. Deluding yourself won’t change that.
1 Antwort Letzte Antwort

0
M mcasq_qsacj_234@lemmy.zip

If the situation gets dire, it's likely that the weather will be manipulated. Countries would then have to be convinced not to use this for military purposes.
A This user is from outside of this forum
A This user is from outside of this forum
auraithx@lemmy.dbzer0.com

schrieb zuletzt editiert von

#106

This isn’t a thing.
1 Antwort Letzte Antwort

0

Anmelden zum Antworten

P

Paradromics implanted and removed its Connexus brain implant in a patient during epilepsy surgery, a first for the Neuralink rival
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

6 Stimmen

1 Beiträge

1 Aufrufe

Niemand hat geantwortet
M

Spread of sexual deepfake images created by generative AI growing in Japan
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
48

1

99 Stimmen

48 Beiträge

2 Aufrufe

Y

enable the absolute worst of what humanity has to offer. can we call it a reality check? we think of humans as so great and important and unique for quite a while now while the world is spiraling downwards. maybe humans arent so great after all. like what is art? ppl vibe with slob music but birds cant vote. how does that make sense? if one can watch AI slob (and we all will with the constant improvements in ai) and like it, well maybe our taste of art is not any better than what a bird can do and like. i hope LLM will lead to a breakthrough in understanding what type of animal we really are.
P

Cory Doctorow on how we lost the internet
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
19

145 Stimmen

19 Beiträge

2 Aufrufe

F

This is going to be my goto example of why people need to care about data privacy. This is fucking insane. I'd fire someone for even throwing that out as a suggestion.
P

YouTube tops Disney and Netflix in TV viewing
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
96

1

216 Stimmen

96 Beiträge

0 Aufrufe

C

"Not Interested" is just free data for them to fill out your account's advertising profile.
P

Scientists '3D Print' Material Deep Inside The Body Using Ultrasound
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
6

1

62 Stimmen

6 Beiträge

3 Aufrufe

W

What could possibly go wrong? Edit: reads like the substrate still needs to be introduced first
P

Gig Companies Violate Workers’ Rights: Amazon Flex, DoorDash, Favor, Instacart, Lyft, Shipt, and Uber claim to offer workers flexibility but end up paying them less than state or local minimum wages.
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
8

1

148 Stimmen

8 Beiträge

0 Aufrufe

L

Whenever these things come up you always hear "then the company won't survive!" CEO and managers make bank somehow but it doesn't matter that the workers can't live on that wage. It's always so weird how when workers actually take a pay cut, that the businesses get used to it. When the CEOs get bonuses they have to get used to that too.
F

Indian Government orders censoring of accounts on X
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
12

149 Stimmen

12 Beiträge

2 Aufrufe

M

Why? Because you can’t sell them?
F

Mozilla is Introducing 'Terms of Use' to Firefox | Also about to go into effect is an updated privacy notice
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

0 Stimmen

1 Beiträge

0 Aufrufe

Niemand hat geantwortet