linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

I'm looking for an article showing that LLMs don't know how they work internally

Technology

80 Beiträge 32 Kommentatoren 1.5k Aufrufe

L lgsp@feddit.it
I found the aeticle in a post on the fediverse, and I can't find it anymore.

The reaserchers asked a simple mathematical question to an LLM ( like 7+4) and then could see how internally it worked by finding similar paths, but nothing like performing mathematical reasoning, even if the final answer was correct.

Then they asked the LLM to explain how it found the result, what was it's internal reasoning. The answer was detailed step by step mathematical logic, like a human explaining how to perform an addition.

This showed 2 things:
- LLM don't "know" how they work
- the second answer was a rephrasing of original text used for training that explain how math works, so LLM just used that as an explanation
I think it was a very interesting an meaningful analysis

Can anyone help me find this?

EDIT: thanks to @theunknownmuncher
@lemmy.world
https://www.anthropic.com/research/tracing-thoughts-language-model its this one

EDIT2: I'm aware LLM dont "know" anything and don't reason, and it's exactly why I wanted to find the article. Some more details here: https://feddit.it/post/18191686/13815095
F This user is from outside of this forum
F This user is from outside of this forum
franzcoz@feddit.cl

schrieb am zuletzt editiert von

#10

There was a study by Anthropic, the company behind Claude, that developed another AI that they used as a sort of "brain scanner" for the LLM, in the sense that allowed them to see sort of a model of how the LLM "internal process" worked
L 1 Antwort Letzte Antwort

2
G glizzyguzzler@lemmy.blahaj.zone

Can’t help but here’s a rant on people asking LLMs to “explain their reasoning” which is impossible because they can never reason (not meant to be attacking OP, just attacking the “LLMs think and reason” people and companies that spout it):

LLMs are just matrix math to complete the most likely next word. They don’t know anything and can’t reason.

Anything you read or hear about LLMs or “AI” getting “asked questions” or “explain its reasoning” or talking about how they’re “thinking” is just AI propaganda to make you think they’re doing something LLMs literally can’t do but people sure wish they could.

In this case it sounds like people who don’t understand how LLMs work eating that propaganda up and approaching LLMs like there’s something to talk to or discern from.

If you waste egregiously high amounts of gigawatts to put everything that’s ever been typed into matrices you can operate on, you get a facsimile of the human knowledge that went into typing all of that stuff.

It’d be impressive if the environmental toll making the matrices and using them wasn’t critically bad.

TLDR; LLMs can never think or reason, anyone talking about them thinking or reasoning is bullshitting, they utilize almost everything that’s ever been typed to give (occasionally) reasonably useful outputs that are the most basic bitch shit because that’s the most likely next word at the cost of environmental disaster
T This user is from outside of this forum
T This user is from outside of this forum
treczoks@lemmy.world

schrieb am zuletzt editiert von

#11

I've read that article. They used something they called an "MRI for AIs", and checked e.g. how an AI handled math questions, and then asked the AI how it came to that answer, and the pathways actually differed. While the AI talked about using a textbook answer, it actually did a different approach. That's what I remember of that article.

But yes, it exists, and it is science, not TicTok
L 1 Antwort Letzte Antwort

4
L lgsp@feddit.it
I'm aware of this and agree but:
- I see that asking how an LLM got to their answers as a "proof" of sound reasoning has become common
- this new trend of "reasoning" models, where an internal conversation is shown in all its steps, seems to be based on this assumption of trustable train of thoughts. And given the simple experiment I mentioned, it is extremely dangerous and misleading
- take a look at this video: https://youtube.com/watch?v=Xx4Tpsk_fnM : everything is based on observing and directing this internal reasoning, and these guys are computer scientists. How can they trust this?
So having a good written article at hand is a good idea imho
B This user is from outside of this forum
B This user is from outside of this forum
blue_morpho@lemmy.world

schrieb am zuletzt editiert von

#12

I only follow some YouTubers like Digital Spaceport but there has been a lot of progress from years ago when LLM's were only predictive. They now have an inductive engine attached to the LLM to provide logic guard rails.
1 Antwort Letzte Antwort

1
G glizzyguzzler@lemmy.blahaj.zone

Can’t help but here’s a rant on people asking LLMs to “explain their reasoning” which is impossible because they can never reason (not meant to be attacking OP, just attacking the “LLMs think and reason” people and companies that spout it):

LLMs are just matrix math to complete the most likely next word. They don’t know anything and can’t reason.

Anything you read or hear about LLMs or “AI” getting “asked questions” or “explain its reasoning” or talking about how they’re “thinking” is just AI propaganda to make you think they’re doing something LLMs literally can’t do but people sure wish they could.

In this case it sounds like people who don’t understand how LLMs work eating that propaganda up and approaching LLMs like there’s something to talk to or discern from.

If you waste egregiously high amounts of gigawatts to put everything that’s ever been typed into matrices you can operate on, you get a facsimile of the human knowledge that went into typing all of that stuff.

It’d be impressive if the environmental toll making the matrices and using them wasn’t critically bad.

TLDR; LLMs can never think or reason, anyone talking about them thinking or reasoning is bullshitting, they utilize almost everything that’s ever been typed to give (occasionally) reasonably useful outputs that are the most basic bitch shit because that’s the most likely next word at the cost of environmental disaster
P This user is from outside of this forum
P This user is from outside of this forum
peoplebeproblems@midwest.social

schrieb am zuletzt editiert von

#13

People don't understand what "model" means. That's the unfortunate reality.
A R 2 Antworten Letzte Antwort

4
T theunknownmuncher@lemmy.world

It's true that LLMs aren't "aware" of what internal steps they are taking, so asking an LLM how they reasoned out an answer will just output text that statistically sounds right based on its training set, but to say something like "they can never reason" is provably false.

Its obvious that you have a bias and desperately want reality to confirm it, but there's been significant research and progress in tracing internals of LLMs, that show logic, planning, and reasoning.

EDIT: lol you can downvote me but it doesn't change evidence based research

It’d be impressive if the environmental toll making the matrices and using them wasn’t critically bad.

Developing a AAA video game has a higher carbon footprint than training an LLM, and running inference uses significantly less power than playing that same video game.
G This user is from outside of this forum
G This user is from outside of this forum
glizzyguzzler@lemmy.blahaj.zone

schrieb am zuletzt editiert von

#14

Too deep on the AI propaganda there, it’s completing the next word. You can give the LLM base umpteen layers to make complicated connections, still ain’t thinking.

The LLM corpos trying to get nuclear plants to power their gigantic data centers while AAA devs aren’t trying to buy nuclear plants says that’s a straw man and you simultaneously also are wrong.

Using a pre-trained and memory-crushed LLM that can run on a small device won’t take up too much power. But that’s not what you’re thinking of. You’re thinking of the LLM only accessible via ChatGPT’s api that has a yuge context length and massive matrices that needs hilariously large amounts of RAM and compute power to execute. And it’s still a facsimile of thought.

It’s okay they suck and have very niche actual use cases - maybe it’ll get us to something better. But they ain’t gold, they ain't smart, and they ain’t worth destroying the planet.
T 1 Antwort Letzte Antwort

5
G glizzyguzzler@lemmy.blahaj.zone

Too deep on the AI propaganda there, it’s completing the next word. You can give the LLM base umpteen layers to make complicated connections, still ain’t thinking.

The LLM corpos trying to get nuclear plants to power their gigantic data centers while AAA devs aren’t trying to buy nuclear plants says that’s a straw man and you simultaneously also are wrong.

Using a pre-trained and memory-crushed LLM that can run on a small device won’t take up too much power. But that’s not what you’re thinking of. You’re thinking of the LLM only accessible via ChatGPT’s api that has a yuge context length and massive matrices that needs hilariously large amounts of RAM and compute power to execute. And it’s still a facsimile of thought.

It’s okay they suck and have very niche actual use cases - maybe it’ll get us to something better. But they ain’t gold, they ain't smart, and they ain’t worth destroying the planet.
T This user is from outside of this forum
T This user is from outside of this forum
theunknownmuncher@lemmy.world

schrieb am zuletzt editiert von

#15

it's completing the next word.

Facts disagree, but you've decided to live in a reality that matches your biases despite real evidence, so whatever
G 1 Antwort Letzte Antwort

1
G glizzyguzzler@lemmy.blahaj.zone

Can’t help but here’s a rant on people asking LLMs to “explain their reasoning” which is impossible because they can never reason (not meant to be attacking OP, just attacking the “LLMs think and reason” people and companies that spout it):

LLMs are just matrix math to complete the most likely next word. They don’t know anything and can’t reason.

Anything you read or hear about LLMs or “AI” getting “asked questions” or “explain its reasoning” or talking about how they’re “thinking” is just AI propaganda to make you think they’re doing something LLMs literally can’t do but people sure wish they could.

In this case it sounds like people who don’t understand how LLMs work eating that propaganda up and approaching LLMs like there’s something to talk to or discern from.

If you waste egregiously high amounts of gigawatts to put everything that’s ever been typed into matrices you can operate on, you get a facsimile of the human knowledge that went into typing all of that stuff.

It’d be impressive if the environmental toll making the matrices and using them wasn’t critically bad.

TLDR; LLMs can never think or reason, anyone talking about them thinking or reasoning is bullshitting, they utilize almost everything that’s ever been typed to give (occasionally) reasonably useful outputs that are the most basic bitch shit because that’s the most likely next word at the cost of environmental disaster
A This user is from outside of this forum
A This user is from outside of this forum
annebonny@lemmy.dbzer0.com

schrieb am zuletzt editiert von

#16

How would you prove that someone or something is capable of reasoning or thinking?
G 1 Antwort Letzte Antwort

6
G glizzyguzzler@lemmy.blahaj.zone

Can’t help but here’s a rant on people asking LLMs to “explain their reasoning” which is impossible because they can never reason (not meant to be attacking OP, just attacking the “LLMs think and reason” people and companies that spout it):

LLMs are just matrix math to complete the most likely next word. They don’t know anything and can’t reason.

Anything you read or hear about LLMs or “AI” getting “asked questions” or “explain its reasoning” or talking about how they’re “thinking” is just AI propaganda to make you think they’re doing something LLMs literally can’t do but people sure wish they could.

In this case it sounds like people who don’t understand how LLMs work eating that propaganda up and approaching LLMs like there’s something to talk to or discern from.

If you waste egregiously high amounts of gigawatts to put everything that’s ever been typed into matrices you can operate on, you get a facsimile of the human knowledge that went into typing all of that stuff.

It’d be impressive if the environmental toll making the matrices and using them wasn’t critically bad.

TLDR; LLMs can never think or reason, anyone talking about them thinking or reasoning is bullshitting, they utilize almost everything that’s ever been typed to give (occasionally) reasonably useful outputs that are the most basic bitch shit because that’s the most likely next word at the cost of environmental disaster
A This user is from outside of this forum
A This user is from outside of this forum
annebonny@lemmy.dbzer0.com

schrieb am zuletzt editiert von

#17

Who has claimed that LLMs have the capacity to reason?
A T 2 Antworten Letzte Antwort

3
A annebonny@lemmy.dbzer0.com

Who has claimed that LLMs have the capacity to reason?
A This user is from outside of this forum
A This user is from outside of this forum
adespoton@lemmy.ca

schrieb am zuletzt editiert von

#18

The study being referenced explains in detail why they can’t. So I’d say it’s Anthropic who stated LLMs don’t have the capacity to reason, and that’s what we’re discussing.

The popular media tends to go on and on about conflating AI with AGI and synthetic reasoning.
T 1 Antwort Letzte Antwort

4
P peoplebeproblems@midwest.social

People don't understand what "model" means. That's the unfortunate reality.
A This user is from outside of this forum
A This user is from outside of this forum
adespoton@lemmy.ca

schrieb am zuletzt editiert von

#19

They walk down runways and pose for magazines. Do they reason? Sometimes.
I 1 Antwort Letzte Antwort

4
T theunknownmuncher@lemmy.world

It's true that LLMs aren't "aware" of what internal steps they are taking, so asking an LLM how they reasoned out an answer will just output text that statistically sounds right based on its training set, but to say something like "they can never reason" is provably false.

Its obvious that you have a bias and desperately want reality to confirm it, but there's been significant research and progress in tracing internals of LLMs, that show logic, planning, and reasoning.

EDIT: lol you can downvote me but it doesn't change evidence based research

It’d be impressive if the environmental toll making the matrices and using them wasn’t critically bad.

Developing a AAA video game has a higher carbon footprint than training an LLM, and running inference uses significantly less power than playing that same video game.
O This user is from outside of this forum
O This user is from outside of this forum
ohwhatfollyisman@lemmy.world

schrieb am zuletzt editiert von

#20

but there's been significant research and progress in tracing internals of LLMs, that show logic, planning, and reasoning.

would there be a source for such research?
T 1 Antwort Letzte Antwort

2
L lgsp@feddit.it
I found the aeticle in a post on the fediverse, and I can't find it anymore.

The reaserchers asked a simple mathematical question to an LLM ( like 7+4) and then could see how internally it worked by finding similar paths, but nothing like performing mathematical reasoning, even if the final answer was correct.

Then they asked the LLM to explain how it found the result, what was it's internal reasoning. The answer was detailed step by step mathematical logic, like a human explaining how to perform an addition.

This showed 2 things:
- LLM don't "know" how they work
- the second answer was a rephrasing of original text used for training that explain how math works, so LLM just used that as an explanation
I think it was a very interesting an meaningful analysis

Can anyone help me find this?

EDIT: thanks to @theunknownmuncher
@lemmy.world
https://www.anthropic.com/research/tracing-thoughts-language-model its this one

EDIT2: I'm aware LLM dont "know" anything and don't reason, and it's exactly why I wanted to find the article. Some more details here: https://feddit.it/post/18191686/13815095
B This user is from outside of this forum
B This user is from outside of this forum
bodilessgaze@sh.itjust.works

schrieb am zuletzt editiert von

#21

I don't know how I work. I couldn't tell you much about neuroscience beyond "neurons are linked together and somehow that creates thoughts". And even when it comes to complex thoughts, I sometimes can't explain why. At my job, I often lean on intuition I've developed over a decade. I can look at a system and get an immediate sense if it's going to work well, but actually explaining why or why not takes a lot more time and energy. Am I an LLM?
V 1 Antwort Letzte Antwort

28
A annebonny@lemmy.dbzer0.com

Who has claimed that LLMs have the capacity to reason?
T This user is from outside of this forum
T This user is from outside of this forum
theparadox@lemmy.world

schrieb am zuletzt editiert von

#22

More than enough people who claim to know how it works think it might be "evolving" into a sentient being inside it's little black box. Example from a conversation I gave up on...
https://sh.itjust.works/comment/18759960
T A 2 Antworten Letzte Antwort

6
L lgsp@feddit.it
I found the aeticle in a post on the fediverse, and I can't find it anymore.

The reaserchers asked a simple mathematical question to an LLM ( like 7+4) and then could see how internally it worked by finding similar paths, but nothing like performing mathematical reasoning, even if the final answer was correct.

Then they asked the LLM to explain how it found the result, what was it's internal reasoning. The answer was detailed step by step mathematical logic, like a human explaining how to perform an addition.

This showed 2 things:
- LLM don't "know" how they work
- the second answer was a rephrasing of original text used for training that explain how math works, so LLM just used that as an explanation
I think it was a very interesting an meaningful analysis

Can anyone help me find this?

EDIT: thanks to @theunknownmuncher
@lemmy.world
https://www.anthropic.com/research/tracing-thoughts-language-model its this one

EDIT2: I'm aware LLM dont "know" anything and don't reason, and it's exactly why I wanted to find the article. Some more details here: https://feddit.it/post/18191686/13815095
M This user is from outside of this forum
M This user is from outside of this forum
markovs_gun@lemmy.world

schrieb am zuletzt editiert von

#23

"Researchers" did a thing I did the first day I was actually able to ChatGPT and came to a conclusion that is in the disclaimers on the ChatGPT website. Can I get paid to do this kind of "research?" If you've even read a cursory article about how LLMs work you'd know that asking them what their reasoning is for anything doesn't work because the answer would just always be an explanation of how LLMs work generally.
L 1 Antwort Letzte Antwort

1
A annebonny@lemmy.dbzer0.com

How would you prove that someone or something is capable of reasoning or thinking?
G This user is from outside of this forum
G This user is from outside of this forum
glizzyguzzler@lemmy.blahaj.zone

schrieb am zuletzt editiert von

#24

You can prove it’s not by doing some matrix multiplication and seeing its matrix multiplication. Much easier way to go about it
T W 2 Antworten Letzte Antwort

4
T theunknownmuncher@lemmy.world

it's completing the next word.

Facts disagree, but you've decided to live in a reality that matches your biases despite real evidence, so whatever
G This user is from outside of this forum
G This user is from outside of this forum
glizzyguzzler@lemmy.blahaj.zone

schrieb am zuletzt editiert von

#25

It’s literally tokens. Doesn’t matter if it completes the next word or next phrase, still completing the next most likely token can’t think can’t reason can witch’s brew facsimile of something done before
1 Antwort Letzte Antwort

2
O ohwhatfollyisman@lemmy.world

but there's been significant research and progress in tracing internals of LLMs, that show logic, planning, and reasoning.

would there be a source for such research?
T This user is from outside of this forum
T This user is from outside of this forum
theunknownmuncher@lemmy.world

schrieb am zuletzt editiert von

#26

https://www.anthropic.com/research/tracing-thoughts-language-model for one, the exact article OP was asking for
O 1 Antwort Letzte Antwort

2
T theunknownmuncher@lemmy.world

https://www.anthropic.com/research/tracing-thoughts-language-model for one, the exact article OP was asking for
O This user is from outside of this forum
O This user is from outside of this forum
ohwhatfollyisman@lemmy.world

schrieb am zuletzt editiert von

#27

but this article espouses that llms do the opposite of logic, planning, and reasoning?

quoting:

Claude, on occasion, will give a plausible-sounding argument designed to agree with the user rather than to follow logical steps. We show this by asking it for help on a hard math problem while giving it an incorrect hint. We are able to “catch it in the act” as it makes up its fake reasoning,

are there any sources which show that llms use logic, conduct planning, and reason (as was asserted in the 2nd level comment)?
T 1 Antwort Letzte Antwort

2
A adespoton@lemmy.ca

They walk down runways and pose for magazines. Do they reason? Sometimes.
I This user is from outside of this forum
I This user is from outside of this forum
incogcyberspaceuser@lemmy.world

schrieb am zuletzt editiert von

#28

But why male models?
F 1 Antwort Letzte Antwort

2
O ohwhatfollyisman@lemmy.world

but this article espouses that llms do the opposite of logic, planning, and reasoning?

quoting:

Claude, on occasion, will give a plausible-sounding argument designed to agree with the user rather than to follow logical steps. We show this by asking it for help on a hard math problem while giving it an incorrect hint. We are able to “catch it in the act” as it makes up its fake reasoning,

are there any sources which show that llms use logic, conduct planning, and reason (as was asserted in the 2nd level comment)?
T This user is from outside of this forum
T This user is from outside of this forum
theunknownmuncher@lemmy.world

schrieb am zuletzt editiert von

#29

No, you're misunderstanding the findings. It does show that LLMs do not explain their reasoning when asked, which makes sense and is expected. They do not have access to their inner-workings and generate a response that "sounds" right, but tracing their internal logic shows they operate differently than what they claim, when asked. You can't ask an LLM to explain its own reasoning. But the article shows how they've made progress with tracing under-the-hood, and the surprising results they found about how it is able to do things like plan ahead, which defeats the misconception that it is just "autocomplete"
1 Antwort Letzte Antwort

1

Anmelden zum Antworten

T

Protest footage blocked as online safety act comes into force
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
30

1

482 Stimmen

30 Beiträge

2 Aufrufe

I

Also worth noting that they only won by a landslide because the Tories lost loads of voters because people were so fed up with them. Labour actually got less votes this election than they did previously when they lost
N

OpenAI’s ChatGPT Agent casually clicks through “I am not a robot” verification test
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
55

1

333 Stimmen

55 Beiträge

439 Aufrufe

J

Unfortunately, they thought of that. Some of them are known answers that they use to be sure you're answering honestly. They'll fail you on those even though they know you're not a bot.
A

Peter Thiel Just Accidentally Made a Chilling Admission. Five Decades Ago, One Man Saw It Coming.
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
61

1

151 Stimmen

61 Beiträge

716 Aufrufe

A

That's why I feel like people need to be aware of this, and understand there are Republicans and Democrats taking Thiel's money. It doesn't matter how he gets there, this is his ultimate goal. He would prefer the far right Nazi way, but if he has to hide behind a moderate Democrat he'll do that too. Look at this shit. Ro Kahnna is definitely setting himself up to run for president in 2028. Either that or possibly vice president to Gavin Newsome. Newsome also has taken Thiel money in the past. Thiel's private Uranium mine just happens to be in the home state of Thomas Massie, the Republican who is partnering with Kahnna to take on the Trump Epstein files in a bipartisan tag team. I'm glad they're exposing rich pedophiles, but don't give them fucking brownie points when it's clear they've been sitting on this shit the whole fucking time. The same with Vance going to Rupert Murdoch before all of this dropped. They definitely could have exposed Trump before he even ran for president the second time, but they didn't bc this is just part of their evil bullshit plans. This is just a game to them, and the people who have been hurt and exploited mean nothing. Fuck these evil pieces of shit. All of them.
5

Delta moves toward eliminating set prices in favor of AI that determines how much you personally will pay for a ticket
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
221

1

802 Stimmen

221 Beiträge

3k Aufrufe

W

Vote for a genocider, get genocided.
P

Large majority of French, German and Spanish public back tough EU stance on Big Tech, despite risk to Trump relations
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
19

1

431 Stimmen

19 Beiträge

256 Aufrufe

M

I think they meant 'because'
P

$219 Springer Nature book "Mastering Machine Learning: From Basics to Advanced" was written with a chatbot
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
20

1

238 Stimmen

20 Beiträge

183 Aufrufe

A

Unless you are a major corporation... you are not free to take anything.
P

Is the ‘tech bro-ification’ of abortion here?
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
15

1

69 Stimmen

15 Beiträge

152 Aufrufe

T

Nah. Been working in tech for nearly 30 years, "tech bro" is a delineation. Keeps the fuckers from smearing the rest of us
H

Autonomes Fahren: Lidar kann Smartphone-Kameras schwer beschädigen
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
6

1

5 Stimmen

6 Beiträge

64 Aufrufe

B

Oh sorry, my mind must have been a bit foggy when I read that. We agree 100%