linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

I'm looking for an article showing that LLMs don't know how they work internally

Technology

80 Beiträge 32 Kommentatoren 5 Aufrufe

E excrubulent@slrpnk.net
If all you're saying is that neural networks could develop consciousness one day, sure, and nothing I said contradicts that. Our brains are neural networks, so it stands to reason they could do what our brains can do. But the technical hurdles are huge.

You need at least two things to get there:
1. Enough computing power to support it.
2. Insight into how consciousness is structured.
1 is hard because a single brain alone is about as powerful as a significant chunk of worldwide computing, the gulf between our current power and what we would need is about... 100% of what we would need. We are so woefully under resourced for that. You also need to solve how to power the computers without cooking the planet, which is not something we're even close to solving currently.

2 means that we can't just throw more power or training at the problem. Modern NN modules have an underlying theory that makes them work. They're essentially statistical curve-fitting machines. We don't currently have a good theoretical model that would allow us to structure the NN to create a consciousness. It's not even on the horizon yet.

Those are two enormous hurdles. I think saying modern NN design can create consciousness is like Jules Verne in 1867 saying we can get to the Moon with a cannon because of "what progress artillery science has made in the last few years".

Moon rockets are essentially artillery science in many ways, yes, but Jules Verne was still a century away in terms of supporting technologies, raw power, and essential insights into how to do it.
V This user is from outside of this forum
V This user is from outside of this forum
voldemort@lemmy.world

schrieb zuletzt editiert von

#69

We're on the same page about consciousness then. My original comment only pointed out that current AI have problems that we have because they replicate how we work and it seems that people don't like recognising that very obvious fact that we have the exact problems that LLMs have. LLMs aren't rational because we inherently are not rational. That was the only point I was originally trying to make.

For AGI or UGI to exist, massive hurdles will need to be made, likely an entire restructuring of it. I think LLMs will continue to get smarter and likely exceed us but it will not be perfect without a massive rework.

Personally and this is pure speculation, I wouldn't be surprised if AGI or UGI is only possible with the help of a highly advanced AI. Similar to how microbiologist are only now starting to unravel protein synthesis with the help of AI. I think the shear volume of data that needs processing requires something like a highly evolved AI to understand, and that current technology is purely a stepping stone for something more.
E 1 Antwort Letzte Antwort

0
V voldemort@lemmy.world

We're on the same page about consciousness then. My original comment only pointed out that current AI have problems that we have because they replicate how we work and it seems that people don't like recognising that very obvious fact that we have the exact problems that LLMs have. LLMs aren't rational because we inherently are not rational. That was the only point I was originally trying to make.

For AGI or UGI to exist, massive hurdles will need to be made, likely an entire restructuring of it. I think LLMs will continue to get smarter and likely exceed us but it will not be perfect without a massive rework.

Personally and this is pure speculation, I wouldn't be surprised if AGI or UGI is only possible with the help of a highly advanced AI. Similar to how microbiologist are only now starting to unravel protein synthesis with the help of AI. I think the shear volume of data that needs processing requires something like a highly evolved AI to understand, and that current technology is purely a stepping stone for something more.
E This user is from outside of this forum
E This user is from outside of this forum
excrubulent@slrpnk.net

schrieb zuletzt editiert von excrubulent@slrpnk.net

#70

We don't have the same problems LLMs have.

LLMs have zero fidelity. They have no - none - zero - model of the world to compare their output to.

Humans have biases and problems in our thinking, sure, but we're capable of at least making corrections and working with meaning in context. We can recognise our model of the world and how it relates to the things we are saying.

LLMs cannot do that job, at all, and they won't be able to until they have a model of the world. A model of the world would necessarily include themselves, which is self-awareness, which is AGI. That's a meaning-understander. Developing a world model is the same problem as consciousness.

What I'm saying is that you cannot develop fidelity at all without AGI, so no, LLMs don't have the same problems we do. That is an entirely different class of problem.

Some moon rockets fail, but they don't have that in common with moon cannons. One of those can in theory achieve a moon landing and the other cannot, ever, in any iteration.
1 Antwort Letzte Antwort

0
T theunknownmuncher@lemmy.world

LOL you didn't really make the point you thought you did. It isn't an "improper comparison" (it's called a false equivalency FYI), because there isn't a real distinction between information and this thing you just made up called "basic action on data", but anyway have it your way:

Your comment is still exactly like saying an audio pipeline isn't really playing music because it's actually just doing basic math.
G This user is from outside of this forum
G This user is from outside of this forum
glizzyguzzler@lemmy.blahaj.zone

schrieb zuletzt editiert von

#71

I was channeling the Interstellar docking computer (“improper contact” in such a sassy voice)

There is a distinction between data and an action you perform on data (matrix maths, codec algorithm, etc.). It’s literally completely different.

An audio codec (not a pipeline) is just actually doing math - just like the workings of an LLM. There’s plenty of work to be done after the audio codec decodes the m4a to get to tunes in your ears. Same for an LLM, sandwiching those matrix multiplications that make the magic happen are layers that crunch the prompts and assemble the tokens you see it spit out.

LLMs can’t think, that’s just the fact of how they work. The problem is that AI companies are happy to describe them in terms that make you think they can think to sell their product! I literally cannot be wrong that LLMs cannot think or reason, there’s no room for debate, it’s settled long ago. AI companies will string the LLMs together and let them chew for a while to try make themselves catch when they’re dropping bullshit. It’s still not thinking and reasoning though. They can be useful tools, but LLMs are just tools not sentient or verging on sentient
T 1 Antwort Letzte Antwort

0
D darkdarkhouse@lemmy.sdf.org

Do LLMs not exhibit emergent behaviour? But who am I, a simple skin-bag of chemicals, to really say.
G This user is from outside of this forum
G This user is from outside of this forum
glizzyguzzler@lemmy.blahaj.zone

schrieb zuletzt editiert von

#72

They do not, and I, a simple skin-bag of chemicals (mostly water tho) do say
1 Antwort Letzte Antwort

0
W whaleross@lemmy.world

People that can not do Matrix multiplication do not possess the basic concepts of intelligence now? Or is software that can do matrix multiplication intelligent?
G This user is from outside of this forum
G This user is from outside of this forum
glizzyguzzler@lemmy.blahaj.zone

schrieb zuletzt editiert von

#73

So close, LLMs work via matrix multiplication, which is well understood by many meat bags and matrix math can’t think. If a meat bag can’t do matrix math, that’s ok, because the meat bag doesn’t work via matrix multiplication. lol imagine forgetting how to do matrix multiplication and disappearing into a singularity or something
W 1 Antwort Letzte Antwort

1
T theunknownmuncher@lemmy.world

To write the second line, the model had to satisfy two constraints at the same time: the need to rhyme (with "grab it"), and the need to make sense (why did he grab the carrot?). Our guess was that Claude was writing word-by-word without much forethought until the end of the line, where it would make sure to pick a word that rhymes. We therefore expected to see a circuit with parallel paths, one for ensuring the final word made sense, and one for ensuring it rhymes.

Instead, we found that Claude plans ahead. Before starting the second line, it began "thinking" of potential on-topic words that would rhyme with "grab it". Then, with these plans in mind, it writes a line to end with the planned word.

actually read the research?
G This user is from outside of this forum
G This user is from outside of this forum
glizzyguzzler@lemmy.blahaj.zone

schrieb zuletzt editiert von

#74

No, they’re right. The “research” is biased by the company that sells the product and wants to hype it. Many layers don’t make think or reason, but they’re glad to put them in quotes that they hope peeps will forget were there.
1 Antwort Letzte Antwort

0
G glizzyguzzler@lemmy.blahaj.zone

So close, LLMs work via matrix multiplication, which is well understood by many meat bags and matrix math can’t think. If a meat bag can’t do matrix math, that’s ok, because the meat bag doesn’t work via matrix multiplication. lol imagine forgetting how to do matrix multiplication and disappearing into a singularity or something
W This user is from outside of this forum
W This user is from outside of this forum
whaleross@lemmy.world

schrieb zuletzt editiert von

#75

Well, on the other hand. Meat bags can't really do neuron stuff either, despite that is essential for any meat bag operation. Humans are still here though and so are dogs.
1 Antwort Letzte Antwort

0
W wolflink@sh.itjust.works

The environmental toll doesn’t have to be that bad. You can get decent results from single high-end gaming GPU.
G This user is from outside of this forum
G This user is from outside of this forum
glizzyguzzler@lemmy.blahaj.zone

schrieb zuletzt editiert von

#76

You can, but the stuff that’s really useful (very competent code completion) needs gigantic context lengths that even rich peeps with $2k GPUs can’t do. And that’s ignoring the training power and hardware costs to get the models.

Techbros chasing VC funding are pushing LLMs to the physical limit of what humanity can provide power and hardware-wise. Way less hype and letting them come to market organically in 5/10 years would give the LLMs a lot more power efficiency at the current context and depth limits. But that ain’t this timeline, we just got VC money looking to buy nuclear plants and fascists trying to subdue the US for the techbro oligarchs womp womp
1 Antwort Letzte Antwort

0
G glizzyguzzler@lemmy.blahaj.zone

I was channeling the Interstellar docking computer (“improper contact” in such a sassy voice)

There is a distinction between data and an action you perform on data (matrix maths, codec algorithm, etc.). It’s literally completely different.

An audio codec (not a pipeline) is just actually doing math - just like the workings of an LLM. There’s plenty of work to be done after the audio codec decodes the m4a to get to tunes in your ears. Same for an LLM, sandwiching those matrix multiplications that make the magic happen are layers that crunch the prompts and assemble the tokens you see it spit out.

LLMs can’t think, that’s just the fact of how they work. The problem is that AI companies are happy to describe them in terms that make you think they can think to sell their product! I literally cannot be wrong that LLMs cannot think or reason, there’s no room for debate, it’s settled long ago. AI companies will string the LLMs together and let them chew for a while to try make themselves catch when they’re dropping bullshit. It’s still not thinking and reasoning though. They can be useful tools, but LLMs are just tools not sentient or verging on sentient
T This user is from outside of this forum
T This user is from outside of this forum
theunknownmuncher@lemmy.world

schrieb zuletzt editiert von theunknownmuncher@lemmy.world

#77

There is a distinction between data and an action you perform on data (matrix maths, codec algorithm, etc.). It’s literally completely different.

Incorrect. You might want to take an information theory class before speaking on subjects like this.

I literally cannot be wrong that LLMs cannot think or reason, there’s no room for debate, it’s settled long ago.

Lmao yup totally, it's not like this type of research currently gets huge funding at universities and institutions or anything like that it's a dead research field because it's already "settled". (You're wrong 🤭)

LLMs are just tools not sentient or verging on sentient

Correct. No one claimed they are "sentient" (you actually mean "sapient", not "sentient", but it's fine because people commonly mix these terms up. Sentience is about the physical senses. If you can respond to stimuli from your environment, you're sentient, if you can "I think, therefore I am", you're sapient). And no, LLMs are not sapient either, and sapience has nothing to do with neural networks' ability to mathematically reason or use logic, you're just moving the goalpost. But at least you moved it far enough to be actually correct?
G 1 Antwort Letzte Antwort

0
T theunknownmuncher@lemmy.world

There is a distinction between data and an action you perform on data (matrix maths, codec algorithm, etc.). It’s literally completely different.

Incorrect. You might want to take an information theory class before speaking on subjects like this.

I literally cannot be wrong that LLMs cannot think or reason, there’s no room for debate, it’s settled long ago.

Lmao yup totally, it's not like this type of research currently gets huge funding at universities and institutions or anything like that it's a dead research field because it's already "settled". (You're wrong 🤭)

LLMs are just tools not sentient or verging on sentient

Correct. No one claimed they are "sentient" (you actually mean "sapient", not "sentient", but it's fine because people commonly mix these terms up. Sentience is about the physical senses. If you can respond to stimuli from your environment, you're sentient, if you can "I think, therefore I am", you're sapient). And no, LLMs are not sapient either, and sapience has nothing to do with neural networks' ability to mathematically reason or use logic, you're just moving the goalpost. But at least you moved it far enough to be actually correct?
G This user is from outside of this forum
G This user is from outside of this forum
glizzyguzzler@lemmy.blahaj.zone

schrieb zuletzt editiert von

#78

It’s wild, we’re just completely talking past each other at this point! I don’t think I’ve ever gotten to a point where I’m like “it’s blue” and someone’s like “it’s gold” so clearly. And like I know enough to know what I’m talking about and that I’m not wrong (unis are not getting tons of grants to see “if AI can think”, no one but fart sniffing AI bros would fund that (see OP’s requested source is from an AI company about their own model), research funding goes towards making useful things not if ChatGPT is really going through it like the rest of us), but you are very confident in yourself as well. Your mention of information theory leads me to believe you’ve got a degree in the computer science field. The basis of machine learning is not in computer science but in stats (math). So I won’t change my understanding based on your claims since I don’t think you deeply know the basis just the application. The focus on using the “right words” as a gotchya bolsters that vibe. I know you won’t change your thoughts based on my input, so we’re at the age-old internet stalemate! Anyway, just wanted you to know why I decided not to entertain what you’ve been saying - I’m sure I’m in the same boat from your perspective
T 1 Antwort Letzte Antwort

0
G glizzyguzzler@lemmy.blahaj.zone

It’s wild, we’re just completely talking past each other at this point! I don’t think I’ve ever gotten to a point where I’m like “it’s blue” and someone’s like “it’s gold” so clearly. And like I know enough to know what I’m talking about and that I’m not wrong (unis are not getting tons of grants to see “if AI can think”, no one but fart sniffing AI bros would fund that (see OP’s requested source is from an AI company about their own model), research funding goes towards making useful things not if ChatGPT is really going through it like the rest of us), but you are very confident in yourself as well. Your mention of information theory leads me to believe you’ve got a degree in the computer science field. The basis of machine learning is not in computer science but in stats (math). So I won’t change my understanding based on your claims since I don’t think you deeply know the basis just the application. The focus on using the “right words” as a gotchya bolsters that vibe. I know you won’t change your thoughts based on my input, so we’re at the age-old internet stalemate! Anyway, just wanted you to know why I decided not to entertain what you’ve been saying - I’m sure I’m in the same boat from your perspective
T This user is from outside of this forum
T This user is from outside of this forum
theunknownmuncher@lemmy.world

schrieb zuletzt editiert von

#79

loses the argument "we’re at the age-old internet stalemate!" LMAO
G 1 Antwort Letzte Antwort

0
T theunknownmuncher@lemmy.world

loses the argument "we’re at the age-old internet stalemate!" LMAO
G This user is from outside of this forum
G This user is from outside of this forum
glizzyguzzler@lemmy.blahaj.zone

schrieb zuletzt editiert von

#80

Indeed I did not, we’re at a stalemate because you and I do not believe what the other is saying! So we can’t move anywhere since it’s two walls. Buuuut Tim Apple got my back for once, just saw this now!: https://lemmy.blahaj.zone/post/27197259

I’ll leave it at that, as thanks to that white paper I win! Yay internet points!
1 Antwort Letzte Antwort

0

Anmelden zum Antworten

R

I made a terminal-based hacker simulation game for CLI nerds. It's free. Feedback?
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

1 Stimmen

1 Beiträge

0 Aufrufe

Niemand hat geantwortet
N

Pro-AI Subreddit Bans 'Uptick' of Users Who Suffer from AI Delusions
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
80

1

272 Stimmen

80 Beiträge

3 Aufrufe

S

that sub seems to be fully brigaded by bots from marketing team of closed-ai and preplexity
P

Silicon Valley cities hit with request for residents' emails to train AI
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
12

1

124 Stimmen

12 Beiträge

0 Aufrufe

T

Premium supported. You get plenty with the free tier, but you get lots more with paid.
A

The New York Times Just Published Some Bizarre Race Science About Asian Women
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
100

1

278 Stimmen

100 Beiträge

0 Aufrufe

F

It's not just skills, it's also capital investment.
P

This Printer company served you malware for months, called them false positives
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
24

1

297 Stimmen

24 Beiträge

8 Aufrufe

S

This is not a typical home or office printer, very specialized.
V

China aims to recruit top US scientists as Trump tries to kill the CHIPS Act
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

0 Stimmen

1 Beiträge

1 Aufrufe

Niemand hat geantwortet
F

Decentralized Social Media Is the Only Alternative to the Tech Oligarchy
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
9

1

1 Stimmen

9 Beiträge

2 Aufrufe

G

So we need a documentary like Super Size Me but for social media. I think post that documentary coming out was the only time I've seen people's attitudes change in the general population about fast food.
I

Dear Brother Printers: Eat a [Sponsor friendly words here]
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
2

1 Stimmen

2 Beiträge

2 Aufrufe

A

Why doesn't Amazon just sell a generic printer that works with generic toner or pigment or ink. I would buy.