linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

AI Chatbots Remain Overconfident — Even When They’re Wrong: Large Language Models appear to be unaware of their own mistakes, prompting concerns about common uses for AI chatbots.

Technology

68 Beiträge 42 Kommentatoren 856 Aufrufe

S snotflickerman@lemmy.blahaj.zone

That's because they aren't "aware" of anything.
N This user is from outside of this forum
N This user is from outside of this forum
nymnympseudonym@lemmy.world

schrieb zuletzt editiert von

#6

This Nobel Prize winner and subject matter expert takes the opposite view
O S G 3 Antworten Letzte Antwort

9
P pro@programming.dev

This post did not contain any content.
R This user is from outside of this forum
R This user is from outside of this forum
rc__buggy@sh.itjust.works

schrieb zuletzt editiert von

#7

However, when the participants and LLMs were asked retroactively how well they thought they did, only the humans appeared able to adjust expectations

This is what everyone with a fucking clue has been saying for the past 5, 6? years these stupid fucking chatbots have been around.
1 Antwort Letzte Antwort

18
L lodespawn@aussie.zone

Why is a researcher with a PhD in social sciences researching the accuracy confidence of predictive text, how has this person gotten to where they are without being able to understand that LLMs don't think? Surely that came up when he started even considering this brainfart of a research project?
R This user is from outside of this forum
R This user is from outside of this forum
rc__buggy@sh.itjust.works

schrieb zuletzt editiert von

#8

Someone has to prove it wrong before it's actually wrong. Maybe they set out to discredit the bots
L 1 Antwort Letzte Antwort

7
R rc__buggy@sh.itjust.works

Someone has to prove it wrong before it's actually wrong. Maybe they set out to discredit the bots
L This user is from outside of this forum
L This user is from outside of this forum
lodespawn@aussie.zone

schrieb zuletzt editiert von

#9

I guess, but it's like proving your phones predictive text has confidence in its suggestions regardless of accuracy. Confidence is not an attribute of a math function, they are attributing intelligence to a predictive model.
F 1 Antwort Letzte Antwort

6
W whitebrow@lemmy.world

Not even a good use case either, especially when it spews such bullshit like “there’s no recorded instance of trump ever having used the word enigma” and “there’s 1 r in strawberry”.

LLMs are a copy paste machine, not a rationalization engine of any sort (at least as far as all the slop that we get shoved in our face, I don’t include the specialized protein folding and reconstructive models that were purpose built for very niche applications)
Q This user is from outside of this forum
Q This user is from outside of this forum
quill7513@slrpnk.net

schrieb zuletzt editiert von

#10

they're solid starting point for shopping now that wirecutter, slant, and others are enshittified. i hate it and it makes me feel dirty to use, and you can't just do whatever the llm says. but asking it for a list of options to then explore is currently the best way i've found to jump into things like outdoor basketball shoe options
1 Antwort Letzte Antwort

2
P pro@programming.dev

This post did not contain any content.
E This user is from outside of this forum
E This user is from outside of this forum
etherphon@lemmy.world

schrieb zuletzt editiert von

#11

Sounds pretty human to me. /s
S 1 Antwort Letzte Antwort

2
P pro@programming.dev

This post did not contain any content.
E This user is from outside of this forum
E This user is from outside of this forum
el_guapazo@lemmy.world

schrieb zuletzt editiert von

#12

AI evolved their own form of the Dunning Kruger effect.
1 Antwort Letzte Antwort

1
P pro@programming.dev

This post did not contain any content.
M This user is from outside of this forum
M This user is from outside of this forum
modern_medicine_isnt@lemmy.world

schrieb zuletzt editiert von

#13

It's easy, just ask the AI "are you sure"? Until it stops changing it's answer.

But seriously, LLMs are just advanced autocomplete.
L C J 3 Antworten Letzte Antwort

17
P pro@programming.dev

This post did not contain any content.
P This user is from outside of this forum
P This user is from outside of this forum
perspectivist@feddit.uk

schrieb zuletzt editiert von

#14

Large language models aren’t designed to be knowledge machines - they’re designed to generate natural-sounding language, nothing more. The fact that they ever get things right is just a byproduct of their training data containing a lot of correct information. These systems aren’t generally intelligent, and people need to stop treating them as if they are. Complaining that an LLM gives out wrong information isn’t a failure of the model itself - it’s a mismatch of expectations.
S 1 Antwort Letzte Antwort

49
P pro@programming.dev

This post did not contain any content.
E This user is from outside of this forum
E This user is from outside of this forum
etterra@discuss.online

schrieb zuletzt editiert von

#15

Confidently incorrect.
1 Antwort Letzte Antwort

1
N nymnympseudonym@lemmy.world

This Nobel Prize winner and subject matter expert takes the opposite view
O This user is from outside of this forum
O This user is from outside of this forum
obinice@lemmy.world

schrieb zuletzt editiert von

#16

People really do not like seeing opposing viewpoints, eh? There's disagreeing, and then there's downvoting to oblivion without even engaging in a discussion, haha.

Even if they're probably right, in such murky uncertain waters where we're not experts, one should have at least a little open mind, or live and let live.
T F 2 Antworten Letzte Antwort

8
P perspectivist@feddit.uk

Large language models aren’t designed to be knowledge machines - they’re designed to generate natural-sounding language, nothing more. The fact that they ever get things right is just a byproduct of their training data containing a lot of correct information. These systems aren’t generally intelligent, and people need to stop treating them as if they are. Complaining that an LLM gives out wrong information isn’t a failure of the model itself - it’s a mismatch of expectations.
S This user is from outside of this forum
S This user is from outside of this forum
shalafi@lemmy.world

schrieb zuletzt editiert von

#17

Neither are our brains.

“Brains are survival engines, not truth detectors. If self-deception promotes fitness, the brain lies. Stops noticing—irrelevant things. Truth never matters. Only fitness. By now you don’t experience the world as it exists at all. You experience a simulation built from assumptions. Shortcuts. Lies. Whole species is agnosiac by default.”

― Peter Watts, Blindsight (fiction)

Starting to think we're really not much smarter. "But LLMs tell us what we want to hear!" Been on FaceBook lately, or lemmy?

If nothing else, LLMs have woke me to how stupid humans are vs. the machines.
P A S J 4 Antworten Letzte Antwort

6
E etherphon@lemmy.world

Sounds pretty human to me. /s
S This user is from outside of this forum
S This user is from outside of this forum
shalafi@lemmy.world

schrieb zuletzt editiert von

#18

Sounds pretty human to me. no /s
1 Antwort Letzte Antwort

2
L lodespawn@aussie.zone

I guess, but it's like proving your phones predictive text has confidence in its suggestions regardless of accuracy. Confidence is not an attribute of a math function, they are attributing intelligence to a predictive model.
F This user is from outside of this forum
F This user is from outside of this forum
fanciestpants@lemmy.world

schrieb zuletzt editiert von

#19

I work in risk management, but don't really have a strong understanding of LLM mechanics. "Confidence" is something that i quantify in my work, but it has different terms that are associated with it. In modeling outcomes, I may say that we have 60% confidence in achieving our budget objectives, while others would express the same result by saying our chances of achieving our budget objective are 60%. Again, I'm not sure if this is what the LLM is doing, but if it is producing a modeled prediction with a CDF of possible outcomes, then representing its result with 100% confindence means that the LLM didn't model any other possible outcomes other than the answer it is providing, which does seem troubling.
L 1 Antwort Letzte Antwort

1
O obinice@lemmy.world

People really do not like seeing opposing viewpoints, eh? There's disagreeing, and then there's downvoting to oblivion without even engaging in a discussion, haha.

Even if they're probably right, in such murky uncertain waters where we're not experts, one should have at least a little open mind, or live and let live.
T This user is from outside of this forum
T This user is from outside of this forum
thb@lemmy.world

schrieb zuletzt editiert von thb@lemmy.world

#20

It's like talking with someone who thinks the Earth is flat. There isn't anything to discuss. They're objectively wrong.

Humans like to anthropomorphize everything. It's why you can see a face on a car's front grille. LLMs are ultra advanced pattern matching algorithms. They do not think or reason or have any kind of opinion or sentience, yet they are being utilized as if they do. Let's see how it works out for the world, I guess.
S 1 Antwort Letzte Antwort

20
M modern_medicine_isnt@lemmy.world

It's easy, just ask the AI "are you sure"? Until it stops changing it's answer.

But seriously, LLMs are just advanced autocomplete.
L This user is from outside of this forum
L This user is from outside of this forum
lfrith@lemmy.ca

schrieb zuletzt editiert von

#21

They can even get math wrong. Which surprised me. Had to tell it the answer is wrong for them to recalculate and then get the correct answer. It was simple percentages of a list of numbers I had asked.
G S J 3 Antworten Letzte Antwort

7
S shalafi@lemmy.world

Neither are our brains.

“Brains are survival engines, not truth detectors. If self-deception promotes fitness, the brain lies. Stops noticing—irrelevant things. Truth never matters. Only fitness. By now you don’t experience the world as it exists at all. You experience a simulation built from assumptions. Shortcuts. Lies. Whole species is agnosiac by default.”

― Peter Watts, Blindsight (fiction)

Starting to think we're really not much smarter. "But LLMs tell us what we want to hear!" Been on FaceBook lately, or lemmy?

If nothing else, LLMs have woke me to how stupid humans are vs. the machines.
P This user is from outside of this forum
P This user is from outside of this forum
perspectivist@feddit.uk

schrieb zuletzt editiert von

#22

There are plenty of similarities in the output of both the human brain and LLMs, but overall they’re very different. Unlike LLMs, the human brain is generally intelligent - it can adapt to a huge variety of cognitive tasks. LLMs, on the other hand, can only do one thing: generate language. It’s tempting to anthropomorphize systems like ChatGPT because of how competent they seem, but there’s no actual thinking going on. It’s just generating language based on patterns and probabilities.
1 Antwort Letzte Antwort

7
F fanciestpants@lemmy.world

I work in risk management, but don't really have a strong understanding of LLM mechanics. "Confidence" is something that i quantify in my work, but it has different terms that are associated with it. In modeling outcomes, I may say that we have 60% confidence in achieving our budget objectives, while others would express the same result by saying our chances of achieving our budget objective are 60%. Again, I'm not sure if this is what the LLM is doing, but if it is producing a modeled prediction with a CDF of possible outcomes, then representing its result with 100% confindence means that the LLM didn't model any other possible outcomes other than the answer it is providing, which does seem troubling.
L This user is from outside of this forum
L This user is from outside of this forum
lodespawn@aussie.zone

schrieb zuletzt editiert von

#23

Nah so their definition is the classical "how confident are you that you got the answer right". If you read the article they asked a bunch of people and 4 LLMs a bunch of random questions, then asked the respondent whether they/it had confidence their answer was correct, and then checked the answer. The LLMs initially lined up with people (over confident) but then when they iterated, shared results and asked further questions the LLMs confidence increased while people's tends to decrease to mitigate the over confidence.

But the study still assumes intelligence enough to review past results and adjust accordingly, but disregards the fact that an AI isnt intelligence, it's a word prediction model based on a data set of written text tending to infinity. It's not assessing validity of results, it's predicting what the answer is based on all previous inputs. The whole study is irrelevant.
J 1 Antwort Letzte Antwort

1
N nymnympseudonym@lemmy.world

This Nobel Prize winner and subject matter expert takes the opposite view
S This user is from outside of this forum
S This user is from outside of this forum
snotflickerman@lemmy.blahaj.zone

schrieb zuletzt editiert von snotflickerman@lemmy.blahaj.zone

#24

Interesting talk but the number of times he completely dismisses the entire field of linguistics kind of makes me think he's being disingenuous about his familiarity with it.

For one, I think he is dismissing holotes, the concept of "wholeness." That when you cut something apart to it's individual parts, you lose something about the bigger picture. This deconstruction of language misses the larger picture of the human body as a whole, and how every part of us, from our assemblage of organs down to our DNA, impact how we interact with and understand the world. He may have a great definition of understanding but it still sounds (to me) like it's potentially missing aspects of human/animal biologically based understanding.

For example, I have cancer, and about six months before I was diagnosed, I had begun to get more chronically depressed than usual. I felt hopeless and I didn't know why. Surprisingly, that's actually a symptom of my cancer. What understanding did I have that changed how I felt inside and how I understood the things around me? Suddenly I felt different about words and ideas, but nothing had changed externally, something had change internally. The connections in my neural network had adjusted, the feelings and associations with words and ideas was different, but I hadn't done anything to make that adjustment. No learning or understanding had happened. I had a mutation in my DNA that made that adjustment for me.

Further, I think he's deeply misunderstanding (possibly intentionally?) what linguists like Chomsky are saying when they say humans are born with language. They mean that we are born with a genetic blueprint to understand language. Just like animals are born with a genetic blueprint to do things they were never trained to do. Many animals are born and almost immediately stand up to walk. This is the same principle. There are innate biologically ingrained understandings that help us along the path to understanding. It does not mean we are born understanding language as much as we are born with the building blocks of understanding the physical world in which we exist.

Anyway, interesting talk, but I immediately am skeptical of anyone who wholly dismisses an entire field of thought so casually.

For what it's worth, I didn't downvote you and I'm sorry people are doing so.
N 1 Antwort Letzte Antwort

17
L lfrith@lemmy.ca

They can even get math wrong. Which surprised me. Had to tell it the answer is wrong for them to recalculate and then get the correct answer. It was simple percentages of a list of numbers I had asked.
G This user is from outside of this forum
G This user is from outside of this forum
gissamittjobb@lemmy.ml

schrieb zuletzt editiert von

#25

Language models are unsuitable for math problems broadly speaking. We already have good technology solutions for that category of problems. Luckily, you can combine the two - prompt the model to write a program that solves your math problem, then execute it. You're likely to see a lot more success using this approach.
J 1 Antwort Letzte Antwort

8

Anmelden zum Antworten

T

Trump signs executive order targeting banking industry discrimination
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
4

1

7 Stimmen

4 Beiträge

13 Aufrufe

T

His own network.
S

How Digital Press Releases Are Changing Brand Visibility in 2025
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

0 Stimmen

1 Beiträge

4 Aufrufe

Niemand hat geantwortet
A

Europe Flowers and Ornamental Plants Market Trends: Growth, Share, Value, Size, and Analysis
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

2

0 Stimmen

1 Beiträge

3 Aufrufe

Niemand hat geantwortet
A

datacenter liquid cooling solution
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
35

34 Stimmen

35 Beiträge

227 Aufrufe

S

A legit exception might occur with a strict energy-optimization objective, where the point would be transporting heat outside of an HVAC envelope as efficiently as possible. The cost of the additional thermal load is often ignored by hobbyists in their energy calculations but it can be significant. In the context of fixed-capacity solar, for example, it might be cheaper to pipe waste heat from a telco closet to a space that isn’t climate controlled, like a garage, than it would be to expand the solar installation for increased HVAC draw.
Y

Navigating the Skies: Growth and Challenges in the UTM Market
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

1 Stimmen

1 Beiträge

20 Aufrufe

Niemand hat geantwortet
I

Understanding the impacts of generative AI use on children
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
4

50 Stimmen

4 Beiträge

61 Aufrufe

S

That's fine, just use ChatGPT...
D

Trump Media & Technology Group, the company owned by the President, said Tuesday that it would raise $2.5 billion to invest in Bitcoin
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
15

1

50 Stimmen

15 Beiträge

146 Aufrufe

A

it's an insecurity.
E

Realtek's $10 tiny 10GbE network adapter is coming to motherboards later this year
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
10

1

5 Stimmen

10 Beiträge

91 Aufrufe

S

You could look into automatic local caching for diles you're planning to seed, and stick that on an SSD. That way you don't hammer the HDDs in the NAS and still get the good feels of seeding. Then automatically delete files once they get to a certain seed rate or something and you're golden. How aggressive you go with this depends on your actual use case. Are you actually editing raw footage over the network while multiple other clients are streaming other stuff? Or are you just interested in having it be capable? What's the budget? But that sounds complicated. I'd personally rather just DIY it, that way you can put an SSD in there for cache and you get most of the benefits with a lot less cost, and you should be able to respond to issues with minimal changes (i.e. add more RAM or another caching drive).