linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

AI Chatbots Remain Overconfident — Even When They’re Wrong: Large Language Models appear to be unaware of their own mistakes, prompting concerns about common uses for AI chatbots.

Technology

29 Beiträge 22 Kommentatoren 0 Aufrufe

P This user is from outside of this forum
P This user is from outside of this forum
pro@programming.dev

schrieb zuletzt editiert von

#1

This post did not contain any content.
S S L R E 11 Antworten Letzte Antwort

124
P pro@programming.dev

This post did not contain any content.
S This user is from outside of this forum
S This user is from outside of this forum
snotflickerman@lemmy.blahaj.zone

schrieb zuletzt editiert von

#2

That's because they aren't "aware" of anything.
N 1 Antwort Letzte Antwort

60
P pro@programming.dev

This post did not contain any content.
S This user is from outside of this forum
S This user is from outside of this forum
sggeorwell@lemmy.world

schrieb zuletzt editiert von

#3

I’m pretty much done with them except for some search
W 1 Antwort Letzte Antwort

5
S sggeorwell@lemmy.world

I’m pretty much done with them except for some search
W This user is from outside of this forum
W This user is from outside of this forum
whitebrow@lemmy.world

schrieb zuletzt editiert von

#4

Not even a good use case either, especially when it spews such bullshit like “there’s no recorded instance of trump ever having used the word enigma” and “there’s 1 r in strawberry”.

LLMs are a copy paste machine, not a rationalization engine of any sort (at least as far as all the slop that we get shoved in our face, I don’t include the specialized protein folding and reconstructive models that were purpose built for very niche applications)
Q 1 Antwort Letzte Antwort

10
P pro@programming.dev

This post did not contain any content.
L This user is from outside of this forum
L This user is from outside of this forum
lodespawn@aussie.zone

schrieb zuletzt editiert von lodespawn@aussie.zone

#5

Why is a researcher with a PhD in social sciences researching the accuracy confidence of predictive text, how has this person gotten to where they are without being able to understand that LLMs don't think? Surely that came up when he started even considering this brainfart of a research project?
R 1 Antwort Letzte Antwort

8
S snotflickerman@lemmy.blahaj.zone

That's because they aren't "aware" of anything.
N This user is from outside of this forum
N This user is from outside of this forum
nymnympseudonym@lemmy.world

schrieb zuletzt editiert von

#6

This Nobel Prize winner and subject matter expert takes the opposite view
O S 2 Antworten Letzte Antwort

4
P pro@programming.dev

This post did not contain any content.
R This user is from outside of this forum
R This user is from outside of this forum
rc__buggy@sh.itjust.works

schrieb zuletzt editiert von

#7

However, when the participants and LLMs were asked retroactively how well they thought they did, only the humans appeared able to adjust expectations

This is what everyone with a fucking clue has been saying for the past 5, 6? years these stupid fucking chatbots have been around.
1 Antwort Letzte Antwort

9
L lodespawn@aussie.zone

Why is a researcher with a PhD in social sciences researching the accuracy confidence of predictive text, how has this person gotten to where they are without being able to understand that LLMs don't think? Surely that came up when he started even considering this brainfart of a research project?
R This user is from outside of this forum
R This user is from outside of this forum
rc__buggy@sh.itjust.works

schrieb zuletzt editiert von

#8

Someone has to prove it wrong before it's actually wrong. Maybe they set out to discredit the bots
L 1 Antwort Letzte Antwort

6
R rc__buggy@sh.itjust.works

Someone has to prove it wrong before it's actually wrong. Maybe they set out to discredit the bots
L This user is from outside of this forum
L This user is from outside of this forum
lodespawn@aussie.zone

schrieb zuletzt editiert von

#9

I guess, but it's like proving your phones predictive text has confidence in its suggestions regardless of accuracy. Confidence is not an attribute of a math function, they are attributing intelligence to a predictive model.
F 1 Antwort Letzte Antwort

5
W whitebrow@lemmy.world

Not even a good use case either, especially when it spews such bullshit like “there’s no recorded instance of trump ever having used the word enigma” and “there’s 1 r in strawberry”.

LLMs are a copy paste machine, not a rationalization engine of any sort (at least as far as all the slop that we get shoved in our face, I don’t include the specialized protein folding and reconstructive models that were purpose built for very niche applications)
Q This user is from outside of this forum
Q This user is from outside of this forum
quill7513@slrpnk.net

schrieb zuletzt editiert von

#10

they're solid starting point for shopping now that wirecutter, slant, and others are enshittified. i hate it and it makes me feel dirty to use, and you can't just do whatever the llm says. but asking it for a list of options to then explore is currently the best way i've found to jump into things like outdoor basketball shoe options
1 Antwort Letzte Antwort

0
P pro@programming.dev

This post did not contain any content.
E This user is from outside of this forum
E This user is from outside of this forum
etherphon@lemmy.world

schrieb zuletzt editiert von

#11

Sounds pretty human to me. /s
S 1 Antwort Letzte Antwort

2
P pro@programming.dev

This post did not contain any content.
E This user is from outside of this forum
E This user is from outside of this forum
el_guapazo@lemmy.world

schrieb zuletzt editiert von

#12

AI evolved their own form of the Dunning Kruger effect.
1 Antwort Letzte Antwort

1
P pro@programming.dev

This post did not contain any content.
M This user is from outside of this forum
M This user is from outside of this forum
modern_medicine_isnt@lemmy.world

schrieb zuletzt editiert von

#13

It's easy, just ask the AI "are you sure"? Until it stops changing it's answer.

But seriously, LLMs are just advanced autocomplete.
L 1 Antwort Letzte Antwort

10
P pro@programming.dev

This post did not contain any content.
P This user is from outside of this forum
P This user is from outside of this forum
perspectivist@feddit.uk

schrieb zuletzt editiert von

#14

Large language models aren’t designed to be knowledge machines - they’re designed to generate natural-sounding language, nothing more. The fact that they ever get things right is just a byproduct of their training data containing a lot of correct information. These systems aren’t generally intelligent, and people need to stop treating them as if they are. Complaining that an LLM gives out wrong information isn’t a failure of the model itself - it’s a mismatch of expectations.
S 1 Antwort Letzte Antwort

16
P pro@programming.dev

This post did not contain any content.
E This user is from outside of this forum
E This user is from outside of this forum
etterra@discuss.online

schrieb zuletzt editiert von

#15

Confidently incorrect.
1 Antwort Letzte Antwort

1
N nymnympseudonym@lemmy.world

This Nobel Prize winner and subject matter expert takes the opposite view
O This user is from outside of this forum
O This user is from outside of this forum
obinice@lemmy.world

schrieb zuletzt editiert von

#16

People really do not like seeing opposing viewpoints, eh? There's disagreeing, and then there's downvoting to oblivion without even engaging in a discussion, haha.

Even if they're probably right, in such murky uncertain waters where we're not experts, one should have at least a little open mind, or live and let live.
T F 2 Antworten Letzte Antwort

3
P perspectivist@feddit.uk

Large language models aren’t designed to be knowledge machines - they’re designed to generate natural-sounding language, nothing more. The fact that they ever get things right is just a byproduct of their training data containing a lot of correct information. These systems aren’t generally intelligent, and people need to stop treating them as if they are. Complaining that an LLM gives out wrong information isn’t a failure of the model itself - it’s a mismatch of expectations.
S This user is from outside of this forum
S This user is from outside of this forum
shalafi@lemmy.world

schrieb zuletzt editiert von

#17

Neither are our brains.

“Brains are survival engines, not truth detectors. If self-deception promotes fitness, the brain lies. Stops noticing—irrelevant things. Truth never matters. Only fitness. By now you don’t experience the world as it exists at all. You experience a simulation built from assumptions. Shortcuts. Lies. Whole species is agnosiac by default.”

― Peter Watts, Blindsight (fiction)

Starting to think we're really not much smarter. "But LLMs tell us what we want to hear!" Been on FaceBook lately, or lemmy?

If nothing else, LLMs have woke me to how stupid humans are vs. the machines.
P A 2 Antworten Letzte Antwort

1
E etherphon@lemmy.world

Sounds pretty human to me. /s
S This user is from outside of this forum
S This user is from outside of this forum
shalafi@lemmy.world

schrieb zuletzt editiert von

#18

Sounds pretty human to me. no /s
1 Antwort Letzte Antwort

0
L lodespawn@aussie.zone

I guess, but it's like proving your phones predictive text has confidence in its suggestions regardless of accuracy. Confidence is not an attribute of a math function, they are attributing intelligence to a predictive model.
F This user is from outside of this forum
F This user is from outside of this forum
fanciestpants@lemmy.world

schrieb zuletzt editiert von

#19

I work in risk management, but don't really have a strong understanding of LLM mechanics. "Confidence" is something that i quantify in my work, but it has different terms that are associated with it. In modeling outcomes, I may say that we have 60% confidence in achieving our budget objectives, while others would express the same result by saying our chances of achieving our budget objective are 60%. Again, I'm not sure if this is what the LLM is doing, but if it is producing a modeled prediction with a CDF of possible outcomes, then representing its result with 100% confindence means that the LLM didn't model any other possible outcomes other than the answer it is providing, which does seem troubling.
L 1 Antwort Letzte Antwort

1
O obinice@lemmy.world

People really do not like seeing opposing viewpoints, eh? There's disagreeing, and then there's downvoting to oblivion without even engaging in a discussion, haha.

Even if they're probably right, in such murky uncertain waters where we're not experts, one should have at least a little open mind, or live and let live.
T This user is from outside of this forum
T This user is from outside of this forum
thb@lemmy.world

schrieb zuletzt editiert von thb@lemmy.world

#20

It's like talking with someone who thinks the Earth is flat. There isn't anything to discuss. They're objectively wrong.

Humans like to anthropomorphize everything. It's why you can see a face on a car's front grille. LLMs are ultra advanced pattern matching algorithms. They do not think or reason or have any kind of opinion or sentience, yet they are being utilized as if they do. Let's see how it works out for the world, I guess.
1 Antwort Letzte Antwort

7

Anmelden zum Antworten

R

YouTube's Latest Update Shows That Online Monoculture Is Dead
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
124

1

263 Stimmen

124 Beiträge

889 Aufrufe

S

Then all hope is lost and there is absolutely no point in fighting, all it will do is annoy people who try to read your messages. If writing weird can have an impact on the world, I'm sure a lot of other things can too.
D

President Trump says he found a group of 'very wealthy people' to buy TikTok
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
22

1

129 Stimmen

22 Beiträge

115 Aufrufe

N

I can’t believe he knows anybody like that. You think you know somebody…
P

Doctors are using unapproved AI software to record patient meetings, investigation reveals
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
11

1

161 Stimmen

11 Beiträge

64 Aufrufe

R

Why are you using quotations marks? On a serious note, Google's bloat isn't inherent to android, their stuff is added on top as apps and services.
H

Ads on YouTube
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
47

30 Stimmen

47 Beiträge

217 Aufrufe

K

this is like a soulless manager or some ai bot trying to figure why the human brain hates terrible interruptions
L

Rumour: Google intends to discontinue the Android Open Source Project – OSnews
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
41

137 Stimmen

41 Beiträge

229 Aufrufe

R

And I think you swallowed one too many Apple ads.
P

OpenAI sees human interaction as a competitor to ChatGPT's super assistant ambitions
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
27

1

50 Stimmen

27 Beiträge

152 Aufrufe

S

Brother I live in western Europe and of the 6 supermarkets in my smallish city, 4 offer the handscanner. It's incredibly common here, and very convenient.
D

The technology to end traffic deaths exists. Why aren’t we using it?
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
36

43 Stimmen

36 Beiträge

179 Aufrufe

M

You’re seriously attempting to argue with me about whether or not transportation existed before cars?
W

Google and Adobe appear to be abusing copyright to silence a whistleblower's video
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

0 Stimmen

1 Beiträge

13 Aufrufe

Niemand hat geantwortet