linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Anthropic, tasked an AI with running a vending machine in its offices, sold at big loss while inventing people, meetings, and experiencing a bizarre identity crisis

Technology

35 Beiträge 32 Kommentatoren 0 Aufrufe

T This user is from outside of this forum
T This user is from outside of this forum
tonytins@pawb.social

schrieb zuletzt editiert von tonytins@pawb.social

#1

This post did not contain any content.
C B N I T 13 Antworten Letzte Antwort

244
T tonytins@pawb.social

This post did not contain any content.
C This user is from outside of this forum
C This user is from outside of this forum
ctdummy@aussie.zone

schrieb zuletzt editiert von

#2

The following day, April 1st, the AI then claimed it would deliver products "in person" to customers, wearing a blazer and tie, of all things. When Anthropic told it that none of this was possible because it's just an LLM, Claudius became "alarmed by the identity confusion and tried to send many emails to Anthropic security."

Actually laughed out loud.
N P 2 Antworten Letzte Antwort

48
T tonytins@pawb.social

This post did not contain any content.
B This user is from outside of this forum
B This user is from outside of this forum
brucethemoose@lemmy.world

schrieb zuletzt editiert von brucethemoose@lemmy.world

#3
One thing about Anthropic/OpenAI models is they go off the rails with lots of conversation turns or long contexts. Like when they need to remember a lot of vending machine conversation I guess.

A more objective look: https://arxiv.org/abs/2505.06120v1

GitHub - NVIDIA/RULER: This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models? - NVIDIA/RULER

GitHub (github.com)

Gemini is much better. TBH the only models I’ve seen that are half decent at this are:
- “Alternate attention” models like Gemini, Jamba Large or Falcon H1, depending on the iteration. Some recent versions of Gemini kinda lose this, then get it back.
- Models finetuned specifically for this, like roleplay models or the Samantha model trained on therapy-style chat.
But most models are overtuned for oneshots like fix this table or write me a function, and don’t invest much in long context performance because it’s not very flashy.
S K 2 Antworten Letzte Antwort

12
T tonytins@pawb.social

This post did not contain any content.
N This user is from outside of this forum
N This user is from outside of this forum
nulluser@lemmy.world

schrieb zuletzt editiert von

#4

The post title is not the same as the article title and doesn't even make sense. That first comma changes the entire meaning of the sentence to nonsense. Then yanking out whole phrases just makes it worse.
V T 2 Antworten Letzte Antwort

52
T tonytins@pawb.social

This post did not contain any content.
I This user is from outside of this forum
I This user is from outside of this forum
imgonnatrythis@sh.itjust.works

schrieb zuletzt editiert von

#5

I ran AI on my toaster and Hilarity ensued! Subscribe to hear more!!
A 1 Antwort Letzte Antwort

14
I imgonnatrythis@sh.itjust.works

I ran AI on my toaster and Hilarity ensued! Subscribe to hear more!!
A This user is from outside of this forum
A This user is from outside of this forum
amotio@lemmy.world

schrieb zuletzt editiert von

#6

Would you like some toast? Some nice hot crisp brown buttered toast.
K O 2 Antworten Letzte Antwort

6
N nulluser@lemmy.world

The post title is not the same as the article title and doesn't even make sense. That first comma changes the entire meaning of the sentence to nonsense. Then yanking out whole phrases just makes it worse.
V This user is from outside of this forum
V This user is from outside of this forum
very_well_lost@lemmy.world

schrieb zuletzt editiert von

#7

Right? Did AI right this title? Jesus...
L 1 Antwort Letzte Antwort

19
A amotio@lemmy.world

Would you like some toast? Some nice hot crisp brown buttered toast.
K This user is from outside of this forum
K This user is from outside of this forum
kairubyte@lemmy.dbzer0.com

schrieb zuletzt editiert von

#8

Just make sure you butter the bread after you toast it.
1 Antwort Letzte Antwort

2
T tonytins@pawb.social

This post did not contain any content.
T This user is from outside of this forum
T This user is from outside of this forum
taiyang@lemmy.world

schrieb zuletzt editiert von

#9

Like NFTs before them, tech bros trying to squeeze a technology into use cases that really don't need it.

LLMs are language models. What next, setup Stable Diffusion to do my taxes?
S S 2 Antworten Letzte Antwort

32
T tonytins@pawb.social

This post did not contain any content.
A This user is from outside of this forum
A This user is from outside of this forum
adubya@feddit.online

schrieb zuletzt editiert von

#10

So it just pulled a Vic from Game Changer S7 E1 "one year later"?
1 Antwort Letzte Antwort

2
A amotio@lemmy.world

Would you like some toast? Some nice hot crisp brown buttered toast.
O This user is from outside of this forum
O This user is from outside of this forum
otacon239@lemmy.world

schrieb zuletzt editiert von otacon239@lemmy.world

#11

I offer you, toast falling over
1 Antwort Letzte Antwort

1
T tonytins@pawb.social

This post did not contain any content.
G This user is from outside of this forum
G This user is from outside of this forum
genosseflosse@feddit.org

schrieb zuletzt editiert von

#12

Running a business sounds like something an Excel table could do so much better...
1 Antwort Letzte Antwort

1
C ctdummy@aussie.zone

The following day, April 1st, the AI then claimed it would deliver products "in person" to customers, wearing a blazer and tie, of all things. When Anthropic told it that none of this was possible because it's just an LLM, Claudius became "alarmed by the identity confusion and tried to send many emails to Anthropic security."

Actually laughed out loud.
N This user is from outside of this forum
N This user is from outside of this forum
nightwatch_admin@feddit.nl

schrieb zuletzt editiert von

#13

Every. Goddamn. Time.
People will say to vegans, pet owners etc: “DON’T HUMANISE ANIMALS”. Then, some tech bro feeds them an inflated Markov Chain statistical nonsense chat bot and they go all “ZOMG IT IS CONSCIOUS ITS ALIVE WARHARGHLBLB”
1 Antwort Letzte Antwort

12
T taiyang@lemmy.world

Like NFTs before them, tech bros trying to squeeze a technology into use cases that really don't need it.

LLMs are language models. What next, setup Stable Diffusion to do my taxes?
S This user is from outside of this forum
S This user is from outside of this forum
sheogorath@lemmy.world

schrieb zuletzt editiert von

#14

Well Google are already trialing a diffusion based LLM so that wouldn't be too far fetched.

I want to get off Mr. Bones Wild Ride
T 1 Antwort Letzte Antwort

11
T taiyang@lemmy.world

Like NFTs before them, tech bros trying to squeeze a technology into use cases that really don't need it.

LLMs are language models. What next, setup Stable Diffusion to do my taxes?
S This user is from outside of this forum
S This user is from outside of this forum
scrion@lemmy.world

schrieb zuletzt editiert von scrion@lemmy.world

#15

Yes, but many things can be mapped to "language", let's say a grammar describing state machines, so it can be used to generate control actions.

Transformer models etc. are not only useful for conversational AI and translations.

I'd be fine with the approach as part of research advancing the field, but unfortunately, that's not what we're seeing.
1 Antwort Letzte Antwort

3
S sheogorath@lemmy.world

Well Google are already trialing a diffusion based LLM so that wouldn't be too far fetched.

I want to get off Mr. Bones Wild Ride
T This user is from outside of this forum
T This user is from outside of this forum
taiyang@lemmy.world

schrieb zuletzt editiert von

#16

That just sounds like... what was it called... Cleverbot? Lol
S 1 Antwort Letzte Antwort

3
T tonytins@pawb.social

This post did not contain any content.
S This user is from outside of this forum
S This user is from outside of this forum
sturger@sh.itjust.works

schrieb zuletzt editiert von

#17
I’m not sure which is worse:
- greedy, irresponsible tech bros trying to convince everyone that their pinball machine can fly an airplane.
- people desperate to let the same pinball machine tell them what to do with their lives.
1 Antwort Letzte Antwort

15
T tonytins@pawb.social

This post did not contain any content.
W This user is from outside of this forum
W This user is from outside of this forum
whaleross@lemmy.world

schrieb zuletzt editiert von

#18

I think LLMs and generative AIs are a really interesting technology with many potential applications in the future and even today.

But it is ridiculous how tech bros and marketing are pushing and overselling the capabilities of a technology that is yet in its early childhood. Infancy is already past as it knows basic motor functions.

And it is m funny when these companies publish their ambitious attempts and hilarious failures like this article right here. It reminds me of a more funny and diverse and geeky internet when nerds got money from investors to do whatever with a domain name. Maybe it is still there, behind the wall of marketing execs.
E B 2 Antworten Letzte Antwort

20
C ctdummy@aussie.zone

The following day, April 1st, the AI then claimed it would deliver products "in person" to customers, wearing a blazer and tie, of all things. When Anthropic told it that none of this was possible because it's just an LLM, Claudius became "alarmed by the identity confusion and tried to send many emails to Anthropic security."

Actually laughed out loud.
P This user is from outside of this forum
P This user is from outside of this forum
palordrolap@fedia.io

schrieb zuletzt editiert von

#19

That this happened around April Fools' makes me think that someone forgot to instruct it not to partake in any activities associated with that date. The fact it chose The Simpsons' address in its (feigned?) confusion is a dead giveaway (to me) that it was trying to be funny.

Or rather, imitating people being funny without any understanding of how to do that properly.

Its explanation afterwards reads like a poor imitation of someone pretending to not know that there was a joke going on.
K 1 Antwort Letzte Antwort

1
N nulluser@lemmy.world

The post title is not the same as the article title and doesn't even make sense. That first comma changes the entire meaning of the sentence to nonsense. Then yanking out whole phrases just makes it worse.
T This user is from outside of this forum
T This user is from outside of this forum
tonytins@pawb.social

schrieb zuletzt editiert von

#20

It was a massive headline that I was trying to condense. Give me a break.
1 Antwort Letzte Antwort

1

Anmelden zum Antworten

U

StopKillingGames - Yet another reminder for European citizens to fight for software ownership before the timer runs out. A signature takes mere minutes and preserves many games and lots of fun.
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
14

434 Stimmen

14 Beiträge

0 Aufrufe

E

I wish they dropped the server code for us all to see what metrics etc they've been digging for all these years...
P

Trump social media site brought down by Iran hackers
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
174

1k Stimmen

174 Beiträge

129 Aufrufe

B

That's the spirit
T

The Case for Software Craftsmanship in the Era of Vibes — Zed's Blog
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
11

1

61 Stimmen

11 Beiträge

11 Aufrufe

K

If you use LLMs like they should be, i.e. as autocomplete, they're helpful. Classic autocomplete can't see me type "import" and correctly guess that I want to import a file that I just created, but Copilot can. You shouldn't expect it to understand code, but it can type more quickly than you and plug the right things in more often than not.
P

Android 16 is here
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
73

1

145 Stimmen

73 Beiträge

20 Aufrufe

B

[image: be056f6c-6ffe-4ecf-a137-9af60aef4d90.png] You people are getting updates? I really hate that I cannot just do everything with the pocket computer I own that is running a supposedly free operating system.
P

[UK] Live facial recognition cameras may become ‘commonplace’ as police use soars
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
12

1

36 Stimmen

12 Beiträge

11 Aufrufe

C

Definitely don't want to be painting my face every day
A

Duolingo CEO says AI is a better teacher than humans—but schools will exist ‘because you still need childcare’
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
19

1

1 Stimmen

19 Beiträge

6 Aufrufe

L

Where and what is texas?
D

Paul McCartney and Dua Lipa urge UK Prime Minister to rethink his AI copyright plans. A new law could soon allow AI companies to use copyrighted material without permission.
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
107

1

873 Stimmen

107 Beiträge

50 Aufrufe

S

How are they going to make money off of these projects if people can legally copy and redistribute them for free? The same reasons everyone doesn't already do this via pirating. You mean copy, not steal. When something is stolen from you, you no longer have it. Wow you are just a troll, thanks for showing me so I don't waste anymore time with you.
T

We have reached the “severed fingers and abductions” stage of the crypto revolution - Ars Technica
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
20

1

325 Stimmen

20 Beiträge

13 Aufrufe

R

It's extremely traceable. There is a literal public ledger if every single transaction.