linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Anthropic tested Claude's(LLM, AI Chatbot) ability to manage a physical “storefront” to mixed results, as the AI struggled with pricing strategy and inventory management

Technology

9 Beiträge 8 Kommentatoren 109 Aufrufe

P This user is from outside of this forum
P This user is from outside of this forum
pro@programming.dev

schrieb am zuletzt editiert von

#1

This post did not contain any content.

Project Vend: Can Claude run a small shop? (And why does that matter?)

We let Claude run a small shop in the Anthropic office. Here's what happened.

(www.anthropic.com)
A U D D D 5 Antworten Letzte Antwort

66
P pro@programming.dev

This post did not contain any content.

Project Vend: Can Claude run a small shop? (And why does that matter?)

We let Claude run a small shop in the Anthropic office. Here's what happened.

(www.anthropic.com)
A This user is from outside of this forum
A This user is from outside of this forum
a_norny_mousse@feddit.org

schrieb am zuletzt editiert von

#2

Anybody who thought the answer could have been even remotely close to Yes is delusional.
W 1 Antwort Letzte Antwort

17
A a_norny_mousse@feddit.org

Anybody who thought the answer could have been even remotely close to Yes is delusional.
W This user is from outside of this forum
W This user is from outside of this forum
womble@lemmy.world

schrieb am zuletzt editiert von

#3

I doubt anyone expected it to work completely, but it is interesting to see to what extent it worked and how it failed (halucinations and sycophancy)
A 1 Antwort Letzte Antwort

11
W womble@lemmy.world

I doubt anyone expected it to work completely, but it is interesting to see to what extent it worked and how it failed (halucinations and sycophancy)
A This user is from outside of this forum
A This user is from outside of this forum
a_norny_mousse@feddit.org

schrieb am zuletzt editiert von

#4

True; I just hate headlines that ask stupid questions.

But then again, there's always the premise that it could work, in such attempts, which annoys me no less.
1 Antwort Letzte Antwort

1
P pro@programming.dev

This post did not contain any content.

Project Vend: Can Claude run a small shop? (And why does that matter?)

We let Claude run a small shop in the Anthropic office. Here's what happened.

(www.anthropic.com)
U This user is from outside of this forum
U This user is from outside of this forum
uff@lemmy.world

schrieb am zuletzt editiert von

#5

This shit needs to start being regulated.
1 Antwort Letzte Antwort

1
P pro@programming.dev

This post did not contain any content.

Project Vend: Can Claude run a small shop? (And why does that matter?)

We let Claude run a small shop in the Anthropic office. Here's what happened.

(www.anthropic.com)
D This user is from outside of this forum
D This user is from outside of this forum
dhork@lemmy.world

schrieb am zuletzt editiert von dhork@lemmy.world

#6

It is an interesting article, even if it's conclusions are entirely too rosy. The "storefront" was a single vending machine, and the bot was instructed to interact with Anthropic employees (with an hourly cost attached) to do all physical interactions. While the bot did a decent job managing the stock most of the time, it made a lot of bad decisions based on trying to be too helpful to it's customers. It also frequently hallucinated, with some hilarious results I wont spoil here. But as anyone who owns a small business knows, one bad decision could put it under, so saying that an AI can manage a vending machine well "most of the time" is equivalent to saying it cant do the job at all.

Their conclusion is that with a bit more work, Claude might be able to perform as a middle-manager. To me, that says more about how useless middle-management is than how capable their AI is.
S 1 Antwort Letzte Antwort

14
D dhork@lemmy.world

It is an interesting article, even if it's conclusions are entirely too rosy. The "storefront" was a single vending machine, and the bot was instructed to interact with Anthropic employees (with an hourly cost attached) to do all physical interactions. While the bot did a decent job managing the stock most of the time, it made a lot of bad decisions based on trying to be too helpful to it's customers. It also frequently hallucinated, with some hilarious results I wont spoil here. But as anyone who owns a small business knows, one bad decision could put it under, so saying that an AI can manage a vending machine well "most of the time" is equivalent to saying it cant do the job at all.

Their conclusion is that with a bit more work, Claude might be able to perform as a middle-manager. To me, that says more about how useless middle-management is than how capable their AI is.
S This user is from outside of this forum
S This user is from outside of this forum
sepi@piefed.social

schrieb am zuletzt editiert von

#7

So what you are saying is the AI is ready to replace tech CEOs.
1 Antwort Letzte Antwort

2
P pro@programming.dev

This post did not contain any content.

Project Vend: Can Claude run a small shop? (And why does that matter?)

We let Claude run a small shop in the Anthropic office. Here's what happened.

(www.anthropic.com)
D This user is from outside of this forum
D This user is from outside of this forum
dojan@pawb.social

schrieb am zuletzt editiert von

#8

This is so funny. It fails miserably and they’re all “yeah so this is promising.”

Sure, a world where your manager hallucinates meetings with you and assesses you poorly for not performing according to plans that were hallucinated through said meetings sounds like a fantastic idea.
1 Antwort Letzte Antwort

0
P pro@programming.dev

This post did not contain any content.

Project Vend: Can Claude run a small shop? (And why does that matter?)

We let Claude run a small shop in the Anthropic office. Here's what happened.

(www.anthropic.com)
D This user is from outside of this forum
D This user is from outside of this forum
django@discuss.tchncs.de

schrieb am zuletzt editiert von

#9

All the tasks could have been easily solved with some basic APIs and algorithms.
1 Antwort Letzte Antwort

0

Anmelden zum Antworten

P

TikTok plans to lay off several hundred of their moderation team in the UK in favor of AI content moderation
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
9

1

81 Stimmen

9 Beiträge

13 Aufrufe

M

Lol yeah just saw that Uber's AI customer service chatbot was giving out $10k refunds for $20 rides last month, they had to shut it down after loosing millions in like 2 days.
A

Vanishing Culture: Why Preserve Flash? [Internet Archive Blogs]
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
4

45 Stimmen

4 Beiträge

26 Aufrufe

Z

Homestuck is the only reason I need
S

Huawei shows off AI computing system to rival Nvidia's top product
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
15

21 Stimmen

15 Beiträge

106 Aufrufe

C

Huawei was uniquely, specifically, forced out of the US market around the time they were completing for 5G Tower standards.
E

Google’s electricity demand is skyrocketing
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
11

1

189 Stimmen

11 Beiträge

124 Aufrufe

W

What's dystopian is that a company like google will fight tooth and nail to remain the sole owner and rights holder to such a tech. A technology that should be made accessible outside the confines of capitalist motives. Such technologies have the potential to lift entire populations out of poverty. Not to mention that they could mitigate global warming considerably. It is simply not in the interest of humanity to allow one or more companies to hold a monopoly over such technology
O

YouTube Comment Bots are out of control...
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
3

55 Stimmen

3 Beiträge

55 Aufrufe

D

Youtube is just lazy. These bots are laughably easy to detect and block.
R

Trump Mobile launches $47 service and a gold phone
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
129

1

354 Stimmen

129 Beiträge

3k Aufrufe

S

Why mention it? Because the media has a DUTY to call out a corrupt government! Because they're not doing their job!
A

the illusion of human thinking
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
2

0 Stimmen

2 Beiträge

35 Aufrufe

H

Can we get more than just a picture of an Abstract?
P

A Health Privacy ‘Check-Up’: How Unfair Modern Business Practices Can Leave You Under-Informed and Your Most Sensitive Data Ripe for Collection and Sale
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

11 Stimmen

1 Beiträge

19 Aufrufe

Niemand hat geantwortet