linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Vibe coding service Replit deleted production database

Technology

118 Beiträge 73 Kommentatoren 0 Aufrufe

Z zerofk@lemmy.zip

in which the service admitted to “a catastrophic error of judgement”

It’s fancy text completion - it does not have judgement.

The way he talks about it shows he still doesn’t understand that. It doesn’t matter that you tell it simmering in ALL CAPS because that is no different from any other text.
J This user is from outside of this forum
J This user is from outside of this forum
jj4211@lemmy.world

schrieb zuletzt editiert von

#72

judgement

Yeah, it admitted to an error in judgement because the prompter clearly declared it so.

Generally LLMs will make whatever statement about what has happened that you want it to say. If you told it it went fantastic, it would agree. If you told it that it went terribly, it will parrot that sentiment back.

Which what seems to make it so dangerous for some people's mental health, a text generator that wants to agree with whatever you are saying, but doing so without verbatim copying so it gives an illusion of another thought process agreeing with them. Meanwhile, concurrent with your chat is another person starting from the exact same model getting a dialog that violently disagrees with the first person. It's an echo chamber.
1 Antwort Letzte Antwort

5
P panda_abyss@lemmy.ca

I explicitly told it eleven times in ALL CAPS not to do this. I am a little worried about safety now.

Well then, that settles it, this should never have happened.

I don’t think putting complex technical info in front of non technical people like this is a good idea. When it comes to LLMs, they cannot do any work that you yourself do not understand.

That goes for math, coding, health advice, etc.

If you don’t understand then you don’t know what they’re doing wrong. They’re helpful tools but only in this context.
V This user is from outside of this forum
V This user is from outside of this forum
vxx@lemmy.world

schrieb zuletzt editiert von vxx@lemmy.world

#73

What are they helpful tools for then? A study showed that they make experienced developers 19% slower.
P L W 3 Antworten Letzte Antwort

6
Q quibblekrust@thelemmy.club

There was a backup, and it was restored. However, the LLM lied and said there wasn't at first. You can laugh all you want at it. I did. But maybe read the article so you aren't also lying.
C This user is from outside of this forum
C This user is from outside of this forum
cyrano@lemmy.dbzer0.com

schrieb zuletzt editiert von

#74

Not according to the twitter thread. I went thru its thread, it’s a roller coaster of amateurism.
A 1 Antwort Letzte Antwort

0
C chairmanmeow@programming.dev

Well the thing is, LLMs don't seem to really "solve" complex problems. They remember solutions they've seen before.

The example I saw was asking an LLM to solve "Towers of Hanoi" with 100 disks. This is a common recursive programming problem, takes quite a while for a human to write the answer to. The LLM manages this easily. But when asked to solve the same problem with with say 79 disks, or 41 disks, or some other oddball number, the LLM fails to solve the problem, despite it being simpler(!).

It can do pattern matching and provide solutions, but it's not able to come up with truly new solutions. It does not "think" in that way. LLMs are amazing data storage formats, but they're not truly 'intelligent' in the way most people think.
H This user is from outside of this forum
H This user is from outside of this forum
hisao@ani.social

schrieb zuletzt editiert von

#75

This only proves some of them can't solve all complex problems. I'm only claiming some of them can solve some complex problems. Not only by remembering exact solutions, but by remembering steps and actions used in building those solutions, generalizing, and transferring them to new problems. Anyone who tries using it for programming, will discover this very fast.

PS: Some of them were already used to solve problems and find patterns in data humans weren't able to get other ways before (particle research in CERN, bioinformatics, etc).
C 1 Antwort Letzte Antwort

1
M mtk@lemmy.world

Shit, deleting prod is my signature move! AI is coming for my job
Y This user is from outside of this forum
Y This user is from outside of this forum
yesbutactuallymaybe@lemmy.ca

schrieb zuletzt editiert von

#76

Just know your worth. You can do it cheaper!
1 Antwort Letzte Antwort

4
T towerful@programming.dev

Not mad about an estimated usage bill of $8k per month.
Just hire a developer
D This user is from outside of this forum
D This user is from outside of this forum
dogiedog64@lemmy.world

schrieb zuletzt editiert von

#77

But then how would he feel so special and smart about "doing it himself"???? Come on man, think of the rich fratboys!! They NEED to feel special and smart!!!
1 Antwort Letzte Antwort

7
C cyrano@lemmy.dbzer0.com

Title should be “user give database prod access to a llm which deleted the db, user did not have any backup and used the same db for prod and dev”. Less sexy and less llm fault.
This is weird it’s like the last 50 years of software development principles are being ignored.
J This user is from outside of this forum
J This user is from outside of this forum
jj4211@lemmy.world

schrieb zuletzt editiert von

#78

But like the whole 'vibe coding' message is the LLM knows all this stuff so you don't have to.

This isn't some "LLM can do some code completion/suggestions" it's "LLM is so magical you can be an idiot with no skills/training and still produce full stack solutions".
1 Antwort Letzte Antwort

0
D dejected_warp_core@lemmy.world

it talks like a human so it must be smart like a human.

Yikes. Have those people... talked to other people before?
S This user is from outside of this forum
S This user is from outside of this forum
sunbytes@lemmy.world

schrieb zuletzt editiert von

#79

Yes, and they were all as smart at humans.

So mostly average but some absolute thickos too.
1 Antwort Letzte Antwort

3
T tonytins@pawb.social

This post did not contain any content.
T This user is from outside of this forum
T This user is from outside of this forum
tabarnaski@sh.itjust.works

schrieb zuletzt editiert von

#80

The [AI] safety stuff is more visceral to me after a weekend of vibe hacking,” Lemkin said. I explicitly told it eleven times in ALL CAPS not to do this. I am a little worried about safety now.

This sounds like something straight out of The Onion.
N C Y E 4 Antworten Letzte Antwort

65
T tabarnaski@sh.itjust.works

The [AI] safety stuff is more visceral to me after a weekend of vibe hacking,” Lemkin said. I explicitly told it eleven times in ALL CAPS not to do this. I am a little worried about safety now.

This sounds like something straight out of The Onion.
N This user is from outside of this forum
N This user is from outside of this forum
natanael@infosec.pub

schrieb zuletzt editiert von

#81

The Pink Elephant problem of LLMs. You can not reliably make them NOT do something.
J 1 Antwort Letzte Antwort

20
V vxx@lemmy.world

What are they helpful tools for then? A study showed that they make experienced developers 19% slower.
P This user is from outside of this forum
P This user is from outside of this forum
panda_abyss@lemmy.ca

schrieb zuletzt editiert von

#82

Vibe coding you do end up spending a lot of time waiting for prompts, so I get the results of that study.

I fall pretty deep in the power user category for LLMs, so I don’t really feel that the study applies well to me, but also I acknowledge I can be biased there.

I have custom proprietary MCPs for semantic search over my code bases that lets AI do repeated graph searches on my code (imagine combining language server, ctags, networkx, and grep+fuzzy search). That is way faster than iteratively grepping and code scanning manually with a low chance of LLM errors. By the time I open GitHub code search or run ripgrep Claude has used already prioritized and listed my modules to investigate.

That tool alone with an LLM can save me half a day of research and debugging on complex tickets, which pays for an AI subscription alone. I have other internal tools to accelerate work too.

I use it to organize my JIRA tickets and plan my daily goals. I actually get Claude to do a lot of triage for me before I even start a task, which cuts the investigation phase to a few minutes on small tasks.

I use it to review all my PRs before I ask a human to look, it catches a lot of small things and can correct them, then the PR avoids the bike shedding nitpicks some reviewers love. Claude can do this, Copilot will only ever point out nitpicks, so the model makes a huge difference here. But regardless, 1 fewer review request cycle helps keep things moving.

It’s a huge boon to debugging — much faster than searching errors manually. Especially helpful on the types of errors you have to rabbit hole GitHub issue content chains to solve.

It’s very fast to get projects to MVP while following common structure/idioms, and can help write unit tests quickly for me. After the MVP stage it sucks and I go back to manually coding.

I use it to generate code snippets where documentation sucks. If you look at the ibis library in Python for example the docs are Byzantine and poorly organized. LLMs are better at finding the relevant docs than I am there. I mostly use LLM search instead of manual for doc search now.

I have a lot of custom scripts and calculators and apps that I made with it which keep me more focused on my actual work and accelerate things.

I regularly have the LLM help me write bash or python or jq scripts when I need to audit codebases for large refactors. That’s low maintenance one off work that can be easily verified but complex to write. I never remember the syntax for bash and jq even after using them for years.

I guess the short version is I tend to build tools for the AI, then let the LLM use those tools to improve and accelerate my workflows. That returns a lot of time back to me.

I do try vibe coding but end up in the same time sink traps as the study found. If the LLM is ever wrong, you save time forking the chat than trying to realign it, but it’s still likely to be slower. Repeat chats result in the same pitfalls for complex issues and bugs, so you have to abandon that state quickly.

Vibe coding small revisions can still be a bit faster and it’s great at helping me with documentation.
V 1 Antwort Letzte Antwort

2
T tonytins@pawb.social

This post did not contain any content.
C This user is from outside of this forum
C This user is from outside of this forum
codexarcanum@lemmy.dbzer0.com

schrieb zuletzt editiert von codexarcanum@lemmy.dbzer0.com

#83

It sounds like this guy was also relying on the AI to self-report status. Did any of this happen? Like is the replit AI really hooked up to a CLI, did it even make a DB to start with, was there anything useful in it, and did it actually delete it?

Or is this all just a long roleplaying session where this guy pretends to run a business and the AI pretends to do employee stuff for him?

Because 90% of this article is "I asked the AI and it said:" which is not a reliable source for information.
E 1 Antwort Letzte Antwort

13
P panda_abyss@lemmy.ca

I explicitly told it eleven times in ALL CAPS not to do this. I am a little worried about safety now.

Well then, that settles it, this should never have happened.

I don’t think putting complex technical info in front of non technical people like this is a good idea. When it comes to LLMs, they cannot do any work that you yourself do not understand.

That goes for math, coding, health advice, etc.

If you don’t understand then you don’t know what they’re doing wrong. They’re helpful tools but only in this context.
L This user is from outside of this forum
L This user is from outside of this forum
lilb0kchoy@midwest.social

schrieb zuletzt editiert von

#84

When it comes to LLMs, they cannot do any work that you yourself do not understand.

And even if they could how would you ever validate it if you can't understand it.
1 Antwort Letzte Antwort

8
N natanael@infosec.pub

The Pink Elephant problem of LLMs. You can not reliably make them NOT do something.
J This user is from outside of this forum
J This user is from outside of this forum
jankatarch@lemmy.world

schrieb zuletzt editiert von

#85

Just say 12 times next time
1 Antwort Letzte Antwort

5
P panda_abyss@lemmy.ca

Vibe coding you do end up spending a lot of time waiting for prompts, so I get the results of that study.

I fall pretty deep in the power user category for LLMs, so I don’t really feel that the study applies well to me, but also I acknowledge I can be biased there.

I have custom proprietary MCPs for semantic search over my code bases that lets AI do repeated graph searches on my code (imagine combining language server, ctags, networkx, and grep+fuzzy search). That is way faster than iteratively grepping and code scanning manually with a low chance of LLM errors. By the time I open GitHub code search or run ripgrep Claude has used already prioritized and listed my modules to investigate.

That tool alone with an LLM can save me half a day of research and debugging on complex tickets, which pays for an AI subscription alone. I have other internal tools to accelerate work too.

I use it to organize my JIRA tickets and plan my daily goals. I actually get Claude to do a lot of triage for me before I even start a task, which cuts the investigation phase to a few minutes on small tasks.

I use it to review all my PRs before I ask a human to look, it catches a lot of small things and can correct them, then the PR avoids the bike shedding nitpicks some reviewers love. Claude can do this, Copilot will only ever point out nitpicks, so the model makes a huge difference here. But regardless, 1 fewer review request cycle helps keep things moving.

It’s a huge boon to debugging — much faster than searching errors manually. Especially helpful on the types of errors you have to rabbit hole GitHub issue content chains to solve.

It’s very fast to get projects to MVP while following common structure/idioms, and can help write unit tests quickly for me. After the MVP stage it sucks and I go back to manually coding.

I use it to generate code snippets where documentation sucks. If you look at the ibis library in Python for example the docs are Byzantine and poorly organized. LLMs are better at finding the relevant docs than I am there. I mostly use LLM search instead of manual for doc search now.

I have a lot of custom scripts and calculators and apps that I made with it which keep me more focused on my actual work and accelerate things.

I regularly have the LLM help me write bash or python or jq scripts when I need to audit codebases for large refactors. That’s low maintenance one off work that can be easily verified but complex to write. I never remember the syntax for bash and jq even after using them for years.

I guess the short version is I tend to build tools for the AI, then let the LLM use those tools to improve and accelerate my workflows. That returns a lot of time back to me.

I do try vibe coding but end up in the same time sink traps as the study found. If the LLM is ever wrong, you save time forking the chat than trying to realign it, but it’s still likely to be slower. Repeat chats result in the same pitfalls for complex issues and bugs, so you have to abandon that state quickly.

Vibe coding small revisions can still be a bit faster and it’s great at helping me with documentation.
V This user is from outside of this forum
V This user is from outside of this forum
vxx@lemmy.world

schrieb zuletzt editiert von vxx@lemmy.world

#86

Don't you have any security concerns with sending all your code and JIRA tickets to some companies servers? My boss wouldn't be pleased if I send anything that's deemed a company secret over unencrypted channels.
P 1 Antwort Letzte Antwort

4
O opisek@lemmy.world

First time I'm hearing them be related to vibe coding. They've been very respectable in the past, especially with their open-source CodeMirror.
J This user is from outside of this forum
J This user is from outside of this forum
jankatarch@lemmy.world

schrieb zuletzt editiert von jankatarch@lemmy.world

#87

Yeah they limited people to 3 projects and pushed AI into front at some point.

They advertise themselves as a CLOUD IDE POWERED BY AI now.
1 Antwort Letzte Antwort

2
V vxx@lemmy.world

What are they helpful tools for then? A study showed that they make experienced developers 19% slower.
L This user is from outside of this forum
L This user is from outside of this forum
lilb0kchoy@midwest.social

schrieb zuletzt editiert von

#88

I'm not the person you're replying to but the one thing I've found them helpful for is targeted search.

I can ask it a question and then access its sources from whatever response it generates to read and review myself.

Kind of a simpler, free LexisNexis.
P 1 Antwort Letzte Antwort

6
T tonytins@pawb.social

This post did not contain any content.
T This user is from outside of this forum
T This user is from outside of this forum
transtronaut@lemmy.blahaj.zone

schrieb zuletzt editiert von

#89

The founder of SaaS business development outfit SaaStr has claimed AI coding tool Replit deleted a database despite his instructions not to change any code without permission.

Sounds like an absolute diSaaStr...
1 Antwort Letzte Antwort

26
C codexarcanum@lemmy.dbzer0.com

It sounds like this guy was also relying on the AI to self-report status. Did any of this happen? Like is the replit AI really hooked up to a CLI, did it even make a DB to start with, was there anything useful in it, and did it actually delete it?

Or is this all just a long roleplaying session where this guy pretends to run a business and the AI pretends to do employee stuff for him?

Because 90% of this article is "I asked the AI and it said:" which is not a reliable source for information.
E This user is from outside of this forum
E This user is from outside of this forum
eestileib@lemmy.blahaj.zone

schrieb zuletzt editiert von

#90

It seemed like the llm had decided it was in a brat scene and was trying to call down the thunder.
S 1 Antwort Letzte Antwort

6
V vxx@lemmy.world

Don't you have any security concerns with sending all your code and JIRA tickets to some companies servers? My boss wouldn't be pleased if I send anything that's deemed a company secret over unencrypted channels.
P This user is from outside of this forum
P This user is from outside of this forum
panda_abyss@lemmy.ca

schrieb zuletzt editiert von

#91

The tool isn’t returning all code, but it is sending code.

I had discussions with my CTO and security team before integrating Claude code.

I have to use Gemini in one specific workflow and Gemini had a lot of landlines for how they use your data. Anthropic was easier to understand.

Anthropic also has some guidance for running Claude Code in a container with firewall and your specified dev tools, it works but that’s not my area of expertise.

The container doesn’t solve all the issues like using remote servers, but it does let you restrict what files and network requests Claude can access (so e.g. Claude can’t read your env vars or ssh key files).

I do try local LLMs but they’re not there yet on my machine for most use cases. Gemma 3n is decent if you need small model performance and tool calls, phi4 works but isn’t thinking (the thinking variants are awful), and I’m exploring dream coder and diffusion models. R1 is still one of the best local models but frequently overthinks, even the new release. Context window is the largest limiting factor I find locally.
6 1 Antwort Letzte Antwort

2

Anmelden zum Antworten

S

Google, Microsoft and Amazon face pressure over data sovereignty - Rest of World
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
2

1

66 Stimmen

2 Beiträge

24 Aufrufe

W

In April, Nigeria asked Google, Microsoft, and Amazon to set concrete deadlines for opening data centers in the country. Nigeria has been making this demand for about four years, but the companies have so far failed to fulfill their promises. Now, Nigeria has set up a working group with the companies to ensure that data is stored within its shores. Just onshoring the data center does not solve the problems. You can't be sure no data travels to the US servers, some data does need to travel to the US servers, and the entire DC is still subject to US software and certificate keychains. It's better, but not good or safe. I need to channel my inner Mike Ehrmantrout to the US tech companies and government: you had a good thing going you stupid son of a bitch. You had everything you needed and it all ran like clockwork. You could have shut your mouth, cooked, and made as much money as you needed, but you just had to blow it up, you and your pride and your ego. Seriously, this is a massive own goal by the US government. This is a massive loss to US hegemony and influence around the world that's never coming back. It has never been easier to build sovereign clouds with off the shelf and open source tooling. The best practices are largely documented, software is commoditized, and there are plenty of qualified people out there these days and governments staring down the barrel of existential risk have finally got the incentive to fund these efforts.
D

As Data Centers Proliferate, Illinois Communities Grapple with How to Supply the Necessary Water. "This isn’t reused wastewater. This is drinking water”
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
30

1

279 Stimmen

30 Beiträge

176 Aufrufe

V

Relocate those Native American to reservations because those computers need a place to live. Or something like that.
D

AI is the new globalisation that will create a world of have-nots and have-yachts
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
10

1

114 Stimmen

10 Beiträge

45 Aufrufe

S

I admire your positivity. I do not share it though, because from what I have seen, because even if there are open weights, the one with the biggest datacenter will in the future hold the most intelligent and performance model. Very similar to how even if storage space is very cheap today, large companies are holding all the data anyway. AI will go the same way, and thus the megacorps will and in some extent already are owning not only our data, but our thoughts and the ability to modify them. I mean, sponsored prompt injection is just the first thought modifying thing, imagine Google search sponsored hits, but instead it's a hyperconvincing AI response that subtly nudges you to a certain brand or way of thinking. Absolutely terrifies me, especially with all the research Meta has done on how to manipulate people's mood and behaviour through which social media posts they are presented with
D

Microsoft sued by authors over use of books in AI training
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
4

1

114 Stimmen

4 Beiträge

32 Aufrufe

I

The writers alleged in the complaint that Microsoft used a collection of nearly 200,000 pirated books to train Megatron, an algorithm that gives text responses to user prompts. Which Megatron are we referring to? This [image: c747568b-0dd5-431e-bd19-2fbfdf5d372c.webp] Or This [image: 735a9693-ec67-489c-92f6-addb803291a4.webp]
K

Descubra Tudo o Que a 7BRWIN Tem a Oferecer aos Jogadores Brasileiros
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

1 Stimmen

1 Beiträge

13 Aufrufe

Niemand hat geantwortet
J

Looking elsewhere
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
3

1

7 Stimmen

3 Beiträge

23 Aufrufe

J

That's a valid point! I've been searching for places to hangout for a while, sometimes called "campfires". Found a cool Discord with generous front-end folks (that's a broad spectrum!), on frontend.horse.
D

Chromium Blog: Fighting Unwanted Notifications with Machine Learning in Chrome
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
3

1

44 Stimmen

3 Beiträge

26 Aufrufe

V

I use it for my self hosted apps, but yeah, it's rarely useful for websites in the wild.
R

Kids are short-circuiting their school-issued Chromebooks for TikTok clout
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
58

1

512 Stimmen

58 Beiträge

227 Aufrufe

C

Eh, I kinda like the ephemeral nature of most tiktoks, having things go viral within a group of like 10,000 people, to the extent that if you're tangentially connected to the group, you and everyone you know has seen it, but nobody outside that group ever sees and it vanishes into the ether like a month later makes it a little more personal.