linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Vibe coding service Replit deleted production database

Technology

118 Beiträge 73 Kommentatoren 24 Aufrufe

N natanael@infosec.pub

The Pink Elephant problem of LLMs. You can not reliably make them NOT do something.
J This user is from outside of this forum
J This user is from outside of this forum
jankatarch@lemmy.world

schrieb zuletzt editiert von

#85

Just say 12 times next time
1 Antwort Letzte Antwort

5
P panda_abyss@lemmy.ca

Vibe coding you do end up spending a lot of time waiting for prompts, so I get the results of that study.

I fall pretty deep in the power user category for LLMs, so I don’t really feel that the study applies well to me, but also I acknowledge I can be biased there.

I have custom proprietary MCPs for semantic search over my code bases that lets AI do repeated graph searches on my code (imagine combining language server, ctags, networkx, and grep+fuzzy search). That is way faster than iteratively grepping and code scanning manually with a low chance of LLM errors. By the time I open GitHub code search or run ripgrep Claude has used already prioritized and listed my modules to investigate.

That tool alone with an LLM can save me half a day of research and debugging on complex tickets, which pays for an AI subscription alone. I have other internal tools to accelerate work too.

I use it to organize my JIRA tickets and plan my daily goals. I actually get Claude to do a lot of triage for me before I even start a task, which cuts the investigation phase to a few minutes on small tasks.

I use it to review all my PRs before I ask a human to look, it catches a lot of small things and can correct them, then the PR avoids the bike shedding nitpicks some reviewers love. Claude can do this, Copilot will only ever point out nitpicks, so the model makes a huge difference here. But regardless, 1 fewer review request cycle helps keep things moving.

It’s a huge boon to debugging — much faster than searching errors manually. Especially helpful on the types of errors you have to rabbit hole GitHub issue content chains to solve.

It’s very fast to get projects to MVP while following common structure/idioms, and can help write unit tests quickly for me. After the MVP stage it sucks and I go back to manually coding.

I use it to generate code snippets where documentation sucks. If you look at the ibis library in Python for example the docs are Byzantine and poorly organized. LLMs are better at finding the relevant docs than I am there. I mostly use LLM search instead of manual for doc search now.

I have a lot of custom scripts and calculators and apps that I made with it which keep me more focused on my actual work and accelerate things.

I regularly have the LLM help me write bash or python or jq scripts when I need to audit codebases for large refactors. That’s low maintenance one off work that can be easily verified but complex to write. I never remember the syntax for bash and jq even after using them for years.

I guess the short version is I tend to build tools for the AI, then let the LLM use those tools to improve and accelerate my workflows. That returns a lot of time back to me.

I do try vibe coding but end up in the same time sink traps as the study found. If the LLM is ever wrong, you save time forking the chat than trying to realign it, but it’s still likely to be slower. Repeat chats result in the same pitfalls for complex issues and bugs, so you have to abandon that state quickly.

Vibe coding small revisions can still be a bit faster and it’s great at helping me with documentation.
V This user is from outside of this forum
V This user is from outside of this forum
vxx@lemmy.world

schrieb zuletzt editiert von vxx@lemmy.world

#86

Don't you have any security concerns with sending all your code and JIRA tickets to some companies servers? My boss wouldn't be pleased if I send anything that's deemed a company secret over unencrypted channels.
P 1 Antwort Letzte Antwort

4
O opisek@lemmy.world

First time I'm hearing them be related to vibe coding. They've been very respectable in the past, especially with their open-source CodeMirror.
J This user is from outside of this forum
J This user is from outside of this forum
jankatarch@lemmy.world

schrieb zuletzt editiert von jankatarch@lemmy.world

#87

Yeah they limited people to 3 projects and pushed AI into front at some point.

They advertise themselves as a CLOUD IDE POWERED BY AI now.
1 Antwort Letzte Antwort

2
V vxx@lemmy.world

What are they helpful tools for then? A study showed that they make experienced developers 19% slower.
L This user is from outside of this forum
L This user is from outside of this forum
lilb0kchoy@midwest.social

schrieb zuletzt editiert von

#88

I'm not the person you're replying to but the one thing I've found them helpful for is targeted search.

I can ask it a question and then access its sources from whatever response it generates to read and review myself.

Kind of a simpler, free LexisNexis.
P 1 Antwort Letzte Antwort

6
T tonytins@pawb.social

This post did not contain any content.
T This user is from outside of this forum
T This user is from outside of this forum
transtronaut@lemmy.blahaj.zone

schrieb zuletzt editiert von

#89

The founder of SaaS business development outfit SaaStr has claimed AI coding tool Replit deleted a database despite his instructions not to change any code without permission.

Sounds like an absolute diSaaStr...
1 Antwort Letzte Antwort

26
C codexarcanum@lemmy.dbzer0.com

It sounds like this guy was also relying on the AI to self-report status. Did any of this happen? Like is the replit AI really hooked up to a CLI, did it even make a DB to start with, was there anything useful in it, and did it actually delete it?

Or is this all just a long roleplaying session where this guy pretends to run a business and the AI pretends to do employee stuff for him?

Because 90% of this article is "I asked the AI and it said:" which is not a reliable source for information.
E This user is from outside of this forum
E This user is from outside of this forum
eestileib@lemmy.blahaj.zone

schrieb zuletzt editiert von

#90

It seemed like the llm had decided it was in a brat scene and was trying to call down the thunder.
S 1 Antwort Letzte Antwort

6
V vxx@lemmy.world

Don't you have any security concerns with sending all your code and JIRA tickets to some companies servers? My boss wouldn't be pleased if I send anything that's deemed a company secret over unencrypted channels.
P This user is from outside of this forum
P This user is from outside of this forum
panda_abyss@lemmy.ca

schrieb zuletzt editiert von

#91

The tool isn’t returning all code, but it is sending code.

I had discussions with my CTO and security team before integrating Claude code.

I have to use Gemini in one specific workflow and Gemini had a lot of landlines for how they use your data. Anthropic was easier to understand.

Anthropic also has some guidance for running Claude Code in a container with firewall and your specified dev tools, it works but that’s not my area of expertise.

The container doesn’t solve all the issues like using remote servers, but it does let you restrict what files and network requests Claude can access (so e.g. Claude can’t read your env vars or ssh key files).

I do try local LLMs but they’re not there yet on my machine for most use cases. Gemma 3n is decent if you need small model performance and tool calls, phi4 works but isn’t thinking (the thinking variants are awful), and I’m exploring dream coder and diffusion models. R1 is still one of the best local models but frequently overthinks, even the new release. Context window is the largest limiting factor I find locally.
6 1 Antwort Letzte Antwort

2
T tonytins@pawb.social

This post did not contain any content.
B This user is from outside of this forum
B This user is from outside of this forum
blackmist@feddit.uk

schrieb zuletzt editiert von

#92

The world's most overconfident virtual intern strikes again.

Also, who the flying fuck are either of these companies? 1000 records is nothing. That's a fucking text file.
1 Antwort Letzte Antwort

24
P panda_abyss@lemmy.ca

They could hire on a contractor and eschew all those costs.

I’ve done contract work before, this seems a good fit (defined problem plus budget, unknown timeline, clear requirements)
D This user is from outside of this forum
D This user is from outside of this forum
dan@upvote.au

schrieb zuletzt editiert von

#93

That's what I meant by hiring a self-employed freelancer. I don't know a lot about contracting so maybe I used the wrong phrase.
1 Antwort Letzte Antwort

1
L lilb0kchoy@midwest.social

I'm not the person you're replying to but the one thing I've found them helpful for is targeted search.

I can ask it a question and then access its sources from whatever response it generates to read and review myself.

Kind of a simpler, free LexisNexis.
P This user is from outside of this forum
P This user is from outside of this forum
panda_abyss@lemmy.ca

schrieb zuletzt editiert von

#94

One built a bunch of local search tools with MCP and that’s where I get a lot of my value out of it

RAG workflows are incredibly useful and with modern agents and tool calls work very well.

They kind of went out of style but it’s a perfect use case.
1 Antwort Letzte Antwort

2
P panda_abyss@lemmy.ca

The tool isn’t returning all code, but it is sending code.

I had discussions with my CTO and security team before integrating Claude code.

I have to use Gemini in one specific workflow and Gemini had a lot of landlines for how they use your data. Anthropic was easier to understand.

Anthropic also has some guidance for running Claude Code in a container with firewall and your specified dev tools, it works but that’s not my area of expertise.

The container doesn’t solve all the issues like using remote servers, but it does let you restrict what files and network requests Claude can access (so e.g. Claude can’t read your env vars or ssh key files).

I do try local LLMs but they’re not there yet on my machine for most use cases. Gemma 3n is decent if you need small model performance and tool calls, phi4 works but isn’t thinking (the thinking variants are awful), and I’m exploring dream coder and diffusion models. R1 is still one of the best local models but frequently overthinks, even the new release. Context window is the largest limiting factor I find locally.
6 This user is from outside of this forum
6 This user is from outside of this forum
6nk06@sh.itjust.works

schrieb zuletzt editiert von

#95

I have to use Gemini in one specific workflow

I would love some story on why AI is needed at all.
P 1 Antwort Letzte Antwort

4
B balder1991@lemmy.world

All I see is people chatting with an LLM as if it was a person. “How bad is this on a scale
of 1 to 100”, you’re just doomed to get some random answer based solely on whatever context is being fed in the input and that you probably don’t know the extent of it.

Trying to make the LLM “see its mistakes” is a pointless exercise. Getting it to “promise” something is useless.

The issue with LLMs working with human languages is people eventually wanting to apply human things to LLMs such as asking why as if the LLM knows of its own decision process. It only takes an input and generates an output, it won’t be able to have any “meta thought” explanation about why it outputted X and not Y in the previous prompt.
6 This user is from outside of this forum
6 This user is from outside of this forum
6nk06@sh.itjust.works

schrieb zuletzt editiert von

#96

How bad is this on a scale of sad emoji to eggplant emoji.

Children are replacing us, it's terrifying.
1 Antwort Letzte Antwort

0
V vxx@lemmy.world

What are they helpful tools for then? A study showed that they make experienced developers 19% slower.
W This user is from outside of this forum
W This user is from outside of this forum
wraithgear@lemmy.world

schrieb zuletzt editiert von

#97

ok so, i have large reservations with how LLM’s are used. but when used correctly they can be helpful. but where and how?

if you were to use it as a tutor, the same way you would ask a friend what a segment of code does, it will break down the code and tell you. and it will get as nity grity, and elementary school level as you weir wish without judgement, and i in what ever manner you prefer, it will recommend best practices, and will tell you why your code may not work with the understanding that it does not have the knowledge of the project you are working on. (it’s not going to know the name of the function you are trying to load, but it will recommend checking for that in trouble shooting).

it can rtfm and give you the parts you need for any thing with available documentation, and it will link to it so you can verify it, wich you should do often, just like you were taught to do with wikipedia articles.

if you ask i it for code, prepare to go through each line like a worksheet from high school to point out all the problems, wile good exercise for a practicle case, being the task you are on, it would be far better to write it yourself because you should know the particulars and scope.

also it will format your code and provide informational comments if you can’t be bothered, though it will be generic.

again, treat it correctly for its scope, not what it’s sold as by charletons.
1 Antwort Letzte Antwort

6
6 6nk06@sh.itjust.works

I have to use Gemini in one specific workflow

I would love some story on why AI is needed at all.
P This user is from outside of this forum
P This user is from outside of this forum
panda_abyss@lemmy.ca

schrieb zuletzt editiert von

#98

Batch process turning unstructured free form text data into structured outputs.

As a crappy example imagine if you wanted to download metadata about your albums but they’re all labelled “Various Artists”. You can use an LLM call to read the album description and fix the track artists for the tracks, now you can properly organize your collection.

I’m using the same idea, different domain and a complex set of inputs.

It can be much more cost effective than manually spending days tagging data and writing custom importers.

You can definitely go lighter than LLMs. You can use gensim to do category matching, you can use sentence transformers and nearest neighbours (this is basically what Semantle does), but LLM performed the best on more complex document input.
V 1 Antwort Letzte Antwort

3
E eestileib@lemmy.blahaj.zone

It seemed like the llm had decided it was in a brat scene and was trying to call down the thunder.
S This user is from outside of this forum
S This user is from outside of this forum
skyezopen@lemmy.world

schrieb zuletzt editiert von

#99

Oops I dweted evewyfing 🥺
E 1 Antwort Letzte Antwort

4
T tonytins@pawb.social

This post did not contain any content.
S This user is from outside of this forum
S This user is from outside of this forum
sheetzoos@lemmy.world

schrieb zuletzt editiert von

#100

Replit sucks
1 Antwort Letzte Antwort

0
H hisao@ani.social

This only proves some of them can't solve all complex problems. I'm only claiming some of them can solve some complex problems. Not only by remembering exact solutions, but by remembering steps and actions used in building those solutions, generalizing, and transferring them to new problems. Anyone who tries using it for programming, will discover this very fast.

PS: Some of them were already used to solve problems and find patterns in data humans weren't able to get other ways before (particle research in CERN, bioinformatics, etc).
C This user is from outside of this forum
C This user is from outside of this forum
chairmanmeow@programming.dev

schrieb zuletzt editiert von

#101

You're referring to more generic machine learning, not LLMs. These are vastly different technologies.

And I have used them for programming, I know their limitations. They don't really transfer solutions to new problems, not on their own anyway. It usually requires pretty specific prompting. They can at best apply solutions to problems, but even then it's not a truly generalised thing, even if it seems to work for many cases.

That's the trap you're falling into as well; LLMs look like they're doing all this stuff, because they're trained on data produced by people who actually do so. But they can't think of something truly novel. LLMs are mathematically unable to truly generalize, it would prove P=NP if they did (there was a paper from a researcher in IIRC Nijmegen that proved this). She also proved they won't scale, and lo and behold LLM performance is plateauing hard (except in very synthetic, artificial benchmarks designed to make LLMs look good).
H 1 Antwort Letzte Antwort

0
T tabarnaski@sh.itjust.works

The [AI] safety stuff is more visceral to me after a weekend of vibe hacking,” Lemkin said. I explicitly told it eleven times in ALL CAPS not to do this. I am a little worried about safety now.

This sounds like something straight out of The Onion.
C This user is from outside of this forum
C This user is from outside of this forum
chaoticentropy@feddit.uk

schrieb zuletzt editiert von

#102

Even after he used "ALL CAPS"?!? Impossible!
1 Antwort Letzte Antwort

8
P panda_abyss@lemmy.ca

Batch process turning unstructured free form text data into structured outputs.

As a crappy example imagine if you wanted to download metadata about your albums but they’re all labelled “Various Artists”. You can use an LLM call to read the album description and fix the track artists for the tracks, now you can properly organize your collection.

I’m using the same idea, different domain and a complex set of inputs.

It can be much more cost effective than manually spending days tagging data and writing custom importers.

You can definitely go lighter than LLMs. You can use gensim to do category matching, you can use sentence transformers and nearest neighbours (this is basically what Semantle does), but LLM performed the best on more complex document input.
V This user is from outside of this forum
V This user is from outside of this forum
vxx@lemmy.world

schrieb zuletzt editiert von

#103

That's pretty much what google says they use AI for, for structuring.

Thanks for your insight.
1 Antwort Letzte Antwort

1
C chairmanmeow@programming.dev

You're referring to more generic machine learning, not LLMs. These are vastly different technologies.

And I have used them for programming, I know their limitations. They don't really transfer solutions to new problems, not on their own anyway. It usually requires pretty specific prompting. They can at best apply solutions to problems, but even then it's not a truly generalised thing, even if it seems to work for many cases.

That's the trap you're falling into as well; LLMs look like they're doing all this stuff, because they're trained on data produced by people who actually do so. But they can't think of something truly novel. LLMs are mathematically unable to truly generalize, it would prove P=NP if they did (there was a paper from a researcher in IIRC Nijmegen that proved this). She also proved they won't scale, and lo and behold LLM performance is plateauing hard (except in very synthetic, artificial benchmarks designed to make LLMs look good).
H This user is from outside of this forum
H This user is from outside of this forum
hisao@ani.social

schrieb zuletzt editiert von

#104

They don’t really transfer solutions to new problems

Lets say there is a binary format some old game uses (Doom), and in it some of its lumps it can store indexed images, each pixel is an index of color in palette which is stored in another lump, there's also a programming language called Rust, and a little known/used library that can look into binary data of that format, there's also a GUI library in Rust that not many people used either. Would you consider it an "ability to transfer solutions to new problems" that it was able to implement extracting image data from that binary format using the library, extracting palette data from that binary format, converting that indexed image using extracted palette into regular rgba image data, and then render that as window background using that GUI library, the only reference for which is a file with names and type signatures of functions. There's no similar Rust code in the wild at all for any of those scenarios. Most of this it was able to do from a few little prompts, maybe even from the first one. There sure were few little issues along the way that required repromting and figuring things together with it. Stuff like this with AI can take like half an hour while doing the whole thing fully manually could easily take multiple days just for the sake of figuring out APIs of libraries involved and intricacies of recoding indexed image to rgba. For me this is overpowered enough even right now, and it's likely going to improve even more in future.
C 1 Antwort Letzte Antwort

0

Anmelden zum Antworten

D

How China's new auto giants left General Motors, Volkswagen and Tesla in the dust
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
13

1

47 Stimmen

13 Beiträge

95 Aufrufe

N

They don't treat their people like shit, they treat them like slaves. In countries outside China at that. https://www.bbc.com/news/articles/c3v5n7w55kpo
P

A report finds Google undercounted its carbon emissions, which rose 65% from 2019 to 2024, not 51% as claimed; biggest yearly jump was 26% between 2023 and 2024
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
5

169 Stimmen

5 Beiträge

29 Aufrufe

K

But but we need to power our virtual idiot with more energy than entire countries use :((
P

The effects of AI on firms and workers
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
4

1

13 Stimmen

4 Beiträge

33 Aufrufe

B

Your response is: want to be more productive? Replace the CEO and pointless middle management with Ai! Image how much money the shareholders would save!
G

Was ist ChatGPT?
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

0 Stimmen

1 Beiträge

17 Aufrufe

Niemand hat geantwortet
P

The Current System of Online Advertising has Been Ruled Illegal by The Belgian Court of Appeal. Advertising itself is Still Allowed, but not in a Way That Secretly Tracks Everyone’s Behavior.
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
95

1

1k Stimmen

95 Beiträge

278 Aufrufe

G

Obviously the law must be simple enough to follow so that for Jim’s furniture shop is not a problem nor a too high cost to respect it, but it must be clear that if you break it you can cease to exist as company. I think this may be the root of our disagreement, I do not believe that there is any law making body today that is capable of an elegantly simple law. I could be too naive, but I think it is possible. We also definitely have a difference on opinion when it comes to the severity of the infraction, in my mind, while privacy is important, it should not have the same level of punishments associated with it when compared to something on the level of poisoning water ways; I think that a privacy law should hurt but be able to be learned from while in the poison case it should result in the bankruptcy of a company. The severity is directly proportional to the number of people affected. If you violate the privacy of 200 million people is the same that you poison the water of 10 people. And while with the poisoning scenario it could be better to jail the responsible people (for a very, very long time) and let the company survive to clean the water, once your privacy is violated there is no way back, a company could not fix it. The issue we find ourselves with today is that the aggregate of all privacy breaches makes it harmful to the people, but with a sizeable enough fine, I find it hard to believe that there would be major or lasting damage. So how much money your privacy it's worth ? 6 For this reason I don’t think it is wise to write laws that will bankrupt a company off of one infraction which was not directly or indirectly harmful to the physical well being of the people: and I am using indirectly a little bit more strict than I would like to since as I said before, the aggregate of all the information is harmful. The point is that the goal is not to bankrupt companies but to have them behave right. The penalty associated to every law IS the tool that make you respect the law. And it must be so high that you don't want to break the law. I would have to look into the laws in question, but on a surface level I think that any company should be subjected to the same baseline privacy laws, so if there isn’t anything screwy within the law that apple, Google, and Facebook are ignoring, I think it should apply to them. Trust me on this one, direct experience payment processors have a lot more rules to follow to be able to work. I do not want jail time for the CEO by default but he need to know that he will pay personally if the company break the law, it is the only way to make him run the company being sure that it follow the laws. For some reason I don’t have my usual cynicism when it comes to this issue. I think that the magnitude of loses that vested interests have in these companies would make it so that companies would police themselves for fear of losing profits. That being said I wouldn’t be opposed to some form of personal accountability on corporate leadership, but I fear that they will just end up finding a way to create a scapegoat everytime. It is not cynicism. I simply think that a huge fine to a single person (the CEO for example) is useless since it too easy to avoid and if it really huge realistically it would be never paid anyway so nothing usefull since the net worth of this kind of people is only on the paper. So if you slap a 100 billion file to Musk he will never pay because he has not the money to pay even if technically he is worth way more than that. Jail time instead is something that even Musk can experience. In general I like laws that are as objective as possible, I think that a privacy law should be written so that it is very objectively overbearing, but that has a smaller fine associated with it. This way the law is very clear on right and wrong, while also giving the businesses time and incentive to change their practices without having to sink large amount of expenses into lawyers to review every minute detail, which is the logical conclusion of the one infraction bankrupt system that you seem to be supporting. Then you write a law that explicitally state what you can do and what is not allowed is forbidden by default.
P

A UK government trial with 20K+ civil servants using Microsoft's Copilot AI for three months found a 26 minute average daily time saving, or two weeks per year
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
14

1

7 Stimmen

14 Beiträge

70 Aufrufe

G

A carrot perhaps... Or a very big stick.
C

$1.5 Billion AI Company That Reportedly Used No Actual AI Goes Belly Up
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
51

1

347 Stimmen

51 Beiträge

225 Aufrufe

4

Interestingly it loads today. I have AdAway on my phone and PiHole in my home network
A

Tesla bait-and-switch: Cybertruck owners won't get Autosteer feature they paid for
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
24

360 Stimmen

24 Beiträge

122 Aufrufe

F

If only they didn’t fake it to get their desired result, then maybe it could have been useful. I agree that LiDAR and other technologies should be used in conjunction with regular cameras. I don’t know why anyone would be against that unless they have vested interests. For various reasons though I understand that it isn’t always possible - price being a big one.