linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Software developer, here.

Technology

25 Beiträge 16 Kommentatoren 0 Aufrufe

P This user is from outside of this forum
P This user is from outside of this forum
passerby6497@lemmy.world

schrieb zuletzt editiert von

#21

I could definitely write it, but probably not as fast, even with fighting it. The report I got in 25-30 minutes would normally take me closer to 45-60, with having to research what to analyze, figure out how to parse different format of logs and break up and collate them and give a pretty output.
1 Antwort Letzte Antwort

2
K kescusay@lemmy.world

Software developer, here. (No, not a "vibe coder." I actually know how to read and write my own code and what it does.)

Just had the opportunity to test GPT 5 as a coding assistant in Copilot for VS Code, which in my opinion is the only legitimately useful purpose for LLMs. (No, not to write everything for me, just to do some of the more tedious tasks faster.) The IDE itself can help keep them in line, because it detects when they screw up. Which is all the time, due to their nature. Even recent and relatively "good" models like Sonnet need constant babysitting.

GPT 5 failed spectacularly. So badly, in fact, that I'm glad I only set it to analysis tasks and not to any write tasks. I will not be using it for anything else any time soon.
J This user is from outside of this forum
J This user is from outside of this forum
jjjalljs@ttrpg.network

schrieb zuletzt editiert von

#22

Even when it gets it right, you have to then check it carefully. It feels like a net loss of speed most of the time. Reading and checking someone else's code is harder than writing your own
T J 2 Antworten Letzte Antwort

4
J jjjalljs@ttrpg.network

Even when it gets it right, you have to then check it carefully. It feels like a net loss of speed most of the time. Reading and checking someone else's code is harder than writing your own
T This user is from outside of this forum
T This user is from outside of this forum
thefogan@programming.dev

schrieb zuletzt editiert von

#23

have to agree on that, there's the variation, it's faster if you take it's code verbatim, run it, and debug where there's obvious problems... but then you are vulnerable to unobvious problems, when a hacky way of doing it is weak to certain edge cases... and no real way to do it.

Reading it's code, understanding it, finding the problems from the core, sounds as time consuming as writing the code.
1 Antwort Letzte Antwort

1
J jjjalljs@ttrpg.network

Even when it gets it right, you have to then check it carefully. It feels like a net loss of speed most of the time. Reading and checking someone else's code is harder than writing your own
J This user is from outside of this forum
J This user is from outside of this forum
jj4211@lemmy.world

schrieb zuletzt editiert von

#24

On the code competition, I think it can do like 2 or 3 lines in particular scenarios. You have to have an instinct for "are the next three lines so blatantly obvious it is actually worth reading the suggestion, or just ignore it because I know it's going to screw up without even looking".

Very very very rarely do I find prompt driven coding to be useful, like very boilerplate but also very tedious. Like "show user to specify these three parametets in this cli utility", and poof, you got a reasonable argv handling pretty reliably.

Rule of thumb is if a viable answer could be expected during an interview by a random junior code applicant, it's worth giving the llm a shot. If it's something that a junior developer could get right after learning on the job a bit, then forget it, the LLM will be useless.
1 Antwort Letzte Antwort

1
E errer@lemmy.world

Despite the “official” coding score for GPT5 being higher, Claude sonnet still seems to blow it out of the water. That seems to suggest they are training to the test and the test must not be a very good test. Or they are lying.
J This user is from outside of this forum
J This user is from outside of this forum
jj4211@lemmy.world

schrieb zuletzt editiert von

#25

Problem with the "benchmarks" is Goodhart's Law: one a measure becomes a target, it ceases to be a good measurement.

The AI companies obsession with these tests cause them to maniacly train on them, making then better at those tests, but that doesn't necessarily map to actual real world usefulness. Occasionally you'll see a guy that interviews well, but it's petty useless in general on the job. LLMs are basically those all the time, but at least useful because they are cheap and fast enough to be worth it for super easy bits.
1 Antwort Letzte Antwort

1

Anmelden zum Antworten

B

Why isn't the roof of that facility covered with solar panels?
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology
1

0 Stimmen

1 Beiträge

0 Aufrufe

Niemand hat geantwortet
T

No, the UK’s Online Safety Act Doesn’t Make Children Safer Online
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
12

1

186 Stimmen

12 Beiträge

0 Aufrufe

9

Makes identity theft much more likely though
T

how can they get away with this?
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology
26

30 Stimmen

26 Beiträge

0 Aufrufe

B

Quite a few states actually have systems like this. In which individuals can choose their power generator at will. It is nice as it increases competition and lets you tailor energy use to your wants. If you want 100% green, switch to a generator that does that. If your default utility gets too expensive, switch to a cheaper one, etc.
A

I live in NJ, USA.
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology
6

50 Stimmen

6 Beiträge

0 Aufrufe

J

These taxes collected from tariffs have to go somewhere
F

AOL will end dial-up internet service in September, 34 years after it's debut — AOL Shield Browser and AOL Dialer software will be shuttered on the same day
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
40

1

262 Stimmen

40 Beiträge

0 Aufrufe

R

Fitting that it's ending in (eternal) September.
3

Spotify to raise prices in September
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
156

1

282 Stimmen

156 Beiträge

0 Aufrufe

B

Third one in a little over two years. They say it's to keep up with inflation as if they're a retail store operating on razor thin margins and people accept that.
M

Mozilla under fire for Firefox AI "bloat" that blows up CPU and drains battery
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
195

1

788 Stimmen

195 Beiträge

0 Aufrufe

A

So was it a government (state or federal) water treatment plant? If so, I can tell you how it happened. The government contracting agencies have boilerplate text they're supposed to add to contracts to make sure salient requirements get flowed. They're supposed to delete or tailor anything that doesn't make sense, but the contracts people aren't usually very technical. We had requirements flowed to us about password management and account monitoring, but no one logs into a rocket engine or a torpedo. When we'd point it out, they'd say "oops, we should have deleted that."
T

Microsoft no longer permits local Windows 10 accounts if you want Consumer Extended Security Updates — support beyond EOL requires a Microsoft Account link-up even if you pay $30
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
211

1

532 Stimmen

211 Beiträge

4 Aufrufe

E

The Affinity suite, Designer, Photo, and Publisher. I have used Inkscape, Gimp, and Scribus, but Affinity is very intuitive, easy to work with, professional, inexpensive one-time payment (per major version), very well integrated between apps, and follows the same paradigms. I've never been a fan of Adobe. Running Affinity in Wine is a hack, and a lot less responsive in a VM.