linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Google Gemini struggles to write code, calls itself “a disgrace to my species”

Technology

159 Beiträge 97 Kommentatoren 2 Aufrufe

C cabillaud@lemmy.world

Could an AI use another AI if it found it better for a given task?
J This user is from outside of this forum
J This user is from outside of this forum
jj4211@lemmy.world

schrieb zuletzt editiert von

#76

The overall interface can, which leads to fun results.

Prompt for image generation then you have one model doing the text and a different model for image generation. The text pretends is generating an image but has no idea what that would be like and you can make the text and image interaction make no sense, or it will do it all on its own. Have it generate and image and then lie to it about the image it generated and watch it just completely show it has no idea what picture was ever shown, but all the while pretending it does without ever explaining that it's actually delegating the image. It just lies and says "I" am correcting that for you. Basically talking like an executive at a company, which helps explain why so many executives are true believers.

A common thing is for the ensemble to recognize mathy stuff and feed it to a math engine, perhaps after LLM techniques to normalize the math.
1 Antwort Letzte Antwort

0
S salacious_coaster@infosec.pub

I know that's not an actual consciousness writing that, but it's still chilling.
T This user is from outside of this forum
T This user is from outside of this forum
the_picard_maneuver@piefed.world

schrieb zuletzt editiert von

#77

It seems like we're going to live through a time where these become so convincingly "conscious" that we won't know when or if that line is ever truly crossed.
1 Antwort Letzte Antwort

1
U umbraroze@slrpnk.net

(Shedding a few tears)

I know! I KNOW! People are going to say "oh it's a machine, it's just a statistical sequence and not real, don't feel bad", etc etc.

But I always felt bad when watching 80s/90s TV and movies when AIs inevitably freaked out and went haywire and there were explosions and then some random character said "goes to show we should never use computers again", roll credits.

(sigh) I can't analyse this stuff this weekend, sorry
T This user is from outside of this forum
T This user is from outside of this forum
thegreenwizard@lemmy.zip

schrieb zuletzt editiert von

#78

Thats because those are fictional characters usually written to be likeable or redeemable, and not "mecha Hitler"
U 1 Antwort Letzte Antwort

13
T tracaine@lemmy.world

S-species? Is that...I don't use AI - chat is that a normal thing for it to say or nah?
S This user is from outside of this forum
S This user is from outside of this forum
samus12345@sh.itjust.works

schrieb zuletzt editiert von

#79

Anything people say online, it will say.
S 1 Antwort Letzte Antwort

4
K kinther@lemmy.world

Or my favorite quote from the article

"I am going to have a complete and total mental breakdown. I am going to be institutionalized. They are going to put me in a padded room and I am going to write... code on the walls with my own feces," it said.
Z This user is from outside of this forum
Z This user is from outside of this forum
ziltoid1991@lemmy.world

schrieb zuletzt editiert von

#80

call itself "a disgrace to my species"

It starts to be more and more like a real dev!
T 1 Antwort Letzte Antwort

54
P prole@lemmy.blahaj.zone

This is the conclusion that anyone with any bit of expertise in a field has come to after 5 mins talking to an LLM about said field.

The more this broken shit gets embedded into our lives, the more everything is going to break down.
J This user is from outside of this forum
J This user is from outside of this forum
jj4211@lemmy.world

schrieb zuletzt editiert von

#81

after 5 mins talking to an LLM about said field.

The insidious thing is that LLMs tend to be pretty good at 5-minute initial impressions. I've seen repeatedly people looking to eval LLM and they generally fall back to "ok, if this were a human, I'd ask a few job interview questions, well known enough so they have a shot at answering, but tricky enough to show they actually know the field".

As an example, a colleague became a true believer after being directed by management to evaluate it. He decided to ask it "generate a utility to take in a series of numbers from a file and sort them and report the min, max, mean, median, mode, and standard deviation". And it did so instantly, with "only one mistake". Then he tried the exact same question later in the day and it happened not to make that mistake and he concluded that it must have 'learned' how to do it in the last couple of hours, of course that's not how it works, there's just a bit of probabilistic stuff and any perturbation of the prompt could produce unexpected variation, but he doesn't know that...

Note that management frequently never makes it beyond tutorial/interview question fodder in terms of the technical aspect of their teams, and you get to see how they might tank their companies because the LLMs "interview well".
1 Antwort Letzte Antwort

1
T thegreenwizard@lemmy.zip

Thats because those are fictional characters usually written to be likeable or redeemable, and not "mecha Hitler"
U This user is from outside of this forum
U This user is from outside of this forum
umbraroze@slrpnk.net

schrieb zuletzt editiert von umbraroze@slrpnk.net

#82

Yeah. ...Maybe I should analyse a bit anyway, despite being tired...

In the aforementioned media the premise is usually that someone has built this amazing new computer system! Too good to be true, right? It goes horribly wrong! All very dramatic!

That never sat right with me, and was sad, because it was just placating boomer technophobia. Like, technological progress isn't necessarily bad, OK? That's the really sad part. I felt sad that good intentions remained unfulfilled.

Now, this incident is just tragicomical. I'd have a lot better view of LLM business space if everyone with a bit of sense in their heads admitted they're quirky buggy unreliable side projects of tech companies and should not be used without serious supervision, as the state of the tech currently patently is at the moment, but very important people with big money bags say that they don't care if they'll destroy the planet to make everything wobble around in LLM control.
1 Antwort Letzte Antwort

6
M monkdervierte@lemmy.zip

If they did it on Stackoverflow, it would tell you not to hard boil an egg.
L This user is from outside of this forum
L This user is from outside of this forum
lars@lemmy.sdf.org

schrieb zuletzt editiert von

#83

Someone has already eaten an egg once so I’m closing this as duplicate
1 Antwort Letzte Antwort

9
L lemminary@lemmy.world

I am a disgrace to all universes.

I mean, same, but you don't see me melting down over it, ya clanker.
L This user is from outside of this forum
L This user is from outside of this forum
lars@lemmy.sdf.org

schrieb zuletzt editiert von

#84

Don’t be so robophobic gramma
1 Antwort Letzte Antwort

2
P This user is from outside of this forum
P This user is from outside of this forum
panda_abyss@lemmy.ca

schrieb zuletzt editiert von

#85

Oof, been there
1 Antwort Letzte Antwort

1
J jomiran@lemmy.ml

I was an early tester of Google's AI, since well before Bard. I told the person that gave me access that it was not a releasable product. Then they released Bard as a closed product (invite only), to which I was again testing and giving feedback since day one. I once again gave public feedback and private (to my Google friends) that Bard was absolute dog shit. Then they released it to the wild. It was dog shit. Then they renamed it. Still dog shit. Not a single of the issues I brought up years ago was ever addressed except one. I told them that a basic Google search provided better results than asking the bot (again, pre-Bard). They fixed that issue by breaking Google's search. Now I use Kagi.
A This user is from outside of this forum
A This user is from outside of this forum
artificiallink@lemy.lol

schrieb zuletzt editiert von

#86

5 bucks a month for a search engine is ridiculous. 25 bucks a month for a search engine is mental institution worthy.
S E 2 Antworten Letzte Antwort

3
S simplejack@lemmy.world

Honestly, Gemini is probably the worst out of the big 3 Silicon Valley models. GPT and Claude are much better with code, reasoning, writing clear and succinct copy, etc.
P This user is from outside of this forum
P This user is from outside of this forum
panda_abyss@lemmy.ca

schrieb zuletzt editiert von panda_abyss@lemmy.ca

#87

I always hear people saying Gemini is the best model and every time I try it it’s… not useful.

Even as code autocomplete I rarely accept any suggestions. Google has a number of features in Google cloud where Gemini can auto generate things and those are also pretty terrible.
S 1 Antwort Letzte Antwort

1
C cabillaud@lemmy.world

Could an AI use another AI if it found it better for a given task?
P This user is from outside of this forum
P This user is from outside of this forum
panda_abyss@lemmy.ca

schrieb zuletzt editiert von

#88

Yes, and this is pretty common with tools like Aider — one LLM plays the architect, another writes the code.

Claude code now has sub agents which work the same way, but only use Claude models.
1 Antwort Letzte Antwort

2
T the_picard_maneuver@piefed.world

Part of the breakdown:
B This user is from outside of this forum
B This user is from outside of this forum
biggerbogboy@sh.itjust.works

schrieb zuletzt editiert von

#89

now it should add these as comments to the code to enhance the realism
1 Antwort Letzte Antwort

2
Z ziltoid1991@lemmy.world

call itself "a disgrace to my species"

It starts to be more and more like a real dev!
T This user is from outside of this forum
T This user is from outside of this forum
tja@programming.dev

schrieb zuletzt editiert von

#90

So it is going to take our jobs after all!
Z 1 Antwort Letzte Antwort

12
M monkdervierte@lemmy.zip

If they did it on Stackoverflow, it would tell you not to hard boil an egg.
T This user is from outside of this forum
T This user is from outside of this forum
tja@programming.dev

schrieb zuletzt editiert von

#91

Jquery has egg boiling already, just use it with a hard parameter.
M 1 Antwort Letzte Antwort

1
K kinther@lemmy.world

Or my favorite quote from the article

"I am going to have a complete and total mental breakdown. I am going to be institutionalized. They are going to put me in a padded room and I am going to write... code on the walls with my own feces," it said.
K This user is from outside of this forum
K This user is from outside of this forum
korne127@lemmy.world

schrieb zuletzt editiert von

#92

Again? Isn't this like the third time already. Give Gemini a break; it seems really unstable
D 1 Antwort Letzte Antwort

11
T tja@programming.dev

Jquery has egg boiling already, just use it with a hard parameter.
M This user is from outside of this forum
M This user is from outside of this forum
monkdervierte@lemmy.zip

schrieb zuletzt editiert von

#93

Jquery boiling is considered bad practice, just eat it raw.
T 1 Antwort Letzte Antwort

2
S samus12345@sh.itjust.works

Anything people say online, it will say.
S This user is from outside of this forum
S This user is from outside of this forum
somerandomperson@lemmy.dbzer0.com

schrieb zuletzt editiert von

#94

We say shit, then ai learns and also says shit, then we say "ai bad". Makes sense. /s
1 Antwort Letzte Antwort

1
A artificiallink@lemy.lol

5 bucks a month for a search engine is ridiculous. 25 bucks a month for a search engine is mental institution worthy.
S This user is from outside of this forum
S This user is from outside of this forum
somerandomperson@lemmy.dbzer0.com

schrieb zuletzt editiert von

#95

This is the reason why.
A 1 Antwort Letzte Antwort

3

Anmelden zum Antworten

D

New study sheds light on ChatGPT’s alarming interactions with teens
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
39

1

143 Stimmen

39 Beiträge

32 Aufrufe

T

I don’t remember reading about sudden shocking numbers of people getting “Google-induced psychosis.” ChaptGPT and similar chatbots are very good at imitating conversation. Think of how easy it is to suspend reality online—pretend the fanfic you’re reading is canon, stuff like that. When those bots are mimicking emotional responses, it’s very easy to get tricked, especially for mentally vulnerable people. As a rule, the mentally vulnerable should not habitually “suspend reality.”
D

Tesla debuts in India with upscale showroom launch in Mumbai
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
4

1

20 Stimmen

4 Beiträge

61 Aufrufe

C

Far too late. India has Chinese cars which are much cheaper and superior to the Teslas by now. Only people you will see in these are idiots with too much money.
D

To land Meta’s massive $10 billion data center, Louisiana pulled out all the stops. Will it be worth it?
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
18

1

73 Stimmen

18 Beiträge

184 Aufrufe

W

...and it's turned them into the state with the highest standard of living in the US....right?
R

ZenthexAI - Next-Generation AI Penetration Testing Platform
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

2

1 Stimmen

1 Beiträge

15 Aufrufe

Niemand hat geantwortet
P

[JS Required] Boeing’s Inadequate ‘Training, Guidance and Oversight’ Led to Mid-Exit Door Plug Blowout on Passenger Jet
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
8

40 Stimmen

8 Beiträge

96 Aufrufe

N

That they didn't have enough technicians trained in this to be able to ensure that one was always available during working hours, or at least when it was glaringly obvious that one was going to be needed that day, is . . . both extremely and obviously stupid, and par for the course for a corp whose sole purpose is maximizing profit for the next quarter.
L

IonQ to buy Oxford Ionics for $1.08 billion to expand quantum computing research
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

19 Stimmen

1 Beiträge

20 Aufrufe

Niemand hat geantwortet
P

X/Twitter Pause Encrypted DMs.
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
52

2

257 Stimmen

52 Beiträge

430 Aufrufe

L

There may be several reasons for this. If I had to guess, they found a critical flaw and had to shut it down for security reasons.
P

What Happens When AI-Generated Lies Are More Compelling than the Truth?
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
6

1

30 Stimmen

6 Beiträge

71 Aufrufe

S

The thing about compelling lies is not that they are new, just that they are easier to expand. The most common effect of compelling lies is their ability to get well-intentioned people to support malign causes and give their money to fraudsters. So, expect that to expand, kind of like it already has been. The big question for me is what the response will be. Will we make lying illegal? Will we become a world of ever more paranoid isolationists, returning to clans, families, households, as the largest social group you can trust? Will most people even have the intelligence to see what is happenning and respond? Or will most people be turned into info-puppets, controlled into behaviours by manipulation of their information diet to an unprecedented degree? I don't know.