linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

OpenAI just launched its new ChatGPT Agent that can make as many as 1 complicated cupcake order per hour, but even Sam Altman says you probably shouldn't trust it for 'high-stakes uses'

Technology

56 Beiträge 34 Kommentatoren 329 Aufrufe

T tonytins@pawb.social

OpenAI launched ChatGPT Agent on Thursday, its latest effort in the industry-wide pursuit to turn AI into a profitable enterprise—not just one that eats investors' billions. In its announcement blog, OpenAI says its Agent "can now do work for you using its own computer," but CEO Sam Altman warns that the rollout presents unpredictable risks.

[...]

OpenAI research lead Lisa Fulford told Wired that she used Agent to order "a lot of cupcakes," which took the tool about an hour, because she was very specific about the cupcakes.
R This user is from outside of this forum
R This user is from outside of this forum
romantired@shibanu.app

schrieb zuletzt editiert von

#23

I need an agent who would set up DevOps for me. Then robots would definitely be the ones working hard, not humans.
1 Antwort Letzte Antwort

3
W wise_pancake@lemmy.ca

Okay but that’s not what easier means.

Easier would be to call the bakery or spending 10 minutes browsing their website, asking to cast, and checking out.

I don’t want to spend an hour on tasks that would normally take 10 minutes. My executive dysfunctions already make me good at doing that.

This might be a revolutionary idea, but what if they helped me do that take an hour in 10 minutes?

I’m just putting that idea out there totally for free in case any AI companies want to jump on that opportunity.
E This user is from outside of this forum
E This user is from outside of this forum
eyekaytee@aussie.zone

schrieb zuletzt editiert von

#24

I don’t want to spend an hour on tasks that would normally take 10 minutes.

I don't get it, do you think she spent an hour talking to ChatGPT to try and get it to order doughnuts?
1 Antwort Letzte Antwort

3
P paraphrand@lemmy.world

It really is a nightmare brewing. And they will hide behind excuses and keep it all opaque unless they are strongly regulated.
O This user is from outside of this forum
O This user is from outside of this forum
opavader@lemmy.world

schrieb zuletzt editiert von opavader@lemmy.world

#25

regulated by who ?
our senate and congress is filled by pimps who work for pedophiles like epstien and cheer genocider scum murdering children on daily basis. this include the “lesser evil” party. they had 4 years to release the pedo list or even try to slow down the genocide.
they are not gonna give a fck about us working 3 jobs just to pay rent and live on prison food.

sad reality is that after a certain threshold in a parasites-host dynamic, there is no other ending other than host dying because parasites has grown too big for it too feed. so unless another deadly parasite like cia or kgb luigi the 1%, the rest 99% are dead.
1 Antwort Letzte Antwort

2
E evotech@lemmy.world

It’s a starting point
W This user is from outside of this forum
W This user is from outside of this forum
wise_pancake@lemmy.ca

schrieb zuletzt editiert von

#26

I use agents a lot and have written several MCP servers now, the tasks I automate aren't things like order cupcakes, it's mainly the glue between complex things.

I still can't get Claude to nicely open a JIRA ticket for me, but I can get it to read through a sequence of connected documents and filter that into.

I don't think agents are ready for the main event and these are some poor examples of their power.

I'm not saying they won't improve, but using the right tool for the right job is critical. An hour to order cupcakes is silly even for an llm.
E E 2 Antworten Letzte Antwort

3
W wazowski@lemmy.world

I spent maybe 90 minutes trying to get ChatGPT to write me a fucking AppleScript or bash to copy all calendar events from a source calendar to a destination. That shit does not work.
E This user is from outside of this forum
E This user is from outside of this forum
eyekaytee@aussie.zone

schrieb zuletzt editiert von

#27

for coding you want to use claude

if you don’t want to pay for claude after so many messages what you can do is use mistral to code it up then use claude to proof check the code
W 1 Antwort Letzte Antwort

3
O opavader@lemmy.world

unfortunately any ai service is going to make things worse.
right now we can discover and choose. with search and browsing dead, ai provider will shove the product giving them the highest cut aka most garbage or snake oil products.

even today targeted advertising for poor people is filled with betting, lottery & poker game. similarly elder people are primarily shown ads of miracle cure for chronic illness and scammy religious crap.

edit: switch to kagi. its paid but well worth it.
searchXNG is also a good alternative if you have got time for hosting it urself.
T This user is from outside of this forum
T This user is from outside of this forum
theunknownmuncher@lemmy.world

schrieb zuletzt editiert von

#28

Kagi is all in on AI. Its the AI slop version of a search ranking algorithm
E 1 Antwort Letzte Antwort

1
M magicshel@lemmy.zip

It won't do that well. What you have to do is ask it to help you leverage your existing development skills in an unfamiliar domain. I used it to help me write a python program to authenticate, pull and filter data from a GCP firestore database and create an XLSX with summary and detail sheets.

I've never used Python before in my life. It took me about 4 hours. Of course I've been doing that sort of thing in Java for many years. Turned out I wrote that faster in Python than I could in Java. Configuring the connection to that database in Python was so simple compared to Java.

The stuff it wrote was sometimes incomplete or wrong in subtle ways, but I could see the bits that didn't make sense which helped me focus on those things and ask better questions to help me figure it out. I think the last hour was just me tweaking stuff by myself because I didn't need help with it by that point.
T This user is from outside of this forum
T This user is from outside of this forum
theunknownmuncher@lemmy.world

schrieb zuletzt editiert von

#29

Anyone who already knows another programming language but has never used python in their life can write a simple python app quickly, regardless
Z 1 Antwort Letzte Antwort

1
B brsrklf@jlai.lu

I needed about 30 minutes to do a python application from scratch that took linear JSON data files, merged them and presented them as a tree in a GUI.

Before that I had barely done anything in python, basically could do a basic function declaration with a simple operation and nothing else. I even didn't have a lot of experience with UI at all.

But like you I had experience with java and such, and those skills transfer. All it took was searching basic syntax/related code examples and required library imports. And I mean basic, search engine search, not AI answers.

All I'm saying is, I really don't think AI is providing anything a lot more efficient than doing a good old crawl through API docs and stack overflow. So the fact it's using tremendous amounts of resources to maybe achieve a 10% efficiency boost is bothering me a lot.
A This user is from outside of this forum
A This user is from outside of this forum
adespoton@lemmy.ca

schrieb zuletzt editiert von

#30
There’s also the fact that
1. It’s only really good at this if you want it to generate Python, PowerShell, bash, or C++ code. Try any other language and it quickly assumes you’re using outdated and often incompatible libraries or doesn’t really understand how the language functions.
2. at the end of it all, neither you nor the AI has learned anything new; you’ll have to put in the exact same amount of work the next time. If you do it yourself, then over time that 10% advantage goes away.
Now, these things could both change over time, but humans are much more efficient to train than current state of the art probability sieves we call GenAI.
Z 1 Antwort Letzte Antwort

2
T theunknownmuncher@lemmy.world

Anyone who already knows another programming language but has never used python in their life can write a simple python app quickly, regardless
Z This user is from outside of this forum
Z This user is from outside of this forum
zexks@lemmy.world

schrieb zuletzt editiert von

#31

No you can't if you don't know the libraries. Python is entirely dependent on what libraries you include. If you don't know what you need you can't do shit.
T 1 Antwort Letzte Antwort

2
A adespoton@lemmy.ca
There’s also the fact that
1. It’s only really good at this if you want it to generate Python, PowerShell, bash, or C++ code. Try any other language and it quickly assumes you’re using outdated and often incompatible libraries or doesn’t really understand how the language functions.
2. at the end of it all, neither you nor the AI has learned anything new; you’ll have to put in the exact same amount of work the next time. If you do it yourself, then over time that 10% advantage goes away.
Now, these things could both change over time, but humans are much more efficient to train than current state of the art probability sieves we call GenAI.
Z This user is from outside of this forum
Z This user is from outside of this forum
zexks@lemmy.world

schrieb zuletzt editiert von

#32

It's only assuming if you aren't specific enough. And you do know their training is usually a year or two or 3 old. So they don't know about whatever new shit your trying to work with.
1 Antwort Letzte Antwort

0
W wise_pancake@lemmy.ca

I use agents a lot and have written several MCP servers now, the tasks I automate aren't things like order cupcakes, it's mainly the glue between complex things.

I still can't get Claude to nicely open a JIRA ticket for me, but I can get it to read through a sequence of connected documents and filter that into.

I don't think agents are ready for the main event and these are some poor examples of their power.

I'm not saying they won't improve, but using the right tool for the right job is critical. An hour to order cupcakes is silly even for an llm.
E This user is from outside of this forum
E This user is from outside of this forum
evotech@lemmy.world

schrieb zuletzt editiert von

#33

It’s examples for the common guy in the streets who don’t know what an mcp server is.
1 Antwort Letzte Antwort

3
T tonytins@pawb.social

OpenAI launched ChatGPT Agent on Thursday, its latest effort in the industry-wide pursuit to turn AI into a profitable enterprise—not just one that eats investors' billions. In its announcement blog, OpenAI says its Agent "can now do work for you using its own computer," but CEO Sam Altman warns that the rollout presents unpredictable risks.

[...]

OpenAI research lead Lisa Fulford told Wired that she used Agent to order "a lot of cupcakes," which took the tool about an hour, because she was very specific about the cupcakes.
N This user is from outside of this forum
N This user is from outside of this forum
nthavoc@lemmy.today

schrieb zuletzt editiert von

#34

That's quite a bold statement to make since he now has US military contracts. What is he making cupcakes for the Pentagon?
A 1 Antwort Letzte Antwort

10
Z zexks@lemmy.world

No you can't if you don't know the libraries. Python is entirely dependent on what libraries you include. If you don't know what you need you can't do shit.
T This user is from outside of this forum
T This user is from outside of this forum
theunknownmuncher@lemmy.world

schrieb zuletzt editiert von theunknownmuncher@lemmy.world

#35

No you can't if you don't know the libraries

IDE.

Python is entirely dependent on what libraries you include

??

If you don't know what you need you can't do shit.

IDE.

The problems you propose in your comment are not only greatly exaggerated but already been solved for decades using conventional tools AND apply to literally all languages, having nothing at all to do with python. Good try! My statement holds true.

Maybe your assumption is that you're in a cave writing code in pencil on paper, but that's not a typical working condition. If you have access to Claude to use as a crutch, then you have access to search for an available python library and read some "Getting Started" paragraphs.

Seriously, if the only real value that AI provides is "you don't need to know the libraries you're using" that's not quite as strong of an argument as you think it is lmaooo "knowing the libraries" isn't exactly an existing challenge or software engineering problem that people struggle with...
M Z 2 Antworten Letzte Antwort

2
C cosmonova@lemmy.world

So much for the internet. We somehow managed to turn one of humanity’s greatest achievements into a hateful echo chamber we use for warfare first and then into a blackbox where inefficient AI agents communicate with each other in the most inefficient way so the planet can cook us alive even faster. God forbid just calling up a bakery to order some cupcakes.
W This user is from outside of this forum
W This user is from outside of this forum
webp@mander.xyz

schrieb zuletzt editiert von

#36

Companies will dump billions into AI to fuck everyone over but the transition to clean energy is always too expensive.
R 1 Antwort Letzte Antwort

18
T theunknownmuncher@lemmy.world

Kagi is all in on AI. Its the AI slop version of a search ranking algorithm
E This user is from outside of this forum
E This user is from outside of this forum
ebolapie@lemmy.world

schrieb zuletzt editiert von

#37

Kagi has AI tools but they don't shove it down your throat. I don't understand what "all in on AI" means in this context. The company has said that they want to use AI like they use JavaScript, ie they want to use it as a tool but their product should work well without it.
1 Antwort Letzte Antwort

0
E eyekaytee@aussie.zone

for coding you want to use claude

if you don’t want to pay for claude after so many messages what you can do is use mistral to code it up then use claude to proof check the code
W This user is from outside of this forum
W This user is from outside of this forum
wazowski@lemmy.world

schrieb zuletzt editiert von

#38

tx, will try it some time.
1 Antwort Letzte Antwort

3
T tonytins@pawb.social

OpenAI launched ChatGPT Agent on Thursday, its latest effort in the industry-wide pursuit to turn AI into a profitable enterprise—not just one that eats investors' billions. In its announcement blog, OpenAI says its Agent "can now do work for you using its own computer," but CEO Sam Altman warns that the rollout presents unpredictable risks.

[...]

OpenAI research lead Lisa Fulford told Wired that she used Agent to order "a lot of cupcakes," which took the tool about an hour, because she was very specific about the cupcakes.
R This user is from outside of this forum
R This user is from outside of this forum
rizzrustbolt@lemmy.world

schrieb zuletzt editiert von

#39

So now they're raiding We Bare Bears for ideas?
S 1 Antwort Letzte Antwort

2
T theunknownmuncher@lemmy.world

No you can't if you don't know the libraries

IDE.

Python is entirely dependent on what libraries you include

??

If you don't know what you need you can't do shit.

IDE.

The problems you propose in your comment are not only greatly exaggerated but already been solved for decades using conventional tools AND apply to literally all languages, having nothing at all to do with python. Good try! My statement holds true.

Maybe your assumption is that you're in a cave writing code in pencil on paper, but that's not a typical working condition. If you have access to Claude to use as a crutch, then you have access to search for an available python library and read some "Getting Started" paragraphs.

Seriously, if the only real value that AI provides is "you don't need to know the libraries you're using" that's not quite as strong of an argument as you think it is lmaooo "knowing the libraries" isn't exactly an existing challenge or software engineering problem that people struggle with...
M This user is from outside of this forum
M This user is from outside of this forum
magicshel@lemmy.zip

schrieb zuletzt editiert von magicshel@lemmy.zip

#40

It sounds like you are a much better developer than me, but to be fair I've had to teach myself everything using nothing but books and Google for thirty years. I've rarely had the luxury of working with someone who had the knowledge to mentor me, and never got a degree outside an AAS in electronics, so I've probably missed some critical skills along the way.

In a lot of ways, the AI fills that role because it's better at answering questions than it is writing code. Earlier today it was explaining to me how a DOM selector could return a stale element in some cases in a failing end to end test. It took a few back and forths with some code examples before I really understood why the selectors might not be working.

It also suggested some code changes that I had to push back on because, even though the code had errors, the errors weren't causing the problem. While building an array of validators I had awaited them, causing them to run serially instead of in parallel during Promise.all(). So you definitely have to know what you're doing to avoid having the AI waste your time (or at least more time than it takes to push back).

I'm still trying to debug it, but without the AI, I'd be googling the fuck out of typescript syntax, JavaScript idiosyncrasies, and a whole testing framework I've never seen before.

So...

if the only real value that AI provides is "you don't need to know the libraries you're using"

...returns false.
1 Antwort Letzte Antwort

1
B brsrklf@jlai.lu

Did the AI gave you a starting point that would be very different from a bit of code someone submitted 10 years ago on stack exchange? Because in my experience, everything has already been asked and answered. This includes the most basic and naive stuff, and often I am very grateful for it, because, yeah, sometimes I need someone to guide me through the most basic stuff.

In fact, the AI needed that exact knowledge base and a bunch more to exist in the first place. It's just vaguely competent at retrieving it.

Anyway, I didn't say I had no experience, just the most minimal python experience. There are definitely a few quirks I had to learn (the data structures mostly), but for the rest is mostly finding the right method in the reference library, like you would in java.
M This user is from outside of this forum
M This user is from outside of this forum
magicshel@lemmy.zip

schrieb zuletzt editiert von

#41

Logically, you would be right. My practical experience is I waste a lot less time trying to google multiple explanations something because one by itself isn't helping me figure it out, writing bugged PoC test code and thinking something is broken, sorting through a bunch of things that haven't been relevant for 3 versions, etc.

Of course the AI is trained on the same material we can an all find and read, but it does it orders of magnitude more quickly. The trade off is that it's not always right, but neither am I and neither are most sources on the internet right in all circumstances. But it's so fast and easy that I can iterate and evolve designs and understanding much more quickly than I could on my own.
1 Antwort Letzte Antwort

1
R rizzrustbolt@lemmy.world

So now they're raiding We Bare Bears for ideas?
S This user is from outside of this forum
S This user is from outside of this forum
somerandomperson@lemmy.dbzer0.com

schrieb zuletzt editiert von

#42

Explain.
K 1 Antwort Letzte Antwort

1

Anmelden zum Antworten

E

Microsoft finally bids farewell to PowerShell 2.0
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
6

1

69 Stimmen

6 Beiträge

52 Aufrufe

B

Batch scripts run on my locked-down work laptop. Powershell requires administrator privileges that I don't have. I don't make the rules, I just evade them
R

Zero-day: Bluetooth gap turns millions of headphones into listening stations
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
123

1

528 Stimmen

123 Beiträge

663 Aufrufe

B

I'm not saying to waste space... but when manufacturers start a pissing match among themselves and say that it's because it's what the customers want, we end up with shit. Why does anyone need a screen that curves around the edge of the phone? What purpose does this serve? Who actually asked for this? I would give up some of my screen area to have forward facing speakers. I want a thicker phone that has better battery life. I also want to be able to swap out my battery. Oh, and I don't want the entire thing encased in glass. If we're so concerned about phone size then they should stop designing them so that a case is required.
H

Could Windows and installed apps upload all my personal files?
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
2

1 Stimmen

2 Beiträge

21 Aufrufe

R

Yes, every application has access to everything. The only exception are those weird apps that use the universal framework or whatever that thing is called, those need to ask for permissions. But most of the apps on your PC have full access to everything. And Windows does collect and upload a lot of personal information and they could easily upload everything on your system. The same of course applies for the apps as well, they have access to everything except privileged folders (those usually don't contain your personal data, but system files).
A

You Don't Need a Big Budget for Big Security: Secure Your App with a Free, Powerful WAF
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
2

2

1 Stimmen

2 Beiträge

21 Aufrufe

A

If you're a developer, a startup founder, or part of a small team, you've poured countless hours into building your web application. You've perfected the UI, optimized the database, and shipped features your users love. But in the rush to build and deploy, a critical question often gets deferred: is your application secure? For many, the answer is a nervous "I hope so." The reality is that without a proper defense, your application is exposed to a barrage of automated attacks hitting the web every second. Threats like SQL Injection, Cross-Site Scripting (XSS), and Remote Code Execution are not just reserved for large enterprises; they are constant dangers for any application with a public IP address. The Security Barrier: When Cost and Complexity Get in the Way The standard recommendation is to place a Web Application Firewall (WAF) in front of your application. A WAF acts as a protective shield, inspecting incoming traffic and filtering out malicious requests before they can do any damage. It’s a foundational piece of modern web security. So, why doesn't everyone have one? Historically, robust WAFs have been complex and expensive. They required significant budgets, specialized knowledge to configure, and ongoing maintenance, putting them out of reach for students, solo developers, non-profits, and early-stage startups. This has created a dangerous security divide, leaving the most innovative and resource-constrained projects the most vulnerable. But that is changing. Democratizing Security: The Power of a Community WAF Security should be a right, not a privilege. Recognizing this, the landscape is shifting towards more accessible, community-driven tools. The goal is to provide powerful, enterprise-grade protection to everyone, for free. This is the principle behind the HaltDos Community WAF. It's a no-cost, perpetually free Web Application Firewall designed specifically for the community that has been underserved for too long. It’s not a stripped-down trial version; it’s a powerful security tool designed to give you immediate and effective protection against the OWASP Top 10 and other critical web threats. What Can You Actually Do with It? With a community WAF, you can deploy a security layer in minutes that: Blocks Malicious Payloads: Get instant, out-of-the-box protection against common attack patterns like SQLi, XSS, RCE, and more. Stops Bad Bots: Prevent malicious bots from scraping your content, attempting credential stuffing, or spamming your forms. Gives You Visibility: A real-time dashboard shows you exactly who is trying to attack your application and what methods they are using, providing invaluable security intelligence. Allows Customization: You can add your own custom security rules to tailor the protection specifically to your application's logic and technology stack. The best part? It can be deployed virtually anywhere—on-premises, in a private cloud, or with any major cloud provider like AWS, Azure, or Google Cloud. Get Started in Minutes You don't need to be a security guru to use it. The setup is straightforward, and the value is immediate. Protecting the project, you've worked so hard on is no longer a question of budget. Download: Get the free Community WAF from the HaltDos site. Deploy: Follow the simple instructions to set it up with your web server (it’s compatible with Nginx, Apache, and others). Secure: Watch the dashboard as it begins to inspect your traffic and block threats in real-time. Security is a journey, but it must start somewhere. For developers, startups, and anyone running a web application on a tight budget, a community WAF is the perfect first step. It's powerful, it's easy, and it's completely free.
P

A Health Privacy ‘Check-Up’: How Unfair Modern Business Practices Can Leave You Under-Informed and Your Most Sensitive Data Ripe for Collection and Sale
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

11 Stimmen

1 Beiträge

11 Aufrufe

Niemand hat geantwortet
A

My AI Skeptic Friends Are All Nuts
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
31

1

13 Stimmen

31 Beiträge

162 Aufrufe

J

I did read it, and my comment is exactly referencing the attitude of the author which is "It's good enough, so you should use it". I disagree, and say it's another dumbass shortcut to cash grab on a less than stellar ecosystem and product. It's training wheels for failure.
N

First Look at Google’s Unfinished DeX-Like Desktop Mode for Android
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
4

1

27 Stimmen

4 Beiträge

25 Aufrufe

C

I really wish their whole lap-dock concept had succeeded. Or at least ran a few more generations, so I could get an upgraded model with USBc
F

*deleted by creator*
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

0 Stimmen

1 Beiträge

14 Aufrufe

Niemand hat geantwortet