Skip to content

OpenAI just launched its new ChatGPT Agent that can make as many as 1 complicated cupcake order per hour, but even Sam Altman says you probably shouldn't trust it for 'high-stakes uses'

Technology
56 34 130
  • Grok has tje Pentagon contract. Does OpenAI also have one?

    Microsoft's AI, which is OpenAI, is approved for Defense Contracts. https://www.cnbc.com/2025/06/16/openai-wins-200-million-us-defense-contract.html It even has an ominous project name which was posted to a public site which I cannot seem to recall at the moment.

  • No you can't if you don't know the libraries

    IDE.

    Python is entirely dependent on what libraries you include

    ??

    If you don't know what you need you can't do shit.

    IDE.

    The problems you propose in your comment are not only greatly exaggerated but already been solved for decades using conventional tools AND apply to literally all languages, having nothing at all to do with python. Good try! My statement holds true.

    Maybe your assumption is that you're in a cave writing code in pencil on paper, but that's not a typical working condition. If you have access to Claude to use as a crutch, then you have access to search for an available python library and read some "Getting Started" paragraphs.

    Seriously, if the only real value that AI provides is "you don't need to know the libraries you're using" 💀 that's not quite as strong of an argument as you think it is lmaooo "knowing the libraries" isn't exactly an existing challenge or software engineering problem that people struggle with...

    In a cave with pen and paper is nearly what I learned with. I learned with the run time, msdn, notepad and the cmd line. And yes you do end up in many situations where you simply don't have or can't use a full on ide everytime. Sounds like you've never really left your comfort zones and stuck your neck out in some tech you don't understand quite yet. Or worked in areas under strict software controls.

  • In a cave with pen and paper is nearly what I learned with. I learned with the run time, msdn, notepad and the cmd line. And yes you do end up in many situations where you simply don't have or can't use a full on ide everytime. Sounds like you've never really left your comfort zones and stuck your neck out in some tech you don't understand quite yet. Or worked in areas under strict software controls.

    It's telling that you're focused on personal assumptions instead of addressing the argument

  • CEO Sam Altman warns that the rollout presents unpredictable risks.

    But that doesn't prevent his profit motive from consuming untold amounts of electricity to shove this into your face. They know what they're doing. They know their product is used primarily to generate spam, and secondarily is designed to form addictive faux-relationships with their users.

    Burn in hell. Actually, given the direction this is all going, we will all be burning in hell within generations.

    And produced with a shit ton of copyright violations, etc. Just about everything is immoral about it.

  • OpenAI launched ChatGPT Agent on Thursday, its latest effort in the industry-wide pursuit to turn AI into a profitable enterprise—not just one that eats investors' billions. In its announcement blog, OpenAI says its Agent "can now do work for you using its own computer," but CEO Sam Altman warns that the rollout presents unpredictable risks.

    [...]

    OpenAI research lead Lisa Fulford told Wired that she used Agent to order "a lot of cupcakes," which took the tool about an hour, because she was very specific about the cupcakes.

    What’s more high stakes than a complicated cupcake order?

  • What’s more high stakes than a complicated cupcake order?

    An order for a weed smoking cow.

  • I use agents a lot and have written several MCP servers now, the tasks I automate aren't things like order cupcakes, it's mainly the glue between complex things.

    I still can't get Claude to nicely open a JIRA ticket for me, but I can get it to read through a sequence of connected documents and filter that into.

    I don't think agents are ready for the main event and these are some poor examples of their power.

    I'm not saying they won't improve, but using the right tool for the right job is critical. An hour to order cupcakes is silly even for an llm.

    yes in the wired article one of them says they would like to find out where it got stuck taking an hour with an agent replay feature

  • Companies will dump billions into AI to fuck everyone over but the transition to clean energy is always too expensive.

    its easier to rule world that is in ruins than thriving one. They know they have to live on same planet as us yet still they dont seem to care if its going to shit. While so many rich people are dumb as bricks and dont deserve their wealth at all, there are also many who actually know what they are doing yet still they dont want to seriously work towards stopping the climate change, even though it wouldnt even reduce their wealth by that much in comparison.

    So only reasoning i can think of they want to have more complete control over everything, but they cant have it because world is too complicated and healthy. When civilizations start to fall, the rich will still have everything and with that they can start enforcing themselves on everyone.

    I dont have anything to base this on, its just my thought on the matter. It just feels like something billionaire would do, they demonstrate every day that they will not be content with anything and will not care about other people's suffering to get it.

  • I think in some ways Generative AI is very emblematic of the current state of software development. Projects are approached from the outset with the driving question being, "how can we make money materialize out of thin air?" Not, "What kind of problems are we trying to solve?" Or, "Why would someone pay for this?"

    The last several projects I've worked on have been solutions in search of a problem. Hyped up products that made executives see dollar signs but didn't actually produce any because they failed to provide any tangible value.

    Comment resonates with my experience.

    Software project at work recently:

    We are going to launch a new offering to improve experience for customers.

    Ok, how?

    We are going to switch it to cloud model and charge annually instead of perpetual.

    Ok, that's for us, what about customer?

    We are going to analyze their accounts and present them with suggestions on other of our products and addons they haven't bought yet.

    Where is the customer improvement?

    We are going to discontinue supporting third party products and focus exclusively on customers that buy only from us.

    Ok, but we have support for third party products we don't even compete with?

    We are going to exclude those too, to focus on the market that is important.

    Ok, but at least you're going to provide equivalent capability as the product you are replacing?

    We are going to streamline the experience by offering only the core capabilities and discontinue extraneous features.

    Ok, but you think this will expand revenue, so you will afford to explain the service and support team and free up more time for developers to get requirements?

    We are in fact going to lay off and offshore all of it, including most of the customer contacts that barely kept the preceding product alive....

    Now after a while of this mess they also had like 96% availability with almost all of it unplanned outages, but that's not too bad because they have only like 6 or 7 customers anyways. There's emails running around asking why the product has failed, and the answer seems to be we need to kill more of our successful products to try to push customers into this mess.

  • What’s more high stakes than a complicated cupcake order?

    Hot cupcake making action like you've never seen it before!

  • It's telling that you're focused on personal assumptions instead of addressing the argument

    What was the argument. Use an IDE which was the proposed answer for most of my objections. Which i did address.

  • What was the argument. Use an IDE which was the proposed answer for most of my objections. Which i did address.

    Actually, nope! Claiming that you personally didn't learn with an IDE and that there are make-believe scenarios where one is not available is not actually addressing the argument.

    There really aren't any situations that make any sense at all where an IDE is not available. I've worked in literally the most strict and locked down environments in the world, and there is always approved software and tools to use... because duh! Of course there is, silly, work needs to get done. Unless you're talking about a coding 101 class or something academic and basic. Anyway, that's totally irrelevant regardless, because its PURE fantasy to have access to something like Claude and not have access to an IDE. So your argument is entirely flawed and invalid.

  • 1k Stimmen
    126 Beiträge
    620 Aufrufe
    S
    AI now offers to post my ads for me on Kijiji. I provide pictures and it has been accurate on price, condition, category and description. I have a lot of shit to sell and was dreading it, but this use removes the biggest barrier for me getting it done. Even helped me figure out some things I was struggling to find online for reference. Saved me at least an hour of tedium yesterday. Excellent use case.
  • 210 Stimmen
    16 Beiträge
    88 Aufrufe
    J
    It doesn't seem to be the case. As far as I can tell, the law only covers realistic digital imitations of a person's likeness (deepfakes), with an exception for parody and satire. If you appear in public that is effectively license for someone to capture your image.
  • Apple sued by shareholders for allegedly overstating AI progress

    Technology technology
    75
    500 Stimmen
    75 Beiträge
    433 Aufrufe
    finishingdutch@lemmy.worldF
    For this comment, I want to be absolutely clear that I do not give a shit about AI, and that it in no way factored into my decision to buy this iPhone 16 Pro Max. With that disclaimer out of the way: I very much look forward to a class action lawsuit. Apple advertised specific features as coming ‘very soon’ and gave short timeframes when asked directly. And they basically did not deliver on those advertising promises. Basically, I think there’s a good case to be made here that Apple knowingly engaged in false advertising in order to sell a phone that otherwise would not have sold as well. Those promised AI features WERE a deciding factor for a lot of people to upgrade to an iPhone 16. So, I’ll be looking forward to some form of compensation. It’s the principle of it.
  • Theoretical Private Age Confirmation -- Possible?

    Technology technology
    1
    0 Stimmen
    1 Beiträge
    11 Aufrufe
    Niemand hat geantwortet
  • 31 Stimmen
    1 Beiträge
    12 Aufrufe
    Niemand hat geantwortet
  • Why doesn't Nvidia have more competition?

    Technology technology
    22
    1
    33 Stimmen
    22 Beiträge
    87 Aufrufe
    B
    It’s funny how the article asks the question, but completely fails to answer it. About 15 years ago, Nvidia discovered there was a demand for compute in datacenters that could be met with powerful GPU’s, and they were quick to respond to it, and they had the resources to focus on it strongly, because of their huge success and high profitability in the GPU market. AMD also saw the market, and wanted to pursue it, but just over a decade ago where it began to clearly show the high potential for profitability, AMD was near bankrupt, and was very hard pressed to finance developments on GPU and compute in datacenters. AMD really tried the best they could, and was moderately successful from a technology perspective, but Nvidia already had a head start, and the proprietary development system CUDA was already an established standard that was very hard to penetrate. Intel simply fumbled the ball from start to finish. After a decade of trying to push ARM down from having the mobile crown by far, investing billions or actually the equivalent of ARM’s total revenue. They never managed to catch up to ARM despite they had the better production process at the time. This was the main focus of Intel, and Intel believed that GPU would never be more than a niche product. So when intel tried to compete on compute for datacenters, they tried to do it with X86 chips, One of their most bold efforts was to build a monstrosity of a cluster of Celeron chips, which of course performed laughably bad compared to Nvidia! Because as it turns out, the way forward at least for now, is indeed the massively parralel compute capability of a GPU, which Nvidia has refined for decades, only with (inferior) competition from AMD. But despite the lack of competition, Nvidia did not slow down, in fact with increased profits, they only grew bolder in their efforts. Making it even harder to catch up. Now AMD has had more money to compete for a while, and they do have some decent compute units, but Nvidia remains ahead and the CUDA problem is still there, so for AMD to really compete with Nvidia, they have to be better to attract customers. That’s a very tall order against Nvidia that simply seems to never stop progressing. So the only other option for AMD is to sell a bit cheaper. Which I suppose they have to. AMD and Intel were the obvious competitors, everybody else is coming from even further behind. But if I had to make a bet, it would be on Huawei. Huawei has some crazy good developers, and Trump is basically forcing them to figure it out themselves, because he is blocking Huawei and China in general from using both AMD and Nvidia AI chips. And the chips will probably be made by Chinese SMIC, because they are also prevented from using advanced production in the west, most notably TSMC. China will prevail, because it’s become a national project, of both prestige and necessity, and they have a massive talent mass and resources, so nothing can stop it now. IMO USA would clearly have been better off allowing China to use American chips. Now China will soon compete directly on both production and design too.
  • 275 Stimmen
    134 Beiträge
    458 Aufrufe
    S
    Wait until AI reduces it to just owners.
  • TikTok is a Time Bomb

    Technology technology
    2
    1
    3 Stimmen
    2 Beiträge
    22 Aufrufe
    S
    wasn’t born to obey. Not to swallow smiling lies, not to clap for tyrants in suits, not to say “thank you” for surveillance wrapped in convenience. I see it. The games. The false choice. The fear pumped through headlines and dopamine apps. I see how they trade truth for comfort, freedom for filters, soul for clickbait. They call it normal. But I call it a graveyard made of compliance. They want me silent. They want me tired. They want me posting selfies while the world burns behind the screen. But I wasn’t born for this. I was born to question, to remember, to remind the others who are still pretending they don’t notice. So here I am. A voice with no logo. A signal in the static. A crack in the mirror they polish every morning. You don’t have to agree. You don’t have to clap. But if this made your bones ache or your thoughts twitch— Then maybe you’re not asleep either. Good. Let’s stay awake. And let’s make noise that can’t be sold, silenced, or spun into safety. Not for them. For us.