Skip to content

AI agents wrong ~70% of time: Carnegie Mellon study

Technology
272 107 79
  • ChatGPT Lost a Chess Game to an Atari 2600

    Technology technology
    1
    1
    0 Stimmen
    1 Beiträge
    9 Aufrufe
    Niemand hat geantwortet
  • Founder of 23andMe buys back company out of bankruptcy auction

    Technology technology
    60
    1
    348 Stimmen
    60 Beiträge
    201 Aufrufe
    A
    Come on up to Canada, we still got that garlic bomb. I can still taste the one from last week
  • 39 Stimmen
    15 Beiträge
    27 Aufrufe
    C
    I believed they were doing such things against budding competitors long before the LLM era. My test is simple. Replace it with China. Would the replies be the opposite of what you've recieved so far? The answer is yes. Absolutely people would be frothing at the mouth about China being bad actors. Western tech bros are just as paranoid, they copy off others, they steal ideas. When we do it it's called "innovation".
  • The Internet of Consent

    Technology technology
    1
    1
    11 Stimmen
    1 Beiträge
    9 Aufrufe
    Niemand hat geantwortet
  • 17 Stimmen
    1 Beiträge
    11 Aufrufe
    Niemand hat geantwortet
  • Duolingo CEO tries to walk back AI-first comments, fails

    Technology technology
    134
    758 Stimmen
    134 Beiträge
    439 Aufrufe
    kingthrillgore@lemmy.mlK
    I think on iOS they added a thing where it would change based on the days you didn't use Duolingo. Honestly at this point I think it speaks more about the sorry state of their company more than anything.
  • 2 Stimmen
    8 Beiträge
    38 Aufrufe
    F
    IMO stuff like that is why a good trainer is important. IMO it's stronger evidence that proper user-centered design should be done and a usable and intuitive UX and set of APIs developed. But because the buyer of this heap of shit is some C-level, there is no incentive to actually make it usable for the unfortunate peons who are forced to interact with it. See also SFDC and every ERP solution in existence.
  • 0 Stimmen
    6 Beiträge
    27 Aufrufe
    P
    Outlook.... Ok Pretty solid Bahaha hahahahaha Sorry. Outlook is a lot of things. "Gooey crap" would be one way to describe it, but "solid"? Yeah, no. Gmail is (well, was) pretty solid. There are a lot of other webmail providers out there, including self hosted options and most are pretty solid, yeah. Outlook, though? It's a shit show, it's annoying. Do you love me? Please love me, please give feedback, please give feedback again, please look at this, hey am I the best? Am I.. STFU YOU PIECE OF CRAP! Can you PLEASE just let me do my email without being an attention whore every hour? Even down to the basics. Back button? "What is that? Never heard of it, can't go back to the message I just was on because I'm Microsoft software and so half baked." Having two tabs open? "Oh noes, now I get scawed, now I don't know how to manage sessions anymore, better just sign you out everywhere." What is it with Microsoft and not being able to do something basic as sessions normal? I'm not even asking for good, definitely not "awesome", just normal, and that is already too much to ask. Try running it in Firefox! I'm sure it's totally not on purpose, just "oopsie woopsie poopsie" accidentally bwoken. Maybe it's working again today, who knows, tomorrow it'll be broken again. I run everything on Firefox except the Microsoft sites, they have to be in chrome because fuck you, that's why. Seriously, I can't take any Microsoft software seriously at this point, and all of it is on its way out in our company, I'm making sure of that