Skip to content

AI agents wrong ~70% of time: Carnegie Mellon study

Technology
269 106 61
  • Samsung to buy US healthcare services company Xealth

    Technology technology
    1
    24 Stimmen
    1 Beiträge
    1 Aufrufe
    Niemand hat geantwortet
  • Apple appeals EU's €500M fine over App Store payment restraints

    Technology technology
    3
    1
    21 Stimmen
    3 Beiträge
    9 Aufrufe
    zak@lemmy.worldZ
    It's likely their priority is continuing to collect all the fees they can for as long as they can rather than the fine itself.
  • Your smartphone is a parasite, according to evolution

    Technology technology
    1
    1
    0 Stimmen
    1 Beiträge
    10 Aufrufe
    Niemand hat geantwortet
  • 311 Stimmen
    37 Beiträge
    56 Aufrufe
    S
    Same, especially when searching technical or niche topics. Since there aren't a ton of results specific to the topic, mostly semi-related results will appear in the first page or two of a regular (non-Gemini) Google search, just due to the higher popularity of those webpages compared to the relevant webpages. Even the relevant webpages will have lots of non-relevant or semi-relevant information surrounding the answer I'm looking for. I don't know enough about it to be sure, but Gemini is probably just scraping a handful of websites on the first page, and since most of those are only semi-related, the resulting summary is a classic example of garbage in, garbage out. I also think there's probably something in the code that looks for information that is shared across multiple sources and prioritizing that over something that's only on one particular page (possibly the sole result with the information you need). Then, it phrases the summary as a direct answer to your query, misrepresenting the actual information on the pages they scraped. At least Gemini gives sources, I guess. The thing that gets on my nerves the most is how often I see people quote the summary as proof of something without checking the sources. It was bad before the rollout of Gemini, but at least back then Google was mostly scraping text and presenting it with little modification, along with a direct link to the webpage. Now, it's an LLM generating text phrased as a direct answer to a question (that was also AI-generated from your search query) using AI-summarized data points scraped from multiple webpages. It's obfuscating the source material further, but I also can't help but feel like it exposes a little of the behind-the-scenes fuckery Google has been doing for years before Gemini. How it bastardizes your query by interpreting it into a question, and then prioritizes homogeneous results that agree on the "answer" to your "question". For years they've been doing this to a certain extent, they just didn't share how they interpreted your query.
  • Apple acquires RAC7, its first-ever video game studio

    Technology technology
    1
    1
    0 Stimmen
    1 Beiträge
    9 Aufrufe
    Niemand hat geantwortet
  • 365 Stimmen
    198 Beiträge
    59 Aufrufe
    F
    Okay but we were talking about BTC pump and dumps and to perform that on the massive scale which dwarfs any stock ticker below the top 5 by hundreds of billions of dollars while somehow completely illuding people who watch the blockchain like hawks for big movers... It's just not feasible. You would have to be much richer than the official richest man on earth and have almost all of your assets liquid and then on top of that you would need millions of wallets acting asynchronously. And why would you even bother? If you're that rich you could just not hide it.
  • 1 Stimmen
    5 Beiträge
    24 Aufrufe
    A
    Turns out dry sarcasm doesn't come across well in text form, if only there was a way to indicate it
  • 0 Stimmen
    1 Beiträge
    6 Aufrufe
    Niemand hat geantwortet