Skip to content

AI agents wrong ~70% of time: Carnegie Mellon study

Technology
277 108 90
  • Get Your Filthy ChatGPT Away From My Liberal Arts

    Technology technology
    12
    1
    144 Stimmen
    12 Beiträge
    39 Aufrufe
    N
    Indeed—semicolons are usually associated wirh LLMs! But that’s not all! Always remember: use your tools! An LLM „uses“ all types of quotation marks.
  • 586 Stimmen
    100 Beiträge
    324 Aufrufe
    B
    No, LCOE is an aggregated sum of all the cash flows, with the proper discount rates applied based on when that cash flow happens, complete with the cost of borrowing (that is, interest) and the changes in prices (that is, inflation). The rates charged to the ratepayers (approved by state PUCs) are going to go up over time, with inflation, but the effect of that on the overall economics will also be blunted by the time value of money and the interest paid on the up-front costs in the meantime. When you have to pay up front for the construction of a power plant, you have to pay interest on those borrowed funds for the entire life cycle, so that steadily increasing prices over time is part of the overall cost modeling.
  • 28 Stimmen
    16 Beiträge
    6 Aufrufe
    D
    The writing in this story is not accurate. Iran isn't turning it off for the country. They are talking about switching government services to use receivers that use Beidou as primary source of timing and maybe selectively turn off using GPS on those devices.
  • 37 Stimmen
    2 Beiträge
    17 Aufrufe
    P
    Idk if it’s content blocking on my end but I can’t tell you how upset I am that the article had no pictures of the contraption or a video of it in action.
  • 311 Stimmen
    37 Beiträge
    66 Aufrufe
    S
    Same, especially when searching technical or niche topics. Since there aren't a ton of results specific to the topic, mostly semi-related results will appear in the first page or two of a regular (non-Gemini) Google search, just due to the higher popularity of those webpages compared to the relevant webpages. Even the relevant webpages will have lots of non-relevant or semi-relevant information surrounding the answer I'm looking for. I don't know enough about it to be sure, but Gemini is probably just scraping a handful of websites on the first page, and since most of those are only semi-related, the resulting summary is a classic example of garbage in, garbage out. I also think there's probably something in the code that looks for information that is shared across multiple sources and prioritizing that over something that's only on one particular page (possibly the sole result with the information you need). Then, it phrases the summary as a direct answer to your query, misrepresenting the actual information on the pages they scraped. At least Gemini gives sources, I guess. The thing that gets on my nerves the most is how often I see people quote the summary as proof of something without checking the sources. It was bad before the rollout of Gemini, but at least back then Google was mostly scraping text and presenting it with little modification, along with a direct link to the webpage. Now, it's an LLM generating text phrased as a direct answer to a question (that was also AI-generated from your search query) using AI-summarized data points scraped from multiple webpages. It's obfuscating the source material further, but I also can't help but feel like it exposes a little of the behind-the-scenes fuckery Google has been doing for years before Gemini. How it bastardizes your query by interpreting it into a question, and then prioritizes homogeneous results that agree on the "answer" to your "question". For years they've been doing this to a certain extent, they just didn't share how they interpreted your query.
  • 95 Stimmen
    1 Beiträge
    12 Aufrufe
    Niemand hat geantwortet
  • Looking elsewhere

    Technology technology
    3
    1
    7 Stimmen
    3 Beiträge
    19 Aufrufe
    J
    That's a valid point! I've been searching for places to hangout for a while, sometimes called "campfires". Found a cool Discord with generous front-end folks (that's a broad spectrum!), on frontend.horse.
  • 37 Stimmen
    1 Beiträge
    6 Aufrufe
    Niemand hat geantwortet