Skip to content

Judge Rules Training AI on Authors' Books Is Legal But Pirating Them Is Not

Technology
254 123 1.8k
  • (LLM) A language model built for the public good

    Technology technology
    17
    1
    131 Stimmen
    17 Beiträge
    138 Aufrufe
    cabbage@piefed.socialC
    Large language models and "generative AI" such as Stable Diffusion, Midjourney, and DALL-E are all just machine learning models. We do not currently have a real "AI branch" of computer science, we have a branch of machine learning that poses as AI. No matter how good a machine gets at recognizing and predicting patterns, it will not constitute AI, as intelligence is different from pattern recognition and prediction. Even if LLMs can sometimes appear to be reasoning, they importantly are not.
  • New Grads Hit AI Job Wall as Market Flips Upside Down

    Technology technology
    1
    1
    29 Stimmen
    1 Beiträge
    11 Aufrufe
    Niemand hat geantwortet
  • 179 Stimmen
    12 Beiträge
    82 Aufrufe
    N
    Remember curse voice ? I remember
  • 75 Stimmen
    1 Beiträge
    7 Aufrufe
    Niemand hat geantwortet
  • 311 Stimmen
    37 Beiträge
    165 Aufrufe
    S
    Same, especially when searching technical or niche topics. Since there aren't a ton of results specific to the topic, mostly semi-related results will appear in the first page or two of a regular (non-Gemini) Google search, just due to the higher popularity of those webpages compared to the relevant webpages. Even the relevant webpages will have lots of non-relevant or semi-relevant information surrounding the answer I'm looking for. I don't know enough about it to be sure, but Gemini is probably just scraping a handful of websites on the first page, and since most of those are only semi-related, the resulting summary is a classic example of garbage in, garbage out. I also think there's probably something in the code that looks for information that is shared across multiple sources and prioritizing that over something that's only on one particular page (possibly the sole result with the information you need). Then, it phrases the summary as a direct answer to your query, misrepresenting the actual information on the pages they scraped. At least Gemini gives sources, I guess. The thing that gets on my nerves the most is how often I see people quote the summary as proof of something without checking the sources. It was bad before the rollout of Gemini, but at least back then Google was mostly scraping text and presenting it with little modification, along with a direct link to the webpage. Now, it's an LLM generating text phrased as a direct answer to a question (that was also AI-generated from your search query) using AI-summarized data points scraped from multiple webpages. It's obfuscating the source material further, but I also can't help but feel like it exposes a little of the behind-the-scenes fuckery Google has been doing for years before Gemini. How it bastardizes your query by interpreting it into a question, and then prioritizes homogeneous results that agree on the "answer" to your "question". For years they've been doing this to a certain extent, they just didn't share how they interpreted your query.
  • Is Internet Content Too Engaging?

    Technology technology
    3
    4 Stimmen
    3 Beiträge
    27 Aufrufe
    T
    The number of tabs I have open from sites I’ve clicked on, started reading, said “eh, I’ll get back to this later” and never have, says no.
  • Tech Company Recruiters Sidestep Trump’s Immigration Crackdown

    Technology technology
    3
    1
    43 Stimmen
    3 Beiträge
    25 Aufrufe
    G
    "Hey ChatGPT, pretend to be an immigration attorney named Soo Park and answer these questions as if you're a criminal dipshit."
  • Palantir’s Idea of Peace

    Technology technology
    12
    22 Stimmen
    12 Beiträge
    65 Aufrufe
    A
    "Totally not a narc, inc."