Skip to content

Here’s how to spot AI writing, according to Wikipedia

Technology
16 15 0
  • This post did not contain any content.

    I'd gotten really good at discerning a chatGPT bot from a human account just from years of catching bots on Reddit.

    There's a lot of red flags and tells that would be very hard to completely eradicate. There will always be an uncanny valley.

  • Mental note: add a step to my Wikipedia article generator agentic flow that has the LLM check this page and remove these giveaways.

    Pretty much just have the LLM reduce the size of its answer, proofread it that it makes sense and didn't screw punctuation and boom no one will no.

  • This post did not contain any content.

    Really helpful page thank you for sharing.

  • I'd gotten really good at discerning a chatGPT bot from a human account just from years of catching bots on Reddit.

    There's a lot of red flags and tells that would be very hard to completely eradicate. There will always be an uncanny valley.

    You think you have - but there’s really no way of knowing.

    Just because someone writes like a bot doesn’t mean they actually are one. Feeling like "you’ve caught one" doesn’t mean you did - it just means you think you did. You might have been wrong, but you never got confirmation to know for sure, so you have no real basis for judging how good your detection rate actually is. It’s effectively begging the question - using your original assumption as "proof" without actual verification.

    And then there’s the classic toupee fallacy: "All toupees look fake - I’ve never seen one that didn’t." That just means you’re good at spotting bad toupees. You can’t generalize from that and claim you’re good at detecting toupees in general, because all the good ones slip right past you unnoticed.

  • You think you have - but there’s really no way of knowing.

    Just because someone writes like a bot doesn’t mean they actually are one. Feeling like "you’ve caught one" doesn’t mean you did - it just means you think you did. You might have been wrong, but you never got confirmation to know for sure, so you have no real basis for judging how good your detection rate actually is. It’s effectively begging the question - using your original assumption as "proof" without actual verification.

    And then there’s the classic toupee fallacy: "All toupees look fake - I’ve never seen one that didn’t." That just means you’re good at spotting bad toupees. You can’t generalize from that and claim you’re good at detecting toupees in general, because all the good ones slip right past you unnoticed.

    Studies have shown we're pretty bad at detecting good AI stuff, regardless of how skilled we think we are. It's the crappy AI slop that makes everybody think they're Sherlock Holmes.

  • This post did not contain any content.

    I was beginning to think I was smarter than the internet.
    In some facets we all are much smarter than AI.
    However, we are not all clever enough how to explain and express ourselves to the fullest.
    Slight variations in nuance are crucial to the humour of AI.
    Otherwise a giant entity resembling human consciousness is taking form.
    The last F’ng thing I’d like to see is a “Lawnmower Man” type scenario that takes every word ever said or googled by anyone for its literal translation be it completely metaphorical and without the understanding of underlying context.
    Sometimes those thoughts creep into my dreams.

  • You think you have - but there’s really no way of knowing.

    Just because someone writes like a bot doesn’t mean they actually are one. Feeling like "you’ve caught one" doesn’t mean you did - it just means you think you did. You might have been wrong, but you never got confirmation to know for sure, so you have no real basis for judging how good your detection rate actually is. It’s effectively begging the question - using your original assumption as "proof" without actual verification.

    And then there’s the classic toupee fallacy: "All toupees look fake - I’ve never seen one that didn’t." That just means you’re good at spotting bad toupees. You can’t generalize from that and claim you’re good at detecting toupees in general, because all the good ones slip right past you unnoticed.

    Just to add to this. I've noticed some of my co workers who have developing English skills sound like LLMs sometimes.
    I imagine this is probably because they've only wrote English in a school or work setting and never for personal communication.

    ...or maybe they're all just using LLMs idk

  • Something that popped up when I was looking up information for Eyes of Wakanda:

    Which seems copy pasted from another AI generated site:

    FWIW - NONE of this is true. Okoye does not appear in any of the four episodes, voiced by Danai Gurira or anyone else. Given the times the episodes are set in, 1260 BC, 1200 BC, 1400 AD and 1896 AD, that would have been impossible.

    Episode 4 does have a future Black Panther, but she's from 500 years in the future (1896 + 500 = 2396?) and voiced by Anika Noni Rose.

    Yup, there seems to be misinformation there. Even perplexity gets it wrong, but does say there's some inconsistency between websites.

  • Pretty much just have the LLM reduce the size of its answer, proofread it that it makes sense and didn't screw punctuation and boom no one will no.

    and boom no one will no.

    You're not an LLM, for sure.

  • You think you have - but there’s really no way of knowing.

    Just because someone writes like a bot doesn’t mean they actually are one. Feeling like "you’ve caught one" doesn’t mean you did - it just means you think you did. You might have been wrong, but you never got confirmation to know for sure, so you have no real basis for judging how good your detection rate actually is. It’s effectively begging the question - using your original assumption as "proof" without actual verification.

    And then there’s the classic toupee fallacy: "All toupees look fake - I’ve never seen one that didn’t." That just means you’re good at spotting bad toupees. You can’t generalize from that and claim you’re good at detecting toupees in general, because all the good ones slip right past you unnoticed.

    I mean I wasn't going around accusing everyone of being a bot or thinking that I was right all the time. I did have a few false positives and owned up to it. But once you see the pattern of behavior (big gap on joined date vs first active date, only posting in karma farming subs or subs known to have high bot populations) and accounts literally keeping the "as a large language model..." Or "Okay, here's a supportive Reddit-style comment with some minor spelling mistakes..." in some of their comments, ads at the end of their comments, and posting all hours of the day without any gaps for sleep or work, or posting fragments comments identical to ones posted months or years earlier by someone else, you start to realize maybe these accounts might not be genuine.

    I have screenshots to prove it but if you really believe I don't know what I'm talking about then there's really nothing I can say to dissuade that.

  • This post did not contain any content.

    That's why I like make basic grammatical mistakes, speling erors, and include a few fucks in my internet writing. Nobody's not gona mistake me for no got dagned robot.

  • I mean I wasn't going around accusing everyone of being a bot or thinking that I was right all the time. I did have a few false positives and owned up to it. But once you see the pattern of behavior (big gap on joined date vs first active date, only posting in karma farming subs or subs known to have high bot populations) and accounts literally keeping the "as a large language model..." Or "Okay, here's a supportive Reddit-style comment with some minor spelling mistakes..." in some of their comments, ads at the end of their comments, and posting all hours of the day without any gaps for sleep or work, or posting fragments comments identical to ones posted months or years earlier by someone else, you start to realize maybe these accounts might not be genuine.

    I have screenshots to prove it but if you really believe I don't know what I'm talking about then there's really nothing I can say to dissuade that.

    I don't think anyone is questioning your ability to follow a hunch and get evidence to prove it.

    The point is about how often you're correct on your first guess.

  • 44 Stimmen
    10 Beiträge
    61 Aufrufe
    muusemuuse@sh.itjust.worksM
    Hospitals would likely be fine with it. The health insurance industry would not though and would pressure the hospital to cut you off. It’s illegal. But they would do it anyway. You would need serious fuck you money to change this. And even then, probably a lot of Luigis too.
  • 31 Stimmen
    6 Beiträge
    80 Aufrufe
    moseschrute@piefed.socialM
    While I agree, everyone constantly restating this is not helpful. We should instead ask ourselves what’s about BlueSky is working and what can we learn? For example, I think the threadiverse could benefit from block lists, which auto update with new filter keywords. I’ve seen Lemmy users talk about how much time they spend crafting their filters to get the feed of content they want. It would be much nicer if you could choose and even combine block lists (e.g. US politics).
  • 14 Stimmen
    2 Beiträge
    31 Aufrufe
    lupusblackfur@lemmy.worldL
    Welp, queue up some more multi-million dollar "donations" to have these cases dropped... Not like the TechBros don't have the funds. ‍️ ‍️
  • How can websites verify unique (IRL) identities?

    Technology technology
    6
    8 Stimmen
    6 Beiträge
    56 Aufrufe
    H
    Safe, yeah. Private, no. If you want to verify whether a user is a real person, you need very personally identifiable information. That’s not ever going to be private. The best you could do, in theory, is have a government service that takes that PII and gives the user a signed cryptographic certificate they can use to verify their identity. Most people would either lose their private key or have it stolen, so even that system would have problems. The closest to reality you could do right now is use Apple’s FaceID, and that’s anything but private. Pretty safe though. It’s super illegal and quite hard to steal someone’s face.
  • 138 Stimmen
    31 Beiträge
    316 Aufrufe
    S
    Nobody fucking cares.
  • Is Matrix cooked?

    Technology technology
    54
    101 Stimmen
    54 Beiträge
    467 Aufrufe
    W
    Didn't know it only applied to UWP apps on Windows. That does seem like a pretty big problem then. it is mostly for compatibility reasons. no win32 programs are equipped to handle such granular permissions and sandboxing, they are all made with the assumption that they have access to whatever they need (other than other users' resources and things that require elevation). if Microsoft would have made that limitation to every kind of software, that Windows version would have probably been a failure in popularity because lots of software would have broken. I think S editions of windows is how they tried to go in that direction, with a more drastic way of simply just dropping support for 3rd party win32 programs. I don't still have a Mac readily available to test with but afaik it is any application that uses Apple's packaging format. ok, so if you run linux or windows utils in a compatibility layer, they still have less of a limited access? by which I mean graphical utilities. just tried with firefox, for macos it wanted to give me an .iso file (???) if so, it seems apple is doing roughly the same as microsoft with uwp and the appx format, and linux with flatpak: it's a choice for the user
  • A World Without iPhones?

    Technology technology
    7
    34 Stimmen
    7 Beiträge
    77 Aufrufe
    S
    I believe the world was a better place before smartphones started dominating everyone's attention. It has had a profound impact on how people are socializing, and not in a positive way if you ask me.
  • 92 Stimmen
    42 Beiträge
    364 Aufrufe
    G
    You don’t understand. The tracking and spying is the entire point of the maneuver. The ‘children are accessing porn’ thing is just a Trojan horse to justify the spying. I understand what are you saying, I simply don't consider to check if a law is applied as a Trojan horse in itself. I would agree if the EU had said to these sites "give us all the the access log, a list of your subscriber, every data you gather and a list of every IP it ever connected to your site", and even this way does not imply that with only the IP you could know who the user is without even asking the telecom company for help. So, is it a Trojan horse ? Maybe, it heavily depend on how the EU want to do it. If they just ask "show me how you try to avoid that a minor access your material", which normally is the fist step, I don't see how it could be a Trojan horse. It could become, I agree on that. As you pointed out, it’s already illegal for them to access it, and parents are legally required to prevent their children from accessing it. No, parents are not legally required to prevent it. The seller (or provider) is legally required. It is a subtle but important difference. But you don’t lock down the entire population, or institute pre-crime surveillance policies, just because some parents are not going to follow the law. True. You simply impose laws that make mandatories for the provider to check if he can sell/serve something to someone. I mean asking that the cashier of mall check if I am an adult when I buy a bottle of wine is no different than asking to Pornhub to check if the viewer is an adult. I agree that in one case is really simple and in the other is really hard (and it is becoming harder by the day). You then charge the guilty parents after the offense. Ok, it would work, but then how do you caught the offendind parents if not checking what everyone do ? Is it not simpler to try to prevent it instead ?