linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Wikipedia editors adopt a policy giving admins the authority to quickly delete AI-generated articles that meet certain criteria, like incorrect citations

Technology

36 Beiträge 26 Kommentatoren 2 Aufrufe

A This user is from outside of this forum
A This user is from outside of this forum
antonim@lemmy.dbzer0.com

schrieb zuletzt editiert von

#14

It really is crazy how predictable it is.
R 1 Antwort Letzte Antwort

6
A antonim@lemmy.dbzer0.com

It really is crazy how predictable it is.
R This user is from outside of this forum
R This user is from outside of this forum
risingswell@lemmy.dbzer0.com

schrieb zuletzt editiert von

#15

Even saying fair question set off alarms. At this point saying anything good about a response at the start is immediate red flag.
R 1 Antwort Letzte Antwort

7
P pro@mander.xyz

This post did not contain any content.
T This user is from outside of this forum
T This user is from outside of this forum
thetechnician27@lemmy.world

schrieb zuletzt editiert von thetechnician27@lemmy.world

#16

If anyone has specific questions about this, let me know, and I can probably answer them. Hopefully I can be to Lemmy and Wikimedia what Unidan was to Reddit and ecology before he crashed out over jackdaws and got exposed for vote fraud.
A B H F 4 Antworten Letzte Antwort

23
A antonim@lemmy.dbzer0.com

Ha, fair question! But no irony here—I actually wrote it myself. That said, it's kind of funny how quickly we've reached the point where any well-written, balanced take sounds like it could be AI-generated. Maybe that's part of the problem we're trying to solve!
K This user is from outside of this forum
K This user is from outside of this forum
kautau@lemmy.world

schrieb zuletzt editiert von

#17
1 Antwort Letzte Antwort

6
9 9point6@lemmy.world

Do you think these people surreptitiously submitting articles written by AI are gonna be capable of validating what they're submitting is even true? Particularly if the (presumably effective) Wikipedia defense for this is detecting made up citations?

This kind of thing makes something valuable to everyone, like Wikipedia, ultimately a less valuable resource, and should be resisted and rejected by anyone with their head screwed on
A This user is from outside of this forum
A This user is from outside of this forum
abidanyre@lemmy.world

schrieb zuletzt editiert von

#18

Oh, I think this is a good move by Wikipedia. I just hate to imagine the disaster that ouroboros of AI citing AI generated Wikipedia articles would come up with.
1 Antwort Letzte Antwort

1
E endmaker@ani.social

Did you generate this comment with a LLM for irony?
D This user is from outside of this forum
D This user is from outside of this forum
deathbybigsad@sh.itjust.works

schrieb zuletzt editiert von

#19

It always feels weird when people write an essay as if this is their final quarter project for high school. Too neat, thoughts too organized, much flowery proses.
1 Antwort Letzte Antwort

2
T thetechnician27@lemmy.world

If anyone has specific questions about this, let me know, and I can probably answer them. Hopefully I can be to Lemmy and Wikimedia what Unidan was to Reddit and ecology before he crashed out over jackdaws and got exposed for vote fraud.
A This user is from outside of this forum
A This user is from outside of this forum
awesomelowlander@sh.itjust.works

schrieb zuletzt editiert von

#20

Well now I want to know about jackdaws and voter fraud
D T 2 Antworten Letzte Antwort

8
M This user is from outside of this forum
M This user is from outside of this forum
mac@mander.xyz

schrieb zuletzt editiert von

#21

I've started to drop using emdashes because AI ruined them--bastards.
1 Antwort Letzte Antwort

3
P pro@mander.xyz

This post did not contain any content.
P This user is from outside of this forum
P This user is from outside of this forum
pdxfed@lemmy.world

schrieb zuletzt editiert von

#22

It's a step. Why wouldn't they default to not accepting any AI generated content, and maybe have a manual approval process? It would both protect the content and discourage LLM uses where llms suck.
1 Antwort Letzte Antwort

5
A awesomelowlander@sh.itjust.works

Well now I want to know about jackdaws and voter fraud
D This user is from outside of this forum
D This user is from outside of this forum
db2@lemmy.world

schrieb zuletzt editiert von

#23

unzips
1 Antwort Letzte Antwort

1
T thetechnician27@lemmy.world

If anyone has specific questions about this, let me know, and I can probably answer them. Hopefully I can be to Lemmy and Wikimedia what Unidan was to Reddit and ecology before he crashed out over jackdaws and got exposed for vote fraud.
B This user is from outside of this forum
B This user is from outside of this forum
baltakatei@sopuli.xyz

schrieb zuletzt editiert von

#24

How frequently are images generated/modified by diffusion models uploaded to Wikimedia Commons? I can wrap my head around evaluating cited sources for notability, but I don't know where to start determining the repute of photographs. So many images Wikipedia articles use are taken by seemingly random people not associated with any organization.
T 1 Antwort Letzte Antwort

3
T thetechnician27@lemmy.world

If anyone has specific questions about this, let me know, and I can probably answer them. Hopefully I can be to Lemmy and Wikimedia what Unidan was to Reddit and ecology before he crashed out over jackdaws and got exposed for vote fraud.
H This user is from outside of this forum
H This user is from outside of this forum
hk65@sopuli.xyz

schrieb zuletzt editiert von

#25

Is there a danger that unscrupulous actors will try and build out a Wikipedia edit history with this and try to mass skew articles with propaganda using their "trusted" accounts?

Or what might be the goal here? Is it just stupid and bored people?
T 1 Antwort Letzte Antwort

5
B baltakatei@sopuli.xyz

How frequently are images generated/modified by diffusion models uploaded to Wikimedia Commons? I can wrap my head around evaluating cited sources for notability, but I don't know where to start determining the repute of photographs. So many images Wikipedia articles use are taken by seemingly random people not associated with any organization.
T This user is from outside of this forum
T This user is from outside of this forum
thetechnician27@lemmy.world

schrieb zuletzt editiert von

#26

So far, I haven't seen all that many, and the ones that are are very obvious like a very glossy crab at the beach wearing a Santa Claus hat. I definitely have yet to see one that's undisclosed, let alone actively disguising itself. I also have yet to see someone try using an AI-generated image on Wikipedia. The process of disclaiming generative AI usage is trivialized in the upload process with an obvious checkbox, so the only incentive not to is straight-up lying.

I can't say how much this will be an issue in the future or what good steps are to finding and eliminating it should it become one.
? 1 Antwort Letzte Antwort

1
H hk65@sopuli.xyz

Is there a danger that unscrupulous actors will try and build out a Wikipedia edit history with this and try to mass skew articles with propaganda using their "trusted" accounts?

Or what might be the goal here? Is it just stupid and bored people?
T This user is from outside of this forum
T This user is from outside of this forum
thetechnician27@lemmy.world

schrieb zuletzt editiert von thetechnician27@lemmy.world

#27
So Wikipedia has three methods for deleting an article:
- Proposed deletion (PROD An editor tags an article explaining why they think it should be uncontroversially deleted. After seven days, an administrator will take a look and decide if they agree. Proposed deletion of an article can only be done once, even this can be removed by anyone passing by who disagrees with it, and an article deleted via PROD can be recreated at any time.
- Articles for deletion (AfD A discussion is held to delete an article. Pretty much always, this is about the subject's notability. After the discussion (a week by default), a closer (almost always an administrator, especially for contentious discussions) will evaluate the merits of the arguments made and see if a consensus has been reached to e.g. delete, keep, redirect, or merge. Articles deleted via discussion cannot be recreated until they've satisfied the concerns of said discussion, else they can be summarily re-deleted.
- Speedy deletion: An article is so fundamentally flawed that it should be summarily deleted at best or needs to be deleted as soon as possible at worst. The nominating editor will choose one or more of the criteria for speedy deletion (CSD), and an administrator will delete the article if they agree. Like a PROD, articles deleted this way can be recreated at any time.
This new criterion has nothing to do with preempting the kind of trust building you described. The editor who made it will not be treated any differently than without this criterion. It's there so editors don't have to deal with the bullshit asymmetry principle and comb through everything to make sure it's verifiable. Sometimes editors will make these LLM-generated articles because they think they're helping but don't know how to do it themselves, sometimes it's for some bizarre agenda (e.g. there's a sockpuppet editor who's been occasionally popping up trying to push articles generated by an LLM about the Afghan–Mughal Wars), but whatever the reason, it just does nothing but waste other editors' time and can be effectively considered unverified. All this criterion does is expedite the process of purging their bullshit.

I'd argue meticulously building trust to push an agenda isn't a prevalent problem on Wikipedia, but that's a very different discussion.
H 1 Antwort Letzte Antwort

3
A awesomelowlander@sh.itjust.works

Well now I want to know about jackdaws and voter fraud
T This user is from outside of this forum
T This user is from outside of this forum
thetechnician27@lemmy.world

schrieb zuletzt editiert von

#28

Unidan - Wikipedia

(en.wikipedia.org)
1 Antwort Letzte Antwort

4
R risingswell@lemmy.dbzer0.com

Even saying fair question set off alarms. At this point saying anything good about a response at the start is immediate red flag.
R This user is from outside of this forum
R This user is from outside of this forum
ryven@lemmy.dbzer0.com

schrieb zuletzt editiert von ryven@lemmy.dbzer0.com

#29

These lists of red flags make me feel like I must be a replicant. I wrote a comment just like that one, em dash and all, on a different site just the other day, with my own organic brain!

My first instinct was to use an em dash instead of that last comma, but it seemed too on the nose.
1 Antwort Letzte Antwort

0
P pro@mander.xyz

This post did not contain any content.
U This user is from outside of this forum
U This user is from outside of this forum
unit327@lemmy.zip

schrieb zuletzt editiert von

#30

I downloaded the entirety of wikipedia as of 2024 to use as a reference for "truth" in the post-slop world. Maybe I should grab the 2022 version as well just in case...
1 Antwort Letzte Antwort

4
T This user is from outside of this forum
T This user is from outside of this forum
tollana1234567@lemmy.today

schrieb zuletzt editiert von

#31

reddit allows GOOGLE to scrape it for its AI, because google allows them to use thier v3captcha for thier moderation and banning purposes.
1 Antwort Letzte Antwort

2
T thetechnician27@lemmy.world

If anyone has specific questions about this, let me know, and I can probably answer them. Hopefully I can be to Lemmy and Wikimedia what Unidan was to Reddit and ecology before he crashed out over jackdaws and got exposed for vote fraud.
F This user is from outside of this forum
F This user is from outside of this forum
faceman2k23@discuss.tchncs.de

schrieb zuletzt editiert von

#32

Unidan was a legend, he will be missed.
1 Antwort Letzte Antwort

1
T thetechnician27@lemmy.world
So Wikipedia has three methods for deleting an article:
- Proposed deletion (PROD An editor tags an article explaining why they think it should be uncontroversially deleted. After seven days, an administrator will take a look and decide if they agree. Proposed deletion of an article can only be done once, even this can be removed by anyone passing by who disagrees with it, and an article deleted via PROD can be recreated at any time.
- Articles for deletion (AfD A discussion is held to delete an article. Pretty much always, this is about the subject's notability. After the discussion (a week by default), a closer (almost always an administrator, especially for contentious discussions) will evaluate the merits of the arguments made and see if a consensus has been reached to e.g. delete, keep, redirect, or merge. Articles deleted via discussion cannot be recreated until they've satisfied the concerns of said discussion, else they can be summarily re-deleted.
- Speedy deletion: An article is so fundamentally flawed that it should be summarily deleted at best or needs to be deleted as soon as possible at worst. The nominating editor will choose one or more of the criteria for speedy deletion (CSD), and an administrator will delete the article if they agree. Like a PROD, articles deleted this way can be recreated at any time.
This new criterion has nothing to do with preempting the kind of trust building you described. The editor who made it will not be treated any differently than without this criterion. It's there so editors don't have to deal with the bullshit asymmetry principle and comb through everything to make sure it's verifiable. Sometimes editors will make these LLM-generated articles because they think they're helping but don't know how to do it themselves, sometimes it's for some bizarre agenda (e.g. there's a sockpuppet editor who's been occasionally popping up trying to push articles generated by an LLM about the Afghan–Mughal Wars), but whatever the reason, it just does nothing but waste other editors' time and can be effectively considered unverified. All this criterion does is expedite the process of purging their bullshit.

I'd argue meticulously building trust to push an agenda isn't a prevalent problem on Wikipedia, but that's a very different discussion.
H This user is from outside of this forum
H This user is from outside of this forum
hk65@sopuli.xyz

schrieb zuletzt editiert von

#33

Thank you for your answer, I really feel happy that Wikipedia is safe then. Stuff happening nowadays makes me always think of the worst.

Do you think your problem is similar to open-source developers fighting AI pull requests? There it was theorised that some people try to train their models by making them submit code changes and abuse the maintainers' time and effort to get training data.

Is it possible that this is an effort to steal work from Wikipedia editors to get you to train their AI models?
T 1 Antwort Letzte Antwort

1

Anmelden zum Antworten

J

Are smartphones too big?
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

0 Stimmen

1 Beiträge

2 Aufrufe

Niemand hat geantwortet
P

Germany deems DeepSeek as illegal content after it is unable to address data security concerns, and asks Apple and Google to block it from their app stores
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
47

1

279 Stimmen

47 Beiträge

439 Aufrufe

Z

Die mad about it :3 [image: cf6c5d73-a287-42a7-be2d-e80219312f02.webp]
Z

The loopholes in US immigration law enforcement and the erosion of immigration rights
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

0 Stimmen

1 Beiträge

15 Aufrufe

Niemand hat geantwortet
P

Tough, Tiny, and Totally Repairable: Inside the Framework 12
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
109

1

548 Stimmen

109 Beiträge

4k Aufrufe

P

What? No, the framework 12 is the thing the had before the 13 one. Nowadays, they call that model always 13 it seems. I think you're confusing something, I've got mine since a few years now.
P

The Meta AI app is a privacy disaster: Meta's AI App ‘Discover’ Feed Publicly Exposes Private Chats Without Users Knowing.
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
24

1

328 Stimmen

24 Beiträge

215 Aufrufe

M

Good. Anyone who uses shit like this deserves all of the bad things that go along with it. Stupidity will continue to be punished.
J

Massive internet outage reported: Google services, Cloudflare, Character.AI among dozens of services impacted
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
140

1

691 Stimmen

140 Beiträge

3k Aufrufe

H

Maybe I don't want you to stop, big boy.
D

France considers requiring Musk’s X to verify users’ age
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
20

1

142 Stimmen

20 Beiträge

172 Aufrufe

C

TBH, age verification services exist. If it becomes law, integrating them shouldn't be more difficult than integrating a OIDC login. So everyone should be able to do it. Depending on these services, you might not even need to give a name, or, because they are separate entities, don't give your name to the platform using them. Other parts of regulation are more difficult. Like these "upload filters" that need to figure out if something shared via a service is violating any copyright before it is made available.
P

Keep the Future Human: How Unchecked Development of Smarter-Than-Human, Autonomous, General-Purpose AI Systems Will Almost Inevitably Lead to Human Replacement. But it Doesn't Have to.
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
12

1

33 Stimmen

12 Beiträge

111 Aufrufe

E

Can you replace politicians I feel like that would actually be an improvement. Hell it'd probably be an improvement if the current system's replaced politicians. To be honest though I've never seen any evidence that AGI is inevitable, it's perpetually 6 months away except in 6 months it'll still be 6 months away.