linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

An analysis of 15M+ biomedical abstracts from 2010 to 2024 finds researchers using AI to write abstracts use certain words far more often than those who don't

Technology

4 Beiträge 4 Kommentatoren 59 Aufrufe

P This user is from outside of this forum
P This user is from outside of this forum
pro@programming.dev

schrieb am zuletzt editiert von pro@programming.dev

#1

Large language models (LLMs) like ChatGPT can generate and revise text with human-level performance. These models come with clear limitations, can produce inaccurate information, and reinforce existing biases. Yet, many scientists use them for their scholarly writing. But how widespread is such LLM usage in the academic literature? To answer this question for the field of biomedical research, we present an unbiased, large-scale approach: We study vocabulary changes in more than 15 million biomedical abstracts from 2010 to 2024 indexed by PubMed and show how the appearance of LLMs led to an abrupt increase in the frequency of certain style words. This excess word analysis suggests that at least 13.5% of 2024 abstracts were processed with LLMs. This lower bound differed across disciplines, countries, and journals, reaching 40% for some subcorpora. We show that LLMs have had an unprecedented impact on scientific writing in biomedical research, surpassing the effect of major world events such as the COVID pandemic.

Just a moment...

(www.science.org)
R P T 3 Antworten Letzte Antwort

47
P pro@programming.dev

Large language models (LLMs) like ChatGPT can generate and revise text with human-level performance. These models come with clear limitations, can produce inaccurate information, and reinforce existing biases. Yet, many scientists use them for their scholarly writing. But how widespread is such LLM usage in the academic literature? To answer this question for the field of biomedical research, we present an unbiased, large-scale approach: We study vocabulary changes in more than 15 million biomedical abstracts from 2010 to 2024 indexed by PubMed and show how the appearance of LLMs led to an abrupt increase in the frequency of certain style words. This excess word analysis suggests that at least 13.5% of 2024 abstracts were processed with LLMs. This lower bound differed across disciplines, countries, and journals, reaching 40% for some subcorpora. We show that LLMs have had an unprecedented impact on scientific writing in biomedical research, surpassing the effect of major world events such as the COVID pandemic.

Just a moment...

(www.science.org)
R This user is from outside of this forum
R This user is from outside of this forum
renzhexiangjiao@piefed.blahaj.zone

schrieb am zuletzt editiert von

#2

tbh I don't see anything wrong with using AI just to write the abstract, assuming the author redacts it afterwards. It becomes much more problematic if AI is used in the middle section of the paper, where it is crucial to present information as accurately as possible.
1 Antwort Letzte Antwort

10
P pro@programming.dev

Large language models (LLMs) like ChatGPT can generate and revise text with human-level performance. These models come with clear limitations, can produce inaccurate information, and reinforce existing biases. Yet, many scientists use them for their scholarly writing. But how widespread is such LLM usage in the academic literature? To answer this question for the field of biomedical research, we present an unbiased, large-scale approach: We study vocabulary changes in more than 15 million biomedical abstracts from 2010 to 2024 indexed by PubMed and show how the appearance of LLMs led to an abrupt increase in the frequency of certain style words. This excess word analysis suggests that at least 13.5% of 2024 abstracts were processed with LLMs. This lower bound differed across disciplines, countries, and journals, reaching 40% for some subcorpora. We show that LLMs have had an unprecedented impact on scientific writing in biomedical research, surpassing the effect of major world events such as the COVID pandemic.

Just a moment...

(www.science.org)
P This user is from outside of this forum
P This user is from outside of this forum
plebcouncilman@sh.itjust.works

schrieb am zuletzt editiert von plebcouncilman@sh.itjust.works

#3

Analysis of over 15M+ bodies of water finds that water is wet.
1 Antwort Letzte Antwort

1
P pro@programming.dev

Large language models (LLMs) like ChatGPT can generate and revise text with human-level performance. These models come with clear limitations, can produce inaccurate information, and reinforce existing biases. Yet, many scientists use them for their scholarly writing. But how widespread is such LLM usage in the academic literature? To answer this question for the field of biomedical research, we present an unbiased, large-scale approach: We study vocabulary changes in more than 15 million biomedical abstracts from 2010 to 2024 indexed by PubMed and show how the appearance of LLMs led to an abrupt increase in the frequency of certain style words. This excess word analysis suggests that at least 13.5% of 2024 abstracts were processed with LLMs. This lower bound differed across disciplines, countries, and journals, reaching 40% for some subcorpora. We show that LLMs have had an unprecedented impact on scientific writing in biomedical research, surpassing the effect of major world events such as the COVID pandemic.

Just a moment...

(www.science.org)
T This user is from outside of this forum
T This user is from outside of this forum
trailee@sh.itjust.works

schrieb am zuletzt editiert von

#4

Very interesting paper, and grade A irony to begin the title with “delving” while finding that “delve” is one of the top excess words/markers of LLM writing.

Moreover, the authors highlight a few excerpts that “illustrate the LLM-style flowery language” including

By meticulously delving into the intricate web connecting […] and […], this comprehensive chapter takes a deep dive into their involvement as significant risk factors for […].

…and then they clearly intentionally conclude the discussion section thus

We hope that future work will meticulously delve into tracking LLM usage more accurately and assess which policy changes are crucial to tackle the intricate challenges posed by the rise of LLMs in scientific publishing.

Great work.
1 Antwort Letzte Antwort

5

Anmelden zum Antworten

S

Precision in Focus: North America Clinical Microscopes Market
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

0 Stimmen

1 Beiträge

3 Aufrufe

Niemand hat geantwortet
C

YouTube will start using your view history to guess if you're an adult
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
99

201 Stimmen

99 Beiträge

775 Aufrufe

D

Aww, YouTube thinks you're smart! And short.
D

Corning settles EU antitrust probe by agreeing to open smartphone glass market
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
4

1

149 Stimmen

4 Beiträge

60 Aufrufe

T

Very true. And the fine will be raised for next time, so you really dont want strike one.
R

Relo IT
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

2

1 Stimmen

1 Beiträge

18 Aufrufe

Niemand hat geantwortet
A

Palantir partners to develop AI software for nuclear construction
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
4

33 Stimmen

4 Beiträge

49 Aufrufe

T

The grift goes nuclear. No surprise.
R

16 Billion Apple, Facebook, Google And Other Passwords Leaked — Act Now
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
10

1

17 Stimmen

10 Beiträge

97 Aufrufe

T

That's why it's not brute force anymore.
B

Is it feasible and scalable to combine self-replicating automata (after von Neumann) with federated learning and the social web?
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
9

1

6 Stimmen

9 Beiträge

62 Aufrufe

B

Cool. Well, the feedback until now was rather lukewarm. But that's fine, I'm now going more in a P2P-direction. It would be cool to have a way for everybody to participate in the training of big AI models in case HuggingFace enshittifies
D

Judge Rules Apple Top Executive Alex Roman Lied Under Oath, Makes Criminal Contempt Referral
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
6

0 Stimmen

6 Beiträge

67 Aufrufe

P

I applaud this, but I still say it's not far enough. Adjusted, the amount might match, but 121.000 is still easier to cough up for a billionaire than 50 is for a single mother of two who can barely make ends meet