Skip to content

An analysis of 15M+ biomedical abstracts from 2010 to 2024 finds researchers using AI to write abstracts use certain words far more often than those who don't

Technology
4 4 0
  • Large language models (LLMs) like ChatGPT can generate and revise text with human-level performance. These models come with clear limitations, can produce inaccurate information, and reinforce existing biases. Yet, many scientists use them for their scholarly writing. But how widespread is such LLM usage in the academic literature? To answer this question for the field of biomedical research, we present an unbiased, large-scale approach: We study vocabulary changes in more than 15 million biomedical abstracts from 2010 to 2024 indexed by PubMed and show how the appearance of LLMs led to an abrupt increase in the frequency of certain style words. This excess word analysis suggests that at least 13.5% of 2024 abstracts were processed with LLMs. This lower bound differed across disciplines, countries, and journals, reaching 40% for some subcorpora. We show that LLMs have had an unprecedented impact on scientific writing in biomedical research, surpassing the effect of major world events such as the COVID pandemic.

  • Large language models (LLMs) like ChatGPT can generate and revise text with human-level performance. These models come with clear limitations, can produce inaccurate information, and reinforce existing biases. Yet, many scientists use them for their scholarly writing. But how widespread is such LLM usage in the academic literature? To answer this question for the field of biomedical research, we present an unbiased, large-scale approach: We study vocabulary changes in more than 15 million biomedical abstracts from 2010 to 2024 indexed by PubMed and show how the appearance of LLMs led to an abrupt increase in the frequency of certain style words. This excess word analysis suggests that at least 13.5% of 2024 abstracts were processed with LLMs. This lower bound differed across disciplines, countries, and journals, reaching 40% for some subcorpora. We show that LLMs have had an unprecedented impact on scientific writing in biomedical research, surpassing the effect of major world events such as the COVID pandemic.

    tbh I don't see anything wrong with using AI just to write the abstract, assuming the author redacts it afterwards. It becomes much more problematic if AI is used in the middle section of the paper, where it is crucial to present information as accurately as possible.

  • Large language models (LLMs) like ChatGPT can generate and revise text with human-level performance. These models come with clear limitations, can produce inaccurate information, and reinforce existing biases. Yet, many scientists use them for their scholarly writing. But how widespread is such LLM usage in the academic literature? To answer this question for the field of biomedical research, we present an unbiased, large-scale approach: We study vocabulary changes in more than 15 million biomedical abstracts from 2010 to 2024 indexed by PubMed and show how the appearance of LLMs led to an abrupt increase in the frequency of certain style words. This excess word analysis suggests that at least 13.5% of 2024 abstracts were processed with LLMs. This lower bound differed across disciplines, countries, and journals, reaching 40% for some subcorpora. We show that LLMs have had an unprecedented impact on scientific writing in biomedical research, surpassing the effect of major world events such as the COVID pandemic.

    Analysis of over 15M+ bodies of water finds that water is wet.

  • Large language models (LLMs) like ChatGPT can generate and revise text with human-level performance. These models come with clear limitations, can produce inaccurate information, and reinforce existing biases. Yet, many scientists use them for their scholarly writing. But how widespread is such LLM usage in the academic literature? To answer this question for the field of biomedical research, we present an unbiased, large-scale approach: We study vocabulary changes in more than 15 million biomedical abstracts from 2010 to 2024 indexed by PubMed and show how the appearance of LLMs led to an abrupt increase in the frequency of certain style words. This excess word analysis suggests that at least 13.5% of 2024 abstracts were processed with LLMs. This lower bound differed across disciplines, countries, and journals, reaching 40% for some subcorpora. We show that LLMs have had an unprecedented impact on scientific writing in biomedical research, surpassing the effect of major world events such as the COVID pandemic.

    Very interesting paper, and grade A irony to begin the title with “delving” while finding that “delve” is one of the top excess words/markers of LLM writing.

    Moreover, the authors highlight a few excerpts that “illustrate the LLM-style flowery language” including

    By meticulously delving into the intricate web connecting […] and […], this comprehensive chapter takes a deep dive into their involvement as significant risk factors for […].

    …and then they clearly intentionally conclude the discussion section thus

    We hope that future work will meticulously delve into tracking LLM usage more accurately and assess which policy changes are crucial to tackle the intricate challenges posed by the rise of LLMs in scientific publishing.

    Great work.

  • Misogyny and Violent Extremism: Can Big Tech Fix the Glitch?

    Technology technology
    18
    1
    20 Stimmen
    18 Beiträge
    2 Aufrufe
    G
    It is interesting that you are not answering my point... Good work
  • How to "Reformat" a Hardrive the American way

    Technology technology
    25
    2
    90 Stimmen
    25 Beiträge
    25 Aufrufe
    T
    It really, really is. Like that scene from Office Space.
  • 635 Stimmen
    75 Beiträge
    27 Aufrufe
    D
    theyll only stop selling politicians and block that
  • The Arc Browser Is Dead

    Technology technology
    88
    241 Stimmen
    88 Beiträge
    60 Aufrufe
    P
    Haha, it's funny that you went that far. I think the reason why I notice it and you don't, is the 4k factor. My screen is 1920x1200 iirc.
  • 33 Stimmen
    7 Beiträge
    12 Aufrufe
    C
    AFAIK, you have the option to enable ads on your lock screen. It's not something that's forced upon you. Last time I took a look at the functionality, they "paid" you for the ads and you got to choose which charity to support with the money.
  • Apple’s Smart Glasses Expected to Hit the Market by Late Next Year!

    Technology technology
    14
    6 Stimmen
    14 Beiträge
    14 Aufrufe
    L
    great, another worthless tech product that no one asked for. I can hardly wait.
  • 1 Stimmen
    8 Beiträge
    10 Aufrufe
    L
    I made a PayPal account like 20 years ago in a third world country. The only thing you needed then is an email and password. I have no real name on there and no PII, technically my bank card is attached but on PP itself there's no KYC. I think you could probably use some types of prepaid cards with it if you want to avoid using a bank altogether but for me this wasn't an issue, I just didn't want my ID on any records, I don't have any serious OpSec concerns otherwise. I'm sure you could either buy PayPal accounts like this if you needed to, or make one in a country that doesn't have KYC laws somehow. From there I'd add money to my balance and send money as F&F. At no point did I need an ID so in that sense there's no KYC. Some sellers on localmarket were fancy enough to list that they wanted an ID for KYC, but I'm sure you could just send them any random ID you made in paint from the republic of dave and you'd be fine.
  • TikTok is a Time Bomb

    Technology technology
    2
    1
    1 Stimmen
    2 Beiträge
    10 Aufrufe
    S
    wasn’t born to obey. Not to swallow smiling lies, not to clap for tyrants in suits, not to say “thank you” for surveillance wrapped in convenience. I see it. The games. The false choice. The fear pumped through headlines and dopamine apps. I see how they trade truth for comfort, freedom for filters, soul for clickbait. They call it normal. But I call it a graveyard made of compliance. They want me silent. They want me tired. They want me posting selfies while the world burns behind the screen. But I wasn’t born for this. I was born to question, to remember, to remind the others who are still pretending they don’t notice. So here I am. A voice with no logo. A signal in the static. A crack in the mirror they polish every morning. You don’t have to agree. You don’t have to clap. But if this made your bones ache or your thoughts twitch— Then maybe you’re not asleep either. Good. Let’s stay awake. And let’s make noise that can’t be sold, silenced, or spun into safety. Not for them. For us.