Skip to content

Large Language Model Performance Doubles Every 7 Months

Technology
53 34 11
  • Classic pseudo-science for the modern grifter. Vague definitions, sloppy measurements, extremely biased, wild unsupported predictions, etc.

    and assuming that improvement doesn't plateau, ever,

  • This is such bullshit. Models have already consumed all available data and have nothing left to consume, whole needing exponentially more data for progressive advancements

    time for them to set sail to the wild seas again!

  • Moore's law hasn't been exponential for ~15 years now.

    If you are just talking transitor density I believe it still is, but even if not, my point was that it had exponential growth spanning over many decades.

  • Because so much money has been thrown at it, for startups, for power generation, for investors, that this is little more than marketing for retail investors to buy into.

    That's no doubt that they pour money to machines instead of people. We all see that in statistics that machines get more support these days than people.

  • when will they be able to tell me how many 'r's are in 'strawberry' in under 1s?

    Deepseek-r1:1.5b

    ::: spoiler Thinking

    Thinking...
    Alright, so I'm trying to figure out how many 'r's are in the word "strawberry." Let me start by reading it out loud to get a better 
    sense of what it sounds like.
    
    S-T-R-A-W-B-E-R-R-Y
    
    Okay, let's break this down letter by letter. The first letter is S, then T, R, A, W, B, E, and two R's at the end.
    
    So far, I've got one 'R' in the third position after S-T- and before A-W-B-E-R-R-Y.
    
    Wait a minute, there are two R's in "strawberry." So that means there are two instances of the letter 'r' in the word. Is that all?
    
    Let me double-check to make sure I'm not missing any other 'r's. Scanning through each letter again:
    
    1. S - No
    2. T - No
    3. R - Yes, one R here.
    4. A - No
    5. W - No
    6. B - No
    7. E - No
    8. R - Second R
    9. R - Third R
    
    Wait, am I counting correctly? So far, from 1 to 9, that's three letters after the first two: R, A, W, B, and then R, R again at the end.
    
    Hold on, maybe I should visualize it like this:
    
    S T R A W B E R R Y
    
    So positions:
    1. S
    2. T
    3. R (position 3)
    4. A
    5. W
    6. B
    7. E
    8. R (position 8)
    9. R (position 9)
    
    And then the word ends with 'Y.'
    
    Hmm, so only two 'R's in total? Let me confirm:
    
    Looking back: positions 3 and 8 and 9.
    
    Wait, that makes three 'R's. Did I miss one?
    
    Hold on, maybe I need to write it out clearly to visualize better:
    
    S T R A W B E R R Y
    
    Counting each 'R':
    
    - After S (position 2): No
    - After T (position 3): Yes, first R
    - After A (position 4): No
    - After W (position 5): No
    - After B (position 6): No
    - After E (position 7): No
    - After R (position 8): Second R
    - After R (position 9): Third R
    - After Y (position 10): No
    
    So that's three 'R's in total. Did I count right? Maybe I should write it down step by step to ensure accuracy.
    
    Alternatively, maybe there are only two 'R's because the 'Y' is considered separate from the R. But logically, even if Y has a vowel 
    sound, 'r' still counts separately unless specified otherwise.
    
    Wait, actually, in English letters, 'Y' isn't really pronounced like an 'R.' It sounds more like a consonant. So in that case, probably 
    only two 'R's: one between S and T, and another before the end of the word.
    
    But I'm still confused because when I count, I sometimes make mistakes. Maybe using an alphabet list or a counting tool would help.
    
    Let me write down each letter with its position:
    
    1. S
    2. T
    3. R (count 1)
    4. A
    5. W
    6. B
    7. E
    8. R (count 2)
    9. R (count 3)
    10. Y
    
    So that's three 'R's in total.
    
    Wait, but the word is "strawberry," and I'm pretty sure it has two 'r's because of the spelling of "strawberry." Maybe the counting 
    matches that.
    
    I think my initial count might have been off by one, thinking there are only two R's when actually there are three: one in position 3, 
    another at 8, and another at 9. But given that Y isn't pronounced like an 'R,' it's more accurate to say there are two 'r's.
    
    Hmm, maybe I need to consider the pronunciation of each letter as well. Even though 'Y' is a consonant, sometimes people might play 
    around with silent letters or other pronunciation rules. But unless instructed otherwise, I should stick to standard English letter 
    sounds for this purpose.
    
    So after carefully counting and considering each letter's individual sound, it seems that "strawberry" has two 'r's: one in the middle 
    and another at the end.
    ...done thinking.
    
    

    :::

    To determine how many 'r's are in the word "strawberry," we can break it down letter by letter:
    
    1. **S** - No
    2. **T** - No
    3. **R** (count 1)
    4. **A** - No
    5. **W** - No
    6. **B** - No
    7. **E** - No
    8. **R** (count 2)
    9. **R** (count 3)
    10. **Y** - No
    
    However, considering the pronunciation of each letter and standard English rules, even though 'Y' is a consonant, it's more accurate to 
    say that "strawberry" contains only two 'r's: one in position 3 and another at position 9.
    
    Final Answer: There are two 'r's in the word "strawberry."
    
  • This is like measuring the increasing speeds of cars in the early years and extrapolating that they would be supersonic by now by ignoring the exponential impact that air resistance has.

    Or like looking at the early days of semiconductors and extrapolating that CPU speed will double every 18 months ..smh these people

  • Or like looking at the early days of semiconductors and extrapolating that CPU speed will double every 18 months ..smh these people

    Since CPU speeds are still doubling every 18 months you have a solid point!

    Or maybe not since you are probably referring to the doubling of transistors that was an observation which was accurate over a lengthy period of time in the context of when the observation was made. Nobody said that would continue indefinitely either.

  • By 2030, AI will greatly outperform humans in some complex intellectual tasks. Discover how LLMs are doubling their capabilities every seven months.

    Someone doesn't know the folly of extending straight lines graphs into the future.

  • By 2030, AI will greatly outperform humans in some complex intellectual tasks. Discover how LLMs are doubling their capabilities every seven months.

    new moore law dropped

  • Since CPU speeds are still doubling every 18 months you have a solid point!

    Or maybe not since you are probably referring to the doubling of transistors that was an observation which was accurate over a lengthy period of time in the context of when the observation was made. Nobody said that would continue indefinitely either.

    Yup, that's what I was alluding to, while it may not still be the case for transistors, they did manage to take 50 odd years to get there, push that trend line from the figure 50 years heh (not saying you should, 5 seems much more conservative)

    Take a look at Nvidias pace wrt Moore's law (of FLOPS) https://netrouting.com/nvidia-surpassing-moores-law-gpu-innovation/

  • Classic pseudo-science for the modern grifter. Vague definitions, sloppy measurements, extremely biased, wild unsupported predictions, etc.

    That graph is hilarious. Enormous error bars, totally arbitrary quantization of complexity, and it's title? "Task time for a human that an AI model completes with a 50 percent success rate". 50 percent success is useless, lmao.

    On a more sober note, I'm very disappointed that IEEE is publishing this kind of trash.

  • Deepseek-r1:1.5b

    ::: spoiler Thinking

    Thinking...
    Alright, so I'm trying to figure out how many 'r's are in the word "strawberry." Let me start by reading it out loud to get a better 
    sense of what it sounds like.
    
    S-T-R-A-W-B-E-R-R-Y
    
    Okay, let's break this down letter by letter. The first letter is S, then T, R, A, W, B, E, and two R's at the end.
    
    So far, I've got one 'R' in the third position after S-T- and before A-W-B-E-R-R-Y.
    
    Wait a minute, there are two R's in "strawberry." So that means there are two instances of the letter 'r' in the word. Is that all?
    
    Let me double-check to make sure I'm not missing any other 'r's. Scanning through each letter again:
    
    1. S - No
    2. T - No
    3. R - Yes, one R here.
    4. A - No
    5. W - No
    6. B - No
    7. E - No
    8. R - Second R
    9. R - Third R
    
    Wait, am I counting correctly? So far, from 1 to 9, that's three letters after the first two: R, A, W, B, and then R, R again at the end.
    
    Hold on, maybe I should visualize it like this:
    
    S T R A W B E R R Y
    
    So positions:
    1. S
    2. T
    3. R (position 3)
    4. A
    5. W
    6. B
    7. E
    8. R (position 8)
    9. R (position 9)
    
    And then the word ends with 'Y.'
    
    Hmm, so only two 'R's in total? Let me confirm:
    
    Looking back: positions 3 and 8 and 9.
    
    Wait, that makes three 'R's. Did I miss one?
    
    Hold on, maybe I need to write it out clearly to visualize better:
    
    S T R A W B E R R Y
    
    Counting each 'R':
    
    - After S (position 2): No
    - After T (position 3): Yes, first R
    - After A (position 4): No
    - After W (position 5): No
    - After B (position 6): No
    - After E (position 7): No
    - After R (position 8): Second R
    - After R (position 9): Third R
    - After Y (position 10): No
    
    So that's three 'R's in total. Did I count right? Maybe I should write it down step by step to ensure accuracy.
    
    Alternatively, maybe there are only two 'R's because the 'Y' is considered separate from the R. But logically, even if Y has a vowel 
    sound, 'r' still counts separately unless specified otherwise.
    
    Wait, actually, in English letters, 'Y' isn't really pronounced like an 'R.' It sounds more like a consonant. So in that case, probably 
    only two 'R's: one between S and T, and another before the end of the word.
    
    But I'm still confused because when I count, I sometimes make mistakes. Maybe using an alphabet list or a counting tool would help.
    
    Let me write down each letter with its position:
    
    1. S
    2. T
    3. R (count 1)
    4. A
    5. W
    6. B
    7. E
    8. R (count 2)
    9. R (count 3)
    10. Y
    
    So that's three 'R's in total.
    
    Wait, but the word is "strawberry," and I'm pretty sure it has two 'r's because of the spelling of "strawberry." Maybe the counting 
    matches that.
    
    I think my initial count might have been off by one, thinking there are only two R's when actually there are three: one in position 3, 
    another at 8, and another at 9. But given that Y isn't pronounced like an 'R,' it's more accurate to say there are two 'r's.
    
    Hmm, maybe I need to consider the pronunciation of each letter as well. Even though 'Y' is a consonant, sometimes people might play 
    around with silent letters or other pronunciation rules. But unless instructed otherwise, I should stick to standard English letter 
    sounds for this purpose.
    
    So after carefully counting and considering each letter's individual sound, it seems that "strawberry" has two 'r's: one in the middle 
    and another at the end.
    ...done thinking.
    
    

    :::

    To determine how many 'r's are in the word "strawberry," we can break it down letter by letter:
    
    1. **S** - No
    2. **T** - No
    3. **R** (count 1)
    4. **A** - No
    5. **W** - No
    6. **B** - No
    7. **E** - No
    8. **R** (count 2)
    9. **R** (count 3)
    10. **Y** - No
    
    However, considering the pronunciation of each letter and standard English rules, even though 'Y' is a consonant, it's more accurate to 
    say that "strawberry" contains only two 'r's: one in position 3 and another at position 9.
    
    Final Answer: There are two 'r's in the word "strawberry."
    

    I like how it counted correctly and then gave an incorrect final answer.. Bravo 👏

  • Someone doesn't know the folly of extending straight lines graphs into the future.

  • Oof, the alt text on that second one was unexpectedly dark lmao

  • By 2030, AI will greatly outperform humans in some complex intellectual tasks. Discover how LLMs are doubling their capabilities every seven months.

    they are improving at an exponential rate. It's just that the exponent is less than one.

  • That graph is hilarious. Enormous error bars, totally arbitrary quantization of complexity, and it's title? "Task time for a human that an AI model completes with a 50 percent success rate". 50 percent success is useless, lmao.

    On a more sober note, I'm very disappointed that IEEE is publishing this kind of trash.

    in yes/no type questions, 50% success rate is the absolute worst one can do. Any worse and you're just giving an inverted correct answer more than half the time

  • 32 Stimmen
    6 Beiträge
    30 Aufrufe
    G
    Yes. I can't imagine that they will go after individuals. Businesses can't be so cavalier. But if creators don't pay the extra cost to make their models compliant with EU law, then they can't be used in the EU anyway. So it probably doesn't matter much. The Llama models with vision have the no-EU clause. It's because Meta wasn't allowed to train on European's data because of GDPR. The pure LLMs are fine. They might even be compliant, but we'll have to see what the courts think.
  • 350 Stimmen
    72 Beiträge
    77 Aufrufe
    M
    Sure, the internet is more practical, and the odds of being caught in the time required to execute a decent strike plan, even one as vague as: "we're going to Amerika and we're going to hit 50 high profile targets on July 4th, one in every state" (Dear NSA analyst, this is entirely hypothetical) so your agents spread to the field and start assessing from the ground the highest impact targets attainable with their resources, extensive back and forth from the field to central command daily for 90 days of prep, but it's being carried out on 270 different active social media channels as innocuous looking photo exchanges with 540 pre-arranged algorithms hiding the messages in the noise of the image bits. Chances of security agencies picking this up from the communication itself? About 100x less than them noticing 50 teams of activists deployed to 50 states at roughly the same time, even if they never communicate anything. HF (more often called shortwave) is well suited for the numbers game. A deep cover agent lying in wait, potentially for years. Only "tell" is their odd habit of listening to the radio most nights. All they're waiting for is a binary message: if you hear the sequence 3 17 22 you are to make contact for further instructions. That message may come at any time, or may not come for a decade. These days, you would make your contact for further instructions via internet, and sure, it would be more practical to hide the "make contact" signal in the internet too, but shortwave is a longstanding tech with known operating parameters.
  • The End of Publishing as We Know It

    Technology technology
    10
    1
    51 Stimmen
    10 Beiträge
    27 Aufrufe
    beejjorgensen@lemmy.sdf.orgB
    Lol.. I wanted "DRM". But it's been a long day.
  • 1 Stimmen
    1 Beiträge
    8 Aufrufe
    Niemand hat geantwortet
  • Adaptive Keyboards & Writing Technologies For One-Handed Users

    Technology technology
    5
    1
    112 Stimmen
    5 Beiträge
    24 Aufrufe
    T
    Came here to say this.
  • My character isn't answering me

    Technology technology
    1
    0 Stimmen
    1 Beiträge
    11 Aufrufe
    Niemand hat geantwortet
  • Companies are using Ribbon AI, an AI interviewer to screen candidates.

    Technology technology
    52
    56 Stimmen
    52 Beiträge
    96 Aufrufe
    P
    I feel like I could succeed in an LLM selection process. I could sell my skills to a robot, could get an LLM to help. It's a long way ahead of keyword based automatic selectors At least an LLM is predictable, human judges are so variable
  • 81 Stimmen
    8 Beiträge
    35 Aufrufe
    P
    I expect them to give shareholders and directors a haircut before laying off workers, yes. But we know Microsoft never does that, so they can go f themselves.