linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

DeepSeek's distilled new R1 AI model can run on a single GPU | TechCrunch

Technology

21 Beiträge 15 Kommentatoren 0 Aufrufe

D double_quack@lemm.ee

Excuse me... what? Ok, that's something...
T This user is from outside of this forum
T This user is from outside of this forum
tropicaldingdong@lemmy.world

schrieb zuletzt editiert von

#12

Here I'm DM"ing you something. Its very personal, but I want to share it with you and I made it using Deepsite (in part).
1 Antwort Letzte Antwort

0
B blarth@thelemmy.club

7b trash model?
L This user is from outside of this forum
L This user is from outside of this forum
laintrain@lemmy.dbzer0.com

schrieb zuletzt editiert von

#13

I'm genuinely curious what you do that a 7b model is "trash" to you? Like yeah sure a gippity now tends to beat out a mistral 7b but I'm pretty happy with my mistral most of the time if I ever even need ai at all.
1 Antwort Letzte Antwort

5
S schizoidman@lemm.ee

This post did not contain any content.
F This user is from outside of this forum
F This user is from outside of this forum
fogetaboutit@programming.dev

schrieb zuletzt editiert von

#14

ew probably still censored.
M T 2 Antworten Letzte Antwort

2
F fogetaboutit@programming.dev

ew probably still censored.
M This user is from outside of this forum
M This user is from outside of this forum
mwa@lemm.ee

schrieb zuletzt editiert von

#15

You can self host it right??
F J 2 Antworten Letzte Antwort

5
F fogetaboutit@programming.dev

ew probably still censored.
T This user is from outside of this forum
T This user is from outside of this forum
t156@lemmy.world

schrieb zuletzt editiert von

#16

The censorship only exists on the version they host, which is fair enough. If they're running it themselves in China, they can't just break the law.

If you run it yourself, the censorship isn't there.
M J 2 Antworten Letzte Antwort

7
T t156@lemmy.world

The censorship only exists on the version they host, which is fair enough. If they're running it themselves in China, they can't just break the law.

If you run it yourself, the censorship isn't there.
M This user is from outside of this forum
M This user is from outside of this forum
monkdervierte@lemmy.ml

schrieb zuletzt editiert von monkdervierte@lemmy.ml

#17

Yeah, i think the censoring in the LLM data itself would be pretty vulnerable to circumvention.
1 Antwort Letzte Antwort

1
V vhstape@lemmy.sdf.org

the Chinese AI lab also released a smaller, “distilled” version of its new R1, DeepSeek-R1-0528-Qwen3-8B, that DeepSeek claims beats comparably sized models on certain benchmarks

Most models come in 1B, 7-8B, 12-14B, and 27+B parameter variants. According to the docs, they benchmarked the 8B model using an NVIDIA H20 (96 GB VRAM) and got between 144-1198 tokens/sec. Most consumer GPUs probably aren’t going to be able to keep up with
B This user is from outside of this forum
B This user is from outside of this forum
brucethemoose@lemmy.world

schrieb zuletzt editiert von brucethemoose@lemmy.world

#18

Depends on the quantization.

7B is small enough to run it in FP8 or a Marlin quant with SGLang/VLLM/TensorRT, so you can probably get very close to the H20 on a 3090 or 4090 (or even a 3060) and you know a little Docker.
1 Antwort Letzte Antwort

1
M mwa@lemm.ee

You can self host it right??
F This user is from outside of this forum
F This user is from outside of this forum
fogetaboutit@programming.dev

schrieb zuletzt editiert von

#19

if the model is censored... then what, retraining it? Or doing it from scratch like what open-r1 is doing?
1 Antwort Letzte Antwort

0
T t156@lemmy.world

The censorship only exists on the version they host, which is fair enough. If they're running it themselves in China, they can't just break the law.

If you run it yourself, the censorship isn't there.
J This user is from outside of this forum
J This user is from outside of this forum
jaschen@lemm.ee

schrieb zuletzt editiert von

#20

Untrue, I downloaded the vanilla version and it's hardcoded in.
1 Antwort Letzte Antwort

2
M mwa@lemm.ee

You can self host it right??
J This user is from outside of this forum
J This user is from outside of this forum
jaschen@lemm.ee

schrieb zuletzt editiert von

#21

The self hosted model has hard coded censored content.
1 Antwort Letzte Antwort

1

Anmelden zum Antworten

P

Tech Workers, Shareholders, and Civil Society All Call For Big Tech Accountability in Israel’s Genocide against Palestinians
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

90 Stimmen

1 Beiträge

0 Aufrufe

Niemand hat geantwortet
P

A UK government trial with 20K+ civil servants using Microsoft's Copilot AI for three months found a 26 minute average daily time saving, or two weeks per year
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
14

1

7 Stimmen

14 Beiträge

0 Aufrufe

G

A carrot perhaps... Or a very big stick.
N

Want a humanoid, open source robot for just $3,000? Hugging Face is on it.
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
38

1

73 Stimmen

38 Beiträge

0 Aufrufe

F

For sure they are! Meta more then the others though
P

Britain’s Companies Are Being Hacked
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
9

1

21 Stimmen

9 Beiträge

0 Aufrufe

D

Is that "goodbye" in Russian? Why?
W

Meta’s ‘Free Expression’ Push Results in Far Fewer Content Takedowns
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
2

1

8 Stimmen

2 Beiträge

2 Aufrufe

R

Meta? Isn't that owned by alleged pedophile Mark Zuckerberg? I heard he was a pedo on Facebook.
T

Telegram partners with xAI to bring Grok to over a billion users
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
36

1

38 Stimmen

36 Beiträge

2 Aufrufe

R

So you pay taxes to Putin. Good to know who actually helps funding the regime. I suggest you go someplace else. I won't take this from a jerk from likely one of the countries buying fossil fuels from said regime, that have also supported it after a few falsified elections starting in 1996, which is also the year I was born. And of course "paying taxes to Putin" can't be even compared to what TG is doing, so just shut up and go do something you know how to do, like I dunno what.
F

Tiny LEDs May Power Future AI Inteconnects
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

8 Stimmen

1 Beiträge

0 Aufrufe

Niemand hat geantwortet
R

Audible unveils plans to use AI voices to narrate audiobooks
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
6

1

0 Stimmen

6 Beiträge

3 Aufrufe

F

Ah, I see what you’re saying, I misunderstood and thought you were taking about picking a different book. Indeed, for the worst case scenario a mediocre AI voice could be an improvement!