linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

DeepSeek's distilled new R1 AI model can run on a single GPU | TechCrunch

Technology

21 Beiträge 15 Kommentatoren 0 Aufrufe

D double_quack@lemm.ee

Save me the search, please. What's deepsite?
T This user is from outside of this forum
T This user is from outside of this forum
tropicaldingdong@lemmy.world

schrieb zuletzt editiert von tropicaldingdong@lemmy.world

#10

double_quack | Lemmy Profile

(tmpweb.net)

Above is what I can do with deepsite by pasting in the first page of your lemmy profile and the prompt:

"This is double_quack, a lemmy user on Lemmy, a new social media platform. Create a cool profile page in a style that they'll like based on the front page of their lemmy account (pasted in a ctrl + a, ctrl + c, ctrl + v of your profile)."

It not perfect by any stretch of the imagination, but like, its not a bad starting point.

if you want to try it: https://huggingface.co/spaces/enzostvs/deepsite
D 1 Antwort Letzte Antwort

1
T tropicaldingdong@lemmy.world

double_quack | Lemmy Profile

(tmpweb.net)

Above is what I can do with deepsite by pasting in the first page of your lemmy profile and the prompt:

"This is double_quack, a lemmy user on Lemmy, a new social media platform. Create a cool profile page in a style that they'll like based on the front page of their lemmy account (pasted in a ctrl + a, ctrl + c, ctrl + v of your profile)."

It not perfect by any stretch of the imagination, but like, its not a bad starting point.

if you want to try it: https://huggingface.co/spaces/enzostvs/deepsite
D This user is from outside of this forum
D This user is from outside of this forum
double_quack@lemm.ee

schrieb zuletzt editiert von

#11

Excuse me... what? Ok, that's something...
T 1 Antwort Letzte Antwort

0
D double_quack@lemm.ee

Excuse me... what? Ok, that's something...
T This user is from outside of this forum
T This user is from outside of this forum
tropicaldingdong@lemmy.world

schrieb zuletzt editiert von

#12

Here I'm DM"ing you something. Its very personal, but I want to share it with you and I made it using Deepsite (in part).
1 Antwort Letzte Antwort

0
B blarth@thelemmy.club

7b trash model?
L This user is from outside of this forum
L This user is from outside of this forum
laintrain@lemmy.dbzer0.com

schrieb zuletzt editiert von

#13

I'm genuinely curious what you do that a 7b model is "trash" to you? Like yeah sure a gippity now tends to beat out a mistral 7b but I'm pretty happy with my mistral most of the time if I ever even need ai at all.
1 Antwort Letzte Antwort

5
S schizoidman@lemm.ee

This post did not contain any content.
F This user is from outside of this forum
F This user is from outside of this forum
fogetaboutit@programming.dev

schrieb zuletzt editiert von

#14

ew probably still censored.
M T 2 Antworten Letzte Antwort

2
F fogetaboutit@programming.dev

ew probably still censored.
M This user is from outside of this forum
M This user is from outside of this forum
mwa@lemm.ee

schrieb zuletzt editiert von

#15

You can self host it right??
F J 2 Antworten Letzte Antwort

5
F fogetaboutit@programming.dev

ew probably still censored.
T This user is from outside of this forum
T This user is from outside of this forum
t156@lemmy.world

schrieb zuletzt editiert von

#16

The censorship only exists on the version they host, which is fair enough. If they're running it themselves in China, they can't just break the law.

If you run it yourself, the censorship isn't there.
M J 2 Antworten Letzte Antwort

7
T t156@lemmy.world

The censorship only exists on the version they host, which is fair enough. If they're running it themselves in China, they can't just break the law.

If you run it yourself, the censorship isn't there.
M This user is from outside of this forum
M This user is from outside of this forum
monkdervierte@lemmy.ml

schrieb zuletzt editiert von monkdervierte@lemmy.ml

#17

Yeah, i think the censoring in the LLM data itself would be pretty vulnerable to circumvention.
1 Antwort Letzte Antwort

1
V vhstape@lemmy.sdf.org

the Chinese AI lab also released a smaller, “distilled” version of its new R1, DeepSeek-R1-0528-Qwen3-8B, that DeepSeek claims beats comparably sized models on certain benchmarks

Most models come in 1B, 7-8B, 12-14B, and 27+B parameter variants. According to the docs, they benchmarked the 8B model using an NVIDIA H20 (96 GB VRAM) and got between 144-1198 tokens/sec. Most consumer GPUs probably aren’t going to be able to keep up with
B This user is from outside of this forum
B This user is from outside of this forum
brucethemoose@lemmy.world

schrieb zuletzt editiert von brucethemoose@lemmy.world

#18

Depends on the quantization.

7B is small enough to run it in FP8 or a Marlin quant with SGLang/VLLM/TensorRT, so you can probably get very close to the H20 on a 3090 or 4090 (or even a 3060) and you know a little Docker.
1 Antwort Letzte Antwort

1
M mwa@lemm.ee

You can self host it right??
F This user is from outside of this forum
F This user is from outside of this forum
fogetaboutit@programming.dev

schrieb zuletzt editiert von

#19

if the model is censored... then what, retraining it? Or doing it from scratch like what open-r1 is doing?
1 Antwort Letzte Antwort

0
T t156@lemmy.world

The censorship only exists on the version they host, which is fair enough. If they're running it themselves in China, they can't just break the law.

If you run it yourself, the censorship isn't there.
J This user is from outside of this forum
J This user is from outside of this forum
jaschen@lemm.ee

schrieb zuletzt editiert von

#20

Untrue, I downloaded the vanilla version and it's hardcoded in.
1 Antwort Letzte Antwort

2
M mwa@lemm.ee

You can self host it right??
J This user is from outside of this forum
J This user is from outside of this forum
jaschen@lemm.ee

schrieb zuletzt editiert von

#21

The self hosted model has hard coded censored content.
1 Antwort Letzte Antwort

1

Anmelden zum Antworten

S

85K – A Melhor Opção para Quem Busca Diversão e Recompensas
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

1 Stimmen

1 Beiträge

0 Aufrufe

Niemand hat geantwortet
P

Community Notes vanishes from X feeds, raising 'serious questions' amid ongoing EU probe
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
94

1

464 Stimmen

94 Beiträge

2 Aufrufe

L

Make them publishers or whatever is required to have it be a legal requirement, have them ban people who share false information. The law doesn't magically make open discussions not open. By design, social media is open. If discussion from the public is closed, then it's no longer social media. ban people who share false information Banning people doesn't stop falsehoods. It's a broken solution promoting a false assurance. Authorities are still fallible & risk banning over unpopular/debatable expressions that may turn out true. There was unpopular dissent over covid lockdown policies in the US despite some dramatic differences with EU policies. Pro-palestinian protests get cracked down. Authorities are vulnerable to biases & swayed. Moreover, when people can just share their falsehoods offline, attempting to ban them online is hard to justify. If print media, through its decline, is being held legally responsible Print media is a controlled medium that controls it writers & approves everything before printing. It has a prepared, coordinated message. They can & do print books full of falsehoods if they want. Social media is open communication where anyone in the entire public can freely post anything before it is revoked. They aren't claiming to spread the truth, merely to enable communication.
P

Microsoft’s vast advertising business is target of Irish Council for Civil Liberties (ICCL) Enforce application for class action launch under EU data law
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
4

1

33 Stimmen

4 Beiträge

0 Aufrufe

A

Phew okay /s
Q

The World's First Mass-Produced Flying Car Is Here and It Costs $1 Million
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
6

1

0 Stimmen

6 Beiträge

0 Aufrufe

H

Then that's changed since the last time I toyed with the idea. Which, granted, was probably 20 years ago...
A

HMD, Lava to launch feature phones with direct-to-mobile technology, Developed in collaboration with Tejas Networks and powered by Saankhya's chipset, these phones can stream content without internet
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
6

9 Stimmen

6 Beiträge

0 Aufrufe

N

So they.just reinvented the DVB-T tuner. Edit: I looked it up and it's literally just that. The fact they're shoving it into feature phones is interesting.
Q

Backblaze Drive Stats for Q1 2025
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

49 Stimmen

1 Beiträge

1 Aufrufe

Niemand hat geantwortet
F

[Opinion] Unending ransomware attacks are a symptom, not the sickness
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
4

1

44 Stimmen

4 Beiträge

2 Aufrufe

G

It varies based on local legislation, so in some places paying ransoms is banned but it's by no means universal. It's totally valid to be against paying ransoms wherever possible, but it's not entirely black and white in some situations. For example, what if a hospital gets ransomed? Say they serve an area not served by other facilities, and if they can't get back online quickly people will die? Sounds dramatic, but critical public services get ransomed all the time and there are undeniable real world consequences. Recovery from ransomware can cost significantly more than a ransom payment if you're not prepared. It can also take months to years to recover, especially if you're simultaneously fighting to evict a persistent (annoyed, unpaid) threat actor from your environment. For the record I don't think ransoms should be paid in most scenarios, but I do think there is some nuance to consider here.
V

China aims to recruit top US scientists as Trump tries to kill the CHIPS Act
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

0 Stimmen

1 Beiträge

1 Aufrufe

Niemand hat geantwortet