linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Most of us will leave behind a large ‘digital legacy’ when we die. Here’s how to plan what happens to it

Technology

47 Beiträge 30 Kommentatoren 2 Aufrufe

0 01189998819991197253@infosec.pub

You said you released it on your writing. How did you go about doing that? It's a cool use case, and I'm intrigued.
F This user is from outside of this forum
F This user is from outside of this forum
facedeer@fedia.io

schrieb zuletzt editiert von

#13

It's a bit technical, I haven't found any pre-packaged software to do what I'm doing yet.

First I installed https://github.com/openai/whisper , the speech-to-text model that OpenAI released back when they were less blinded by dollar signs. I wrote a Python script that used it to go through all of the audio files in the directory tree where I'm storing this stuff and produced a transcript that I stored in a .json file alongside it.

For the LLM, I installed https://github.com/LostRuins/koboldcpp/releases/ and used the https://huggingface.co/unsloth/Qwen3-30B-A3B-128K-GGUF model, which is just barely small enough to run smoothly on my RTX 4090. I wrote another Python script that methodically goes through those .json files that Whisper produced, takes the raw text of the transcript, and feeds it to the LLM with a couple of prompts explaining what the transcript is and what I'd like the LLM to do with it (write a summary, or write a bullet-point list of subject tags). Those get saved in the .json file too.

Most recently I've been experimenting with creating an index of the transcripts using those LLM results and the Whoosh library in Python, so that I can do local searches of the transcripts based on topics. I'm building towards writing up something where I can literally tell it "Tell me about Uncle Pete" and it'll first search for the relevant transcripts and then feed those into the LLM with a prompt to extract the relevant information from them.

If you don't find the idea of writing scripts for that sort of thing literally fun (like me) then you may need to wait a bit for someone more capable and more focused than I am to create a user-friendly application to do all this. In the meantime, though, hoard that data. Storage is cheap.
0 1 Antwort Letzte Antwort

10
F facedeer@fedia.io

It's a bit technical, I haven't found any pre-packaged software to do what I'm doing yet.

First I installed https://github.com/openai/whisper , the speech-to-text model that OpenAI released back when they were less blinded by dollar signs. I wrote a Python script that used it to go through all of the audio files in the directory tree where I'm storing this stuff and produced a transcript that I stored in a .json file alongside it.

For the LLM, I installed https://github.com/LostRuins/koboldcpp/releases/ and used the https://huggingface.co/unsloth/Qwen3-30B-A3B-128K-GGUF model, which is just barely small enough to run smoothly on my RTX 4090. I wrote another Python script that methodically goes through those .json files that Whisper produced, takes the raw text of the transcript, and feeds it to the LLM with a couple of prompts explaining what the transcript is and what I'd like the LLM to do with it (write a summary, or write a bullet-point list of subject tags). Those get saved in the .json file too.

Most recently I've been experimenting with creating an index of the transcripts using those LLM results and the Whoosh library in Python, so that I can do local searches of the transcripts based on topics. I'm building towards writing up something where I can literally tell it "Tell me about Uncle Pete" and it'll first search for the relevant transcripts and then feed those into the LLM with a prompt to extract the relevant information from them.

If you don't find the idea of writing scripts for that sort of thing literally fun (like me) then you may need to wait a bit for someone more capable and more focused than I am to create a user-friendly application to do all this. In the meantime, though, hoard that data. Storage is cheap.
0 This user is from outside of this forum
0 This user is from outside of this forum
01189998819991197253@infosec.pub

schrieb zuletzt editiert von

#14

That's awesome! Thank you!

If you don’t find the idea of writing scripts for that sort of thing literally fun...

I absolutely do. What I find as a potential showstopper for me right now, is that I don't have a nonintegrated GPU, which makes complex LLMs hard to run. Basically, if I can't push the processing to CPU, I'm looking at around 2-5 seconds per token; it's rough. But I like your workflow a lot, and I'm going to try to get something similar going with my incredibly old hardware, and see if CPU-only processing of this would be something feasible (though, I'm not super hopeful there).

And, yes, I, too, am aware of the hallucinations and such that come from the technology. But, honestly, for this non-critical use case, I don't really care.
F 1 Antwort Letzte Antwort

1
0 01189998819991197253@infosec.pub

That's awesome! Thank you!

If you don’t find the idea of writing scripts for that sort of thing literally fun...

I absolutely do. What I find as a potential showstopper for me right now, is that I don't have a nonintegrated GPU, which makes complex LLMs hard to run. Basically, if I can't push the processing to CPU, I'm looking at around 2-5 seconds per token; it's rough. But I like your workflow a lot, and I'm going to try to get something similar going with my incredibly old hardware, and see if CPU-only processing of this would be something feasible (though, I'm not super hopeful there).

And, yes, I, too, am aware of the hallucinations and such that come from the technology. But, honestly, for this non-critical use case, I don't really care.
F This user is from outside of this forum
F This user is from outside of this forum
facedeer@fedia.io

schrieb zuletzt editiert von

#15

I only just recently discovered that my installation of Whisper was completely unaware that I had a GPU, and was running entirely on my CPU. So even if you can't get a good LLM running locally you might still be able to get everything turned into text transcripts for eventual future processing.
0 1 Antwort Letzte Antwort

2
P pro@programming.dev

This post did not contain any content.
F This user is from outside of this forum
F This user is from outside of this forum
fubarx@lemmy.world

schrieb zuletzt editiert von fubarx@lemmy.world

#16

A long time ago, I had the idea for a startup to keep digital material, including accounts, passwords, old documents, etc. in a digital vault that would be released to the next-of-kin when someone dies. It would also convert documents to newer formats so your old unpublished WordPerfect novel could be opened and read by the grandkids (should they choose).

Problem is, nobody would (or should) trust a startup with that material. This is stuff that should be around for many decades and most startups go out of business.
O P C 3 Antworten Letzte Antwort

4
F fubarx@lemmy.world

A long time ago, I had the idea for a startup to keep digital material, including accounts, passwords, old documents, etc. in a digital vault that would be released to the next-of-kin when someone dies. It would also convert documents to newer formats so your old unpublished WordPerfect novel could be opened and read by the grandkids (should they choose).

Problem is, nobody would (or should) trust a startup with that material. This is stuff that should be around for many decades and most startups go out of business.
O This user is from outside of this forum
O This user is from outside of this forum
odelik@lemmy.today

schrieb zuletzt editiert von

#17

This could be a non-profit funded by participants and government grants.
1 Antwort Letzte Antwort

1
F fubarx@lemmy.world

A long time ago, I had the idea for a startup to keep digital material, including accounts, passwords, old documents, etc. in a digital vault that would be released to the next-of-kin when someone dies. It would also convert documents to newer formats so your old unpublished WordPerfect novel could be opened and read by the grandkids (should they choose).

Problem is, nobody would (or should) trust a startup with that material. This is stuff that should be around for many decades and most startups go out of business.
P This user is from outside of this forum
P This user is from outside of this forum
prioritymotif@lemmy.world

schrieb zuletzt editiert von

#18

Bitwarden does all that. If you pay the subscription you get a GB of storage and delegate emergency access to other people.
4 1 Antwort Letzte Antwort

3
F freedomadvocate@lemmy.net.au

This is something that I've really been thinking about lately as I get older and my kids start to grow up. I've got 60TB+ of digital data, including all my families history of photos and videos digitized which are backed up to 3 separate cloud services, onenote filled with information, password managers filled with logins and details, etc, along with my Steam/Xbox/Playstation/Epic/GOG/etc accounts with 1000+ games on them.

I'm tempted to make a website/app to try and tie it all together in an easy way tbh.
F This user is from outside of this forum
F This user is from outside of this forum
flagstaff@programming.dev

schrieb zuletzt editiert von

#19

backed up to 3 separate cloud services

Why so many?
F 1 Antwort Letzte Antwort

0
C catloaf@lemm.ee

Yup. My parents aren't even in ill health, let alone dead, but we recently took all the old VHS tapes, including a lot of OTA recordings, and a significant number of DVDs, and dumped them. Recordings of talking with relatives got digitized, same way you'd keep family photos.

I have no expectation that people keep my junk. I'll pass on a handful of stuff like identifying photos of people and places, but nobody wants or needs the 500 photos of my cat. Even I don't want that many, but storage is cheap enough that I don't bother to delete the useless ones.
T This user is from outside of this forum
T This user is from outside of this forum
thejml@lemm.ee

schrieb zuletzt editiert von

#20

My wife’s parents recently passed. It took months to slog through their stuff and my wife was over it only weeks in. She dumped so much but constantly fights with herself for both taking more than she wanted/needed to and yet less that what she feels she should have. We’ve told our daughter multiple times “our stuff May mean a lot to us, it doesn’t have to mean anything at all to you. If you don’t want it, never feel bad dumping/selling/letting it go.” Out of all the stuff we all collect in life just by living, barely anything has any sentimental value.

On one hand I’ve got a huge collection of photos and albums I’ve taken and collected. I’m trying to clear some out as I go… but I’m not looking forward to that process when my parents go. My dad’s an avid photographer and I know he has a few hundred thousand photos, most of which are near duplicates and he rarely cleans them up.
1 Antwort Letzte Antwort

3
F flagstaff@programming.dev

backed up to 3 separate cloud services

Why so many?
F This user is from outside of this forum
F This user is from outside of this forum
freedomadvocate@lemmy.net.au

schrieb zuletzt editiert von

#21

I already pay for storage for one of them for other reasons (M365), but I much prefer Google Photos as a service, especially for sharing photos and albums, so I pay for that too. The third is just some crazy good deal with "iDrive Photos" where it's $5/year for unlimited upload from my phone haha.

Also in case anything happens to one of my accounts, especially since one is tied to my Xbox account and I have kids.
1 Antwort Letzte Antwort

1
P prioritymotif@lemmy.world

Bitwarden does all that. If you pay the subscription you get a GB of storage and delegate emergency access to other people.
4 This user is from outside of this forum
4 This user is from outside of this forum
4am@lemm.ee

schrieb zuletzt editiert von

#22

Does Bitwarden have emergency delegation now? I’d been waiting for it
S 1 Antwort Letzte Antwort

3
P pro@programming.dev

This post did not contain any content.
U This user is from outside of this forum
U This user is from outside of this forum
underline960@sh.itjust.works

schrieb zuletzt editiert von

#23
In 2017, I helped develop key recommendations for planning your digital legacy. These include:

creating an inventory of accounts and assets, recording usernames and login information, and if possible, downloading personal content for local storage

specifying preferences in writing, noting wishes about what content should be preserved, deleted, or shared – and with whom

using password managers to securely store and share access to information and legacy preferences

designating a digital executor who has legal authority to carry out your digital legacy wishes and preferences, ideally with legal advice

using legacy features on available platforms, such as Facebook’s Legacy Contact, Google’s Inactive Account Manager, or Apple’s Digital Legacy.
1 Antwort Letzte Antwort

3
N This user is from outside of this forum
N This user is from outside of this forum
notfromhere@lemmy.ml

schrieb zuletzt editiert von

#24

It sounds like something similar to RAG (retrieval augmented generation) or a database lookup. Are you storing the transcripts in a SQL like database or noSQL db or doing semantic similarity on any of it?

I was thinking of a similar project and building a knowledge graph for each person.
1 Antwort Letzte Antwort

1
F facedeer@fedia.io

I only just recently discovered that my installation of Whisper was completely unaware that I had a GPU, and was running entirely on my CPU. So even if you can't get a good LLM running locally you might still be able to get everything turned into text transcripts for eventual future processing.
0 This user is from outside of this forum
0 This user is from outside of this forum
01189998819991197253@infosec.pub

schrieb zuletzt editiert von

#25

Nicceeeee! Thank you!
1 Antwort Letzte Antwort

1
P pro@programming.dev

This post did not contain any content.
B This user is from outside of this forum
B This user is from outside of this forum
blametheantifa@lemmy.world

schrieb zuletzt editiert von

#26

Thanks to capitalism, you don’t own most of that “digital legacy” and do not have the right to bequeath or transfer ownership for the vast majority of it.
T 1 Antwort Letzte Antwort

23
F fubarx@lemmy.world

A long time ago, I had the idea for a startup to keep digital material, including accounts, passwords, old documents, etc. in a digital vault that would be released to the next-of-kin when someone dies. It would also convert documents to newer formats so your old unpublished WordPerfect novel could be opened and read by the grandkids (should they choose).

Problem is, nobody would (or should) trust a startup with that material. This is stuff that should be around for many decades and most startups go out of business.
C This user is from outside of this forum
C This user is from outside of this forum
catloaf@lemm.ee

schrieb zuletzt editiert von

#27

This is stuff that should be around for many decades

Should it? 99.99% of my email doesn't need to be around for more than a few days, let alone decades. And that number will only go up when I'm dead. Really important stuff, like ownership titles, is on file in paper here in my house and with the relevant title agency.
F 1 Antwort Letzte Antwort

0
T turmacar@lemmy.world

I think it would be interesting to have some kind of global archive. Even if descendants don't care "now" has the potential to be the beginning of the best documented era in history. Historians would kill for photographs by random average people from any other time.

A lot of people thought that that's what the Internet would be, but that's obviously not the case. And I know the "right to be forgotten" is a thing, and deservedly so, but at some point you're throwing out the wine with the amphora.
C This user is from outside of this forum
C This user is from outside of this forum
catloaf@lemm.ee

schrieb zuletzt editiert von

#28

No, we do have that. Social media is a gold mine for analysis, both for modern sociology and for future archaeology.
1 Antwort Letzte Antwort

0
4 4am@lemm.ee

Does Bitwarden have emergency delegation now? I’d been waiting for it
S This user is from outside of this forum
S This user is from outside of this forum
skaverat@discuss.tchncs.de

schrieb zuletzt editiert von

#29

At least for the 2 years I'm using it
4 1 Antwort Letzte Antwort

0
C catloaf@lemm.ee

This is stuff that should be around for many decades

Should it? 99.99% of my email doesn't need to be around for more than a few days, let alone decades. And that number will only go up when I'm dead. Really important stuff, like ownership titles, is on file in paper here in my house and with the relevant title agency.
F This user is from outside of this forum
F This user is from outside of this forum
fubarx@lemmy.world

schrieb zuletzt editiert von

#30

A couple years ago, I would have agreed. Most of our email is junk. But nowadays, you can have an LLM digest and summarize it for you. That could also be a service the legacy system offers.
Grandkids can just ask for a free-form search term without having to wade through everything.
1 Antwort Letzte Antwort

0
B blametheantifa@lemmy.world

Thanks to capitalism, you don’t own most of that “digital legacy” and do not have the right to bequeath or transfer ownership for the vast majority of it.
T This user is from outside of this forum
T This user is from outside of this forum
tibi@lemmy.world

schrieb zuletzt editiert von

#31

You can take ownership of a lot of it. Thanks to GDPR, major platforms offer ways to export data like photos, videos, activity on their platforms, messages etc. Store locally first, avoid over reliance on online platforms for safekeeping your data.

Also, we need to fight to keep ownership of digital media while we still can. Buy movies and music on physical media so they keep making them. Buy physical books. Buy from DRM free platforms like GoG. As convenient as it may be, avoid over reliance on streaming services.

And of course, make backups of anything you care about. Only you can keep your data safe. Online services will only keep your data as long as they can exploit it to make money.
1 Antwort Letzte Antwort

2
0 01189998819991197253@infosec.pub

You said you released it on your writing. How did you go about doing that? It's a cool use case, and I'm intrigued.
P This user is from outside of this forum
P This user is from outside of this forum
plebcouncilman@sh.itjust.works

schrieb zuletzt editiert von plebcouncilman@sh.itjust.works

#32

If you’re interested in “chatting” with your writing there’s a couple of out of the box solutions right now, like Kortex or Reflect Notes. They are AI first note taking apps. I don’t use them out of privacy concerns but if you don’t care that much they might allow you to do what you want. They claim to be E2E encrypted and the AI unable to phone home but these are companies that sprung out of nowhere so I don’t trust they necessarily have done all their homework to actually provide full privacy.

Alternatively there’s an Obsidian plugin that I believe allows you to do such a thing as well with local LLMs if you wanted to which is the privacy first way to this. I’ve just moved to Obsidian from Capacities so I have yet to try it out as I’m still setting up my vault.
0 1 Antwort Letzte Antwort

2

Anmelden zum Antworten

P

A World Without iPhones?
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
7

34 Stimmen

7 Beiträge

0 Aufrufe

S

I believe the world was a better place before smartphones started dominating everyone's attention. It has had a profound impact on how people are socializing, and not in a positive way if you ask me.
P

Stepping outside the algorithm
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

19 Stimmen

1 Beiträge

0 Aufrufe

Niemand hat geantwortet
T

Catbox.moe got screwed 😿
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
40

55 Stimmen

40 Beiträge

2 Aufrufe

A

I'll gladly give you a reason. I'm actually happy to articulate my stance on this, considering how much I tend to care about digital rights. Services that host files should not be held responsible for what users upload, unless: The service explicitly caters to illegal content by definition or practice (i.e. the if the website is literally titled uploadyourcsamhere[.]com then it's safe to assume they deliberately want to host illegal content) The service has a very easy mechanism to remove illegal content, either when asked, or through simple monitoring systems, but chooses not to do so (catbox does this, and quite quickly too) Because holding services responsible creates a whole host of negative effects. Here's some examples: Someone starts a CDN and some users upload CSAM. The creator of the CDN goes to jail now. Nobody ever wants to create a CDN because of the legal risk, and thus the only providers of CDNs become shady, expensive, anonymously-run services with no compliance mechanisms. You run a site that hosts images, and someone decides they want to harm you. They upload CSAM, then report the site to law enforcement. You go to jail. Anybody in the future who wants to run an image sharing site must now self-censor to try and not upset any human being that could be willing to harm them via their site. A social media site is hosting the posts and content of users. In order to be compliant and not go to jail, they must engage in extremely strict filtering, otherwise even one mistake could land them in jail. All users of the site are prohibited from posting any NSFW or even suggestive content, (including newsworthy media, such as an image of bodies in a warzone) and any violation leads to an instant ban, because any of those things could lead to a chance of actually illegal content being attached. This isn't just my opinion either. Digital rights organizations such as the Electronic Frontier Foundation have talked at length about similar policies before. To quote them: "When social media platforms adopt heavy-handed moderation policies, the unintended consequences can be hard to predict. For example, Twitter’s policies on sexual material have resulted in posts on sexual health and condoms being taken down. YouTube’s bans on violent content have resulted in journalism on the Syrian war being pulled from the site. It can be tempting to attempt to “fix” certain attitudes and behaviors online by placing increased restrictions on users’ speech, but in practice, web platforms have had more success at silencing innocent people than at making online communities healthier." Now, to address the rest of your comment, since I don't just want to focus on the beginning: I think you have to actively moderate what is uploaded Catbox does, and as previously mentioned, often at a much higher rate than other services, and at a comparable rate to many services that have millions, if not billions of dollars in annual profits that could otherwise be spent on further moderation. there has to be swifter and stricter punishment for those that do upload things that are against TOS and/or illegal. The problem isn't necessarily the speed at which people can be reported and punished, but rather that the internet is fundamentally harder to track people on than real life. It's easy for cops to sit around at a spot they know someone will be physically distributing illegal content at in real life, but digitally, even if you can see the feed of all the information passing through the service, a VPN or Tor connection will anonymize your IP address in a manner that most police departments won't be able to track, and most three-letter agencies will simply have a relatively low success rate with. There's no good solution to this problem of identifying perpetrators, which is why platforms often focus on moderation over legal enforcement actions against users so frequently. It accomplishes the goal of preventing and removing the content without having to, for example, require every single user of the internet to scan an ID (and also magically prevent people from just stealing other people's access tokens and impersonating their ID) I do agree, however, that we should probably provide larger amounts of funding, training, and resources, to divisions who's sole goal is to go after online distribution of various illegal content, primarily that which harms children, because it's certainly still an issue of there being too many reports to go through, even if many of them will still lead to dead ends. I hope that explains why making file hosting services liable for user uploaded content probably isn't the best strategy. I hate to see people with good intentions support ideas that sound good in practice, but in the end just cause more untold harms, and I hope you can understand why I believe this to be the case.
A

Korea’s POSTECH develops sub-millimeter waveguide to shrink AR glasses
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
4

6 Stimmen

4 Beiträge

0 Aufrufe

T

Oh I agree. I just think is part of the equation perhaps the thinner and lighter will enable for better processor? Not an AR guy , although I lived my oculus until FB got hold of it. Didn't use it ever again after that day.
P

Meta plans to replace humans with AI to automate up to 90% of its privacy and integrity risk assessments, including in sensitive areas like violent content
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
44

81 Stimmen

44 Beiträge

0 Aufrufe

L

Hear me out, Eliza. It'll be equally useless and for orders of magnitude less cost. And no one will mistakenly or fraudulently call it AI.
M

Why Japan's animation industry has embraced AI
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
12

1

1 Stimmen

12 Beiträge

2 Aufrufe

R

The genre itself has become neutered, too. A lot of anime series have the usual "anime elements" and a couple custom ideas. And similar style, too glossy for my taste. OK, what I think is old and boring libertarian stuff, I'll still spell it out. The reason people are having such problems is because groups and businesses are de facto legally enshrined in their fields, it's almost like feudal Europe's system of privileges and treaties. At some point I thought this is good, I hope no evil god decided to fulfill my wish. There's no movement, and a faction (like Disney with Star Wars) that buys a place (a brand) can make any garbage, and people will still try to find the depth in it and justify it (that complaint has been made about Star Wars prequels, but no, they are full of garbage AND have consistent arcs, goals and ideas, which is why they revitalized the Expanded Universe for almost a decade, despite Lucas-<companies> having sort of an internal social collapse in year 2005 right after Revenge of the Sith being premiered ; I love the prequels, despite all the pretense and cringe, but their verbal parts are almost fillers, their cinematographic language and matching music are flawless, the dialogue just disrupts it all while not adding much, - I think Lucas should have been more decisive, a bit like Tartakovsky with the Clone Wars cartoon, just more serious, because non-verbal doesn't equal stupid). OK, my thought wandered away. Why were the legal means they use to keep such positions created? To make the economy nicer to the majority, to writers, to actors, to producers. Do they still fulfill that role? When keeping monopolies, even producing garbage or, lately, AI slop, - no. Do we know a solution? Not yet, because pressing for deregulation means the opponent doing a judo movement and using that energy for deregulating the way everything becomes worse. Is that solution in minimizing and rebuilding the system? I believe still yes, nothing is perfect, so everything should be easy to quickly replace, because errors and mistakes plaguing future generations will inevitably continue to be made. The laws of the 60s were simple enough for that in most countries. The current laws are not. So the general direction to be taken is still libertarian. Is this text useful? Of course not. I just think that in the feudal Europe metaphor I'd want to be a Hussite or a Cossack or at worst a Venetian trader.
A

Duolingo CEO says AI is a better teacher than humans—but schools will exist ‘because you still need childcare’
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
19

1

1 Stimmen

19 Beiträge

2 Aufrufe

L

Where and what is texas?
F

*deleted by creator*
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

0 Stimmen

1 Beiträge

0 Aufrufe

Niemand hat geantwortet