this post was submitted on 13 Dec 2023
484 points (100.0% liked)

196

16459 readers
2470 users here now

Be sure to follow the rule before you head out.

Rule: You must post before you leave.

^other^ ^rules^

founded 1 year ago
MODERATORS
 
all 43 comments
sorted by: hot top controversial new old
[–] morrowind@lemmy.ml 68 points 11 months ago (2 children)

I hope he has indexed everything he saved and can search it efficiently

[–] Bonsoir@lemmy.ca 106 points 11 months ago (1 children)

It says he's a saver, not a retriever!

[–] Viking_Hippie@lemmy.world 19 points 11 months ago

Hell, he's not even golden!

[–] Honytawk@lemmy.zip 18 points 11 months ago (1 children)

Those things are lost in the great mines of Morediskspaceia

[–] ichmagrum@feddit.de 1 points 11 months ago

Better hope none of it is encrypted, he's definitely going to forget that password.

[–] don@lemm.ee 51 points 11 months ago (2 children)

It’s the meticulous savers you should worry about. The savers smart enough to automate what they save, and fastidious enough to know every sector of what they’ve saved. Those savers may/may not save the whole galaxy.

[–] Honytawk@lemmy.zip 23 points 11 months ago

Save the cheerleader, save the world.

ctrl+s

[–] TacoNissan@lemmy.zip 9 points 11 months ago* (last edited 11 months ago) (1 children)

I save lots of 2000s kid's shows, for when my future kids grow up. No telling when they'll become lost media. I use filebot to automatically rename the files to TVDB standards, and so far I've collected 8tb. Do I have a problem?

[–] don@lemm.ee 7 points 11 months ago

Do I have a problem?

As long as you can afford to maintain your repository, no.

[–] Waluigis_Talking_Buttplug@lemmy.world 51 points 11 months ago (1 children)

I'm gonna gift that man a really strong magnet

[–] ProgrammingSocks@pawb.social 10 points 11 months ago (1 children)

Too late. It's all SSDs already. They're more robust long term.

[–] vsh@lemm.ee 4 points 11 months ago (1 children)

Blud made RAID ∞ out of SSDs

[–] deur@feddit.nl 3 points 11 months ago (1 children)

What the fuck even is this comment.

[–] festnt@sh.itjust.works 1 points 1 month ago

removed i guess

[–] Microplasticbrain@lemm.ee 34 points 11 months ago (1 children)

Im in this picture and I dont like it. Saved for later.

[–] Rekonok@sh.itjust.works 12 points 11 months ago

Same

I saved your comment too

[–] atocci@kbin.social 29 points 11 months ago (1 children)
[–] Ultragramps@lemmy.blahaj.zone 12 points 11 months ago (1 children)

The only one who has the lost episodes of You Can’t Do That On Television.

[–] mindbleach@sh.itjust.works 2 points 11 months ago

And Adventures In Wonderland, somehow.

[–] ZILtoid1991@kbin.social 17 points 11 months ago (1 children)

Where's the large towers of 3 layer M-Disk Blu-rays?

[–] ichmagrum@feddit.de 1 points 11 months ago

In the room with all the old PCs they ever owned.

[–] jsh@sh.itjust.works 17 points 11 months ago

The data hoooorder

[–] Belgdore@lemm.ee 17 points 11 months ago

People like this are the reason we will have records of this period of history in a thousand years.

[–] problematicPanther@lemmy.world 14 points 11 months ago (1 children)

is that the archive.org guy?

[–] fluxion@lemmy.world 9 points 11 months ago* (last edited 11 months ago) (1 children)

No archive.org guy uses his powers for good. This guy is an internet hoarder.

[–] shikogo@pawb.social 5 points 11 months ago

This guy has solved the entire lost media wiki and is keeping it all to himself.

[–] bumblebeebeard@reddthat.com 11 points 11 months ago

hello yes, I've archived this.

[–] LoamImprovement@beehaw.org 8 points 11 months ago (4 children)

Hold up, does someone know how to save an entire site? I would really like to get the 5e wikidot archived in case Hasbro or whoever wants to shut it down for good.

[–] Kolanaki@yiffit.net 7 points 11 months ago* (last edited 11 months ago)

Probably a browser extension these days. I had one back in the late 90's or early 2000's that would simply download the page you were on, as well as every page, image, audio file, etc. on every recursive link on that page.

This was back when most websites had a table of contents link somewhere, though. There are plenty of sites now that don't link to every page contained on the domain and are only accessible if you manually enter the URL or use dynamically created pages that only exist upon request.

[–] anton@lemmy.blahaj.zone 5 points 11 months ago

It won't save everything, but if a script follows every link recursively, most content should be reached that way. That's kind of what Google does but for one site instead of the internet.

If there is a search function try very simple queries.

The alternative of brute forcing links would be unfeasible, even if you are not rate limited by the site, due to the exponential complexity.

If you want to do something please look into api/scraping etikette like exponential back off.

[–] jherazob@beehaw.org 5 points 11 months ago

There's software that browses to the homepage of a site and starts traversing it all, saving it all in the process

[–] zzz@feddit.de 3 points 11 months ago (1 children)

Link? And where can I upload a PDF* of the site to share with you? tmpfiles.org’s short duration probably won’t cut it…

*Although I’m certain The Saver™️ would only do full webarchive zips, for us casuals, the PDF export shall do (and be easier in day to day use)

[–] LoamImprovement@beehaw.org 2 points 11 months ago

http://dnd5e.wikidot.com/

Honestly it's not the information so much as the way it's organized that I'd like to save. It is the best resource for putting together characters, currently.

[–] yessikg@lemmy.blahaj.zone 8 points 11 months ago

It's me, but I don't have a beard

[–] mindbleach@sh.itjust.works 7 points 11 months ago (1 children)

The internet is forever, whether it wants to be or not.

[–] ichmagrum@feddit.de 2 points 11 months ago

It's actually pretty selective. Recently tried reading an old webcomic, lots of dead links and the various web archive pages were very incomplete. I'm sure SOMEONE has it saved somewhere, but it doesn't look like they made it easily available to the general public.

[–] averyminya@beehaw.org 5 points 11 months ago

Somebody saaaaaave meeee

i have a deleted webpage in ecosia app

[–] vsh@lemm.ee 4 points 11 months ago

r/datahoarder folks can relate

[–] Vonneks@lemmings.world 2 points 10 months ago

Trazyn the infinite type of fella

[–] Cheskaz@beehaw.org 1 points 11 months ago

I will die thinking about how I didn't save Globvids Plague Doctors video.