en.osm.town is one of the many independent Mastodon servers you can use to participate in the fediverse.
An independent, community of OpenStreetMap people on the Fediverse/Mastodon. Funding graciously provided by the OpenStreetMap Foundation.

Server stats:

250
active users

#datahoarding

1 post1 participant1 post today

I was curious if the closure of The Bay meant we would lose archives about the company history.

Turns out there is an Hudson’s Bay Company Archives, which is a partnership between the Government of Manitoba and the Hudson’s Bay Company History Foundation (HBCHF). The foundation is funded separately from the actual The Bay company and apparently not about to close.

They published a memo to confirm that the archives are safe:
librarianship.ca/news/hbc-arch

does someone have a file server hosted at home (or privately enough that it is YOURS) and wants to archive all Linus Tech Tips Floatplane Exclusives??? (in a way I could still download/access them if I want to)

I really need some storage space :'D and I want to archive this forever if possible

it's 154GB of 1080p30fps videos, all the Floatplane exclusives from when they started being a thing, until 20th February (I really could only afford one month so some recent vids are already missing)

(please boost for maximum reach 💛)

Just discovered ArchiveBox — FOSS, self-hosted internet archiving.

The way the web is going, with the US government redacting and outright erasing historic content, publishers segmenting content by region (and also sometimes redacting/censoring it), and CloudFlare shitting all over everything, I think it's time for me to start my #archiving and #DataHoarding journey.

#SelfHosting #SelfHosted #DataHoarder

github.com/ArchiveBox/ArchiveB

GitHubGitHub - ArchiveBox/ArchiveBox: 🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more... - ArchiveBox/ArchiveBox
Continued thread

Since Amazon will remove the ability to download ebooks by the 28th of Feb. Why not backup your audible books too?

Resource and guides: github.com/rmcrackan/Audiobook

iOS App - Bound to listen to them: apps.apple.com/us/app/bound-au

Audiobooks for Plex: prologue.audio/

Libation - import audible and remove DRM: github.com/rmcrackan/Libation

- I've setup a VNET thick jail on my FreeBSD NAS.
- The jail has its own IP address on my LAN.
- I declared a devfs ruleset to unhide /dev/tun* for the VNET jail.
- I installed Wireguard in the jail.
- I enabled Wireguard with a ProtonVPN configuration.
- I installed qbittorrent-nox and configured it to use the Wireguard interface.

I now have a home ISP-proof qBittorrent setup with which to torrent Anna's Archive.

Hopefully there is no way that my ISP can get in, otherwise I'll get legal scare letters that threaten to put me in a jail myself.

Honestly I feel like this was more straightforward to do than with LXC containers.

Amazon will remove the ability to download the ebooks for Kindle at the end of the month. So if you ever close your amazon account, you'll no longer be able to access the books you had bought.

Let's fix that

1. Bulk Exporter: github.com/treetrum/amazon-kin

2. Calibre to manage books calibre-ebook.com/download

3. Calibre plugin to remove DRM: github.com/noDRM/DeDRM_tools/r

Source: bsky.app/profile/remysharp.com

This affects numerous sites with climate information and research, including NOAA (National Oceanic and Atmospheric Administration), NASA, and Department of Energy.

epa.gov
noaa.gov
research.noaa.gov
ncei.noaa.gov all urls where a lot of information is still up but these are definitely in their sights

NOAA has a lot of well-made educational materials, videos, and podcasts also worth saving.

#USDataPurge #USPol #SafeguardingResearch #DataHoarding #Archive

uk.news.yahoo.com/trump-orders

Yahoo News · Trump orders USDA to take down websites referencing climate crisisBy Gabrielle Canon and agencies

Is there some kind of file #storage that does delta compression? I'd love to have a file #archive with

• revisions (i.e. changes to files are tracked)
• deduplication (identical files only take up space once)
• deltas (small changes to large files only take up the size of the difference)

Git would cover this, but doesn't work well with TBs of data. Backup software with delta support like Borg or restic can do it too, but have bad UX for tracking file revisions.

Replied in thread

@jonny I'm interested in this too. I agree about climate data, and probably data pertaining to underprivileged communities, are the most at risk. I don't know where to even look for such data.

If you use Lemmy and/or Reddit, I'd recommend asking in the /c/datahoarder and /r/datahoarders communities, respectively. I'd also recommend using the #archiving and #datahoarding hashtags in the Fediverse. In the meantime, I'll boost this and see if anyone else knows.

I want to have an array of 4 drives, using ZFS through TrueNAS Scale, and each drive should at least have 8TB.
What RAID(Z) level should I use or should I go with two mirrored vdevs (I am aiming for resilience & capacity) and what drive size would be a good compromise between price and size?
If everything works out this planned NAS will be primarily an archive and storage for my media library. It will also serve as a Backup storage for my PC and maybe a Server for e.g. JellyFin.

:boostRequest:

@askfedi