Does lemmy have any communities dedicated to archiving/hoarding data?

  • Gerowen@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    edit-2
    3 months ago

    Neither are that bad honestly. I have jigdo scripts I run with every point release of Debian and have a copy of English Wikipedia on a Kiwix mirror I also host. Wikipedia is a tad over 100 GB. The source, arm64 and amd64 complete repos (DVD images) for Debian Trixie, including the network installer and a couple live boot images, are 353 GB.

    Kiwix has copies of a LOT of stuff, including Wikipedia on their website. You can view their zim files with a desktop application or host your own web version. Their website is: https://kiwix.org/

    If you want (or if Wikipedia is censored for you) you can also look at my mirror to see what a web hosted version looks like: https://kiwix.marcusadams.me/

    Note: I use Anubis to help block scrapers. You should have no issues as a human other than you may see a little anime girl for a second on first load, but every once and a while Brave has a disagreement with her and a page won’t load correctly. I’ve only seen it in Brave, and only rarely, but I’ve seen it once or twice so thought I’d mention it.

    • Fmstrat@lemmy.world
      link
      fedilink
      arrow-up
      0
      ·
      edit-2
      3 months ago

      120GB not including Wikimedia 😉

      Also, I wish they included OSM maps, not just the wiki.

    • Gigasser@lemmy.world
      link
      fedilink
      arrow-up
      0
      ·
      3 months ago

      I wonder if there’s anyways to edit these files afterwards? They tend to be read only, right? I must confess, I don’t have too much experience with this myself.

      • Prathas@lemmy.zip
        link
        fedilink
        arrow-up
        0
        ·
        3 months ago

        It’s probably hundreds of thousands of HTML files, no? What is the fear about being able to edit or not?

          • Prathas@lemmy.zip
            link
            fedilink
            arrow-up
            0
            ·
            3 months ago

            Okay, I’m unfamiliar with both. Well, I still don’t understand why read-only state matters; are you concerned about tampering?

            • Gigasser@lemmy.world
              link
              fedilink
              arrow-up
              0
              ·
              2 months ago

              Well I think it would be cool to be able to fix/edit any inaccurate articles, or pages that may have been messed with by trolls, or to update with more up to date info.

    • mistermodal@lemmy.mlBanned
      link
      fedilink
      arrow-up
      0
      ·
      3 months ago

      Yeah also if you make a Zim wiki or convert a website into Zim then you can run that stuff too. If you use Emacs it’s easy to convert some pages to wikitext for Zim too

    • clif@lemmy.world
      link
      fedilink
      arrow-up
      0
      ·
      edit-2
      3 months ago

      Last time I updated it was closer to 120GB but if you’re not sweating 100 GB then an extra 20 isn’t going to bother anyone these days.

      Also, thanks for reminding me that I need to check my dates and update.

      EDIT: you can also easily configure a SBC like a Raspberry Pi (or any of the clones) that will boot, set the Wi-Fi to access point mode, and serve kiwix as a website that anyone (on the local AP wifi network) can connect to and query… And it’ll run off a USB battery pack. I have one kicking around the house somewhere

      • techwithjake@sh.itjust.works
        link
        fedilink
        arrow-up
        0
        ·
        3 months ago

        Just built one of those using Dietpi as the OS and NVME M.2 for the storage. I have many different ZIMs and running different services and only using about 270GB.

        Works great for offline use. Probably should add an ISO or 2 as well.

        • clif@lemmy.world
          link
          fedilink
          arrow-up
          0
          ·
          edit-2
          3 months ago

          What other services are you running?

          @fmstrat@lemmy.world asked what else I was running in a sibling comment to yours and I didn’t have an answer because I’m not… yet : )

          • techwithjake@sh.itjust.works
            link
            fedilink
            arrow-up
            0
            ·
            3 months ago

            DietPi makes it dead simple to run most of these things as their “software suite” is pretty robust and simple to setup.

            For “user facing” applications:

            • Homer Dashboard as the landing page when going to the .local address in a browser
            • Kiwix for the ZIMs
            • Hedgedoc for personal note taking/wiki
            • Lychee photos for a very lightweight photo album maker/viewer for keepsake photos.

            For “admin side” stuff:

            • Portainer to manage the containers/stacks
            • Watchtower to auto-update the containers while they’re still network connected
            • Transmission daemonized to download and seed the ZIMs or anything else non-pirate related
            • Use jojo2357’s ZIM updater to auto-update ZIMs via cron job while they’re still network connected
            • DietPi-Dashboard as an all-in-one dashboard to monitor and control the RPi from a web interface. (Yeah I know I can do everything SSH’ing in but I’m lazy.)
            • File Browser just in case I want other people to have access to files but since it’s in maintenance mode and I’m unsure I want others to have access, might strip it out

            I try to use containers from LinuxServer.io whenever possible. Mostly just cause it’s what I do on my main server.

            I’m still looking at adding/removing things as I get more time to sit down but I’m pretty happy with it’s current state.

      • Fmstrat@lemmy.world
        link
        fedilink
        arrow-up
        0
        ·
        edit-2
        3 months ago

        Do you recommend adding anything else to it?

        For instance, OSM maps?

        I’ve been thinking about running the Kiwix app + OSMAnd on an old Android phone and auto updating it once a year.

        • clif@lemmy.world
          link
          fedilink
          arrow-up
          0
          ·
          3 months ago

          That’s a good question (and good idea) that I hadn’t really thought about past a collection of ZIMs. The one I built advertises it’s own AP SSID that anyone can connect to and then access the ZIMs that are served via kiwix-serve on HTTP/80. That is, I wanted a single, low power, headless device that multiple people could use simultaneously via wifi and browser rather than a personal device.

          I hadn’t really thought about other helpful services past that. I mean, we’ve got a (wee) server so why not use it? I like the idea of OSM and their website is open source but has a lot of dependencies :

          openstreetmap-website is a Ruby on Rails application that uses PostgreSQL as its database, and has a large number of dependencies for installation

          A fully-functional openstreetmap-website installation depends on other services, including map tile servers and geocoding services, that are provided by other software. The default installation uses publicly-available services to help with development and testing.

          I wonder how hard it would be to host everything it needs locally/offline… and what that would do to power consumption : )

          Thanks for the idea - something to look into, for sure.

            • clif@lemmy.world
              link
              fedilink
              arrow-up
              0
              ·
              3 months ago

              Yeah, I feel the same in that it’s assuredly doable, but how hard is it?

              If you’re able to dig into and make some progress, please tag me because I’m interested but don’t have much time these days.

  • Pumpkin Escobar@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    3 months ago

    I stumbled across this sort of fascinating area of doomsday prepping a few weeks back.

    https://prepperpress.com/usb/

    A nice addition to that, don’t just make it a USB, but a raspberry pi. So you’d have a reasonably low-powered computer you could easily take with you.

    Not suggesting this one as it seems a bit expensive to me, but https://www.prepperdisk.com/products/prepper-disk-premium-over-512gb-of-survival-content?view=sl-8978CA41

    • techwithjake@sh.itjust.works
      link
      fedilink
      arrow-up
      0
      ·
      edit-2
      3 months ago

      Just built one of these myself. I went NVME M.2 instead of SD Card to avoid data corruption. I know SD Cards are fine if you don’t write to them a lot but if you wanna update or add your own stuff, scares me. Plus NVME is just so much faster.

        • techwithjake@sh.itjust.works
          link
          fedilink
          arrow-up
          0
          ·
          3 months ago

          Pretty much what Sinthesis said; USB power brick and/or solar panels. Both at the ready and tested. Also got a big ass battery backup that will charge off solar panels.

        • Bob Robertson IX @discuss.tchncs.de
          link
          fedilink
          English
          arrow-up
          0
          ·
          3 months ago

          You find a generator, or solar panels, or wind mill, or water turbine, or a bicycle hooked up to a generator.

          If electricity permanently goes out then we’re in a scavenger situation and it is time to start taking apart things that are no longer necessary to build the things that are.

  • Dr. Bluefall@toast.ooo
    link
    fedilink
    English
    arrow-up
    0
    ·
    3 months ago

    The English Language Wikipedia probably wouldn’t be hard, or Debian Stable.

    All of Debian’s packages might be a tad more expensive, though.

  • Retro_unlimited@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    3 months ago

    I also recommend downloading “Flashpoint archive” to have flash games and animations to stay entertained.

    There is a 4gb version and a 2.3TB version.

  • Maroon@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    3 months ago

    I thought the whole point of torrenting was to decentralise distribution. I use torrents to get my distros.

    In my own little bubble, I thought that’s how most people got their distro.

    • prole@lemmy.blahaj.zone
      link
      fedilink
      arrow-up
      0
      ·
      edit-2
      3 months ago

      What happens when they just cut the underwater cables? Torrent over carrier pigeon for a linux distro would take ages

      • hayvan@feddit.nl
        link
        fedilink
        arrow-up
        0
        ·
        3 months ago

        Sneakernet to the rescue. Some of you are too young to know about walking around with boxes full of disks.

        • oppy1984@lemdro.id
          link
          fedilink
          English
          arrow-up
          0
          ·
          3 months ago

          It was trading CD-R’s during my high school days… good times. Napster was just starting to take off by the time we had a CD-R trading network set up, Napster just increased the amount of CD’s that got passed around.

        • Zink@programming.dev
          link
          fedilink
          arrow-up
          0
          ·
          3 months ago

          A wise man once said

          Never underestimate the bandwidth of a station wagon full of tapes hurtling down the highway.

      • DarkAri@lemmy.blahaj.zoneBanned
        link
        fedilink
        arrow-up
        0
        ·
        3 months ago

        A good way to see what the future of places like the U.S are is to look at places like North Korea, where they do exactly this, move files around on flash media to avoid the state censors.

      • Sestren@lemmy.world
        link
        fedilink
        arrow-up
        0
        ·
        3 months ago

        Pigeon latency is horrible, but the bandwidth is pretty great. You could probably load up an adult pigeon with at least 12TB of media.

      • stinerman@midwest.social
        link
        fedilink
        English
        arrow-up
        0
        ·
        3 months ago

        Torrents are often used for installers, but for packages it tends to be more trouble than what it’s worth. Is creating a torrent for a 4k library worth it?

  • Dagamant@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    3 months ago

    I would add in some rom collections and book repositories as well. The whole library of Nintendo games is under a gig and would go a long way for entertaining people.