fosstodon.org is one of the many independent Mastodon servers you can use to participate in the fediverse.
Fosstodon is an invite only Mastodon instance that is open to those who are interested in technology; particularly free & open source software. If you wish to join, contact us for an invite.

Administered by:

Server stats:

8.7K
active users

#ArchiveBox

2 posts1 participant0 posts today
Klaus Frank<p>"Anything unsaved will be lost" - Nintendo Wii</p><p>So archive the world! And do it on your own media without DRM in a way that no 3rd party can tamper, suppress, or even know about its existance to begin with.<br><a href="https://chaos.social/tags/ArchiveBox" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArchiveBox</span></a> <a href="https://chaos.social/tags/Archiving" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Archiving</span></a> <a href="https://chaos.social/tags/Preservation" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Preservation</span></a></p>
Rad Web Hosting<p>How to Install and Run <a href="https://mastodon.social/tags/ArchiveBox" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArchiveBox</span></a> on <a href="https://mastodon.social/tags/Ubuntu" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Ubuntu</span></a> <a href="https://mastodon.social/tags/VPS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VPS</span></a> Server in 5 Minutes (Quick Start Guide) </p><p>This article provides a guide for how to install and run ArchiveBox on Ubuntu VPS server.<br>What is ArchiveBox?<br>ArchiveBox&nbsp;is&nbsp;a powerful, self-hosted internet archiving solution to collect, save, and view websites offline. Without active preservation effort, everything on the internet eventually ...<br>Continued 👉 <a href="https://blog.radwebhosting.com/how-to-install-and-run-archivebox-on-ubuntu-vps-server/?utm_source=mastodon&amp;utm_medium=social&amp;utm_campaign=mastodon.social" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">blog.radwebhosting.com/how-to-</span><span class="invisible">install-and-run-archivebox-on-ubuntu-vps-server/?utm_source=mastodon&amp;utm_medium=social&amp;utm_campaign=mastodon.social</span></a> <a href="https://mastodon.social/tags/selfhosted" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>selfhosted</span></a> <a href="https://mastodon.social/tags/opensource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>opensource</span></a> <a href="https://mastodon.social/tags/vpsguide" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>vpsguide</span></a> <a href="https://mastodon.social/tags/selfhosting" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>selfhosting</span></a> <a href="https://mastodon.social/tags/installguide" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>installguide</span></a></p>
Rad Web Hosting<p>How to Install and Run <a href="https://mastodon.social/tags/ArchiveBox" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArchiveBox</span></a> on <a href="https://mastodon.social/tags/Ubuntu" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Ubuntu</span></a> <a href="https://mastodon.social/tags/VPS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VPS</span></a> Server in 5 Minutes (Quick Start Guide) </p><p>This article provides a guide for how to install and run ArchiveBox on Ubuntu VPS server.<br>What is ArchiveBox?<br>ArchiveBox&nbsp;is&nbsp;a powerful, self-hosted internet archiving solution to collect, save, and view websites offline. Without active preservation effort, everything on the internet eventually ...<br>Continued 👉 <a href="https://blog.radwebhosting.com/how-to-install-and-run-archivebox-on-ubuntu-vps-server/?utm_source=mastodon&amp;utm_medium=social&amp;utm_campaign=mastodon.social" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">blog.radwebhosting.com/how-to-</span><span class="invisible">install-and-run-archivebox-on-ubuntu-vps-server/?utm_source=mastodon&amp;utm_medium=social&amp;utm_campaign=mastodon.social</span></a> <a href="https://mastodon.social/tags/opensource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>opensource</span></a> <a href="https://mastodon.social/tags/installguide" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>installguide</span></a> <a href="https://mastodon.social/tags/selfhosted" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>selfhosted</span></a> <a href="https://mastodon.social/tags/vpsguide" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>vpsguide</span></a> <a href="https://mastodon.social/tags/selfhosting" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>selfhosting</span></a></p>
Santiago Lema :amiga:Continuing my <a href="https://go.lema.org?t=proxmox" class="mention hashtag" rel="nofollow noopener" target="_blank">#proxmox</a> adventures I setup an instance of <a href="https://go.lema.org?t=archivebox" class="mention hashtag" rel="nofollow noopener" target="_blank">#archivebox</a>. There's a Firefox / Chrome plugin you can right click to keep a snapshot of a given site with a set number of levels to get.<br><br>The result for one entry is something like this:<br><a href="https://archive.lema.org/archive/1750662377.873811/index.html" rel="nofollow noopener" target="_blank">https://archive.lema.org/archive/1750662377.873811/index.html</a><br>
くゎ<p>んんんん……?<br>やっぱり <a href="https://fedibird.com/tags/ArchiveBox" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArchiveBox</span></a> のupdateがなんか暴走しとるな……<br>編集したタグがいつのまにか元に戻されるのも厄介だけど、先方のサイトに何度もリクエスト飛ぶのも大問題……原因なんやコレ……?</p>
くゎ<p><a href="https://fedibird.com/tags/ArchiveBox" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArchiveBox</span></a> のWeb UIもAPIもブラウザプラグインも使いづらすぎたので、Web UIを無理やり操作してどうにかするWebページを作成した<br>フロントに置いてる <a href="https://fedibird.com/tags/nginx" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>nginx</span></a> の設定でまぜまぜして同じドメイン傘下のpath違いということにして、iframe内のArchiveBoxをDOMでごにょごにょしつつ、操作時はPOSTをエミュレーションする力業<br>これでだいぶ便利になったぞー</p><p><a href="https://fedibird.com/tags/fedibird" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>fedibird</span></a></p>
Caleb 🦈<p>Working with <a href="https://mast.hpc.social/tags/archivebox" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>archivebox</span></a> and <a href="https://mast.hpc.social/tags/ytdlp" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ytdlp</span></a> arguments has been way more frustrating than one would hope and expect. Keep getting errors for the arguments I'm using but they work fine on the command line... Yay open source tools :) <a href="https://mast.hpc.social/tags/debian" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>debian</span></a> <a href="https://mast.hpc.social/tags/linux" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>linux</span></a> <a href="https://mast.hpc.social/tags/archiveallthethings" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>archiveallthethings</span></a></p>
くゎ<p>ラズパイの空き容量が枯渇する問題、原因判明<br>dockerで動かしている <a href="https://fedibird.com/tags/ArchiveBox" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArchiveBox</span></a> のschedulerが暴走していて、chromiumのprofileを無限に作っていた<br>とりあえずschedulerのcontainerをstopしてコンテナを解放し、それから CHROME_USER_DATA_DIR オプションを設定して(たぶん)解決</p>
Rad Web Hosting<p>How to Install and Run <a href="https://mastodon.social/tags/ArchiveBox" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArchiveBox</span></a> on <a href="https://mastodon.social/tags/Ubuntu" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Ubuntu</span></a> <a href="https://mastodon.social/tags/VPS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VPS</span></a> Server </p><p>This article provides a guide for how to install and run ArchiveBox on Ubuntu VPS server.<br>What is ArchiveBox?<br>ArchiveBox&nbsp;is&nbsp;a powerful, self-hosted internet archiving solution to collect, save, and view websites offline. Without active preservation effort, everything on the internet eventually disappears or degrades.</p><p>In this blog post, we ...<br>Continued 👉 <a href="https://blog.radwebhosting.com/how-to-install-and-run-archivebox-on-ubuntu-vps-server/?utm_source=mastodon&amp;utm_medium=social&amp;utm_campaign=mastodon.social" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">blog.radwebhosting.com/how-to-</span><span class="invisible">install-and-run-archivebox-on-ubuntu-vps-server/?utm_source=mastodon&amp;utm_medium=social&amp;utm_campaign=mastodon.social</span></a> <a href="https://mastodon.social/tags/installguide" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>installguide</span></a> <a href="https://mastodon.social/tags/selfhosting" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>selfhosting</span></a> <a href="https://mastodon.social/tags/vpsguide" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>vpsguide</span></a> <a href="https://mastodon.social/tags/opensource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>opensource</span></a></p>
<p>a little tool I built to fight linkrot and save our sources from the memory hole → <a href="https://sij.law/deepciter" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">sij.law/deepciter</span><span class="invisible"></span></a></p><p><a href="https://earth.law/tags/digitalpreservation" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>digitalpreservation</span></a> <a href="https://earth.law/tags/selfhosting" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>selfhosting</span></a> <a href="https://earth.law/tags/archivebox" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>archivebox</span></a> <a href="https://earth.law/tags/opensource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>opensource</span></a> <a href="https://earth.law/tags/foss" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>foss</span></a> <a href="https://earth.law/tags/textfragments" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>textfragments</span></a> <a href="https://earth.law/tags/waybackmachine" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>waybackmachine</span></a> <a href="https://earth.law/tags/linkrot" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>linkrot</span></a> <a href="https://earth.law/tags/memoryhole" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>memoryhole</span></a> <a href="https://earth.law/tags/legaltech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>legaltech</span></a> <a href="https://earth.law/tags/permalink" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>permalink</span></a> <a href="https://earth.law/tags/deepcite" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>deepcite</span></a></p>
Rad Web Hosting<p>How to Install and Run <a href="https://mastodon.social/tags/ArchiveBox" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArchiveBox</span></a> on <a href="https://mastodon.social/tags/Ubuntu" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Ubuntu</span></a> <a href="https://mastodon.social/tags/VPS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VPS</span></a> Server </p><p>This article provides a guide for how to install and run ArchiveBox on Ubuntu VPS server.<br>What is ArchiveBox?<br>ArchiveBox&nbsp;is&nbsp;a powerful, self-hosted internet archiving solution to collect, save, and view websites offline. Without active preservation effort, everything on the internet eventually disappears or degrades.</p><p>In this blog post, we ...<br>Continued 👉 <a href="https://blog.radwebhosting.com/how-to-install-and-run-archivebox-on-ubuntu-vps-server/?utm_source=mastodon&amp;utm_medium=social&amp;utm_campaign=ReviveOldPost" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">blog.radwebhosting.com/how-to-</span><span class="invisible">install-and-run-archivebox-on-ubuntu-vps-server/?utm_source=mastodon&amp;utm_medium=social&amp;utm_campaign=ReviveOldPost</span></a> <a href="https://mastodon.social/tags/opensource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>opensource</span></a> <a href="https://mastodon.social/tags/selfhosting" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>selfhosting</span></a> <a href="https://mastodon.social/tags/vpsguide" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>vpsguide</span></a> <a href="https://mastodon.social/tags/installguide" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>installguide</span></a></p>
ResearchBuzz: Firehose<p>ZDNet: How to set up your own article archiving service – and why I did (RIP, Pocket). “Mozilla killed Pocket, but your bookmarks don’t have to die. Here’s how to self-host ArchiveBox – with a little help from ChatGPT – and take ownership of your reading archive.”</p><p><a href="https://rbfirehose.com/2025/05/29/zdnet-how-to-set-up-your-own-article-archiving-service-and-why-i-did-rip-pocket/" class="" rel="nofollow noopener" target="_blank">https://rbfirehose.com/2025/05/29/zdnet-how-to-set-up-your-own-article-archiving-service-and-why-i-did-rip-pocket/</a></p>
veroandi<p>Hearing about ArchiveBox and hosting it on my own server was one of the best things I discovered this year.</p><p>I now have my own internet archive :)</p><p>I imported all my data from <a href="https://mastodon.social/tags/Pocket" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Pocket</span></a> and will do the same with many articles saved in my browser's favourites lists.</p><p>ArchiveBox allows you to take snapshots of a web page and save them in PDF, text and many other formats.</p><p>And on my own server, enough with relying on others.</p><p> <a href="https://archivebox.io" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">archivebox.io</span><span class="invisible"></span></a></p><p><a href="https://mastodon.social/tags/selfhosting" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>selfhosting</span></a> <a href="https://mastodon.social/tags/archivebox" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>archivebox</span></a> <a href="https://mastodon.social/tags/internetarchive" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>internetarchive</span></a></p>
Tom MacWright<p>been trying to archive all outlinks from macwright.com with <a href="https://mastodon.social/tags/archivebox" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>archivebox</span></a> and results are decidedly mixed: tasks keep getting stuck in a 'pending' state with no feedback as to whether anything is working or not.</p>
Rad Web Hosting<p>How to Install and Run <a href="https://mastodon.social/tags/ArchiveBox" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArchiveBox</span></a> on <a href="https://mastodon.social/tags/Ubuntu" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Ubuntu</span></a> <a href="https://mastodon.social/tags/VPS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VPS</span></a> Server </p><p>This article provides a guide for how to install and run ArchiveBox on Ubuntu VPS server.<br>What is ArchiveBox?<br>ArchiveBox&nbsp;is&nbsp;a powerful, self-hosted internet archiving solution to collect, save, and view websites offline. Without active preservation effort, everything on the internet eventually disappears or degrades.</p><p>In this blog post, we ...<br>Continued 👉 <a href="https://blog.radwebhosting.com/how-to-install-and-run-archivebox-on-ubuntu-vps-server/?utm_source=mastodon&amp;utm_medium=social&amp;utm_campaign=ReviveOldPost" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">blog.radwebhosting.com/how-to-</span><span class="invisible">install-and-run-archivebox-on-ubuntu-vps-server/?utm_source=mastodon&amp;utm_medium=social&amp;utm_campaign=ReviveOldPost</span></a> <a href="https://mastodon.social/tags/installguide" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>installguide</span></a> <a href="https://mastodon.social/tags/opensource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>opensource</span></a> <a href="https://mastodon.social/tags/vpsguide" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>vpsguide</span></a> <a href="https://mastodon.social/tags/selfhosting" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>selfhosting</span></a></p>
Rad Web Hosting<p>How to Install and Run <a href="https://mastodon.social/tags/ArchiveBox" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArchiveBox</span></a> on <a href="https://mastodon.social/tags/Ubuntu" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Ubuntu</span></a> <a href="https://mastodon.social/tags/VPS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VPS</span></a> Server </p><p>This article provides a guide for how to install and run ArchiveBox on Ubuntu VPS server.<br>What is ArchiveBox?<br>ArchiveBox&nbsp;is&nbsp;a powerful, self-hosted internet archiving solution to collect, save, and view websites offline. Without active preservation effort, everything on the internet eventually disappears or degrades.</p><p>In this blog post, we ...<br>Continued 👉 <a href="https://blog.radwebhosting.com/how-to-install-and-run-archivebox-on-ubuntu-vps-server/?utm_source=mastodon&amp;utm_medium=social&amp;utm_campaign=ReviveOldPost" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">blog.radwebhosting.com/how-to-</span><span class="invisible">install-and-run-archivebox-on-ubuntu-vps-server/?utm_source=mastodon&amp;utm_medium=social&amp;utm_campaign=ReviveOldPost</span></a> <a href="https://mastodon.social/tags/installguide" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>installguide</span></a> <a href="https://mastodon.social/tags/opensource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>opensource</span></a> <a href="https://mastodon.social/tags/vpsguide" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>vpsguide</span></a> <a href="https://mastodon.social/tags/selfhosting" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>selfhosting</span></a></p>
Bastianoso<p>Kann jemand von <a href="https://mastodon.social/tags/ArchiveBox" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArchiveBox</span></a> berichten? Ich habe heute mal ein bisschen damit herumgespielt, mache mir aber Sorgen um den Speicherverbrauch und vor allem: braucht man das wirklich oder ist es nur ein Linkdump, den man dann irgendwann wegwirft? Wie nutzt ihr es?</p>
Rad Web Hosting<p>How to Install and Run <a href="https://mastodon.social/tags/ArchiveBox" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArchiveBox</span></a> on <a href="https://mastodon.social/tags/Ubuntu" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Ubuntu</span></a> <a href="https://mastodon.social/tags/VPS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VPS</span></a> Server </p><p>This article provides a guide for how to install and run ArchiveBox on Ubuntu VPS server.<br>What is ArchiveBox?<br>ArchiveBox&nbsp;is&nbsp;a powerful, self-hosted internet archiving solution to collect, save, and view websites offline. Without active preservation effort, everything on the internet eventually disappears or degrades.</p><p>In this blog post, we ...<br>Continued 👉 <a href="https://blog.radwebhosting.com/how-to-install-and-run-archivebox-on-ubuntu-vps-server/?utm_source=mastodon&amp;utm_medium=social&amp;utm_campaign=ReviveOldPost" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">blog.radwebhosting.com/how-to-</span><span class="invisible">install-and-run-archivebox-on-ubuntu-vps-server/?utm_source=mastodon&amp;utm_medium=social&amp;utm_campaign=ReviveOldPost</span></a> <a href="https://mastodon.social/tags/opensource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>opensource</span></a> <a href="https://mastodon.social/tags/vpsguide" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>vpsguide</span></a> <a href="https://mastodon.social/tags/installguide" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>installguide</span></a> <a href="https://mastodon.social/tags/selfhosting" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>selfhosting</span></a></p>
Rad Web Hosting<p>How to Install and Run <a href="https://mastodon.social/tags/ArchiveBox" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArchiveBox</span></a> on <a href="https://mastodon.social/tags/Ubuntu" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Ubuntu</span></a> <a href="https://mastodon.social/tags/VPS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VPS</span></a> Server </p><p>This article provides a guide for how to install and run ArchiveBox on Ubuntu VPS server.<br>What is ArchiveBox?<br>ArchiveBox&nbsp;is&nbsp;a powerful, self-hosted internet archiving solution to collect, save, and view websites offline. Without active preservation effort, everything on the internet eventually disappears or degrades.</p><p>In this blog post, we ...<br>Continued 👉 <a href="https://blog.radwebhosting.com/how-to-install-and-run-archivebox-on-ubuntu-vps-server/?utm_source=mastodon&amp;utm_medium=social&amp;utm_campaign=ReviveOldPost" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">blog.radwebhosting.com/how-to-</span><span class="invisible">install-and-run-archivebox-on-ubuntu-vps-server/?utm_source=mastodon&amp;utm_medium=social&amp;utm_campaign=ReviveOldPost</span></a> <a href="https://mastodon.social/tags/opensource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>opensource</span></a> <a href="https://mastodon.social/tags/installguide" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>installguide</span></a> <a href="https://mastodon.social/tags/vpsguide" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>vpsguide</span></a> <a href="https://mastodon.social/tags/selfhosting" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>selfhosting</span></a></p>
Preston Maness ☭<p>I've mirrored a relatively simple website (redsails.org; it's mostly text, some images) for posterity via <a href="https://tenforward.social/tags/wget" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>wget</span></a>. However, I also wanted to grab snapshots of any outlinks (of which there are many, as citations/references). By default, I couldn't figure out a configuration where wget would do that out of the box, without endlessly, recursively spidering the whole internet. I ended up making a kind-of poor man's <a href="https://tenforward.social/tags/ArchiveBox" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArchiveBox</span></a> instead:</p><p>for i in $(cat others.txt) ; do dirname=$(echo "$i" | sha256sum | cut -d' ' -f 1) ; mkdir -p $dirname ; wget --span-hosts --page-requisites --convert-links --backup-converted --adjust-extension --tries=5 --warc-file="$dirname/$dirname" --execute robots=off --wait 1 --waitretry 5 --timeout 60 -o "$dirname/wget-$dirname.log" --directory-prefix="$dirname/" $i ; done</p><p>Basically, there's a list of bookmarks^W URLs in others.txt that I grabbed from the initial mirror of the website with some <a href="https://tenforward.social/tags/grep" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>grep</span></a> foo. I want to do as good of a mirror/snapshot of each specific URL as I can, without spidering/mirroring endlessly all over. So, I hash the URL, and kick off a specific wget job for it that will span hosts, but only for the purposes of making the specific URL as usable locally/offline as possible. I know from experience that this isn't perfect. But... it'll be good enough for my purposes. I'm also stashing a WARC file. Probably a bit overkill, but I figure it might be nice to have.</p><p><a href="https://tenforward.social/tags/RedSails" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>RedSails</span></a> <a href="https://tenforward.social/tags/archive" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>archive</span></a> <a href="https://tenforward.social/tags/archival" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>archival</span></a> <a href="https://tenforward.social/tags/archiving" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>archiving</span></a> <a href="https://tenforward.social/tags/warc" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>warc</span></a></p>