@Ninjatrappeur First draft of a cli plugin to extract all external links: github.com/michitux/dokuwiki-p

Install into lib/plugins/linkextractor and run as

bin/plugin.php linkextractor

Output should be suitable for piping into archivebox, but I haven't tested this yet. It will be slow if you have a lot of pages. With some more code I could use DokuWiki's index to store external links, then extracting them would be much faster. However, as a daily cron job this might be good enough already.

My approach would be to write a plugin that extracts all links and then feed that into something like archivebox.io/. Depending on how easy it is to get the URL to the archived version, one could also write a plugin that adds a link to the archived version to every link. As I am interested in something like this, too, and I know DokuWiki quite well, I might try to write a quick draft if that approach would be suitable for you.


Fosstodon is an English speaking Mastodon instance that is open to anyone who is interested in technology; particularly free & open source software.