Sync metadata from a feed to archive.org
Find a file
2025-12-10 10:55:21 +01:00
.gitignore initial commit 2025-12-10 10:55:21 +01:00
conf.example.toml initial commit 2025-12-10 10:55:21 +01:00
feed2exec.ini initial commit 2025-12-10 10:55:21 +01:00
metadata.py initial commit 2025-12-10 10:55:21 +01:00
README.md initial commit 2025-12-10 10:55:21 +01:00

Updates metadata on archive.org based on what is displayed in a feed.

Concept

Here's the idea. Let's assume you're a content creator. You regularly upload your content on archive.org, then you embed the archive.org files into your blog. Which means you'd need to input the same metadata, which is so boring.

Using this tool, you can just put dumb metadata on archive.org, making your upload process much faster. The metadata will then be copied back from your blog to archive.org.

Installation

Dependencies

This tool relies on the excellent feed2exec.

You also need to install some dependencies:

apt install feed2exec python3-internetarchive python3-extruct python3-html2text python3-markdown python3-requests python3-requests-cache

Configuration

The first thing you need to configure is your feed2exec.ini: change the name and the url.

Then, you need to configure your archive.org accounts:

cp conf.example.toml conf.toml

edit conf.toml and insert your accounts credentials.

Run

env PYTHONPATH=. feed2exec --database $PWD/feed2exec.db --config feed2exec.ini fetch

This will do a single run. If you want this to run periodically, just use crond, or systemd timers, or whatever you please.