|
|
1 dag geleden | |
|---|---|---|
| .gitignore | 4 jaren geleden | |
| .make-and-exec | 10 maanden geleden | |
| .urldecode-test | 3 jaren geleden | |
| .warc-dump-responses-test | 10 maanden geleden | |
| .youtube-extract-rapid-test | 3 jaren geleden | |
| LICENSE | 7 jaren geleden | |
| README.md | 7 jaren geleden | |
| alphabetseq | 1 jaar geleden | |
| archivebot-blogspot | 10 maanden geleden | |
| archivebot-compress-db | 10 maanden geleden | |
| archivebot-db-edit | 3 maanden geleden | |
| archivebot-fix-queue-counters | 1 jaar geleden | |
| archivebot-high-resources | 10 maanden geleden | |
| archivebot-irccloud-paste | 10 maanden geleden | |
| archivebot-jobid-calculation | 7 jaren geleden | |
| archivebot-jobs | 2 weken geleden | |
| archivebot-list-stuck-requests | 10 maanden geleden | |
| archivebot-log-extract-ignores | 10 maanden geleden | |
| archivebot-monitor-job-queue | 10 maanden geleden | |
| archivebot-pipelines-count-jobs | 1 dag geleden | |
| archivebot-requeue-refused | 10 maanden geleden | |
| archivebot-tmp-log-check-and-delete | 5 maanden geleden | |
| archivebot-youtube | 10 maanden geleden | |
| at-tracker-sample-user-item-size | 10 maanden geleden | |
| azure-storage-list | 4 jaren geleden | |
| b64grep | 10 maanden geleden | |
| base64url | 10 maanden geleden | |
| bencode2json | 3 jaren geleden | |
| bing-scrape | 10 maanden geleden | |
| bugzilla-url-list | 10 maanden geleden | |
| cdx-chunk | 10 maanden geleden | |
| cidr-merge | 1 jaar geleden | |
| cloudflare-email-decode | 4 jaren geleden | |
| combine-by-prefix | 10 maanden geleden | |
| curl-ab | 10 maanden geleden | |
| curl-ia | 5 maanden geleden | |
| curl-irc | 2 weken geleden | |
| curl-ua | 6 maanden geleden | |
| deb-repo-urls | 10 maanden geleden | |
| dedupe | 10 maanden geleden | |
| dir-to-ia | 1 maand geleden | |
| europarl-meps-collect | 10 maanden geleden | |
| extract-urls-for-archiveteam-projects | 10 maanden geleden | |
| foolfuuka-search | 10 maanden geleden | |
| format-size | 10 maanden geleden | |
| fos-ftp-upload | 10 maanden geleden | |
| get-crx4chrome-urls | 10 maanden geleden | |
| gitea-list-repos | 6 maanden geleden | |
| github-list-repos | 6 maanden geleden | |
| gitlab-list-repos | 6 maanden geleden | |
| gofile.io-dl | 10 maanden geleden | |
| grab-site-pack | 6 maanden geleden | |
| html-extract-stupid | 10 maanden geleden | |
| http-response-bodies | 3 jaren geleden | |
| http-response-bodies.c | 2 jaren geleden | |
| ia-cdx-search | 1 maand geleden | |
| ia-cdx-search-subdomains | 10 maanden geleden | |
| ia-derive | 1 maand geleden | |
| ia-download | 1 maand geleden | |
| ia-files-xml-to-jsonl | 5 jaren geleden | |
| ia-metadata | 1 maand geleden | |
| ia-s3-auth | 1 maand geleden | |
| ia-task-log | 1 maand geleden | |
| ia-tasks | 1 maand geleden | |
| ia-upload-progress | 10 maanden geleden | |
| ia-upload-stream | 1 maand geleden | |
| ia-verify-file | 1 maand geleden | |
| ia-wait-item-tasks | 1 maand geleden | |
| iasha1check | 3 maanden geleden | |
| ix.io-upload | 10 maanden geleden | |
| kill-connections | 10 maanden geleden | |
| kill-wpull-connections | 10 maanden geleden | |
| killcx-all-https | 10 maanden geleden | |
| mastodon-enumerate-users | 7 jaren geleden | |
| mastodon-outdated | 10 maanden geleden | |
| moinmoin-url-list | 7 maanden geleden | |
| parent-urls | 10 maanden geleden | |
| pipelines-launch-in-tmux-windows | 10 maanden geleden | |
| pipelines-monitor-tmux-wget-outcomes | 10 maanden geleden | |
| pipelines-stop-gracefully | 10 maanden geleden | |
| reddit-pushshift-search | 10 maanden geleden | |
| run-every-five-minutes | 10 maanden geleden | |
| s3-bucket-find-direct-url | 2 jaren geleden | |
| s3-bucket-list | 1 jaar geleden | |
| s3-bucket-list-qwarc | 10 maanden geleden | |
| sample | 10 maanden geleden | |
| snscrape-extract | 10 maanden geleden | |
| snscrape-facebook-user | 10 maanden geleden | |
| snscrape-instagram-user | 10 maanden geleden | |
| snscrape-prepare-commands | 10 maanden geleden | |
| snscrape-tmux | 10 maanden geleden | |
| snscrape-twitter-filter | 10 maanden geleden | |
| snscrape-twitter-hashtag | 10 maanden geleden | |
| snscrape-twitter-user | 10 maanden geleden | |
| snscrape-upload | 10 maanden geleden | |
| snscrape-vk-user | 10 maanden geleden | |
| snscrape-wiki-transfer-merge | 10 maanden geleden | |
| social-media-extract-profile-link | 10 maanden geleden | |
| sum-sizes | 2 jaren geleden | |
| tar-many-files-progress | 10 maanden geleden | |
| tcp-closer | 7 jaren geleden | |
| torrent-tiny | 3 jaren geleden | |
| transfer.archivete.am-upload | 5 maanden geleden | |
| transfer.notkiska.pw-check-ia | 10 maanden geleden | |
| uniqify | 10 maanden geleden | |
| uniqify-recent | 10 maanden geleden | |
| url-normalise | 10 maanden geleden | |
| url-prefix-count | 10 maanden geleden | |
| urldecode | 4 jaren geleden | |
| urldecode.c | 3 jaren geleden | |
| urlsort | 10 maanden geleden | |
| warc-dump-responses | 3 jaren geleden | |
| warc-dump-responses.c | 2 jaren geleden | |
| warc-get-rubbish-offset | 6 maanden geleden | |
| warc-peek | 3 jaren geleden | |
| warc-size | 10 maanden geleden | |
| warc-tiny | 5 maanden geleden | |
| website-extract-social-media | 10 maanden geleden | |
| wget-spider-estimate-size | 10 maanden geleden | |
| wiki-list-to-main | 7 jaren geleden | |
| wiki-recursive-extract-normalise | 10 maanden geleden | |
| wiki-sections-sort | 10 maanden geleden | |
| wiki-website-extract-social-media | 10 maanden geleden | |
| wpull1-parallel-progress-monitor | 10 maanden geleden | |
| wpull1-progress-monitor | 10 maanden geleden | |
| wpull2-children | 10 maanden geleden | |
| wpull2-db-edit | 8 maanden geleden | |
| wpull2-extract-ignored | 10 maanden geleden | |
| wpull2-extract-remaining | 10 maanden geleden | |
| wpull2-html-scrape | 9 maanden geleden | |
| wpull2-log-colourise | 10 maanden geleden | |
| wpull2-log-extract-errors | 10 maanden geleden | |
| wpull2-requeue | 1 jaar geleden | |
| wpull2-url-origin | 10 maanden geleden | |
| youtube-channel-list.py | 3 jaren geleden | |
| youtube-extract | 3 jaren geleden | |
| youtube-extract-rapid | 4 jaren geleden | |
| youtube-extract-rapid.c | 4 jaren geleden | |
| youtube-filter-autogen-channels | 10 maanden geleden | |
| zstdwarccat | 1 jaar geleden | |
Over the past few years, I’ve written and accumulated a number of useful little things to help with archival-related tasks. This repository collects them. I hope someone finds some of them useful.
This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program. If not, see https://www.gnu.org/licenses/.