348 Commity (34a3c9d0f3d790b7ebecb4e21610093d50de053a)
 

Autor SHA1 Wiadomość Data
  JustAnotherArchivist 9ce4653094 Document colouring and usage 5 lat temu
  JustAnotherArchivist e7c5d82254 Coloured WARCs?! 5 lat temu
  JustAnotherArchivist 70b413f5c1 Better events: include raw WARC header data and separate HTTP requests into headers and body 5 lat temu
  JustAnotherArchivist 641bc7a207 Fix infinite loop at end of WARC 5 lat temu
  JustAnotherArchivist a700e8e2fe Add tcp-closer command 5 lat temu
  JustAnotherArchivist 859c75a591 Add tool for WARC verification and extraction 5 lat temu
  JustAnotherArchivist e867a2327f Replace urlencoded @ symbol 5 lat temu
  JustAnotherArchivist cbd952024b Workaround for hash no longer needed with current transfer.sh code 5 lat temu
  JustAnotherArchivist 61431c2054 Add VK scraping helper 5 lat temu
  JustAnotherArchivist d6ff566c4d Instagram always uses lower-case usernames 5 lat temu
  JustAnotherArchivist 138c2a2d39 Get rid of post-processing now that snscrape (dev version) has clean URLs 5 lat temu
  JustAnotherArchivist 27b0d2da75 Better username capitalisation extraction method 5 lat temu
  JustAnotherArchivist 3aa828a0ac transfer.kiska.pw -> transfer.notkiska.pw 5 lat temu
  JustAnotherArchivist 63f4a8b3d3 transfer.sh -> transfer.kiska.pw 5 lat temu
  JustAnotherArchivist 0168d50f62 Automatically fix capitalisation of Facebook and Twitter usernames 5 lat temu
  JustAnotherArchivist db0104b3c8 Get correct capitalisation for a Facebook username 5 lat temu
  JustAnotherArchivist 4a1a9a10e0 Allow overriding the "remote filename" 5 lat temu
  JustAnotherArchivist 769f95808e Add ix.io upload script 5 lat temu
  JustAnotherArchivist c79721337b +x 5 lat temu
  JustAnotherArchivist c30dcf5985 Finding outdated Mastodon instances 5 lat temu
  JustAnotherArchivist 1748a6b607 Better workaround for the 5000 results limit; works for FoolFuuka 2.0.1 and up 5 lat temu
  JustAnotherArchivist fd680551df Add Bing, Reddit/Pushshift, and FoolFuuka scrapers 5 lat temu
  JustAnotherArchivist ede77ad142 Filter Twitter hashtag scrapes based on account scrapes 5 lat temu
  JustAnotherArchivist 57ef544c6c Fix line endings 5 lat temu
  JustAnotherArchivist 07c3e7baaa Add snscrape helpers 5 lat temu
  JustAnotherArchivist b7e3a703d8 Monitor how a pipeline's wget processes are faring 5 lat temu
  JustAnotherArchivist 168f61b39a Quote filename so it works with any weird characters in the paths 5 lat temu
  JustAnotherArchivist 8f77c8c72a xargs -r flag to not run the second find if the first produces no results (GNU extension) 5 lat temu
  JustAnotherArchivist 9d7a4096f9 Pipe into second find directly 5 lat temu
  JustAnotherArchivist e3a4bf6a47 Replace slow lsof with procfs access 5 lat temu
  JustAnotherArchivist 4a83a54616 Print host for each stuck request 5 lat temu
  JustAnotherArchivist 2b2c65f034 Print PID 5 lat temu
  JustAnotherArchivist fadb70e297 Fixed version which handles multiple roots correctly 5 lat temu
  JustAnotherArchivist d10a1d3675 First set of little things 5 lat temu
  JustAnotherArchivist a00607f28e Initial commit 5 lat temu
  JustAnotherArchivist 2a41f169c5 Add -c option to cast the return value of shutdown(2) to int explicitly on broken machines 6 lat temu
  JustAnotherArchivist 8ffb48fb1b Remove set -e/errexit, which causes the script to silently fail when no process is found with -j 6 lat temu
  JustAnotherArchivist 632fbcb4d0 Replace kill with ps in process existence check 6 lat temu
  JustAnotherArchivist 4f3cfc6e56 Add check for ptrace scope 6 lat temu
  JustAnotherArchivist 96a329578e Refactor 6 lat temu
  JustAnotherArchivist 1e7ec4a56e Executable bit 6 lat temu
  JustAnotherArchivist 73877ecb96 Initial commit 6 lat temu
  JustAnotherArchivist 10715f1d3a Rewrite GDB command to stop on the first error, e.g. if lsof is broken. 6 lat temu
  JustAnotherArchivist 103640a311 Make kill-wpull-connections executable 6 lat temu
  JustAnotherArchivist f7dc46991c Check whether lsof and gdb are available 6 lat temu
  JustAnotherArchivist 64e815b9a5 Better way of finding the PID for ArchiveBot jobs 6 lat temu
  JustAnotherArchivist 290a4bf518 Filter out the script from the PID list when using -j 6 lat temu
  JustAnotherArchivist 2787d9cd51 Initial commit 6 lat temu