217 Révisions (788b25707d9aad1f76da1922f4a48927ce205875)
 

Auteur SHA1 Message Date
  JustAnotherArchivist 788b25707d Handle more domains and case variations il y a 3 ans
  JustAnotherArchivist 36aa2e8259 Add archivebot-log-extract-ignores il y a 3 ans
  JustAnotherArchivist 5b731fbde1 Fix compatibility with wpull 2.x il y a 3 ans
  JustAnotherArchivist 743e0582ba Fix confusing error message when lxml is not installed il y a 3 ans
  JustAnotherArchivist 491a80a04b Add warc-tiny scrape command for parsing HTTP responses using wpull and extracting links il y a 3 ans
  JustAnotherArchivist fd2728f1b8 Add archivebot-irccloud-paste il y a 3 ans
  JustAnotherArchivist 4eff3c3eb3 Refactor, strip query/fragment il y a 3 ans
  JustAnotherArchivist 821cacf626 Add --help il y a 3 ans
  JustAnotherArchivist caffebab2e Add parent-urls il y a 3 ans
  JustAnotherArchivist 77ec76bc04 Add --urls and --nodl options il y a 3 ans
  JustAnotherArchivist 06cf71f73d Fix gofile.io download: getServer is not used by the website anymore, and getUpload no longer returns the MD5 il y a 3 ans
  JustAnotherArchivist bff1490871 Add github-list-repos il y a 3 ans
  JustAnotherArchivist bf695d63a3 Fix channel URLs il y a 3 ans
  JustAnotherArchivist dde4464555 Cover two more rare URLs il y a 3 ans
  JustAnotherArchivist bbf2d2c315 Be more lenient regarding slashes to catch things with collapsed URLs in paths etc. il y a 3 ans
  JustAnotherArchivist 362f66eb26 Handle youtube-nocookie.com and fix removenonyt mode not recognising CC domains il y a 3 ans
  JustAnotherArchivist 81e2b4b999 Refine patterns il y a 3 ans
  JustAnotherArchivist 9974d4613c Stop trying to rewrite patterns for percent encoding il y a 3 ans
  JustAnotherArchivist 0ee83bc0f2 Refactor il y a 3 ans
  JustAnotherArchivist b66260ca94 Add youtube-extract il y a 3 ans
  JustAnotherArchivist d82dff8b71 Add ETA column il y a 3 ans
  JustAnotherArchivist 01274e461a Prevent constantly moving bytes around for better performance on large chunked records il y a 3 ans
  JustAnotherArchivist 77d9f61de0 Colourise output il y a 3 ans
  JustAnotherArchivist 6512669cfd Refactor and compare file list as well il y a 3 ans
  JustAnotherArchivist 8e0cb30d0a Add atdash mode il y a 3 ans
  JustAnotherArchivist 5fe595d71c Record wrapper script in meta WARC as well il y a 3 ans
  JustAnotherArchivist c1def0e7a8 Fix S3_WITH_LIST_URLS being defined (but empty) when --with-list-urls is not used il y a 3 ans
  JustAnotherArchivist 398cbfdcda Add s3-bucket-list-qwarc, rewritten s3-bucket-list on top of qwarc il y a 3 ans
  JustAnotherArchivist 80084e0d35 Another alternative and performance/memory comparison il y a 3 ans
  JustAnotherArchivist 6a288a6338 Use grep instead, which is faster but uses more memory il y a 3 ans
  JustAnotherArchivist 4d274e64e0 Add dedupe il y a 3 ans
  JustAnotherArchivist a4af8e6ca6 Add IE6 UA il y a 3 ans
  JustAnotherArchivist ac277437a3 Add Googlebot UA il y a 3 ans
  JustAnotherArchivist 0181e53f01 Treat NXDOMAIN and no A/AAAA record errors as ok il y a 3 ans
  JustAnotherArchivist 41c2a9d2d4 Add support for alternative xmlns il y a 3 ans
  JustAnotherArchivist 830e9dbc43 Treat redirects as successful retrievals il y a 3 ans
  JustAnotherArchivist 7a999c9b0a Ignore redirects il y a 3 ans
  JustAnotherArchivist 579d589853 Add a script to extract errors from wpull 2.x logs il y a 3 ans
  JustAnotherArchivist d60948e90f Verbosity il y a 3 ans
  JustAnotherArchivist a9a4792854 Fix server validation il y a 3 ans
  JustAnotherArchivist 57e2e26d80 Support multi-file uploads il y a 3 ans
  JustAnotherArchivist 02c967f608 Add gofile.io download script il y a 3 ans
  JustAnotherArchivist a83d28d08e Add WARC/1.1 support il y a 3 ans
  JustAnotherArchivist ba2f7db380 Merge warc-peek repository into little-things il y a 3 ans
  JustAnotherArchivist 79fc113467 Merge kill-wpull-connections repository into little-things il y a 3 ans
  JustAnotherArchivist b4bb9babac Switch to HTTPS il y a 3 ans
  JustAnotherArchivist 9f3c7b3ca8 Support negative filter values for date columns as relative to the current datetime il y a 3 ans
  JustAnotherArchivist c7151efc3e Add script for checking whether a file on transfer.notkiska.pw was archived correctly with AB il y a 3 ans
  JustAnotherArchivist 4c90bacaed Shield values in colons with angled brackets il y a 3 ans
  JustAnotherArchivist f51adccd3f Add --meta mode for dump-responses which prefixes each line with information about the file and record il y a 3 ans