295 Commits (c50a8fd796a26b8c69076bce2e1758b0a0c70b0f)
 

Auteur SHA1 Bericht Datum
  JustAnotherArchivist c50a8fd796 Fix 'Dictionary mismatch' error when very small dicts are used because the temporary file isn't written to disk before zstdcat gets executed 2 jaren geleden
  JustAnotherArchivist 5bc3d4b020 Fix crash on an empty response 2 jaren geleden
  JustAnotherArchivist 7f25c092d1 Catch other connection errors 2 jaren geleden
  JustAnotherArchivist f8352809f3 Handle ConnectionResetError 2 jaren geleden
  JustAnotherArchivist 0b34268210 Catch socket.timeout, which is a separate exception class from TimeoutError before Python 3.10 2 jaren geleden
  JustAnotherArchivist 0f7a2b32a3 Log number of results on a page 2 jaren geleden
  JustAnotherArchivist 628aeb052f Handle rate limiting 2 jaren geleden
  JustAnotherArchivist d3ea3ce8a0 Switch from urllib to http.client to reuse connections 2 jaren geleden
  JustAnotherArchivist 8f7619ff3a Add retries 2 jaren geleden
  JustAnotherArchivist f98fdd5f01 Fix printing HTTP response line to stdout instead of stderr 2 jaren geleden
  JustAnotherArchivist c9400ac46f Fix recognition of command without optional parts 2 jaren geleden
  JustAnotherArchivist 5ca15a7c94 Add concurrency support 2 jaren geleden
  JustAnotherArchivist 191948cf9d Print number of modified records on requeueing 2 jaren geleden
  JustAnotherArchivist 5121524f83 Log retrieval of showNumPages 2 jaren geleden
  JustAnotherArchivist aba7a1b0b8 Replace resumeKey pagination with page number pagination 2 jaren geleden
  JustAnotherArchivist d57324a26c Add --where for arbitrary conditions 2 jaren geleden
  JustAnotherArchivist fed64387bd Invert count/write logic 2 jaren geleden
  JustAnotherArchivist f914b6afbe Also reset the status_code on requeueing 2 jaren geleden
  JustAnotherArchivist 303bb69c37 Add ia-cdx-search 2 jaren geleden
  JustAnotherArchivist 0b45f7b2ba Swap syntaxes 2 jaren geleden
  JustAnotherArchivist b2c9ea2fa4 Refactor 2 jaren geleden
  JustAnotherArchivist eaf53e1a44 Add alphabetseq 2 jaren geleden
  JustAnotherArchivist c9c8b7e1f7 Add ia-wait-item-tasks 2 jaren geleden
  JustAnotherArchivist b440b35c2f Handle ancient /?v= URLs 2 jaren geleden
  JustAnotherArchivist 0044281b9d Add YouTube channel listing script 2 jaren geleden
  JustAnotherArchivist 1686e04cbe Add a timeout to prevent potentially indefinite blocking 2 jaren geleden
  JustAnotherArchivist 2fc9652ee9 Add support for other instances and full-instance listing 2 jaren geleden
  JustAnotherArchivist b72da478b2 Fix org repo listing on new design/site structure 2 jaren geleden
  JustAnotherArchivist ce7a069af5 Add --jsonl option 2 jaren geleden
  JustAnotherArchivist 9412f0c81c Add azure-storage-list 2 jaren geleden
  JustAnotherArchivist 696e221fc1 Add support for password-protected folders 2 jaren geleden
  JustAnotherArchivist 158c1f1fe0 Fix usage error 2 jaren geleden
  JustAnotherArchivist 53bfe468bf Basic error checks 2 jaren geleden
  JustAnotherArchivist 8c612082b6 Restore MD5 check as the API returns it again 2 jaren geleden
  JustAnotherArchivist 8554c01a84 Fix gofile.io download to the new getFolder endpoint and download server structure 2 jaren geleden
  JustAnotherArchivist a246bad957 Add support for Shorts 2 jaren geleden
  JustAnotherArchivist 6d019e63fc Fix removenonyt performance by using simpler fixed-string patterns instead of a PCRE 2 jaren geleden
  JustAnotherArchivist b27a428787 Fix usage notes from URLs to lines on stdin 2 jaren geleden
  JustAnotherArchivist c4b62c2fea Fix piping when reads return less data than expected 2 jaren geleden
  JustAnotherArchivist dba6d1fb0e Fix stderr printing 2 jaren geleden
  JustAnotherArchivist 6e5a019d9e Always decode stdin with surrogateescape to avoid breaking on binary input 2 jaren geleden
  JustAnotherArchivist e48fb9d1b6 Tighten patterns for user and custom channel URLs so they can handle HTML input more easily 2 jaren geleden
  JustAnotherArchivist 9cbc3f7968 Extract playlist and channel IDs from watch URLs 2 jaren geleden
  JustAnotherArchivist 80bf010433 Percent-decode each line only once 2 jaren geleden
  JustAnotherArchivist f1fcfabafa Add support for reading warc.zst from stdin 2 jaren geleden
  JustAnotherArchivist d5f646f995 Add zstdwarccat 2 jaren geleden
  JustAnotherArchivist 4415c8d5dd Add support for img.youtube.com (old thumbnails) 2 jaren geleden
  JustAnotherArchivist 50a0fcc7b0 Fix performance regression due to 479c2684 2 jaren geleden
  JustAnotherArchivist 479c268441 Fix whitespace handling 2 jaren geleden
  JustAnotherArchivist 56f21d1fc0 Add aggressive video ID v parameter extraction 2 jaren geleden