313 Commits (142a5a9c493ceed477bfc1625f718e2c47198d38)
 

Author SHA1 Message Date
  JustAnotherArchivist 142a5a9c49 Get rid of asyncio 2 years ago
  JustAnotherArchivist b6663ae731 Add concurrency 2 years ago
  JustAnotherArchivist 22f2e68356 Add JSONL output option for S3 listing 2 years ago
  JustAnotherArchivist bfebe9a2a5 Fix only sending partial file contents on retries 2 years ago
  JustAnotherArchivist 39b3b7793a Add support for IA_CONFIG_FILE environment variable 2 years ago
  JustAnotherArchivist 7ed2906dd2 Add progress bar 2 years ago
  JustAnotherArchivist 58f0f0f8d0 Fix being unable to resume an upload that crashed in the first part 2 years ago
  JustAnotherArchivist 74485c399b Require decompressed WARCs with warc-tiny 2 years ago
  JustAnotherArchivist e24790132e Add at-tracker-sample-user-item-size 2 years ago
  JustAnotherArchivist a14939b069 Add base64url 2 years ago
  JustAnotherArchivist 5c2ce7ec10 Add cdx-chunk 2 years ago
  JustAnotherArchivist fe0b020352 Add support for reading from stdin 2 years ago
  JustAnotherArchivist 1010769c3c Handle connection errors 2 years ago
  JustAnotherArchivist 1acdc88c81 Add ia-upload-stream 2 years ago
  JustAnotherArchivist 360c4d9371 Add youtube-extract-rapid 2 years ago
  JustAnotherArchivist d07b5a7d09 Remove debugging prints 2 years ago
  JustAnotherArchivist bf5e065a0f Add URL/percent decoding tool 2 years ago
  JustAnotherArchivist 11485d9404 Add infrastructure for simple C-based tools 2 years ago
  JustAnotherArchivist c50a8fd796 Fix 'Dictionary mismatch' error when very small dicts are used because the temporary file isn't written to disk before zstdcat gets executed 2 years ago
  JustAnotherArchivist 5bc3d4b020 Fix crash on an empty response 2 years ago
  JustAnotherArchivist 7f25c092d1 Catch other connection errors 2 years ago
  JustAnotherArchivist f8352809f3 Handle ConnectionResetError 2 years ago
  JustAnotherArchivist 0b34268210 Catch socket.timeout, which is a separate exception class from TimeoutError before Python 3.10 2 years ago
  JustAnotherArchivist 0f7a2b32a3 Log number of results on a page 2 years ago
  JustAnotherArchivist 628aeb052f Handle rate limiting 2 years ago
  JustAnotherArchivist d3ea3ce8a0 Switch from urllib to http.client to reuse connections 2 years ago
  JustAnotherArchivist 8f7619ff3a Add retries 2 years ago
  JustAnotherArchivist f98fdd5f01 Fix printing HTTP response line to stdout instead of stderr 2 years ago
  JustAnotherArchivist c9400ac46f Fix recognition of command without optional parts 2 years ago
  JustAnotherArchivist 5ca15a7c94 Add concurrency support 2 years ago
  JustAnotherArchivist 191948cf9d Print number of modified records on requeueing 2 years ago
  JustAnotherArchivist 5121524f83 Log retrieval of showNumPages 2 years ago
  JustAnotherArchivist aba7a1b0b8 Replace resumeKey pagination with page number pagination 2 years ago
  JustAnotherArchivist d57324a26c Add --where for arbitrary conditions 2 years ago
  JustAnotherArchivist fed64387bd Invert count/write logic 2 years ago
  JustAnotherArchivist f914b6afbe Also reset the status_code on requeueing 2 years ago
  JustAnotherArchivist 303bb69c37 Add ia-cdx-search 2 years ago
  JustAnotherArchivist 0b45f7b2ba Swap syntaxes 2 years ago
  JustAnotherArchivist b2c9ea2fa4 Refactor 2 years ago
  JustAnotherArchivist eaf53e1a44 Add alphabetseq 2 years ago
  JustAnotherArchivist c9c8b7e1f7 Add ia-wait-item-tasks 2 years ago
  JustAnotherArchivist b440b35c2f Handle ancient /?v= URLs 2 years ago
  JustAnotherArchivist 0044281b9d Add YouTube channel listing script 2 years ago
  JustAnotherArchivist 1686e04cbe Add a timeout to prevent potentially indefinite blocking 2 years ago
  JustAnotherArchivist 2fc9652ee9 Add support for other instances and full-instance listing 2 years ago
  JustAnotherArchivist b72da478b2 Fix org repo listing on new design/site structure 2 years ago
  JustAnotherArchivist ce7a069af5 Add --jsonl option 2 years ago
  JustAnotherArchivist 9412f0c81c Add azure-storage-list 2 years ago
  JustAnotherArchivist 696e221fc1 Add support for password-protected folders 2 years ago
  JustAnotherArchivist 158c1f1fe0 Fix usage error 2 years ago