JustAnotherArchivist
acd2fab899
Add warc-dump-responses
il y a 1 an
JustAnotherArchivist
512ced5ebd
Make test script optional
il y a 1 an
JustAnotherArchivist
67b12f645f
Fix exit statuses of ia-upload-stream and ia-wait-item-tasks
il y a 1 an
JustAnotherArchivist
6a76814ec5
Add crude in-progress upload listing
il y a 1 an
JustAnotherArchivist
34a3c9d0f3
Use _type instead of key check hack
il y a 1 an
JustAnotherArchivist
ec20f38c82
Handle nested playlists
il y a 1 an
JustAnotherArchivist
8386d33323
Add wpull2-log-colourise
il y a 1 an
JustAnotherArchivist
a4e05d8932
Fix TypeError
il y a 1 an
JustAnotherArchivist
0435954e65
Print net queue size
il y a 1 an
JustAnotherArchivist
9f31ba8828
Add archivebot-fix-queue-counters
il y a 1 an
JustAnotherArchivist
8d267c7f46
Add bencode2json
il y a 1 an
JustAnotherArchivist
98adc6cfac
Exclude backslashes in channel patterns
il y a 1 an
JustAnotherArchivist
a07c2b2374
Fix handling of invalid UTF-8 input
il y a 1 an
JustAnotherArchivist
725db7d05d
Fix confusing output for skipped lines
il y a 1 an
JustAnotherArchivist
3fca23c0a0
Fix pagination on users
il y a 1 an
JustAnotherArchivist
c2f6f5054c
Handle actual 429
il y a 1 an
JustAnotherArchivist
ccf4d678fb
Allow negative offsets to peek near the end of the file
il y a 1 an
JustAnotherArchivist
4798154e98
Fix URLs without a path
il y a 2 ans
JustAnotherArchivist
1830d67283
Add ia-cdx-search-subdomains
il y a 2 ans
JustAnotherArchivist
565be7bf1b
Fix
il y a 2 ans
JustAnotherArchivist
e2085e6c81
Add cloudflare-email-decode
il y a 2 ans
JustAnotherArchivist
73f35f5591
Fix infinite loop when file ends with something that is not a WARC record
il y a 2 ans
JustAnotherArchivist
06d60a798c
Bump read size
il y a 2 ans
JustAnotherArchivist
3e0b70be6b
Handle processes with too many open connections
il y a 2 ans
JustAnotherArchivist
df7b25c2db
Error on unknown options
il y a 2 ans
JustAnotherArchivist
4bd4f5a30c
Fix 'Argument list too long' error when using --urls-from-stdin with many URLs
il y a 2 ans
JustAnotherArchivist
e20d35a553
Fix crash on 429
il y a 2 ans
JustAnotherArchivist
cef61434a0
Add --urls-from-stdin
il y a 2 ans
JustAnotherArchivist
b5cf04947b
Add Wasabi
il y a 2 ans
JustAnotherArchivist
d2afd1309d
Add s3-bucket-find-direct-url
il y a 2 ans
JustAnotherArchivist
95988466ec
Make S3 response pattern matching more flexible (so it also works on Scaleway)
il y a 2 ans
JustAnotherArchivist
a9a03d3a00
Add urlsort
il y a 2 ans
JustAnotherArchivist
9798cc1188
Typo
il y a 2 ans
JustAnotherArchivist
d193637e5e
Add kill-connections
il y a 2 ans
JustAnotherArchivist
6cfe8e51ba
Make job a global variable in --pyfilter expressions so it can be used in genexps
il y a 2 ans
JustAnotherArchivist
a4627fa1c6
Queue derives with `ia tasks` instead of this manual curl rubbish
il y a 2 ans
JustAnotherArchivist
c68b310afc
Always print the parts value if there is an upload ID
Previously, parts wouldn't be printed if it was an empty list. This made resuming uploads that crashed in the first part harder than necessary.
il y a 2 ans
JustAnotherArchivist
fdc3c3d69e
Support float values for --partsize with M or G suffix
il y a 2 ans
JustAnotherArchivist
002c1eb7ae
Wait until item exists
IA doesn't immediately create the item on CreateMultipartUpload, so if it didn't already exist, UploadPart would fail for a while and we'd waste bandwidth.
il y a 2 ans
JustAnotherArchivist
142a5a9c49
Get rid of asyncio
No point in using it when it only delegates to a ThreadPoolExecutor anyway.
il y a 2 ans
JustAnotherArchivist
b6663ae731
Add concurrency
il y a 2 ans
JustAnotherArchivist
22f2e68356
Add JSONL output option for S3 listing
il y a 2 ans
JustAnotherArchivist
bfebe9a2a5
Fix only sending partial file contents on retries
il y a 2 ans
JustAnotherArchivist
39b3b7793a
Add support for IA_CONFIG_FILE environment variable
il y a 2 ans
JustAnotherArchivist
7ed2906dd2
Add progress bar
il y a 2 ans
JustAnotherArchivist
58f0f0f8d0
Fix being unable to resume an upload that crashed in the first part
il y a 2 ans
JustAnotherArchivist
74485c399b
Require decompressed WARCs with warc-tiny
il y a 2 ans
JustAnotherArchivist
e24790132e
Add at-tracker-sample-user-item-size
il y a 2 ans
JustAnotherArchivist
a14939b069
Add base64url
il y a 2 ans
JustAnotherArchivist
5c2ce7ec10
Add cdx-chunk
il y a 2 ans