JustAnotherArchivist
e24790132e
Add at-tracker-sample-user-item-size
пре 2 година
JustAnotherArchivist
a14939b069
Add base64url
пре 2 година
JustAnotherArchivist
5c2ce7ec10
Add cdx-chunk
пре 2 година
JustAnotherArchivist
fe0b020352
Add support for reading from stdin
пре 2 година
JustAnotherArchivist
1010769c3c
Handle connection errors
пре 2 година
JustAnotherArchivist
1acdc88c81
Add ia-upload-stream
пре 2 година
JustAnotherArchivist
360c4d9371
Add youtube-extract-rapid
пре 2 година
JustAnotherArchivist
d07b5a7d09
Remove debugging prints
пре 2 година
JustAnotherArchivist
bf5e065a0f
Add URL/percent decoding tool
urldecode.c is entirely written by OrIdow6 except for one bug fix (char → uint8_t in the mallocs) and whitespace changes. The test suite is by JAA.
Co-authored-by: OrIdow6 <68304414+OrIdow6@users.noreply.github.com>
пре 2 година
JustAnotherArchivist
11485d9404
Add infrastructure for simple C-based tools
пре 2 година
JustAnotherArchivist
c50a8fd796
Fix 'Dictionary mismatch' error when very small dicts are used because the temporary file isn't written to disk before zstdcat gets executed
пре 2 година
JustAnotherArchivist
5bc3d4b020
Fix crash on an empty response
This check was a leftover from the resumeKey pagination, where empty responses are supposed to be impossible. With the page pagination, they are possible.
пре 2 година
JustAnotherArchivist
7f25c092d1
Catch other connection errors
пре 2 година
JustAnotherArchivist
f8352809f3
Handle ConnectionResetError
пре 2 година
JustAnotherArchivist
0b34268210
Catch socket.timeout, which is a separate exception class from TimeoutError before Python 3.10
пре 2 година
JustAnotherArchivist
0f7a2b32a3
Log number of results on a page
пре 2 година
JustAnotherArchivist
628aeb052f
Handle rate limiting
пре 2 година
JustAnotherArchivist
d3ea3ce8a0
Switch from urllib to http.client to reuse connections
пре 2 година
JustAnotherArchivist
8f7619ff3a
Add retries
пре 2 година
JustAnotherArchivist
f98fdd5f01
Fix printing HTTP response line to stdout instead of stderr
пре 2 година
JustAnotherArchivist
c9400ac46f
Fix recognition of command without optional parts
пре 2 година
JustAnotherArchivist
5ca15a7c94
Add concurrency support
The proper way to do that (with asyncio) is of course aiohttp. A major drawback of the implemented approach is that running tasks can't be cancelled in case of an error. However, it works with just the standard library, and that advantage outweighs the awkward error handling for now.
пре 2 година
JustAnotherArchivist
191948cf9d
Print number of modified records on requeueing
пре 2 година
JustAnotherArchivist
5121524f83
Log retrieval of showNumPages
пре 2 година
JustAnotherArchivist
aba7a1b0b8
Replace resumeKey pagination with page number pagination
resumeKey pagination is horribly broken. It may return incomplete results or infinite loops.
пре 2 година
JustAnotherArchivist
d57324a26c
Add --where for arbitrary conditions
пре 2 година
JustAnotherArchivist
fed64387bd
Invert count/write logic
Previously, write was the actual default action, and in some forms of the command, the action value isn't actually checked against the possible values, so on a typo, it would write instead of count.
пре 2 година
JustAnotherArchivist
f914b6afbe
Also reset the status_code on requeueing
пре 2 година
JustAnotherArchivist
303bb69c37
Add ia-cdx-search
пре 2 година
JustAnotherArchivist
0b45f7b2ba
Swap syntaxes
пре 2 година
JustAnotherArchivist
b2c9ea2fa4
Refactor
пре 2 година
JustAnotherArchivist
eaf53e1a44
Add alphabetseq
пре 2 година
JustAnotherArchivist
c9c8b7e1f7
Add ia-wait-item-tasks
пре 2 година
JustAnotherArchivist
b440b35c2f
Handle ancient /?v= URLs
пре 2 година
JustAnotherArchivist
0044281b9d
Add YouTube channel listing script
пре 2 година
JustAnotherArchivist
1686e04cbe
Add a timeout to prevent potentially indefinite blocking
пре 2 година
JustAnotherArchivist
2fc9652ee9
Add support for other instances and full-instance listing
пре 2 година
JustAnotherArchivist
b72da478b2
Fix org repo listing on new design/site structure
пре 2 година
JustAnotherArchivist
ce7a069af5
Add --jsonl option
пре 2 година
JustAnotherArchivist
9412f0c81c
Add azure-storage-list
пре 2 година
JustAnotherArchivist
696e221fc1
Add support for password-protected folders
пре 2 година
JustAnotherArchivist
158c1f1fe0
Fix usage error
пре 2 година
JustAnotherArchivist
53bfe468bf
Basic error checks
пре 2 година
JustAnotherArchivist
8c612082b6
Restore MD5 check as the API returns it again
Effectively partially reverts 06cf71f7
пре 2 година
JustAnotherArchivist
8554c01a84
Fix gofile.io download to the new getFolder endpoint and download server structure
пре 2 година
JustAnotherArchivist
a246bad957
Add support for Shorts
пре 2 година
JustAnotherArchivist
6d019e63fc
Fix removenonyt performance by using simpler fixed-string patterns instead of a PCRE
пре 2 година
JustAnotherArchivist
b27a428787
Fix usage notes from URLs to lines on stdin
пре 2 година
JustAnotherArchivist
c4b62c2fea
Fix piping when reads return less data than expected
пре 2 година
JustAnotherArchivist
dba6d1fb0e
Fix stderr printing
пре 2 година