29 Commit (73f35f55913e2af31b8859f53bbdb724a7760831)

Autore SHA1 Messaggio Data
  JustAnotherArchivist 73f35f5591 Fix infinite loop when file ends with something that is not a WARC record 2 anni fa
  JustAnotherArchivist 06d60a798c Bump read size 2 anni fa
  JustAnotherArchivist 74485c399b Require decompressed WARCs with warc-tiny 2 anni fa
  JustAnotherArchivist fe0b020352 Add support for reading from stdin 2 anni fa
  JustAnotherArchivist 5b731fbde1 Fix compatibility with wpull 2.x 3 anni fa
  JustAnotherArchivist 743e0582ba Fix confusing error message when lxml is not installed 3 anni fa
  JustAnotherArchivist 491a80a04b Add warc-tiny scrape command for parsing HTTP responses using wpull and extracting links 3 anni fa
  JustAnotherArchivist 01274e461a Prevent constantly moving bytes around for better performance on large chunked records 3 anni fa
  JustAnotherArchivist 4c90bacaed Shield values in colons with angled brackets 3 anni fa
  JustAnotherArchivist f51adccd3f Add --meta mode for dump-responses which prefixes each line with information about the file and record 3 anni fa
  JustAnotherArchivist 9cc1f41917 Pass the filename in NewFile events 3 anni fa
  JustAnotherArchivist a38efc31b6 Introduce a way to provide additional arguments to processors 3 anni fa
  JustAnotherArchivist 49376db51b Decode HTTP request bodies 4 anni fa
  JustAnotherArchivist 34c1a58034 Fix detection of multiple transfer encodings 4 anni fa
  JustAnotherArchivist 5982e131a4 Stop gracefully when encountering a SIGPIPE 4 anni fa
  JustAnotherArchivist c13a1150df Add support for WARC/1.1 4 anni fa
  JustAnotherArchivist 376cde7b8c Fix broken block digest calculation on malformed HTTP responses 4 anni fa
  JustAnotherArchivist b121cbd958 Write all log messages to stderr 4 anni fa
  JustAnotherArchivist ed1270d988 Add support for upper-cased chunk lengths 4 anni fa
  JustAnotherArchivist d4826abde2 Add record ID to log messages 4 anni fa
  JustAnotherArchivist 552a4147c2 Fix not returning complete body for non-chunked responses 4 anni fa
  JustAnotherArchivist f2e836d2e9 Add support for differently formatted digests 5 anni fa
  JustAnotherArchivist 94c4f76570 Fix crash when a digest is missing from a record 5 anni fa
  JustAnotherArchivist ef78a3318c Colour only the header field names but not the values 5 anni fa
  JustAnotherArchivist 9ce4653094 Document colouring and usage 5 anni fa
  JustAnotherArchivist e7c5d82254 Coloured WARCs?! 5 anni fa
  JustAnotherArchivist 70b413f5c1 Better events: include raw WARC header data and separate HTTP requests into headers and body 5 anni fa
  JustAnotherArchivist 641bc7a207 Fix infinite loop at end of WARC 5 anni fa
  JustAnotherArchivist 859c75a591 Add tool for WARC verification and extraction 5 anni fa