31 коммитов (master)

Автор SHA1 Сообщение Дата
  JustAnotherArchivist 4ff212eb20 Fix empty files being considered valid WARCs 10 месяцев назад
  JustAnotherArchivist 828dae2597 Raise an error when verification fails 10 месяцев назад
  JustAnotherArchivist 73f35f5591 Fix infinite loop when file ends with something that is not a WARC record 2 лет назад
  JustAnotherArchivist 06d60a798c Bump read size 2 лет назад
  JustAnotherArchivist 74485c399b Require decompressed WARCs with warc-tiny 2 лет назад
  JustAnotherArchivist fe0b020352 Add support for reading from stdin 2 лет назад
  JustAnotherArchivist 5b731fbde1 Fix compatibility with wpull 2.x 3 лет назад
  JustAnotherArchivist 743e0582ba Fix confusing error message when lxml is not installed 3 лет назад
  JustAnotherArchivist 491a80a04b Add warc-tiny scrape command for parsing HTTP responses using wpull and extracting links 3 лет назад
  JustAnotherArchivist 01274e461a Prevent constantly moving bytes around for better performance on large chunked records 3 лет назад
  JustAnotherArchivist 4c90bacaed Shield values in colons with angled brackets 3 лет назад
  JustAnotherArchivist f51adccd3f Add --meta mode for dump-responses which prefixes each line with information about the file and record 3 лет назад
  JustAnotherArchivist 9cc1f41917 Pass the filename in NewFile events 3 лет назад
  JustAnotherArchivist a38efc31b6 Introduce a way to provide additional arguments to processors 3 лет назад
  JustAnotherArchivist 49376db51b Decode HTTP request bodies 4 лет назад
  JustAnotherArchivist 34c1a58034 Fix detection of multiple transfer encodings 4 лет назад
  JustAnotherArchivist 5982e131a4 Stop gracefully when encountering a SIGPIPE 4 лет назад
  JustAnotherArchivist c13a1150df Add support for WARC/1.1 4 лет назад
  JustAnotherArchivist 376cde7b8c Fix broken block digest calculation on malformed HTTP responses 4 лет назад
  JustAnotherArchivist b121cbd958 Write all log messages to stderr 4 лет назад
  JustAnotherArchivist ed1270d988 Add support for upper-cased chunk lengths 4 лет назад
  JustAnotherArchivist d4826abde2 Add record ID to log messages 4 лет назад
  JustAnotherArchivist 552a4147c2 Fix not returning complete body for non-chunked responses 4 лет назад
  JustAnotherArchivist f2e836d2e9 Add support for differently formatted digests 5 лет назад
  JustAnotherArchivist 94c4f76570 Fix crash when a digest is missing from a record 5 лет назад
  JustAnotherArchivist ef78a3318c Colour only the header field names but not the values 5 лет назад
  JustAnotherArchivist 9ce4653094 Document colouring and usage 5 лет назад
  JustAnotherArchivist e7c5d82254 Coloured WARCs?! 5 лет назад
  JustAnotherArchivist 70b413f5c1 Better events: include raw WARC header data and separate HTTP requests into headers and body 5 лет назад
  JustAnotherArchivist 641bc7a207 Fix infinite loop at end of WARC 5 лет назад
  JustAnotherArchivist 859c75a591 Add tool for WARC verification and extraction 5 лет назад