32 Commit (5579129b11f4acc9d0dc019cb4cb8b6533371cbb)

Autore SHA1 Messaggio Data
  JustAnotherArchivist 5579129b11 Support overriding the total fetch timeout 2 anni fa
  JustAnotherArchivist 215ac03221 Support HEAD requests 2 anni fa
  JustAnotherArchivist a7d7852c6d Fix ISO-8859-1-encoded Location header handling 3 anni fa
  JustAnotherArchivist 1678075a89 Log traceback on exceptions raised from an item 4 anni fa
  JustAnotherArchivist b1a1c03f7e Handle STOP file and high memory usage before full disk to allow stopping while the disk is above the limit 4 anni fa
  JustAnotherArchivist dd44d9b174 Adjust logging levels: log individual request failures only at WARNING and cancelled tasks at ERROR level 4 anni fa
  JustAnotherArchivist 91035d769c Catch exceptions in Item.process and mark the items as errors instead of crashing 4 anni fa
  JustAnotherArchivist 69984765b3 Fix taskType typo silencing cancellation warnings 4 anni fa
  JustAnotherArchivist c263ad0b03 Return ClientResponse object from fetch only if the retrieval was successful 4 anni fa
  JustAnotherArchivist cb0d11284e Write only successful retrievals (i.e. ones that don't cause an exception) to WARC 4 anni fa
  JustAnotherArchivist 1214409a0b Flush big responses to a temporary file instead of trying to keep everything in-memory 4 anni fa
  JustAnotherArchivist 08d96b37c5 Support deep/multiple inheritance from Item 4 anni fa
  JustAnotherArchivist 9d8de13775 Add Item.flush_subitems to flush the new subitems to the database while the item is still being processed 4 anni fa
  JustAnotherArchivist 50b936b18c Refactor QWARC class to keep relevant variables in instance attributes instead of local variables 4 anni fa
  JustAnotherArchivist c5d8d93166 Remove stray whitespace 4 anni fa
  JustAnotherArchivist 7d53577522 Add parameter for disabling SSL/TLS certificate validation 4 anni fa
  JustAnotherArchivist 50d46ad51c Use log filename in the target URI of the log resource record 4 anni fa
  JustAnotherArchivist a5dfd5c805 Write spec file + its dependencies and command line to meta WARC 4 anni fa
  JustAnotherArchivist d751844626 Fix starting another item before stopping on STOP file or memory limit exceedance 4 anni fa
  JustAnotherArchivist 2b0778f9b5 Remove leftovers from initial code rewrite 4 anni fa
  JustAnotherArchivist ab22966fef Add to log which item a message is coming from 4 anni fa
  JustAnotherArchivist 6fafd32685 Error when the retries are exceeded 4 anni fa
  JustAnotherArchivist 8647d6b396 Use f-strings instead of str.format 4 anni fa
  JustAnotherArchivist 5008e6e8cd Deduplicate items 4 anni fa
  JustAnotherArchivist 46c95e2157 Disable decoding the response content 4 anni fa
  JustAnotherArchivist ad22a2327a Support adding headers to individual requests 5 anni fa
  JustAnotherArchivist 67076f964c Add support for POST requests 5 anni fa
  JustAnotherArchivist c1574a06c9 Fix sleep task type 5 anni fa
  JustAnotherArchivist e0ca88c807 Fix reference to get_rss 5 anni fa
  JustAnotherArchivist 8a8935810d Fix references to memory and disk space check methods 5 anni fa
  JustAnotherArchivist be5673cfbf Add record deduplication within a process 5 anni fa
  JustAnotherArchivist e892a6b6a7 Initial commit 5 anni fa