105 コミット (master)
 

作成者 SHA1 メッセージ 日付
  JustAnotherArchivist 2e1dc59e9d Fix log level of one message 3年前
  JustAnotherArchivist f025c4e9f3 Add extensive debug logging 3年前
  JustAnotherArchivist ce7f8fdc92 Make optional arguments to fetch kwarg-only 3年前
  JustAnotherArchivist b29db245fb Configurable verbosity for log file and stderr 3年前
  JustAnotherArchivist dbe1ed71ab "Freeze" log file object before writing to WARC to ensure that further log messages aren't picked up 3年前
  JustAnotherArchivist 8ca2a6bde5 Fix exceptions on journal errors 3年前
  JustAnotherArchivist 3c8b45b3a6 Refactor cleanup code 3年前
  JustAnotherArchivist dcd5455388 Fix crash on starting a run while the DB is locked 3年前
  JustAnotherArchivist 168fa78736 Avoid locking the DB when there are no subitems to insert 3年前
  JustAnotherArchivist 4484d6c588 Add Item representation 3年前
  JustAnotherArchivist 5675118877 Rename id to id_ to avoid clash with builtin 3年前
  JustAnotherArchivist a1e693739e Replace DB locking with an async context manager 3年前
  JustAnotherArchivist cbcef2f173 Add Linux classifier 3年前
  JustAnotherArchivist 733506aed7 Remove obsolete TODO 3年前
  JustAnotherArchivist c7fac0ec3f Add WARC journalling with rollback on errors 3年前
  JustAnotherArchivist a4cf1a4225 Fix str_get_all_between yielding half-overlapping matches 3年前
  JustAnotherArchivist 15203bd991 Handle redirect traps/loops 3年前
  JustAnotherArchivist f8f5258197 Track redirect depth 3年前
  JustAnotherArchivist a3d6fb35f8 Turn response handlers into kwarg-only functions for easier extendability without breaking existing code 3年前
  JustAnotherArchivist a91cc23d47 Simplify get_software_info's signature to just the extra dependency packages 3年前
  JustAnotherArchivist 6cc4adb901 Remove stray TODO 3年前
  JustAnotherArchivist c5604ef965 Simplify header merging 3年前
  JustAnotherArchivist 59ae1183d2 Add fromResponse parameter for URL completion and automatic Referer header 3年前
  JustAnotherArchivist 2324216016 Add baseUrl and evaluate incomplete URLs relative to it 3年前
  JustAnotherArchivist b30ccf8bf8 Move response/exception history to ClientResponse.qhistory 3年前
  JustAnotherArchivist e69527c715 Add defaultResponseHandler on the Item level 3年前
  JustAnotherArchivist 03336e4988 Add item to response handler arguments (e.g. for logging) 3年前
  JustAnotherArchivist 005999fcb9 Disable aiohttp's Content-Type checking on JSON parsing by default 3年前
  JustAnotherArchivist 6bdcfe71f0 Refactor database creation and item generation: call `Item.generate()` on every qwarc run and dedupe its output, allowing the addition of further items by modifying the spec file 3年前
  JustAnotherArchivist c878241f24 Switch from concurrent.futures.CancelledError to asyncio.CancelledError 3年前
  JustAnotherArchivist 749158b97a Use the Future's result directly rather than awaiting again 3年前
  JustAnotherArchivist 5c6169ee4d Bump Python version classifiers 3年前
  JustAnotherArchivist a85e80ffa2 Configurable request timeout 3年前
  JustAnotherArchivist 429ac94689 Make it possible to override and remove headers 3年前
  JustAnotherArchivist e40be54578 Document verify_ssl parameter 3年前
  JustAnotherArchivist d3437bde19 Move default headers to qwarc.const 3年前
  JustAnotherArchivist 17fc3499ff Fix infinite loop in workaround for aiohttp issue 4630 3年前
  JustAnotherArchivist b6003af1e5 Work around aiohttp bug on parsing chunked transfer encoding responses when the buffer ends in an unfortunate spot 4年前
  JustAnotherArchivist 1678075a89 Log traceback on exceptions raised from an item 4年前
  JustAnotherArchivist 4ff8b260a1 Don't close raw data tempfiles until the response gets GC'd 4年前
  JustAnotherArchivist 4d9e4d8fe8 Fix ClientResponse._read returning more than nbytes if the entire response fits into the first block fed into the parser 4年前
  JustAnotherArchivist 2895f4bfdf Catch TypeError in Content-Length parsing 4年前
  JustAnotherArchivist 8358ba9131 Add support for only reading part of the response into memory 4年前
  JustAnotherArchivist 939978beec Handle EOF from the HTTP payload parser correctly 4年前
  JustAnotherArchivist b1a1c03f7e Handle STOP file and high memory usage before full disk to allow stopping while the disk is above the limit 4年前
  JustAnotherArchivist dd44d9b174 Adjust logging levels: log individual request failures only at WARNING and cancelled tasks at ERROR level 4年前
  JustAnotherArchivist 820384fe1e Stop deduping small responses 4年前
  JustAnotherArchivist 91035d769c Catch exceptions in Item.process and mark the items as errors instead of crashing 4年前
  JustAnotherArchivist 69984765b3 Fix taskType typo silencing cancellation warnings 4年前
  JustAnotherArchivist 461cedbbde Avoid temporary files created by warcio due to not knowing the record payload length 4年前