5 Commits (34a3c9d0f3d790b7ebecb4e21610093d50de053a)

Author SHA1 Message Date
  JustAnotherArchivist 256a94443e Fix deduplication within each section processing 4 years ago
  JustAnotherArchivist 98d77ecc96 Deduplicate output 4 years ago
  JustAnotherArchivist 6ce64baf87 Remove redundant url-normalise after the extraction 4 years ago
  JustAnotherArchivist 869ade27eb Separate names in stderr annotations for the various url-normalise processes 4 years ago
  JustAnotherArchivist 79f0bd4332 Normalise URLs everywhere to reduce duplicates 4 years ago
  JustAnotherArchivist 0f13a1fadd Add verbosity options, and annotate stderr on wiki-recursive-extract 4 years ago
  JustAnotherArchivist 5285c406d9 Add script for recursive website and social media discovery 4 years ago