5 Commits (master)

Author SHA1 Message Date
  JustAnotherArchivist 256a94443e Fix deduplication within each section processing 4 years ago
  JustAnotherArchivist 98d77ecc96 Deduplicate output 4 years ago
  JustAnotherArchivist 6ce64baf87 Remove redundant url-normalise after the extraction 4 years ago
  JustAnotherArchivist 869ade27eb Separate names in stderr annotations for the various url-normalise processes 4 years ago
  JustAnotherArchivist 79f0bd4332 Normalise URLs everywhere to reduce duplicates 4 years ago
  JustAnotherArchivist 0f13a1fadd Add verbosity options, and annotate stderr on wiki-recursive-extract 4 years ago
  JustAnotherArchivist 5285c406d9 Add script for recursive website and social media discovery 4 years ago