archiving community contributions on YouTube: unpublished captions, title and description translations and caption credits
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
tech234a edba494c75 Exception fix 3 jaren geleden
.gitignore Update gitignore 3 jaren geleden
README.md Update README.md 3 jaren geleden
config.json Add files via upload 3 jaren geleden
discovery.py Reduce exceptions, limit threads 3 jaren geleden
export.py Fix duplicate files 3 jaren geleden
requirements.txt Implement TRACKER_USERNAME support 3 jaren geleden
tracker.py Implement TRACKER_USERNAME support 3 jaren geleden
worker.py Exception fix 3 jaren geleden

README.md

YouTube Community Contributions Archiving Worker

Export YouTube community-contributed captioning drafts to SBV files. Export YouTube community-contributed titles and descriptions to JSON. Export published caption credits to JSON.

Setup

Install the requirements in the requirements.txt file (pip install -r requirements.txt). Because the captioning editor is only available to logged-in users, you must specify the values of three session cookies for any Google account (HSID, SSID, and SID). You can get these cookie values by opening the developer tools on any youtube.com webpage, going to the “Application” (Chrome) or “Storage” (Firefox) tab, selecting “Cookies”, and copying the required values.

Usage

Export Captions

Simply run python3 ytcc-exporter.py followed by a list of space-separated YouTube video IDs, and all community-contributed captioning drafts in all languages will be exported.

Discover videos

Simply run python3 discovery.py followed by a list of space-separated YouTube video IDs and a list of discovered video, channel and playlist IDs will be printed, as well as whether caption contributions are enabled.