archiving community contributions on YouTube: unpublished captions, title and description translations and caption credits
Ви не можете вибрати більше 25 тем Теми мають розпочинатися з літери або цифри, можуть містити дефіси (-) і не повинні перевищувати 35 символів.
tech234a dac703aafa Export retries on exception 3 роки тому
.gitignore Update gitignore 3 роки тому
README.md Update README.md 3 роки тому
config.json Allow specifying TRACKER_USERNAME in config.json 3 роки тому
discovery.py Captcha detection on video page 3 роки тому
export.py Export retries on exception 3 роки тому
requirements.txt Implement TRACKER_USERNAME support 3 роки тому
tracker.py Export retries on exception 3 роки тому
worker.py youtube-dl error logging 3 роки тому

README.md

YouTube Community Contributions Archiving Worker

Export YouTube community-contributed captioning to SBV files. Export YouTube community-contributed titles and descriptions to JSON. Export published caption credits to JSON.

Setup

Ensure that python 3.8.5, curl, and rsync are installed on your system. Install the Python module requirements in the requirements.txt file (pip install -r requirements.txt). Because the captioning editor is only available to logged-in users, you must specify the values of three session cookies for any Google account (HSID, SSID, and SID). You can get these cookie values by opening the developer tools on any youtube.com webpage, going to the “Application” (Chrome) or “Storage” (Firefox) tab, selecting “Cookies”, and copying the required values. These values can be specified in the config.json file or as environment variables (SSID, SID, HSID). The TRACKER_USERNAME can also be specified in config.json or as an environment variable. This is the name that is used for the dashboard.

Primary Usage

Archiving Worker:

After completing the above setup steps, simply run python3 worker.py.

Heroku

A wrapper repo for free and easy deployment and environment configuration, as well automatic updates every 24-27.6 hours is available. Deploy up to 5 instances of it to a free Heroku account (total max runtime 550 hours) with no need for credit card verification by clicking the button below.

Deploy

Bonus Features

Export Captions and Titles/Descriptions Manually

Simply run python3 ytcc-exporter.py followed by a list of space-separated YouTube video IDs, and all community-contributed captioning and titles/descriptions in all languages will be exported.

Discover Videos Manually

Simply run python3 discovery.py followed by a list of space-separated YouTube video IDs and a list of discovered video, channel and playlist IDs will be printed, as well as whether caption contributions are enabled.