archiving community contributions on YouTube: unpublished captions, title and description translations and caption credits
Ви не можете вибрати більше 25 тем Теми мають розпочинатися з літери або цифри, можуть містити дефіси (-) і не повинні перевищувати 35 символів.
tech234a a4b250f002 Merge branch 'master' of github.com:Data-Horde/ytcc-archive 3 роки тому
.gitignore Update gitignore 3 роки тому
README.md Update README.md 3 роки тому
config.json Add files via upload 3 роки тому
discovery.py Reduce exceptions, limit threads 3 роки тому
export.py Get published captions, titles, descriptions 3 роки тому
requirements.txt Add files via upload 3 роки тому
tracker.py Implement initial tracker API 3 роки тому
worker.py Allow specifying cookies as environment variables, use requests session 3 роки тому

README.md

YouTube Community Contributions Archiving Worker

Export YouTube community-contributed captioning drafts to SBV files. Export YouTube community-contributed titles and descriptions to JSON. Export published caption credits to JSON.

Setup

Install the requirements in the requirements.txt file (pip install -r requirements.txt). Because the captioning editor is only available to logged-in users, you must specify the values of three session cookies for any Google account (HSID, SSID, and SID). You can get these cookie values by opening the developer tools on any youtube.com webpage, going to the “Application” (Chrome) or “Storage” (Firefox) tab, selecting “Cookies”, and copying the required values.

Usage

Export Captions

Simply run python3 ytcc-exporter.py followed by a list of space-separated YouTube video IDs, and all community-contributed captioning drafts in all languages will be exported.

Discover videos

Simply run python3 discovery.py followed by a list of space-separated YouTube video IDs and a list of discovered video, channel and playlist IDs will be printed, as well as whether caption contributions are enabled.