archiving community contributions on YouTube: unpublished captions, title and description translations and caption credits
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
tech234a 2115c9ccce Fix Requests 3 years ago
.gitignore Update gitignore 3 years ago
Dockerfile Added Dockerfile written by Fusl 3 years ago
README.md Update README.md 3 years ago
config.json Allow specifying TRACKER_USERNAME in config.json 3 years ago
discovery.py Switch from Requests to http3 after 30 failures 3 years ago
export.py Switch from Requests to http3 after 30 failures 3 years ago
http3.py Workaround for Heroku 3 years ago
http3_base.py Credit source 3 years ago
requirements.txt Initial HTTP3 Support 3 years ago
switchable_request.py Fix Requests 3 years ago
tracker.py Redirect support 3 years ago
worker.py Fix loop Heroku 3 years ago

README.md

YouTube Community Contributions Archiving Worker

Worker for the Save Community Captions project: Archiving unpublished YouTube community-contributions.

Setup

To run these tools you will need to supply session cookies (SSID,HSID,SID) see the tutorial for more details.

Primary Usage

Heroku️ (Minimal Setup! Minimal Maintence!)

A wrapper repo for free and easy deployment and environment configuration, as well automatic updates every 24-27.6 hours is available. Deploy up to 5 instances of it to a free Heroku account (total max monthly runtime 550 hours) with no need for credit card verification by clicking the button below.

Deploy

Archiving Worker

After completing the above setup steps, simply run

python3 worker.py

Docker image

Stable Docker Image:

docker pull fusl/ytcc-archive

Run:

docker container run --restart=unless-stopped --network=host -d --tmpfs /grab/out --name=grab_ext-yt-communitycontribs -e HSID=XXX-e SID=XXX -e SSID=XXX -e TRACKER_USERNAME=Fusl -e PYTHONUNBUFFERED=1 fusl/ytcc-archive

Bonus Features

Export Captions and Titles/Descriptions Manually

Simply run python3 exporter.py followed by a list of space-separated YouTube video IDs, and all community-contributed captioning and titles/descriptions in all languages will be exported.

Discover Videos Manually

Simply run python3 discovery.py followed by a list of space-separated YouTube video IDs and a list of discovered video, channel and playlist IDs will be printed, as well as whether caption contributions are enabled.

Stats

See how much has been archived so far.