![]() ![]() py - sheet SHEET the name of the google sheets document - header HEADER 1 - based index for the header row - check - if - exists when possible checks if the URL has been archived before and does not archive the same URL twice - save - logs creates or appends execution logs to files logs / LEVEL. Here is the current result from running the python auto_archive.py -help: python auto_archive.py -help In addition to screenshots you will need to install Docker.Ĭonfiguration is done via a config.yaml file (see ) and some properties of that file can be overwritten via command line arguments. If you would like to take archival WACZ snapshots using browsertrix-crawler.Internet Archive credentials can be retrieved from.fonts-noto to deal with multiple unicode characters during selenium/geckodriver's screenshots: sudo apt install fonts-noto -y.firefox and geckodriver on a path folder like /usr/local/bin.ffmpeg must also be installed locally for this tool to work.Credentials for this account should be stored in service_account.json, in the same directory as the script. A Google Service account is necessary for use with gspread.If you are using pipenv (recommended), pipenv install is sufficient to install Python prerequisites. ![]() It can be run manually or on an automated basis. The Google Sheets where the links come from is updated with information about the archived content. Uses different archivers depending on the platform, and can save content to local storage, S3 bucket (Digital Ocean Spaces, AWS. Python script to automatically archive social media posts, videos, and images from a Google Sheets document. Read the article about Auto Archiver on. ![]()
0 Comments
Leave a Reply. |