Files
browsertrix-crawler/.gitignore
Tessa Walsh e02058f001 Add ad blocking via request interception (#173)
* ad blocking via request interception, extending block rules system, adding new AdBlockRules
* Load list of hosts to block from https://raw.githubusercontent.com/StevenBlack/hosts/master/hosts added as json on image build
* Enabled via --blockAds and setting a custom message via --adBlockMessage
* new test to check for ad blocking
* Add test-crawls dir to .gitignore and .dockerignore
2022-11-15 18:30:27 -08:00

8 lines
78 B
Plaintext

*.pyc
__pycache__
*.egg-info/
collections/
node_modules/
crawls/
test-crawls/