(from github.com/abolotnov)
If I want to crawl only *.example.com (both http and https) and want to exclude images, js files, css files - what should my crawler setup look like?
I tried many combinations but I either get external sites crawled or only http and not https and looks like excluded urls are overwritten by included so I managed to keep crawler to stick with the domain more or less but can’t make it ignore unwanted files.