(from github.com/erbouchard)
I’m trying to figure out. My HTML pages are indexed properly but none of PDF, Word or Excel files.
Using version 12.6.
Crawling parameters
URLs
http://host/NPG/
Included URL For Crawling
http://host/NPG/.*
Excluded URLs For Crawling
.(?i).*(css|js|jpeg|jpg|gif|png|bmp|wmv|exe|mp4)
Included URLs For Indexing
http://host/NPG/.*
Excluded URLs For Indexing
(empty)
Questions
-
Is this supposed to index those documents (references in
<a href="...">...</a>
) by default? -
Or do I have to configure something?
Thanks