SMB path with space does not work

Hello,

file crawling with SMB path with spaces do not work… no documents are indexed.


Fess is doing a encoding with a plus sign and does not crawl the folder… :frowning:

2021-11-24 11:07:31,827 [WebFsCrawler] INFO  Target Path: smb://dcsrv01.em.pri/Firmendaten/Logistik Test/
2021-11-24 11:07:31,830 [WebFsCrawler] INFO  Included Path: smb://dcsrv01.em.pri/Firmendaten/Logistik+Test/.*
2021-11-24 11:07:41,926 [IndexUpdater] INFO  Processing no docs in indexing queue (Doc:{access 40ms}, Mem:{used 148MB, heap 415MB, max 512MB})
2021-11-24 11:07:51,898 [IndexUpdater] INFO  Processing no docs in indexing queue (Doc:{access 31ms}, Mem:{used 152MB, heap 415MB, max 512MB})
2021-11-24 11:08:01,906 [IndexUpdater] INFO  Processing no docs in indexing queue (Doc:{access 35ms}, Mem:{used 154MB, heap 415MB, max 512MB})
2021-11-24 11:08:04,036 [WebFsCrawler] INFO  [EXEC TIME] crawling time: 32651ms
2021-11-24 11:08:11,917 [IndexUpdater] INFO  Processing no docs in indexing queue (Doc:{access 41ms}, Mem:{used 155MB, heap 415MB, max 512MB})
2021-11-24 11:08:11,918 [IndexUpdater] INFO  [EXEC TIME] index update time: 176ms
2021-11-24 11:08:12,366 [main] INFO  Finished Crawler
2021-11-24 11:08:12,494 [main] INFO  [CRAWL INFO] CrawlerEndTime=2021-11-24T11:08:12.366+0100,WebFsCrawlExecTime=32651,CrawlerStatus=true,CrawlerStartTime=2021-11-24T11:07:30.928+0100,WebFsCrawlEndTime=2021-11-24T11:08:12.365+0100,WebFsIndexExecTime=176,WebFsIndexSize=0,CrawlerExecTime=41438,WebFsCrawlStartTime=2021-11-24T11:07:31.358+0100
2021-11-24 11:08:12,501 [main] INFO  Disconnected to http://localhost:9200
2021-11-24 11:08:12,510 [main] INFO  Destroyed LaContainer.

And yes, the folder exists, and they are files inside. The location is correct.

Please see this post.

Thank you a lot.

It crawls the specified location after setting:

#DISABLE_URL_ENCODE

to Included Paths for crawling with space in SMB path.