Could not access smb://

Hello,

I followed the following documentation: Part 9: Fess Active Directory integration (codelibs.org)

Part 15: File server crawls that require authentication (codelibs.org)

Fess can recognize the first layer of the folders, which initially means that it can be reached.

Log: Could not access smb://..de/8_Büro/

The user stored under File Authentication has access to the directory. Tested on Windows and via CIFS on Linux.

I tested the user with @****.de and without.

Where could the problem lie?

Please check the fess-crawler.log file with the debug level enabled.

If I understand correctly it is already on debug

debug level “/usr/share/fess/app/WEB-INF/env/crawler/resources/log4j2.xml”:

<?xml version="1.0" encoding="UTF-8"?>
<Properties>
        <Property name="domain.name" value="${sys:fess.log.name:-fess}" />
        <Property name="log.level" value="${sys:fess.log.level:-debug}" />

log:
2024-07-04 00:00:03,918 [WebFsCrawler] INFO Target Path: smb://-----.-----.de/DFSBiopark
2024-07-04 00:00:03,965 [Crawler-20240704000000-1-4] INFO Crawling URL: smb://-----.-----.de/-----
2024-07-04 00:00:05,475 [Crawler-20240704000000-1-3] INFO Crawling URL: smb://-----.-----.de/-----/
2024-07-04 00:00:05,475 [Crawler-20240704000000-1-2] INFO Crawling URL: smb://-----.-----.de/-----/
2024-07-04 00:00:05,475 [Crawler-20240704000000-1-1] INFO Crawling URL: smb://-----.-----.de/-----/
2024-07-04 00:00:05,475 [Crawler-20240704000000-1-5] INFO Crawling URL: smb://-----.-----.de/-----
2024-07-04 00:00:05,627 [Crawler-20240704000000-1-2] INFO Could not access smb://-----.-----.de/-----/
2024-07-04 00:00:05,756 [Crawler-20240704000000-1-1] INFO Could not access smb://-----.-----.de/-----/
2024-07-04 00:00:05,893 [Crawler-20240704000000-1-3] INFO Could not access smb://-----.-----.de/-----/
2024-07-04 00:00:06,033 [Crawler-20240704000000-1-5] INFO Could not access smb://-----.-----.de/-----
2024-07-04 00:00:06,357 [Crawler-20240704000000-1-4] INFO Crawling URL: smb://-----.-----.de/-----/
2024-07-04 00:00:06,522 [Crawler-20240704000000-1-4] INFO Could not access smb://-----.-----.de/-----/
2024-07-04 00:00:06,628 [Crawler-20240704000000-1-2] INFO Crawling URL: smb://-----.-----.de/-----/
2024-07-04 00:00:06,756 [Crawler-20240704000000-1-1] INFO Crawling URL: smb://-----.-----.de/-----/
2024-07-04 00:00:06,797 [Crawler-20240704000000-1-2] INFO Could not access smb://-----.-----.de/-----/
2024-07-04 00:00:06,894 [Crawler-20240704000000-1-3] INFO Crawling URL: smb://-----.-----.de/-----/
2024-07-04 00:00:06,952 [Crawler-20240704000000-1-1] INFO Could not access smb://-----.-----.de/-----/
2024-07-04 00:00:07,034 [Crawler-20240704000000-1-5] INFO Crawling URL: smb://-----.-----.de/-----/
2024-07-04 00:00:07,114 [Crawler-20240704000000-1-3] INFO Could not access smb://-----.-----.de/-----/
2024-07-04 00:00:07,241 [Crawler-20240704000000-1-5] INFO Could not access smb://-----.-----.de/-----/
2024-07-04 00:00:07,523 [Crawler-20240704000000-1-4] INFO Crawling URL: smb://-----.-----.de/-----/
2024-07-04 00:00:07,617 [Crawler-20240704000000-1-4] INFO Could not access smb://-----.-----.de/-----/
2024-07-04 00:00:13,932 [IndexUpdater] INFO Processing no docs in indexing queue (Doc:{access 3ms}, Mem:{used 160.722MB, heap 400.384MB, max 524.288MB})
2024-07-04 00:00:23,929 [IndexUpdater] INFO Processing no docs in indexing queue (Doc:{access 3ms}, Mem:{used 149.778MB, heap 400.384MB, max 524.288MB})
2024-07-04 00:00:33,929 [IndexUpdater] INFO Processing no docs in indexing queue (Doc:{access 3ms}, Mem:{used 152.526MB, heap 400.384MB, max 524.288MB})
2024-07-04 00:00:38,890 [WebFsCrawler] INFO [EXEC TIME] crawling time: 35025ms
2024-07-04 00:00:43,938 [IndexUpdater] INFO Processing no docs in indexing queue (Doc:{access 11ms}, Mem:{used 153.947MB, heap 400.384MB, max 524.288MB})
2024-07-04 00:00:43,938 [IndexUpdater] INFO [EXEC TIME] index update time: 24ms
2024-07-04 00:00:43,960 [main] INFO Finished Crawler
2024-07-04 00:00:43,999 [main] INFO [CRAWL INFO] DataCrawlEndTime=2024-07-04T00:00:03.856+0200,CrawlerEndTime=2024-07-04T00:00:43.960+0200,WebFsCrawlExecTime=35025,CrawlerStatus=true,CrawlerStartTime=2024-07-04T00:00:03.827+0200,WebFsC>
2024-07-04 00:00:44,001 [main] INFO Disconnected to https://localhost:----
2024-07-04 00:00:44,003 [main] INFO Destroyed LaContainer.

The log level setting is in the default crawler settings of the scheduler.

2024-07-05 08:39:06,613 [Crawler-20240705083901-1-2] DEBUG Failed to connect to domain controller for -----

The domain controller does not seem to be accessible from Fess.