Confluence crawling

Hello,

Confluence crawling process is limited to crawling individual pages, and is not crawling entire spaces.

Can you help me in this topic ?

Thanks,

Did you check fess.log?

I have reviewed the fess.log and fess-crawler.log files, but I didn’t come across any errors except for this message after some time of processing:

2023-03-10 04:16:57,442 [NMvHxoYBQzW1VzppkiEP-1] ERROR Failed to process a data crawling: DOMAIN - WIKI
org.codelibs.fess.ds.atlassian.AtlassianDataStoreException: Failed to access org.codelibs.fess.ds.atlassian.api.confluence.content.GetContentsRequest@763ee2e0
at org.codelibs.fess.ds.atlassian.api.confluence.content.GetContentsRequest.execute(GetContentsRequest.java:92) ~[fess-ds-atlassian-14.6.0.jar:?]
at org.codelibs.fess.ds.atlassian.api.confluence.ConfluenceClient.getContents(ConfluenceClient.java:100) ~[fess-ds-atlassian-14.6.0.jar:?]
at org.codelibs.fess.ds.atlassian.ConfluenceDataStore.storeData(ConfluenceDataStore.java:72) ~[fess-ds-atlassian-14.6.0.jar:?]
at org.codelibs.fess.ds.AbstractDataStore.store(AbstractDataStore.java:122) ~[classes/:?]
at org.codelibs.fess.helper.DataIndexHelper$DataCrawlingThread.process(DataIndexHelper.java:218) ~[classes/:?]
at org.codelibs.fess.helper.DataIndexHelper$DataCrawlingThread.run(DataIndexHelper.java:204) ~[classes/:?]
Caused by: org.codelibs.curl.CurlException: HTTP Status : 500, error : {“statusCode”:500,“message”:“”,“reason”:“Internal Server Error”}
at org.codelibs.fess.ds.atlassian.api.confluence.content.GetContentsRequest.execute(GetContentsRequest.java:88) ~[fess-ds-atlassian-14.6.0.jar:?]
… 5 more
2023-03-10 04:16:58,343 [NMvHxoYBQzW1VzppkiEP-1] INFO Sent 369 docs (Doc:{process 1854ms, send 815ms, size 1021KB}, Mem:{used 387MB, heap 512MB, max 512MB})
2023-03-10 04:16:58,421 [NMvHxoYBQzW1VzppkiEP-1] INFO Deleted 0 old docs.
2023-03-10 04:16:58,421 [DataStoreCrawler] INFO [EXEC TIME] crawling time: 2168067ms
2023-03-10 04:16:58,422 [main] INFO Finished Crawler
2023-03-10 04:16:58,461 [main] INFO [CRAWL INFO] DataCrawlExecTime=2168067,DataCrawlEndTime=2023-03-10T04:16:58.422+0100,CrawlerEndTime=2023-03-10T04:16:58.422+0100,DataIndexExecTime=115850,CrawlerStatus=true,CrawlerStartTime=2023-03-10T03:40:50.315+0100,DataIndexSize=16825,CrawlerExecTime=2168107,DataCrawlStartTime=2023-03-10T03:40:50.341+0100
2023-03-10 04:16:58,469 [main] INFO Disconnected to http://localhost:9201
2023-03-10 04:16:58,471 [main] INFO Destroyed LaContainer.