Apach POIについて

◆質問
fess.logで以下のようなexceptionに主なものが2件出力されておりました。
以下のエラーはApachePOIによるものでインデックス作成には影響がない認識でよろしいでしょうか。(以前も同様な質問をさせていただきました)
後学のためどういった観点で以下のexceptionが問題ないと判断できるのか、教えていただけますと幸いです。

※情報に不備あればご指摘ください。

◆経緯
fess上のインデックスを確認したところ、以下のexceptionはインデックス上は問題ないと考えています。(fessのインデックスと実ファイルに差分があったがファイルサイズ原因のものと把握しています)

◆1件目

2023-07-26 10:34:43,661 [Crawler-20230726092141-1-3] WARN Command offset 22 past end of data at 3
2023-07-26 10:34:43,661 [Crawler-20230726092141-1-3] WARN Failed to create chunk at 710, ignoring rest of data.
java.lang.IllegalArgumentException: Found a chunk with a negative length, which isn’t allowed
at org.apache.poi.hdgf.chunks.ChunkFactory.createChunk(ChunkFactory.java:149) ~[poi-scratchpad-5.2.3.jar:5.2.3]
at org.apache.poi.hdgf.streams.ChunkStream.findChunks(ChunkStream.java:66) ~[poi-scratchpad-5.2.3.jar:5.2.3]
at org.apache.poi.hdgf.streams.PointerContainingStream.findChildren(PointerContainingStream.java:70) ~[poi-scratchpad-5.2.3.jar:5.2.3]
at org.apache.poi.hdgf.streams.PointerContainingStream.findChildren(PointerContainingStream.java:77) ~[poi-scratchpad-5.2.3.jar:5.2.3]
at org.apache.poi.hdgf.streams.PointerContainingStream.findChildren(PointerContainingStream.java:77) ~[poi-scratchpad-5.2.3.jar:5.2.3]
at org.apache.poi.hdgf.HDGFDiagram.(HDGFDiagram.java:89) ~[poi-scratchpad-5.2.3.jar:5.2.3]
at org.apache.poi.hdgf.extractor.VisioTextExtractor.(VisioTextExtractor.java:52) ~[poi-scratchpad-5.2.3.jar:5.2.3]
at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:232) ~[tika-parser-microsoft-module-2.6.0.jar:2.6.0]
at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:175) ~[tika-parser-microsoft-module-2.6.0.jar:2.6.0]
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:298) ~[tika-core-2.6.0.jar:2.6.0]
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:298) ~[tika-core-2.6.0.jar:2.6.0]
at org.codelibs.fess.crawler.extractor.impl.TikaExtractor$TikaDetectParser.parse(TikaExtractor.java:507) ~[fess-crawler-14.6.1.jar:?]
at org.codelibs.fess.crawler.extractor.impl.TikaExtractor.lambda$getText$0(TikaExtractor.java:194) ~[fess-crawler-14.6.1.jar:?]
at org.codelibs.fess.crawler.extractor.impl.TikaExtractor.getContent(TikaExtractor.java:404) ~[fess-crawler-14.6.1.jar:?]
at org.codelibs.fess.crawler.extractor.impl.TikaExtractor.getText(TikaExtractor.java:183) ~[fess-crawler-14.6.1.jar:?]
at org.codelibs.fess.crawler.transformer.AbstractFessFileTransformer.getExtractData(AbstractFessFileTransformer.java:387) ~[classes/:?]
at org.codelibs.fess.crawler.transformer.AbstractFessFileTransformer.generateData(AbstractFessFileTransformer.java:100) ~[classes/:?]
at org.codelibs.fess.crawler.transformer.AbstractFessFileTransformer.transform(AbstractFessFileTransformer.java:81) ~[classes/:?]
at org.codelibs.fess.crawler.processor.impl.DefaultResponseProcessor.process(DefaultResponseProcessor.java:74) ~[fess-crawler-14.6.1.jar:?]
at org.codelibs.fess.crawler.CrawlerThread.processResponse(CrawlerThread.java:291) ~[fess-crawler-14.6.1.jar:?]
at org.codelibs.fess.crawler.FessCrawlerThread.processResponse(FessCrawlerThread.java:249) ~[classes/:?]
at org.codelibs.fess.crawler.CrawlerThread.run(CrawlerThread.java:162) ~[fess-crawler-14.6.1.jar:?]
at java.lang.Thread.run(Thread.java:833) ~[?:?]
2023-07-26 10:34:43,661 [Crawler-20230726092141-1-3] WARN Header called for 1050034239 bytes, but that would take us past the end of the data!
2023-07-26 10:34:43,661 [Crawler-20230726092141-1-3] WARN Header called for 1056964608 bytes, but that would take us past the end of the data!

◆2件目

2023-07-26 10:34:43,677 [Crawler-20230726092141-1-3] WARN Command offset 104 past end of data at 20
2023-07-26 10:34:43,677 [Crawler-20230726092141-1-3] WARN Command offset 112 past end of data at 20
2023-07-26 10:34:43,677 [Crawler-20230726092141-1-3] WARN Command offset 120 past end of data at 20
2023-07-26 10:34:43,677 [Crawler-20230726092141-1-3] WARN Unexpected error processing command, ignoring and continuing. Command: org.apache.poi.hdgf.chunks.Chunk$Command@418d7179
java.lang.ArrayIndexOutOfBoundsException: Index 23 out of bounds for length 20
at org.apache.poi.util.LittleEndian.getLong(LittleEndian.java:155) ~[poi-5.2.3.jar:5.2.3]
at org.apache.poi.util.LittleEndian.getDouble(LittleEndian.java:74) ~[poi-5.2.3.jar:5.2.3]
at org.apache.poi.hdgf.chunks.Chunk.processCommands(Chunk.java:199) ~[poi-scratchpad-5.2.3.jar:5.2.3]
at org.apache.poi.hdgf.chunks.ChunkFactory.createChunk(ChunkFactory.java:207) ~[poi-scratchpad-5.2.3.jar:5.2.3]
at org.apache.poi.hdgf.streams.ChunkStream.findChunks(ChunkStream.java:66) ~[poi-scratchpad-5.2.3.jar:5.2.3]
at org.apache.poi.hdgf.streams.PointerContainingStream.findChildren(PointerContainingStream.java:70) ~[poi-scratchpad-5.2.3.jar:5.2.3]
at org.apache.poi.hdgf.streams.PointerContainingStream.findChildren(PointerContainingStream.java:77) ~[poi-scratchpad-5.2.3.jar:5.2.3]
at org.apache.poi.hdgf.HDGFDiagram.(HDGFDiagram.java:89) ~[poi-scratchpad-5.2.3.jar:5.2.3]
at org.apache.poi.hdgf.extractor.VisioTextExtractor.(VisioTextExtractor.java:52) ~[poi-scratchpad-5.2.3.jar:5.2.3]
at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:232) ~[tika-parser-microsoft-module-2.6.0.jar:2.6.0]
at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:175) ~[tika-parser-microsoft-module-2.6.0.jar:2.6.0]
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:298) ~[tika-core-2.6.0.jar:2.6.0]
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:298) ~[tika-core-2.6.0.jar:2.6.0]
at org.codelibs.fess.crawler.extractor.impl.TikaExtractor$TikaDetectParser.parse(TikaExtractor.java:507) ~[fess-crawler-14.6.1.jar:?]
at org.codelibs.fess.crawler.extractor.impl.TikaExtractor.lambda$getText$0(TikaExtractor.java:194) ~[fess-crawler-14.6.1.jar:?]
at org.codelibs.fess.crawler.extractor.impl.TikaExtractor.getContent(TikaExtractor.java:404) ~[fess-crawler-14.6.1.jar:?]
at org.codelibs.fess.crawler.extractor.impl.TikaExtractor.getText(TikaExtractor.java:183) ~[fess-crawler-14.6.1.jar:?]
at org.codelibs.fess.crawler.transformer.AbstractFessFileTransformer.getExtractData(AbstractFessFileTransformer.java:387) ~[classes/:?]
at org.codelibs.fess.crawler.transformer.AbstractFessFileTransformer.generateData(AbstractFessFileTransformer.java:100) ~[classes/:?]
at org.codelibs.fess.crawler.transformer.AbstractFessFileTransformer.transform(AbstractFessFileTransformer.java:81) ~[classes/:?]
at org.codelibs.fess.crawler.processor.impl.DefaultResponseProcessor.process(DefaultResponseProcessor.java:74) ~[fess-crawler-14.6.1.jar:?]
at org.codelibs.fess.crawler.CrawlerThread.processResponse(CrawlerThread.java:291) ~[fess-crawler-14.6.1.jar:?]
at org.codelibs.fess.crawler.FessCrawlerThread.processResponse(FessCrawlerThread.java:249) ~[classes/:?]
at org.codelibs.fess.crawler.CrawlerThread.run(CrawlerThread.java:162) ~[fess-crawler-14.6.1.jar:?]
at java.lang.Thread.run(Thread.java:833) ~[?:?]
2023-07-26 10:34:43,677 [Crawler-20230726092141-1-3] WARN Command offset 24 past end of data at 20
2023-07-26 10:34:43,677 [Crawler-20230726092141-1-3] WARN Command offset 32 past end of data at 20

ログレベルがERRORで出力されるものがFessとしては、処理できなかったものになります。なので、WARNであれば、普通に処理されていると思います。

各ライブラリがそれぞれの基準で出力しているものなので、Fessとしては参考情報レベルの出力としか見ていませんが、気になる例外などがあるのであれば、各ライブラリに問い合わせるのが良いと思います。