It lasted for several days to throw the exception: failed to load bitset #103840
Labels
>bug
:Distributed/Engine
Anything around managing Lucene and the Translog in an open shard.
Team:Distributed
Meta label for distributed team
Elasticsearch Version
7.10.1
Java Version
jdk11
OS Version
4.14.81.bm.29-amd64 #1 SMP Debian 4.14.81.bm.29
Problem Description
In our product, there are two cases showing that the file-system corruption, but the shard is green , which seems abnormal.
Case1
Server log: It lasted for several days to throw follow exception:
We just print a warn log no matter what the exception is:
https://github.com/elastic/elasticsearch/blob/1c34507e66d7db1211f66f3513706fdf548736aa/server/src/main/java/org/elasticsearch/index/cache/bitset/BitsetFilterCache.java#L270C5-L270C5
If we should distinguish exceptions: if there is an IOException, the shard should be failed.
Case2
Client log:
It throws the EngineException inner, but doesn't process the exception outside:
elasticsearch/server/src/main/java/org/elasticsearch/index/engine/InternalEngine.java
Line 724 in 1c34507
If we should fail the shard here when throwing EngineException.
The text was updated successfully, but these errors were encountered: