Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] org.opensearch.recovery.RecoveryWhileUnderLoadIT.testRecoverWhileUnderLoadAllocateReplicasRelocatePrimariesTest {p0={"cluster.indices.replication.strategy":"DOCUMENT"}} is flaky #13820

Open
reta opened this issue May 24, 2024 · 1 comment
Labels
bug Something isn't working flaky-test Random test failure that succeeds on second run Storage Issues and PRs relating to data and metadata storage

Comments

@reta
Copy link
Collaborator

reta commented May 24, 2024

Describe the bug

The test case org.opensearch.recovery.RecoveryWhileUnderLoadIT.testRecoverWhileUnderLoadAllocateReplicasRelocatePrimariesTest {p0={"cluster.indices.replication.strategy":"DOCUMENT"}} is flaky:

java.lang.AssertionError: All incoming requests on node [node_s4] should have finished. Expected 0 but got 368; pending tasks [[{
  "node" : "node_s4",
  "id" : 76,
  "type" : "direct",
  "action" : "indices:admin/seq_no/retention_lease_sync[p]",
  "status" : {
    "phase" : "primary"
  },
  "description" : "retention_lease_sync shardId=[test][3]",
  "start_time" : "2024-05-24T17:25:25.617Z",
  "start_time_in_millis" : 1716571525617,
  "running_time" : "1m",
  "running_time_in_nanos" : 65752135745,
  "cancellable" : false,
  "cancelled" : false,
  "parent_task_id" : "MWBqDHQmSkefpKFbNwRdHg:75",
  "headers" : { },
  "resource_stats" : {
    "average" : {
      "cpu_time_in_nanos" : 0,
      "memory_in_bytes" : 0
    },
    "total" : {
      "cpu_time_in_nanos" : 0,
      "memory_in_bytes" : 0
    },
    "min" : {
      "cpu_time_in_nanos" : 0,
      "memory_in_bytes" : 0
    },
    "max" : {
      "cpu_time_in_nanos" : 0,
      "memory_in_bytes" : 0
    },
    "thread_info" : {
      "thread_executions" : 0,
      "active_threads" : 0
    }
  }
},{
  "node" : "node_s4",
  "id" : 57,
  "type" : "transport",
  "action" : "internal:index/shard/recovery/start_recovery",
  "description" : "",
  "start_time" : "2024-05-24T17:25:25.549Z",
  "start_time_in_millis" : 1716571525549,
  "running_time" : "1m",
  "running_time_in_nanos" : 65820120649,
  "cancellable" : false,
  "cancelled" : false,
  "headers" : { },
  "resource_stats" : {
    "average" : {
      "cpu_time_in_nanos" : 0,
      "memory_in_bytes" : 0
    },
    "total" : {
      "cpu_time_in_nanos" : 0,
      "memory_in_bytes" : 0
    },
    "min" : {
      "cpu_time_in_nanos" : 0,
      "memory_in_bytes" : 0
    },
    "max" : {
      "cpu_time_in_nanos" : 0,
      "memory_in_bytes" : 0
    },
    "thread_info" : {
      "thread_executions" : 0,
      "active_threads" : 0
    }
  }
},{
  "node" : "node_s4",
  "id" : 75,
  "type" : "transport",
  "action" : "retention_lease_sync",
  "status" : {
    "phase" : "starting"
  },
  "description" : "retention_lease_sync shardId=[test][3]",
  "start_time" : "2024-05-24T17:25:25.617Z",
  "start_time_in_millis" : 1716571525617,
  "running_time" : "1m",
  "running_time_in_nanos" : 65752504066,
  "cancellable" : false,
  "cancelled" : false,
  "headers" : { },
  "resource_stats" : {
    "average" : {
      "cpu_time_in_nanos" : 0,
      "memory_in_bytes" : 0
    },
    "total" : {
      "cpu_time_in_nanos" : 0,
      "memory_in_bytes" : 0
    },
    "min" : {
      "cpu_time_in_nanos" : 0,
      "memory_in_bytes" : 0
    },
    "max" : {
      "cpu_time_in_nanos" : 0,
      "memory_in_bytes" : 0
    },
    "thread_info" : {
      "thread_executions" : 0,
      "active_threads" : 0
    }
  }
}]]

Related component

Storage

To Reproduce

 ./gradlew ':server:internalClusterTest' --tests "org.opensearch.recovery.RecoveryWhileUnderLoadIT" -Dtests.method="testRecoverWhileUnderLoadAllocateReplicasRelocatePrimariesTest {p0={"cluster.indices.replication.strategy":"DOCUMENT"}}" -Dtests.seed=FE4B39551EA78350

Expected behavior

The test must always pass

Additional Details

Plugins
Please list all plugins currently enabled.

Screenshots
Standard

Host/Environment (please complete the following information):

  • CI

Additional context

@reta reta added bug Something isn't working untriaged flaky-test Random test failure that succeeds on second run labels May 24, 2024
@github-actions github-actions bot added the Storage Issues and PRs relating to data and metadata storage label May 24, 2024
@sachinpkale
Copy link
Member

[Storage Triage - attendees 1 2 3 4 5 6 7 8 9 10 ]

Moving to This Quarter

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working flaky-test Random test failure that succeeds on second run Storage Issues and PRs relating to data and metadata storage
Projects
Status: Now(This Quarter)
Development

No branches or pull requests

3 participants