Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Stack Monitoring] [Test Scenario] Out of the box alerting #98765

Closed
9 of 24 tasks
simianhacker opened this issue Apr 29, 2021 · 7 comments
Closed
9 of 24 tasks

[Stack Monitoring] [Test Scenario] Out of the box alerting #98765

simianhacker opened this issue Apr 29, 2021 · 7 comments
Assignees
Labels
Feature:Stack Monitoring Team:Infra Monitoring UI - DEPRECATED DEPRECATED - Label for the Infra Monitoring UI team. Use Team:obs-ux-infra_services test-plan

Comments

@simianhacker
Copy link
Member

simianhacker commented Apr 29, 2021

Summary

Stack Monitoring provides a set of out-of-the-box alerts, created by simply loading the Stack Monitoring UI within Kibana. The default action for each alert is a server log and the action messaging is controlled by the Stack Monitoring UI code directly.

PRs

Original, and CPU alert: #68805
Disk usage alert: #75419
JVM memory usage alert: #79039
Missing monitoring data alert: #78208
Threadpool rejections alert: #79433

Testing

Creation

  • Ensure alerts are created once visiting the Stack Monitoring UI
  • Ensure a user with the minimum set of monitoring permissions is able to create and manage alerts
    • ⚠️ this requires extra permissions, that should be documented (and reflected in the test scenario)

Management

UX

Specific alerts

  • Ensure you can properly trigger and see the server log for the CPU usage alert
  • Ensure you can properly trigger and see the server log for the disk usage alert
  • Ensure you can properly trigger and see the server log for the jvm memory alert
  • Ensure you can properly trigger and see the server log for the missing monitoring data alert (Note: This alert is only concerned with Elasticsearch now and no longer looks at other stack products [Monitoring] Missing monitoring data alert firing for version upgrade and configuration changes for Kibana in Cloud #83309)
  • Ensure you can properly trigger and see the server log for both threadpool rejection alerts
    • 🤔 I was unable to create the conditions for this.

Information in reproducting legacy alerts -> #87377

  • Ensure you can properly trigger and see the server log for the legacy cluster health alert
  • Ensure you can properly trigger and see the server log for the legacy nodes change alert
  • Ensure you can properly trigger and see the server log for the legacy Elasticsearch version mismatch alert
  • Ensure you can properly trigger and see the server log for the legacy Kibana version mismatch alert
  • Ensure you can properly trigger and see the server log for the legacy Logstash version mismatch alert
  • Ensure you can properly trigger and see the server log for the legacy license expiration alert (This cannot be tested on cloud)
    • 🤔 I was unable to create the conditions for this.

Edge cases

Previous issue: #85841

@simianhacker simianhacker added Team:Infra Monitoring UI - DEPRECATED DEPRECATED - Label for the Infra Monitoring UI team. Use Team:obs-ux-infra_services test-plan Feature:Stack Monitoring labels Apr 29, 2021
@elasticmachine
Copy link
Contributor

Pinging @elastic/logs-metrics-ui (Team:logs-metrics-ui)

@phillipb phillipb self-assigned this Apr 29, 2021
@neptunian neptunian assigned neptunian and unassigned phillipb May 11, 2021
@neptunian
Copy link
Contributor

neptunian commented May 11, 2021

@igoristic

Ensure alerts are created once visiting the Stack Monitoring UI

Can you advise on the best way to ensure the alerts are created? After visiting Stack Monitoring i went to Stack Management -> Rules and Connectors and saw the alerts in the summary. Though there were man others as well. For "Threadpool rejections" there were "Thread pool search rejections" and "Thread pool write rejections". What is meant by the "Original" alert as described in the summary?

Ensure a user with the minimum set of monitoring permissions is able to create and manage alerts

Is creating a user and assigning it to the monitoring role fulfill this requirement? I'm not sure what qualifies as the "minimum set of monitoring permissions".

@neptunian
Copy link
Contributor

I'm not sure this is related, but after the default alerts were created I started getting this in the logs:
Screen Shot 2021-05-11 at 10 28 53 AM
which appears to be coming from:
Screen Shot 2021-05-11 at 10 28 13 AM
I'm using the release cluster

@neptunian
Copy link
Contributor

neptunian commented May 11, 2021

Ensure you can properly trigger and see the server log for both threadpool rejection alerts

Is there any guidance for how to create the conditions for this?

@neptunian
Copy link
Contributor

neptunian commented May 11, 2021

I was not able to get any legacy alerts to fire using the gists. I'm using the release cluster (enterprise license) with the elastic user.

@neptunian
Copy link
Contributor

Ensure Stack Monitoring alerts are not editable or createable in the Alerts & Management UI

Can this be removed? There is a comment below it that says it no longer applies. Can the other comments that were copied over also be addressed?

@simianhacker
Copy link
Member Author

Already shipped

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Feature:Stack Monitoring Team:Infra Monitoring UI - DEPRECATED DEPRECATED - Label for the Infra Monitoring UI team. Use Team:obs-ux-infra_services test-plan
Projects
None yet
Development

No branches or pull requests

4 participants