Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create monitoring alerts #10

Open
lovinscari opened this issue Aug 9, 2022 · 1 comment
Open

Create monitoring alerts #10

lovinscari opened this issue Aug 9, 2022 · 1 comment
Labels
documentation Improvements or additions to documentation enhancement New feature or request

Comments

@lovinscari
Copy link

Alerts need:

Response time
Errors
CPU runtime / storage / memory
Apdex (an amalgamated metric approximating user satisfaction)
Alerts should:

Be actionable
Send the permalink to the metric or a dashboard displaying all relevant metrics via runbook and/or a guide to troubleshooting
Be checked mostly after codebase changes but alert us to issues we can't troubleshoot so that they can be ticketed to Acquia
Alerts can:

Monitor modules/hooks
Monitor SQL DB
Other things that aren't directly relevant (?) but may be useful for troubleshooting
A reasonable metric here is if things are 10x the time they'd normally take or 10% the quality they'd normally have or some threshold undefined for errors over 5-15 minutes period, then we should have an alert.

@lovinscari lovinscari added documentation Improvements or additions to documentation enhancement New feature or request labels Aug 9, 2022
@maxdmayhew
Copy link

maxdmayhew commented Oct 31, 2023

did we already talk about this? is this what new relec/thousand eyes? Is this a duplicate, can we close out?

@lovinscari @tmill29 @rotated8

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation enhancement New feature or request
Projects
Development

No branches or pull requests

2 participants