mirror of
https://github.com/healthchecks/healthchecks.git
synced 2026-04-25 23:15:49 +03:00
[GH-ISSUE #144] Reverse healthcheck - Signal #98
Labels
No labels
bug
bug
bug
feature
good-first-issue
new integration
pull-request
question
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/healthchecks#98
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @Braintelligence on GitHub (Dec 27, 2017).
Original GitHub issue: https://github.com/healthchecks/healthchecks/issues/144
Hi everyone!
Is there a way to set up kind of a "reverse" healthcheck? Like, not watch a regularly incoming ping but rather a single ping that is meant as a warning-trigger?
For example I have a machine that I am monitoring by pinging every 5 minutes but I also want it to be able to ping me when it rebooted for some reason only once, and by doing so, triggering a warning.
So instead of a "ping regularly for relaxation" I want to be able to "ping once for danger".
Is this possible?
@cuu508 commented on GitHub (Dec 27, 2017):
This is an interesting use case.
There is no specific support for it, but you can use the regular checks and it sort of works: set up a new check, and set its period and grace time both to 1 minute. Its normal, default state will be "down". And it will flip to "up" for a short time each time it is pinged.
When you ping this check, you will immediately get a "[name of the check] is UP". Two minutes later you will get "[name of the check] is DOWN". Each time you see this pair of messages you will know the machine had rebooted.
If you send a ping during a shutdown, you could name the check "Machine Shutdown Process". And if you call it during boot, name the check "Machine Boot-up" so the wording in notifications makes at least some sense...
It's not ideal that you would be getting two notifications, and the wording in the notifications also would not be the clearest, but this is something you can use today.
@Braintelligence commented on GitHub (Dec 27, 2017):
Actually the machine doesn't shutdown properly when this happens.
To be more precise: I have machines running unstable applications, that MAY just crash at some point. When this happens they just reboot but they don't restart the processes by themselves (which is wanted this way).
So I want to know if they run properly after their processes were started by hand and I want to know ASAP when they rebooted by themselves.
Your hint was great, I'll incorporate it!
@Braintelligence commented on GitHub (Dec 27, 2017):
(Though it would be really cool if we had the option to create such "signals" as well in the future 😸)
@Braintelligence commented on GitHub (Dec 27, 2017):
Oh and as another possible enhancement:
If there are green checks for a tag and also red checks for a tag, then I'd like for the check-tag to be yellow instead of red, so I'd know that SOME stuff is working, while others is not.
@cuu508 commented on GitHub (Dec 27, 2017):
Are you thinking of the tags in "My Checks" screen or the badges in Account > Badges?
For the latter, I've had an idea about optionally showing the badges like this:
instead of just
or
@Braintelligence commented on GitHub (Dec 27, 2017):
On the "My Checks" part, where below the checks are listed.
Showing the tags like you proposed would be even better. This way I could have something like:
Then if I have 5 machines, I know none have rebooted in the last X timeframe.
@cuu508 commented on GitHub (Aug 20, 2018):
I've been playing around with the badge styling, trying to incorporate the "5 down" bit in them. It's not easy to add the extra information without making the tags area look busy and noisy. Especially for accounts that use dozens of tags. So for now I'll leave the tag area as-is.
PS. There have been a couple related changes & improvements though:
PPS. On the original issue about triggering failures, that's been implemented, check out "Signalling a Failure" in https://healthchecks.io/docs/