What we actually monitor
HTTP/HTTPS, TCP, UDP, and multi-region truth. We don’t guess from one location and call it global.
- HTTP / HTTPS: We hit a URL (optionally require a keyword in the body). We record latency, HTTP code. If we get 4xx/5xx or the body doesn’t match, it’s DOWN.
- TCP ports: We attempt a TCP connect to any port (443, 25565, 22, etc). If we can’t handshake, it’s DOWN. Connect time = latency.
- UDP reachability: For DNS, game servers, lightweight daemons. We send a datagram and confirm reachability. “Couldn’t even open socket” is DOWN.
- Multi-region truth: Each monitor can be checked from multiple probes (for example: Canada, US West, US East). If only one fails, status is PARTIAL; if all fail, it’s DOWN. You get regional context, not just “red light.”
Current probe regions
These are the actual locations hitting your service every cycle, right now. We’ll keep adding more geography (and you’ll see them here automatically).
BH_Canada
Beauharnois, Canada
LasVegas
LasVegas
Vinthill
Vinthill, Virginia
Alert policy & routing
You choose how loud it should be, and who gets hit.
- Per-monitor alerting policy: “page me if any probe is down,” “page me if most probes are down,” or “notify only if all probes are down.”
- We include context in the alert: which region broke first, HTTP/socket failure, and latency.
- We route to the destinations you define: Discord channel / webhook, browser push (installable PWA), email, shared escalation contacts. SMS / voice rollout is in progress.
Incident history & audit trail
Clean evidence for exec briefings, postmortems, and compliance.
- We snapshot every flip: start time, which region failed first, exact error, and when it recovered.
- You get per-probe latency + HTTP codes over time, not just “0/1 up/down.”
- We keep a full incident timeline you can hand to leadership or attach to a root cause analysis.
Example alerts
The actual messages your on-call team receives.
Recovery notice (green)
Service restored. Includes which regions cleared.
Regional degradation (yellow)
Canada good, Vegas bad. We call that “regional,” not “global outage.”
Full outage (red)
All assigned probes failed. Everyone you’ve listed will get hit.
Plan comparison
Limits, intervals, and regional coverage. You can change plans any time.