summaryrefslogtreecommitdiff
path: root/genesishosting/infra/infra-monitoring-setup.md
blob: e0f6c16dc359bfee3cfad865ad1ac298c735bdd7 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
# Monitoring Setup

We use a layered monitoring approach to ensure full visibility and rapid response.

## Stack

- **Prometheus** for metrics collection
- **Grafana** for visualization dashboards
- **Fail2Ban** for intrusion attempts
- **Genesis Shield** for aggregated alerts (Telegram + Mastodon)

## What We Monitor

| System         | Metric Examples                           |
|----------------|--------------------------------------------|
| PostgreSQL     | Replication lag, disk usage, active queries |
| Web Servers    | HTTP response time, TLS errors             |
| MinIO / Assets | Cache hit ratio, sync status               |
| Docker Hosts   | Container uptime, memory pressure          |

## Alerting

- Telegram: Real-time infra alerts
- Mastodon bot: Daily summaries and status posts
- Fallback email alerts for critical failures