blob: e0f6c16dc359bfee3cfad865ad1ac298c735bdd7 (
plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
|
# Monitoring Setup
We use a layered monitoring approach to ensure full visibility and rapid response.
## Stack
- **Prometheus** for metrics collection
- **Grafana** for visualization dashboards
- **Fail2Ban** for intrusion attempts
- **Genesis Shield** for aggregated alerts (Telegram + Mastodon)
## What We Monitor
| System | Metric Examples |
|----------------|--------------------------------------------|
| PostgreSQL | Replication lag, disk usage, active queries |
| Web Servers | HTTP response time, TLS errors |
| MinIO / Assets | Cache hit ratio, sync status |
| Docker Hosts | Container uptime, memory pressure |
## Alerting
- Telegram: Real-time infra alerts
- Mastodon bot: Daily summaries and status posts
- Fallback email alerts for critical failures
|