summaryrefslogtreecommitdiff
path: root/genesishosting/infra/infra-monitoring-setup.md
diff options
context:
space:
mode:
authordoc <doc@filenotfound.org>2025-06-30 20:06:28 +0000
committerdoc <doc@filenotfound.org>2025-06-30 20:06:28 +0000
commit717fcb9c81d2bc3cc7a84a3ebea6572d7ff0f5cf (patch)
tree7cbd6a8d5046409a82b22d34b01aac93b3e24818 /genesishosting/infra/infra-monitoring-setup.md
parent8368ff389ec596dee6212ebeb85e01c638364fb3 (diff)
uploading documentationHEADmaster
Diffstat (limited to 'genesishosting/infra/infra-monitoring-setup.md')
-rw-r--r--genesishosting/infra/infra-monitoring-setup.md25
1 files changed, 25 insertions, 0 deletions
diff --git a/genesishosting/infra/infra-monitoring-setup.md b/genesishosting/infra/infra-monitoring-setup.md
new file mode 100644
index 0000000..e0f6c16
--- /dev/null
+++ b/genesishosting/infra/infra-monitoring-setup.md
@@ -0,0 +1,25 @@
+# Monitoring Setup
+
+We use a layered monitoring approach to ensure full visibility and rapid response.
+
+## Stack
+
+- **Prometheus** for metrics collection
+- **Grafana** for visualization dashboards
+- **Fail2Ban** for intrusion attempts
+- **Genesis Shield** for aggregated alerts (Telegram + Mastodon)
+
+## What We Monitor
+
+| System | Metric Examples |
+|----------------|--------------------------------------------|
+| PostgreSQL | Replication lag, disk usage, active queries |
+| Web Servers | HTTP response time, TLS errors |
+| MinIO / Assets | Cache hit ratio, sync status |
+| Docker Hosts | Container uptime, memory pressure |
+
+## Alerting
+
+- Telegram: Real-time infra alerts
+- Mastodon bot: Daily summaries and status posts
+- Fallback email alerts for critical failures