Better alerting and monitoring

Description

We want to know when something is wrong with k8s, so we have to:

We should create dashboards, and alerts for these.

We also want to export, visualise and alert on data from:

[ ] Traefik (Provides HTTP(s) Ingress) [ ] Kyverno (Validates k8s objects according to custom rules) [ ] Rook (Connects Ceph storage to kubernetes)

The grafana agent will export based on any Prometheus CRD in the namespace grafana-agent with label instance=sysmans
Prometheus CRD definitions (only ServiceMonitor, PodMonitor, Probe)
Existing observability stuff
Grafana Instance
Each thing will probably have its own docs

Edited Apr 10, 2023 by Aria Shrimpton