Building the Stack — Plugin, StatsD, Sweeper, and Grafana
Step-by-step implementation of an Airflow metrics plugin using Gauges, configuring StatsD exporter, and deploying a Sweeper DAG to prevent Pushgateway OOMs.
May 29, 20266 min read

Search for a command to run...

Series
Stop treating batch jobs like long-running services. This series provides a Staff-level blueprint for batch workload observability—covering push-based telemetry, metric granularity, and the architectural divide between monitoring state vs. auditing history.
Step-by-step implementation of an Airflow metrics plugin using Gauges, configuring StatsD exporter, and deploying a Sweeper DAG to prevent Pushgateway OOMs.

What to Track, Where to Store It, and Why Counters Break

Master the Airflow Prometheus StatsD architecture for batch workloads. Eliminate race conditions and cardinality explosions with push-based telemetry.

How Your Pipelines Fail While Your Dashboards Stay Green
