niteshchauhan.xyz

Platform Engineer and SRE with 11+ years of experience building and operating large-scale distributed infrastructure.

Currently leading the platform engineering function at Sportserve — a multi-cluster Kubernetes estate serving 200+ engineers across 15 teams, with observability infrastructure processing 1M+ Prometheus samples/sec and 15M active time series.

Before that, 7+ years across AWS, GCP, and OCI — leading infrastructure for Saudi Arabia’s largest government e-invoicing platform, a data science platform, and enterprise CI/CD at scale.

I care about platform engineering done right: self-service developer tooling, SLO-driven reliability culture, and infrastructure that gets out of engineers’ way.

Here, I write about platform engineering, kubernetes, automation, and building infrastructure at scale.