I keep production systems alive and make sure they stay that way.
5+ years in DevOps and SRE, mostly in FinTech, IoT, and regulated enterprise environments where things breaking at 2am actually costs money. I don't just set up pipelines and walk away. I stick around for the on-call, the incident reviews, and the "why did this alert fire 47 times last night?" conversations.
What I've shipped that I'm proud of:
⚙️ Cut MTTR by 35% by writing incident runbooks that people actually follow, cleaning up alert noise, and running blameless postmortems 💰 Saved 20% on infrastructure spend through capacity planning, autoscaling tuning, and reserved instance strategies 🏗️ Standardized production AWS + Kubernetes environments with Terraform across 5+ enterprise programs in FinTech, IoT, and Litigation 🔐 Designed SOC 2-aligned architectures with proper access controls, centralized logging, and DR readiness 🤖 Built MLOps pipelines for model deployment and inference monitoring in production 📊 Set up observability stacks (CloudWatch, Prometheus, Grafana, Datadog) that caught problems before customers did 🚀 Governed CI/CD across multiple teams with compliance gates, security scans, and change traceability baked in
👥 Mentored 4 junior engineers on production readiness and on-call practices
Stack: AWS · Kubernetes · Docker · Terraform · GitHub Actions · GitLab CI · Jenkins · Prometheus · Grafana · Datadog · Python · Bash · Linux
Domains I've worked in: FinTech, IoT, Healthcare, Regulated Enterprise, Early-stage Startups
Currently looking at Senior DevOps, SRE, and Platform Engineering roles at tech companies, banks, and consulting firms where reliability actually matters.
Reach me at [email protected] | +91-7019261553/9457296121