Summary
Senior DevOps / Platform Engineer with 5+ years of experience designing, scaling, and operating high-availability cloud infrastructure for OTT platforms and AI workloads. Proven ownership of GPU infrastructure, Kubernetes-based platforms, and production systems serving 10M+ users. Known for 50-70% cloud cost reduction, reliable CI/CD automation, and handling 50-100x traffic spikes. Strong focus on reliability, scalability, and pragmatic automation.
Experience
Senior DevOps Engineer · Spyne (Eventila Pvt. Ltd.)
Feb 2024 - Present- Designed and operated GPU-based AI infrastructure for large-scale image and video processing.
- Built custom GPU auto-scaling, cutting infrastructure costs by 50-70% while sustaining 50-100x spikes.
- Implemented CI/CD with Jenkins and Terraform for GPU, ECS, and containerized workloads.
- Owned Kafka (MSK), MongoDB, Debezium, Metabase, with monitoring via Prometheus and CloudWatch.
- Coordinated infra across AI, backend, and frontend teams for stable production releases.
- Improved reliability with proactive alerting, automation, and infrastructure-as-code practices.
Senior DevOps Engineer · To The New
Apr 2020 - Apr 2023- Operated OTT infrastructure serving 10M+ users with sustained 10K+ TPS.
- Automated deployments using EKS, Jenkins, and AWS Lambda.
- Led EKS upgrades, zero-downtime releases, and large-scale migrations.
- Implemented monitoring and logging with Prometheus, Grafana, and ELK.
- Supported high-traffic live events with cost-efficient scaling strategies.
- Mentored junior DevOps engineers and contributed to long-term stability.
DevOps Trainee · To The New
Oct 2019 - Mar 2020- Supported ECS deployments and backend services including MongoDB, RabbitMQ, and Elasticsearch.
- Built Jenkins pipelines and AWS Lambda automations for operational workflows.
- Implemented logging and monitoring using CloudWatch and ELK.
Earlier Role · Company Name
YYYY - YYYY- Placeholder: Add 2-3 impact bullets with metrics.
Key Achievements
- Migrated workloads from amd64 to AWS Graviton, saving ~40% in costs.
- Built an in-house GPU cluster for AI inference, reducing processing costs by 5x.
- Segregated production and non-production AWS accounts with zero downtime.
- Delivered scalable infrastructure for national live events (IPL matches, PM speeches).
Technical Skills
- AWS (EKS, ECS, EC2, GPU, VPC, IAM)
- Terraform
- Jenkins
- Docker
- Kubernetes
- Prometheus
- Grafana
- ELK
- CloudWatch
- MongoDB
- MySQL
- Kafka
- RabbitMQ
- Redis
- Python
- Shell
- Java (working knowledge)
Certifications
- Placeholder: AWS Certified Solutions Architect - Associate (YYYY)
- Placeholder: CKAD or CKA (YYYY)
Education
Bachelor of Technology · Computer Science Engineering
2016 - 2020GLA University, Uttar Pradesh