Resume

Site Reliability Engineer with experience in cloud-native migrations, distributed systems automation, and platform reliability. Proven track record in Kubernetes, AWS, and observability tooling to ensure system resiliency and performance at scale.

Experience

Site Reliability Engineer, Apple Pay

Apple (Contractor through TCS) | Jul 2022 - Present

Cupertino, CA

In this role, I oversee the migration of Apple Pay services to the cloud, ensuring seamless integration with existing Apple services. Implement observability best practices by designing dashboards, alerts, and automated workflows, minimizing service disruptions and operational overhead. Partner with cross-functional teams to develop standardized onboarding processes, accelerating future migration initiatives. Additionally, led operational readiness and reliability for Apple Pay's expansion to third-party browsers, expanding user accessibility.

Site Reliability Engineer, Apple Maps

Apple (Contractor through TCS) | Jan 2020 - Jul 2022

Sunnyvale, CA

Serving as an SRE for Apple Maps, I built and managed large-scale distributed data platforms utilizing Kafka, Zookeeper, Elasticsearch, and Redis. My responsibilities focused on maintaining high platform availability through proactive monitoring, automated failover systems, and zero-downtime platform upgrades. Additionally, automated infrastructure provisioning and configuration using Ansible, reducing manual effort and configuration errors in large-scale Linux environments.

Application Support Lead

AT&T (Contractor through Tech Mahindra) | Mar 2019 - Jan 2020

St. Louis, MO

As the Application Support Lead, I led a support team that maintained operational excellence for over 40 critical business applications, implementing comprehensive monitoring and alerting to ensure high availability. This role involved partnering with development teams to drive incident response and root cause analysis while streamlining release cycles to improve service reliability. Additionally, developed disaster recovery procedures and conducted failover testing to ensure business continuity and minimize service downtime.

Sr. Linux System Administrator

Tech Mahindra Ltd. (Client AT&T) | Jan 2011 - Mar 2019

Noida, India

In this position, I deployed and maintained Linux-based systems across AT&T data centers, prioritizing security, availability, and performance. My responsibilities included managing bare-metal servers, virtual machines in private cloud environments, coordinating OS patching and firmware upgrades, and specializing in high-availability setups with Veritas InfoScale. This approach ensured a robust, scalable, and fully optimized operating environment.

Education

Sikkim Manipal University

Master of Computer Applications (MCA)

Kuvempu University

Bachelor of Science (BSc) in Information Technology

NIIT, New Delhi

GNIIT program in Software Engineering

Technical Skills

Cloud & Infrastructure

Kubernetes AWS Istio Terraform Linux Docker

Programming & Scripting

Python Bash

Monitoring & Observability

OpenTelemetry Prometheus Grafana Splunk Datadog

Data Platforms

Kafka Redis Elasticsearch Zookeeper

DevOps & CI/CD

Git GitHub Spinnaker FluxCD Flagger

AI Development Tools

Claude Code Gemini Code Assist

Professional Certifications

  • AWS Certified Solutions Architect - Associate
  • Red Hat Certified Engineer (RHCE)
  • Cisco Certified Network Associate (CCNA) Routing & Switching
  • ITIL V3 Foundation Certified