BLR
X +0.0000
Y +0.0000
Open to Senior SRE · DevOps Roles

SahilKumarSahu

Role

Senior Site Reliability Engineer

Companies

Bosch · Zeta · Oracle

2021 – Present

I build systems that don't wake you up at 3am. Production-grade Kubernetes, GitOps and observability at scale.

Sahil Kumar Sahu

LOCATION

Bengaluru, Karnataka

STATUS

Open to Work

About

Reliability is easy to claim.
Hard to hold under load.

Who I am

Senior Site Reliability Engineer with 5+ years designing cloud-native platforms at Oracle, Zeta and Robert Bosch. I specialise in Kubernetes, GitOps and observability — building systems that are highly available, secure and operationally quiet so engineering teams can ship without fear. The goal is always the same: make reliability invisible.

5+

Years SRE

$10.5K

Monthly Savings

40%

MTTR Reduced

100+

Microservices

3,000+

Alerts Optimised

50+

Nodes Migrated

Quick facts

Current FocusKubernetes Platform Engineering · GitOps · Cloud Security
LocationBengaluru, Karnataka, India
Open ToSenior SRE · Staff DevOps · Platform Engineering
EducationBTech CSE · CET Bhubaneswar · CGPA 8.86
PublicationIEEE 2018 · TCA Task Scheduling Algorithm
Mentorshiptopmate.io/sahilsahu246

What I don't do

I don't ship infrastructure that only works in demos.
I don't treat observability as a post-launch feature.
I don't build pipelines that teams are afraid to touch.
I don't hand off systems without runbooks and postmortems.

Experience & Education

The trajectory.

2017

BTech Computer Science Engineering

CET Bhubaneswar

2017 – 2021Education

CGPA 8.86 · IEEE publication 2018 · Active IEEE student chapter member.

  • Graduated with CGPA 8.86 — consistently in top percentile of the batch

  • Published TCA Task Scheduling Algorithm paper at IEEE 2018 — peer-reviewed international conference proceedings

  • Proposed TCA (Task-Cluster Algorithm) benchmarked against FCFS and Round Robin — improved throughput and resource utilisation

  • Active IEEE student chapter member — organised 75+ technical events and workshops

  • Final year capstone: developed and benchmarked novel scheduling heuristics for distributed systems

Associate Software Engineer

Robert Bosch

2021 – 2022

Owned infra for 100+ apps including 20+ critical services ensuring high availability.

  • Spearheaded migration of infrastructure to CI/CD across build and production environments

  • Built Ansible automation for monthly release patch fixes across applications

  • Owned end-to-end infrastructure for 100+ apps — 20+ critical with HA guarantees

  • Configured error rate tracking and distributed tracing (APM) in Datadog and New Relic

CI/CDAnsibleDatadogNew RelicAPMLinux

2021

2022

Site Reliability Engineer 1

Zeta

2022 – 2024

Saved $10,500/month in cloud costs. Reduced p95 latency 3 s → 1 s.

  • Orchestrated cloud cost-saving measures — $10,500+ monthly savings

  • Led observability for 100+ microservices — SLIs, SLOs and error budgets defined

  • Removed 3,000+ redundant Prometheus alert rules — noise reduced significantly

  • Reduced p95 latency 3 s→1 s for Sodexo transactions via private endpoint migration

  • Created Jenkins seed jobs for deployment pipelines and payment-service token rotation

  • Migrated centralised logging ELK→OpenSearch for lower cost and better scalability

KubernetesPrometheusJenkinsArgoCDAWSOpenSearchHelmS3

Site Reliability Developer 2

Oracle

2024 – Present

Raised security score 64%→85%. Reduced recurring P1 incidents by 40%.

  • Migrated 50+ on-premise CPDI nodes to OKE, OCI Compute and OCI Block Volume

  • Improved OCI IAM, Cloud Guard and Security Zones — security score 64%→85%

  • Configured OCI Monitoring, Alarms and Notifications via email, Slack and PagerDuty

  • Built Ansible automation to self-heal Kubernetes Operations Framework — toil ↓20%

  • Led blameless postmortems and RCA for P1 incidents — recurring issues ↓40%

OKEOCIKubernetesAnsibleIAMCloud GuardPythonMonitoring

2024

Impact

Proof, not promises.

Hover a node — every number delivered, not projected.

Work

What the arc
produced.

Personal Project

DevOps CI/CD Pipeline on AWS

Personal · 2024

Context

Needed a real-world CI/CD setup that mirrors what enterprise teams run — not tutorials, but production patterns with actual failure modes: memory constraints, ASG kill-loops, Java version mismatches.

AWSJenkinsFlaskEC2ASGALBSSMVPC

Approach

Built from scratch on a custom VPC. Jenkins master on EC2 with GitHub webhook triggers. ALB handles routing, ASG manages scaling. SSM Run Command replaces SSH for zero-touch deployments.

View on GitHub
Personal Project

DevSecOps Pipeline · TravellerHub

Personal · 2024

Context

Most DevSecOps demos stop at scanning. This project layers every stage — build, scan, push, deploy, observe — on a real MERN app with real data, running on production-grade EKS.

EKSArgoCDJenkinsSonarQubeTrivyMongoDBPrometheusGrafana

Approach

Jenkins CI master+worker on EC2. SonarQube for code quality, OWASP for dependency vulnerabilities, Trivy for container scanning. ArgoCD GitOps for deployment. Prometheus and Grafana for observability.

View on GitHub
Live Product

pylance.in — AI Tools Platform

Self-Built · 2025 – Present

Context

Career tools built by someone who actually went through SRE interviews and job hunting. The tools reflect what I wish I had — real ATS feedback, not generic advice.

Next.jsTypeScriptClaude APIVercelTailwindRazorpay

Approach

Next.js App Router with TypeScript. Claude API for AI generation. Mammoth and pdfjs for file parsing. Vercel for deployment. Razorpay for payment integration on premium tools.

View on GitHub

Stack

What I run
in production.

Battle-tested under load.
Not just imported.

Container and Orchestration

Kubernetes

Kubernetes

EKS · OKE

Docker

Docker

Helm

Helm

ArgoCD

ArgoCD

GitOps

Cloud Platforms

AWS

AWS

EC2 · S3 · ALB

Oracle Cloud

Oracle Cloud

OCI

Terraform

Terraform

Ansible

Ansible

CI/CD and GitOps

Jenkins

Jenkins

GitHub Actions

GitHub Actions

GitLab CI

GitLab CI

Observability

Prometheus

Prometheus

Grafana

Grafana

Datadog

Datadog

Languages

Python

Python

Go

Go

Bash

Bash

Linux

Linux

Databases and Tooling

PostgreSQL

PostgreSQL

MySQL

MySQL

Redis

Redis

MongoDB

MongoDB

Nginx

Nginx

Contact

Hard problems
welcome.

Open · SRE · Mentorship · DevOps

Book a 1:1 for resume reviews, SRE mentorship or DevOps guidance — or just to talk shop about Kubernetes, GitOps and keeping production quiet at 3am. I reply to every message.

© 2026 Sahil Kumar Sahu · pylance.in · Built with Next.js and Claude AI