Engineering
reliability at the
edge of scale.
SRE Technical Lead with 7+ years architecting, building and operating highly available, secure and scalable Kubernetes platforms across enterprise production. Deep expertise in DevSecOps, software supply chain security, edge security (F5 / WAF / DMZ), SLO governance and AI-assisted incident response — reducing MTTD and MTTR through disciplined automation and root-cause analysis.
A builder at the intersection of craft & systems.
I'm Viswa — an SRE Technical Lead operating large Kubernetes platforms in production. I own availability and scalability for ecosystems of 300+ cloud-native microservices powering mission-critical Telecom BSS, and I care about the texture of an on-call rotation as much as the architecture behind it.
Lately I've been deep in multi-cluster EKS operations, Istio service mesh patterns, DevSecOps supply-chain hardening, and AI-assisted runbook reviews that turn reliability into something measurable — error budgets engineering teams actually respect.
Things I've brought to life.
Kubernetes Reliability Platform
↗End-to-end SRE platform owning availability, scalability and reliability for an ecosystem of 300+ cloud-native microservices across multiple EKS clusters. Standardised Helm packaging, hardened RBAC, and instituted SLO error-budget governance across B2B, B2C and ISP domains.
DevSecOps Supply-Chain Pipeline
↗Integrated SonarQube, JFrog Xray and Guardrails into GitOps pipelines to automate code quality gates, vulnerability scanning and compliance validation. Secured the edge with F5 LTM, WAF and DMZ architecture, and executed continuous vulnerability remediation from internal and external scan reports.
Observability & AIOps Stack
↗Built enterprise observability with unified dashboards, alerting frameworks and logging pipelines across Prometheus, Grafana, Qryn, Kibana, Splunk and Datadog — improving end-to-end visibility and enabling AI-assisted runbook and playbook reviews that raised incident-response maturity.
Disaster Recovery Migration
↗Planned and executed DR migrations and failover validation for business-critical services, owning Change Management, CAB approvals, release governance and infrastructure risk assessments to deliver safe, auditable change with minimal business impact.
Where I've shipped.
- 01 · Apr 2021 — Present
Rakuten India Pvt Ltd
SRE Technical Lead · Bangalore, IndiaLead end-to-end SRE and platform operations for 300+ cloud-native microservices across multiple Kubernetes production clusters powering mission-critical Telecom BSS platforms (B2B, B2C, ISP). Administer AWS EKS end-to-end — provisioning, upgrades, capacity planning, zero/low-downtime migrations. Operate Istio service mesh, harden Kubernetes RBAC, govern SLA/SLI/SLO with error-budget discipline, and lead Major Incident bridges with structured RCA driving real MTTD/MTTR reductions.
AWS EKSKubernetesIstioHelmTerraformJenkinsJFrog XraySonarQubeF5 / WAFPrometheusGrafanaDatadogSplunk - 02 · Oct 2018 — Apr 2021
Photon Interactive Pvt Ltd
DevOps Engineer · Bangalore, IndiaDesigned enterprise CI/CD pipelines with Git, Jenkins, Maven, Ansible and Docker automating builds across Dev, QA, UAT, Pre-Prod and Production. Containerised Java, Python and Angular workloads and migrated applications onto Docker + Kubernetes. Authored Kubernetes manifests for Deployments, Services, Ingress, PVs and autoscaling, and provisioned AWS infra (EC2, S3, IAM, EFS, ECS/Fargate, VPC) with Terraform and Ansible.
AWSDockerKubernetesJenkinsTerraformAnsibleMavenSonarQubeNexusTomcat
Notes from the build.
Running 300+ Microservices on EKS Without Losing Sleep
What it actually takes to own reliability for an ecosystem this size — capacity planning, upgrade cadence, multi-cluster patterns and the error budgets that keep humans sane.
Istio in Production: Virtual Services, Ingress and the Failure Modes Nobody Mentions
Hard-won lessons from operating Istio across multi-cluster EKS — routing, mTLS, sidecar pitfalls, and how to debug a mesh at 2am.
Shifting Security Left: A DevSecOps Pipeline That Actually Blocks Bad Builds
Wiring SonarQube, JFrog Xray and Guardrails into GitOps so vulnerable code, leaked secrets and policy violations never reach production.