Senior Site Reliability Engineer (SRE) – Observability & Kubernetes (8–12 yrs)

Full Timesenior

Bengaluru, Karnataka, INPosted 8 weeks ago

Role Overview

Cisco is hiring a Senior Site Reliability Engineer (SRE) – Observability & Kubernetes (8–12 yrs). This is a full-time role in Bengaluru. Part of Cisco's Devops hiring. Full responsibilities, required qualifications, and the apply link are listed in the description below.

Resume Keywords to Include

Make sure these keywords appear in your resume to improve ATS scoring

PythonBashAWSDockerKubernetesTerraformLinuxUnix

Job Description

Meet the Team

Cisco’s Cloud Collaboration Technology Group (CCTG) builds and operates large-scale, cloud-native collaboration platforms including Webex. The team focuses on delivering highly reliable, observable, and scalable infrastructure powering millions of users globally. You will work at the intersection of platform engineering, SRE, and observability, enabling engineering teams to operate resilient distributed systems.

Your Impact

As a Cloud Engineer (Grade 10), you will design and operate observability and infrastructure platforms supporting Webex microservices at scale. This role combines deep hands-on engineering with production ownership, where you will independently drive reliability, automation, and performance improvements across distributed systems.

Design, build, and operate observability platforms (logging, metrics, tracing) for microservices
Manage and optimize Kubernetes clusters across multi-region production environments
Own and enhance CI/CD pipelines using Argo CD, Helm, and GitOps workflows
Implement and manage infrastructure-as-code using Terraform on AWS
Operate and scale monitoring ecosystems (OpenSearch/ELK, Prometheus, Grafana, Splunk, Kafka)
Build automation for proactive detection and remediation of production issues
Lead incident response, participate in on-call rotations, and drive post-incident improvements
Ensure system security and compliance through patching and vulnerability management
Collaborate with cross-functional teams to improve system reliability and scalability
Contribute to distributed system design and platform engineering initiatives

Core Technical Skills:

Kubernetes administration
CI/CD with Argo CD and Helm
Docker and container ecosystems
Terraform or IaC tools
Kafka or streaming systems
Linux/Unix expertise
Monitoring and alerting systems

Minimum Qualifications

As a part of core tech 90% of our work in based out of these skills.

8+ years of experience in DevOps, SRE, or platform engineering roles in production environments
Hands-on experience operating Kubernetes at scale (multi-cluster, thousands of pods, Helm-based deployments)
Strong expertise in observability tools (at least two): Prometheus, Grafana, OpenSearch/Elasticsearch, Splunk, Loki, or Logstash
Proven experience with Infrastructure-as-Code (Terraform or equivalent) on AWS
Proficiency in scripting or programming (Python, Golang, or Bash) for automation and CI/CD integration

Preferred Qualifications

Experience managing Kafka / AWS MSK clusters and high-throughput streaming systems
Hands-on experience with OpenTelemetry and distributed tracing frameworks
Familiarity with security standards (ISO 27001, SOC 2, FedRAMP) and container hardening tools
Experience with GitOps workflows (Argo CD), Helm bundles, and progressive delivery (canary/blue-green)
Experience using AI tools (Copilot, Claude, LLM-based agents) for code generation, troubleshooting, or incident automation

#WeAreCisco

#WeAreCisco where every individual brings their unique skills and perspectives together to pursue our purpose of powering an inclusive future for all.

Our passion is connection—we celebrate our employees’ diverse set of backgrounds and focus on unlocking potential. Cisconians often experience one company, many careers where learning and development are encouraged and supported at every stage. Our technology, tools, and culture pioneered hybrid work trends, allowing all to not only give their best, but be their best.

We understand our outstanding opportunity to bring communities together and at the heart of that is our people. One-third of Cisconians collaborate in our 30 employee resource organizations, called Inclusive Communities, to connect, foster belonging, learn to be informed allies, and make a difference. Dedicated paid time off to volunteer—80 hours each year—allows us to give back to causes we are passionate about, and nearly 86% do!

Our purpose, driven by our people, is what makes us the worldwide leader in technology that powers the internet. Helping our customers reimagine their applications, secure their enterprise, transform their infrastructure, and meet their sustainability goals is what we do best. We ensure that every step we take is a step towards a more inclusive future for all. Take your next step and be you, with us!

Why Cisco?

At Cisco, we’re revolutionizing how data and infrastructure connect and protect organizations in the AI era – and beyond. We’ve been innovating fearlessly for 40 years to create solutions that power how humans and technology work together across the physical and digital worlds. These solutions provide customers with unparalleled security, visibility, and insights across the entire digital footprint.

Fueled by the depth and breadth of our technology, we experiment and create meaningful solutions. Add to that our worldwide network of doers and experts, and you’ll see that the opportunities to grow and build are limitless. We work as a team, collaborating with empathy to make really big things happen on a global scale. Because our solutions are everywhere, our impact is everywhere.

We are Cisco, and our power starts with you.

Frequently Asked Questions

How do I apply for the Senior Site Reliability Engineer (SRE) – Observability & Kubernetes (8–12 yrs) position at Cisco?

Use the Apply button above to submit your application directly to Cisco. Most applications take less than 5 minutes if your resume and contact details are ready, and you'll be routed to the employer's official application system to finish.

Where is the Senior Site Reliability Engineer (SRE) – Observability & Kubernetes (8–12 yrs) position at Cisco located?

This position is based in Bengaluru. Cisco has not indicated remote or hybrid options for this role, so candidates should plan for on-site work.

What does a Senior Site Reliability Engineer (SRE) – Observability & Kubernetes (8–12 yrs) at Cisco earn?

Cisco has not disclosed a salary range in this posting. Many employers share specifics later in the interview process; you can also ask during a recruiter screen if compensation transparency is important to you.

When was the Senior Site Reliability Engineer (SRE) – Observability & Kubernetes (8–12 yrs) role at Cisco posted?

This role was posted on April 29, 2026 (56 days ago). It's still listed as actively hiring; we re-confirm openings against the source system multiple times per day and remove closed roles.

How much experience does the Senior Site Reliability Engineer (SRE) – Observability & Kubernetes (8–12 yrs) role at Cisco require?

This is a senior-level position. Most senior roles call for 5+ years of directly relevant experience. Cisco lists their specific requirements in the description below, so review the must-have qualifications closely before applying.

Browse Remote DevOps Engineer Jobs →

AI-powered job search

Get every job scored to your resume

Upload your resume and get jobs ranked, your resume tailored, and employee contacts found automatically.

Get Started Free

No credit card to start