Skip to main content
Cisco logo

Senior Site Reliability Engineer (SRE) – Observability & Kubernetes (8–12 yrs)

Cisco
Full Timesenior
Bengaluru, Karnataka, INPosted April 29, 2026

Resume Keywords to Include

Make sure these keywords appear in your resume to improve ATS scoring

PythonBashAWSDockerKubernetesTerraformLinuxUnixElasticsearchKafkaCI/CDDevOpsMicroservices

Sign up free to auto-tailor your resume with all these keywords and get a higher ATS score

Job Description

Meet the Team

Cisco’s Cloud Collaboration Technology Group (CCTG) builds and operates large-scale, cloud-native collaboration platforms including Webex. The team focuses on delivering highly reliable, observable, and scalable infrastructure powering millions of users globally. You will work at the intersection of platform engineering, SRE, and observability, enabling engineering teams to operate resilient distributed systems.

Your Impact

As a Cloud Engineer (Grade 10), you will design and operate observability and infrastructure platforms supporting Webex microservices at scale. This role combines deep hands-on engineering with production ownership, where you will independently drive reliability, automation, and performance improvements across distributed systems.

  • Design, build, and operate observability platforms (logging, metrics, tracing) for microservices
  • Manage and optimize Kubernetes clusters across multi-region production environments
  • Own and enhance CI/CD pipelines using Argo CD, Helm, and GitOps workflows
  • Implement and manage infrastructure-as-code using Terraform on AWS
  • Operate and scale monitoring ecosystems (OpenSearch/ELK, Prometheus, Grafana, Splunk, Kafka)
  • Build automation for proactive detection and remediation of production issues
  • Lead incident response, participate in on-call rotations, and drive post-incident improvements
  • Ensure system security and compliance through patching and vulnerability management
  • Collaborate with cross-functional teams to improve system reliability and scalability
  • Contribute to distributed system design and platform engineering initiatives

Core Technical Skills:

  • Kubernetes administration
  • CI/CD with Argo CD and Helm
  • Docker and container ecosystems
  • Terraform or IaC tools
  • Kafka or streaming systems
  • Linux/Unix expertise
  • Monitoring and alerting systems

Minimum Qualifications

As a part of core tech 90% of our work in based out of these skills.

  • 8+ years of experience in DevOps, SRE, or platform engineering roles in production environments
  • Hands-on experience operating Kubernetes at scale (multi-cluster, thousands of pods, Helm-based deployments)
  • Strong expertise in observability tools (at least two): Prometheus, Grafana, OpenSearch/Elasticsearch, Splunk, Loki, or Logstash
  • Proven experience with Infrastructure-as-Code (Terraform or equivalent) on AWS
  • Proficiency in scripting or programming (Python, Golang, or Bash) for automation and CI/CD integration

Preferred Qualifications

  • Experience managing Kafka / AWS MSK clusters and high-throughput streaming systems
  • Hands-on experience with OpenTelemetry and distributed tracing frameworks
  • Familiarity with security standards (ISO 27001, SOC 2, FedRAMP) and container hardening tools
  • Experience with GitOps workflows (Argo CD), Helm bundles, and progressive delivery (canary/blue-green)
  • Experience using AI tools (Copilot, Claude, LLM-based agents) for code generation, troubleshooting, or incident automation

#WeAreCisco

#WeAreCisco where every individual brings their unique skills and perspectives together to pursue our purpose of powering an inclusive future for all.

Our passion is connection—we celebrate our employees’ diverse set of backgrounds and focus on unlocking potential. Cisconians often experience one company, many careers where learning and development are encouraged and supported at every stage. Our technology, tools, and culture pioneered hybrid work trends, allowing all to not only give their best, but be their best.

We understand our outstanding opportunity to bring communities together and at the heart of that is our people. One-third of Cisconians collaborate in our 30 employee resource organizations, called Inclusive Communities, to connect, foster belonging, learn to be informed allies, and make a difference. Dedicated paid time off to volunteer—80 hours each year—allows us to give back to causes we are passionate about, and nearly 86% do!

Our purpose, driven by our people, is what makes us the worldwide leader in technology that powers the internet. Helping our customers reimagine their applications, secure their enterprise, transform their infrastructure, and meet their sustainability goals is what we do best. We ensure that every step we take is a step towards a more inclusive future for all. Take your next step and be you, with us!

Why Cisco?

At Cisco, we’re revolutionizing how data and infrastructure connect and protect organizations in the AI era – and beyond. We’ve been innovating fearlessly for 40 years to create solutions that power how humans and technology work together across the physical and digital worlds. These solutions provide customers with unparalleled security, visibility, and insights across the entire digital footprint.

Fueled by the depth and breadth of our technology, we experiment and create meaningful solutions. Add to that our worldwide network of doers and experts, and you’ll see that the opportunities to grow and build are limitless. We work as a team, collaborating with empathy to make really big things happen on a global scale. Because our solutions are everywhere, our impact is everywhere.

We are Cisco, and our power starts with you.

Want AI-powered job matching?

Upload your resume and get every job scored, your resume tailored, and hiring manager emails found - automatically.

Get Started Free