Skip to main content
Five9 logo

Senior Site Reliability Engineer (SRE)

Five9
Full TimeseniorRemote
India (Remote)RemotePosted 21 days ago

Resume Keywords to Include

Make sure these keywords appear in your resume to improve ATS scoring

PythonJavaShellAWSGCPAzureDockerKubernetesTerraformAnsibleGitHub ActionsLinuxGitGitHubGitLabCI/CDDevOpsMicroservices

Sign up free to auto-tailor your resume with all these keywords and get a higher ATS score

Job Description

<div class="content-intro"><p><img src="https://www.five9.com/sites/default/files/2025-02/five9-logo.svg" alt="" width="100" style="max-width: 100%;"></p> <p>Join us in bringing joy to customer experience. &nbsp;Five9 is a leading provider of cloud contact center software, bringing the power of cloud innovation to customers worldwide. &nbsp;&nbsp;</p> <p>Living our values everyday results in our team-first culture and enables us to innovate, grow, and thrive while enjoying the journey together. We celebrate diversity and foster an inclusive environment, empowering our employees to be their authentic selves.&nbsp;</p></div><p data-start="619" data-end="936"><strong>In this SRE role,</strong> you will focus on the foundational work required to modernize our application deployments. The immediate priority is not deep application code integration, but rather tackling technical debt and enhancing <strong>our legacy Linux-based systems</strong>. This requires<strong> strong Linux system administration</strong> and problem-solving skills to ensure stability during our transition to cloud-native workflows. The software development portion of this role is centered on creating internal tools to improve system management, automate operational tasks, and build out our observability stack. Your success in this area is critical for establishing meaningful SLIs and achieving our reliability targets.</p> <hr data-start="1254" data-end="1257"> <h4 data-start="1259" data-end="1289"><strong data-start="1263" data-end="1287">Key Responsibilities</strong></h4> <h4 data-start="1291" data-end="1328"><strong data-start="1296" data-end="1326">Observability &amp; Monitoring</strong></h4> <ul data-start="1329" data-end="2008"> <li data-start="1329" data-end="1523"> <p data-start="1331" data-end="1523"><strong data-start="1331" data-end="1356">Dashboards &amp; Metrics:</strong> Design and implement comprehensive dashboards covering OS/platform-level and application-level monitoring, broken into primary (RED) and secondary indicators (USE).</p> </li> <li data-start="1524" data-end="1629"> <p data-start="1526" data-end="1629"><strong data-start="1526" data-end="1557">Availability &amp; Reliability:</strong> Establish and maintain SLIs, SLOs, and error budgets for the service.</p> </li> <li data-start="1630" data-end="1780"> <p data-start="1632" data-end="1780"><strong data-start="1632" data-end="1659">Performance Monitoring:</strong> Build alerting systems and performance monitoring to proactively identify and resolve issues before they impact users.</p> </li> <li data-start="1781" data-end="2008"> <p data-start="1783" data-end="2008"><strong data-start="1783" data-end="1805">Incident Response:</strong> Participate in on-call rotations, lead incident response efforts (including post-mortem analysis and remediation), maintain on-call routing, and assign application-level problems to engineering teams.</p> </li> </ul> <h4 data-start="2010" data-end="2059"><strong data-start="2015" data-end="2057">Infrastructure Automation &amp; Deployment</strong></h4> <ul data-start="2060" data-end="2446"> <li data-start="2060" data-end="2155"> <p data-start="2062" data-end="2155"><strong data-start="2062" data-end="2092">CI/CD Pipeline Management:</strong> Build and optimize CI/CD pipelines for speed and resilience.</p> </li> <li data-start="2156" data-end="2272"> <p data-start="2158" data-end="2272"><strong data-start="2158" data-end="2185">Infrastructure as Code:</strong> Develop and maintain infrastructure using tools like Terraform, Ansible, or similar.</p> </li> <li data-start="2273" data-end="2446"> <p data-start="2275" data-end="2446"><strong data-start="2275" data-end="2304">Configuration Management:</strong> Automate system configuration and ensure consistency across environments. Implement and recommend best practices for configuration control.</p> </li> </ul> <h4 data-start="2448" data-end="2480"><strong data-start="2453" data-end="2478">Security &amp; Compliance</strong></h4> <ul data-start="2481" data-end="2879"> <li data-start="2481" data-end="2593"> <p data-start="2483" data-end="2593"><strong data-start="2483" data-end="2507">Security Automation:</strong> Ensure security scanning systems are in place and review escalated vulnerabilities.</p> </li> <li data-start="2594" data-end="2691"> <p data-start="2596" data-end="2691"><strong data-start="2596" data-end="2615">Access Control:</strong> Maintain proper authentication, authorization, and audit logging systems.</p> </li> <li data-start="2692" data-end="2776"> <p data-start="2694" data-end="2776"><strong data-start="2694" data-end="2719">Compliance Reporting:</strong> Ensure systems meet regulatory and industry standards.</p> </li> <li data-start="2777" data-end="2879"> <p data-start="2779" data-end="2879"><strong data-start="2779" data-end="2810">Security Incident Response:</strong> Participate in security incident response and remediation efforts.</p> </li> </ul> <h4 data-start="2881" data-end="2909"><strong data-start="2886" data-end="2907">Cost Optimization</strong></h4> <ul data-start="2910" data-end="3269"> <li data-start="2910" data-end="2991"> <p data-start="2912" data-end="2991"><strong data-start="2912" data-end="2936">Resource Management:</strong> Monitor and optimize cloud resource usage and costs.</p> </li> <li data-start="2992" data-end="3077"> <p data-start="2994" data-end="3077"><strong data-start="2994" data-end="3016">Capacity Planning:</strong> Analyze usage patterns and plan for future capacity needs.</p> </li> <li data-start="3078" data-end="3181"> <p data-start="3080" data-end="3181"><strong data-start="3080" data-end="3098">Cost Analysis:</strong> Provide recommendations for cost-effective architecture and resource allocation.</p> </li> <li data-start="3182" data-end="3269"> <p data-start="3184" data-end="3269"><strong data-start="3184" data-end="3201">Right-sizing:</strong> Implement automated scaling and resource optimization strategies.</p> </li> </ul> <h4 data-start="3271" data-end="3320"><strong data-start="3276" data-end="3318">Common Services &amp; Platform Engineering</strong></h4> <ul data-start="3321" data-end="3814"> <li data-start="3321" data-end="3465"> <p data-start="3323" data-end="3465"><strong data-start="3323" data-end="3349">Shared Infrastructure:</strong> Build and maintain common services (notification systems, caching layers, message queues, or third-party stacks).</p> </li> <li data-start="3466" data-end="3581"> <p data-start="3468" data-end="3581"><strong data-start="3468" data-end="3492">Database Operations:</strong> Manage database reliability, performance, and scaling (where not handled by DB teams).</p> </li> <li data-start="3582" data-end="3696"> <p data-start="3584" data-end="3696"><strong data-start="3584" data-end="3614">Service Mesh &amp; Networking:</strong> Implement and maintain service discovery, load balancing, and network policies.</p> </li> <li data-start="3697" data-end="3814"> <p data-start="3699" data-end="3814"><strong data-start="3699" data-end="3719">Developer Tools:</strong> Create and maintain tools and platforms that improve developer productivity and reliability.</p> </li> </ul> <hr data-start="3816" data-end="3819"> <h4 data-start="3821" data-end="3854"><strong data-start="3825" data-end="3852">Required Qualifications</strong></h4> <h4 data-start="3856" data-end="3883"><strong data-start="3861" data-end="3881">Technical Skills: <br></strong>Experience level: 7+ years</h4> <p>- Demonstrated experience managing Production Linux virtual machine deployments of 50+ virtual machines.</p> <p>- Programming Languages: Proficiency in at least two of Python, Shell, Java, NodeJS, or similar.</p> <p>- Cloud Platforms: Experience with AWS, GCP, or Azure.</p> <p>- Containerization: Hands-on experience with Docker, Kubernetes, and container orchestration.</p> <p>- Monitoring &amp; Observability: Experience with Prometheus, Grafana, ELK stack, or similar tools</p> <p>- Infrastructure as Code: Proficiency with Ansible, Terraform, Helm, or similar</p> <p>- Version Control: Expert-level Git usage and collaborative development practices.</p> <p>- CI/CD Pipelines: Hands-on experience with GitLab CI/CD, GitHub Actions, or similar.</p> <h4 data-start="4519" data-end="4552"><strong data-start="4524" data-end="4550">SRE-Specific Knowledge</strong></h4> <ul data-start="4553" data-end="4791"> <li data-start="4553" data-end="4607"> <p data-start="4555" data-end="4607">Experience defining and maintaining SLOs and SLIs.</p> </li> <li data-start="4608" data-end="4670"> <p data-start="4610" data-end="4670">Understanding and implementation of error budget policies.</p> </li> <li data-start="4671" data-end="4728"> <p data-start="4673" data-end="4728">Proven track record in toil reduction and automation.</p> </li> <li data-start="4729" data-end="4791"> <p data-start="4731" data-end="4791">Experience with capacity planning and performance testing.</p> </li> </ul> <hr data-start="4793" data-end="4796"> <h4 data-start="4798" data-end="4832"><strong data-start="4802" data-end="4830">Preferred Qualifications</strong></h4> <ul data-start="4833" data-end="5237"> <li data-start="4833" data-end="4914"> <p data-start="4835" data-end="4914">Bachelor’s degree in Computer Science, Engineering, or equivalent experience.</p> </li> <li data-start="4915" data-end="4973"> <p data-start="4917" data-end="4973">Experience with microservices and distributed systems.</p> </li> <li data-start="4974" data-end="5041"> <p data-start="4976" data-end="5041">Knowledge of security best practices and compliance frameworks.</p> </li> <li data-start="5042" data-end="5104"> <p data-start="5044" data-end="5104">Experience with chaos engineering and reliability testing.</p> </li> <li data-start="5105" data-end="5169"> <p data-start="5107" data-end="5169">Prior experience in an SRE or DevOps role at a tech company.</p> </li> <li data-start="5170" data-end="5237"> <p data-start="5172" data-end="5237">Contributions to open-source projects or technical communities.</p> </li> </ul> <hr data-start="5239" data-end="5242"> <h4 data-start="5244" data-end="5269"><strong data-start="5248" data-end="5267">Success Metrics</strong></h4> <ul data-start="5270" data-end="5607"> <li data-start="5270" data-end="5339"> <p data-start="5272" data-end="5339">Maintain or improve service availability and reliability metrics.</p> </li> <li data-start="5340" data-end="5413"> <p data-start="5342" data-end="5413">Demonstrated reduction in manual operational work through automation.</p> </li> <li data-start="5414" data-end="5478"> <p data-start="5416" data-end="5478">Effective participation in incident response and prevention.</p> </li> <li data-start="5479" data-end="5528"> <p data-start="5481" data-end="5528">High-quality, well-tested code contributions.</p> </li> <li data-start="5529" data-end="5607"> <p data-start="5531" data-end="5607">Strong collaboration with development teams to improve system reliability.</p> </li> </ul> <hr data-start="5609" data-end="5612"> <h4 data-start="5614" data-end="5645"><strong data-start="5618" data-end="5643">Team Culture &amp; Values</strong></h4> <ul data-start="5646" data-end="6001"> <li data-start="5646" data-end="5712"> <p data-start="5648" data-end="5712"><strong data-start="5648" data-end="5675">Blameless Post-Mortems:</strong> Learn from failures without blame.</p> </li> <li data-start="5713" data-end="5788"> <p data-start="5715" data-end="5788"><strong data-start="5715" data-end="5736">Automation First:</strong> Prefer automated solutions over manual processes.</p> </li> <li data-start="5789" data-end="5866"> <p data-start="5791" data-end="5866"><strong data-start="5791" data-end="5814">Measure Everything:</strong> Data-driven decisions and continuous improvement.</p> </li> <li data-start="5867" data-end="5923"> <p data-start="5869" data-end="5923"><strong data-start="5869" data-end="5891">Knowledge Sharing:</strong> Document and share expertise.</p> </li> <li data-start="5924" data-end="6001"> <p data-start="5926" data-end="6001"><strong data-start="5926" data-end="5948">Work-Life Balance:</strong> Sustainable on-call practices and reasonable load.</p> </li> </ul> <hr data-start="6003" data-end="6006"> <h4 data-start="6008" data-end="6038"><strong data-start="6012" data-end="6036">Growth Opportunities</strong></h4> <ul data-start="6039" data-end="6329"> <li data-start="6039" data-end="6106"> <p data-start="6041" data-end="6106">Work on cutting-edge infrastructure and reliability challenges.</p> </li> <li data-start="6107" data-end="6185"> <p data-start="6109" data-end="6185">Exposure to large-scale distributed systems and modern cloud technologies.</p> </li> <li data-start="6186" data-end="6263"> <p data-start="6188" data-end="6263">Clear career path toward Senior SRE, Staff Engineer, or Management roles.</p> </li> <li data-start="6264" data-end="6329"> <p data-start="6266" data-end="6329">Collaboration with engineering teams across the organization.</p> </li> </ul> <p>&nbsp;</p><div class="content-conclusion"><p><span data-contrast="auto">Five9 embraces diversity and is committed to building a team that represents a variety of backgrounds, perspectives, and skills.  The more inclusive we are, the better we are.  Five9 is an equal opportunity employer.</span><span data-ccp-props="{&quot;201341983&quot;:0,&quot;335559739&quot;:160,&quot;335559740&quot;:240}">&nbsp;</span></p> <hr> <p><span data-ccp-props="{&quot;201341983&quot;:0,&quot;335559739&quot;:160,&quot;335559740&quot;:240}">View our privacy policy, including our privacy notice to California residents here:&nbsp;<a href="https://www.five9.com/pt-pt/legal" target="_blank">https://www.five9.com/pt-pt/legal</a>.&nbsp;&nbsp;<br></span></p> <p><span data-ccp-props="{&quot;201341983&quot;:0,&quot;335559739&quot;:160,&quot;335559740&quot;:240}">Note: Five9 will never request that an applicant send money as a prerequisite for commencing employment with Five9.</span></p></div>

About Five9

Five9 logo

Five9

five9.com

DevopsHires remote

Want AI-powered job matching?

Upload your resume and get every job scored, your resume tailored, and hiring manager emails found - automatically.

Get Started Free