Resume Keywords to Include
Make sure these keywords appear in your resume to improve ATS scoring
Sign up free to auto-tailor your resume with all these keywords and get a higher ATS score
Job Description
Senior Site Reliability Engineer (Full Time, Contract;Hybrid in Glendale, CA)
Optomi, in partnership with one of our clients, is seeking a Senior Site Reliability Engineer to design, build, and support large-scale AWS infrastructure powering both internal and customer-facing applications. This role focuses on maintaining highly available production environments, troubleshooting across the full infrastructure and application stack, and strengthening automation, CI/CD, and monitoring capabilities. The ideal candidate brings deep AWS expertise, strong Linux systems experience, and a proven ability to support complex environments while partnering across engineering teams to improve reliability, scalability, and operational performance.
Responsibilities
- Design, implement, and support scalable AWS infrastructure for internal and customer-facing applications
- Maintain and optimize highly available production environments supporting large-scale workloads
- Troubleshoot complex issues across the full infrastructure and application stack (network, OS, application, database, storage, IAM)
- Build, maintain, and enhance CI/CD pipelines to support reliable and efficient deployments
- Implement and manage monitoring, alerting, and observability solutions to ensure system performance and uptime
- Automate infrastructure and operational processes using scripting and Infrastructure as Code
- Partner with engineering and cross-functional teams while providing technical guidance to improve system reliability, scalability, and performance
Apply today if your background includes:
- 10+ years of experience in SRE, DevOps, or senior systems engineering roles supporting large-scale production environments
- Expert-level AWS experience building and supporting environments using VPC, EC2, S3, Fargate, Lambda, CloudFront, ALB/ELB, IAM, and RDS
- Proven ability to troubleshoot across the full infrastructure and application stack (network, OS, application, database, storage, IAM)
- Strong Linux server/OS administration experience in high-availability environments
- Hands-on experience building and supporting CI/CD pipelines (GitLab CI, Jenkins, or similar)
- Experience with Infrastructure as Code and automation tools (Terraform preferred)
- Monitoring and observability experience using tools such as Datadog, New Relic, or CloudWatch
Want AI-powered job matching?
Upload your resume and get every job scored, your resume tailored, and hiring manager emails found - automatically.
Get Started Free