Senior DevOps / SRE Engineer
Vaco by HighspringResume Keywords to Include
Make sure these keywords appear in your resume to improve ATS scoring
Sign up free to auto-tailor your resume with all these keywords and get a higher ATS score
Job Description
We are looking for a DevOps / Site Reliability Engineer to help design, automate, and maintain scalable cloud infrastructure and applications. The role focuses on automation, reliability, observability, and CI/CD pipelines, primarily within an Azure cloud environment.
You will work closely with engineering teams to improve platform reliability, reduce operational toil, and build efficient automation frameworks that support modern cloud-native applications.
Key Responsibilities
- Design and develop automation tools and applications using Python
- Build and enhance the cloud automation framework, starting with Azure infrastructure
- Integrate automation processes into CI/CD pipelines (GitHub Actions, Jenkins)
- Develop proof-of-concepts (POCs) to test and introduce new technologies or solutions
- Troubleshoot and resolve production issues across cloud and on-premise environments
- Participate in the full Software Development Life Cycle (SDLC): analysis, design, development, testing, and deployment
- Evaluate and implement new DevOps tools and best practices
- Implement observability and monitoring solutions for cloud platforms
- Improve system efficiency by automating manual tasks and reducing operational toil
Required Skills
- Strong Python development experience
- Hands-on experience with Infrastructure as Code (Terraform, Ansible)
- Experience building and maintaining CI/CD pipelines (GitHub Actions, Jenkins)
- Solid understanding of object-oriented programming and software development principles
- Strong experience working in Linux/Unix environments
- Experience with NoSQL databases, including data modeling and performance tuning
- Ability to write clean, reusable, well-documented, and maintainable code
- Experience implementing observability tools such as Prometheus, Grafana, or OpenTelemetry
Technology Stack
Cloud: Azure
Programming: Python
Infrastructure as Code: Terraform, Ansible
CI/CD: GitHub Actions, Jenkins
Observability: Prometheus, Grafana, OpenTelemetry
Systems: Linux
Databases: NoSQL
Key Focus Areas
- Cloud automation
- Platform reliability
- CI/CD pipelines
- Observability and monitoring
Similar Jobs
Site Reliability Engineer – Cloud, CI/CD
Braintrust
DevOps Engineer
freelance.ca
Sr Machine Learning Engineer
The Walt Disney Company (Corporate)
Entry Level Software Engineer w/ Java at Onyx Point, Inc. Hanover, MD
Itlearn360
ETL Developer (SSIS & Healthcare Domain) (Delhi)
Blutic
More Jobs at Vaco by Highspring
View all →Want AI-powered job matching?
Upload your resume and get every job scored, your resume tailored, and hiring manager emails found - automatically.
Get Started Free