Resume Keywords to Include
Make sure these keywords appear in your resume to improve ATS scoring
Sign up free to auto-tailor your resume with all these keywords and get a higher ATS score
Job Description
Job Description
At Boeing, we innovate and collaborate to make the world a better place. We’re committed to fostering an environment for every teammate that’s welcoming, respectful and inclusive, with great opportunity for professional growth. Find your future with us.
Boeing Vancouver is embarking on an exciting journey to modernize and migrate our systems to the cloud. We are seeking a skilled Site Reliability Engineer to join our Defence & Government Services team.
This position will focus on supporting the Boeing Global Services (BGS) business organization. This new SRE role will bridge the gap between traditional software engineering and operations to create highly scalable and fault-tolerant systems. As a result, you will ensure the reliable and efficient operation of Boeing Vancouver’s systems and services within the Defence & Government Services Portfolio. The position will be based out of our Richmond BC office, offering a flexible hybrid work style that allows for both virtual and in-office work.
As a Site Reliability Engineer at Boeing, you will play a pivotal role in streamlining our development and operations processes to ensure seamless software delivery and infrastructure management. You will collaborate closely with our development, architecture, and analytics to build and maintain robust systems, automate deployment pipelines, and optimize performance, reliability, and scalability of our applications.
Position Responsibilities:
- Design, build, and maintain scalable and highly available infrastructure and processes using modern DevOps practices.
- Deploy and support customer installations, ensuring a smooth setup and integration of our hybrid multi-tenant SaaS solutions into their environments
- Provide both reactive and proactive support to customers, addressing issues as they arise and implementing strategies to prevent future incidents.
- Lead incident response efforts, perform root cause analysis, and implement preventive measures to minimize downtime and service disruptions.
- Develop and enhance automation tools and scripts to streamline operations, reduce manual intervention, and improve efficiency.
- Set up and manage monitoring and alerting systems to proactively identify and resolve performance issues.
- Analyze system capacity and performance metrics to forecast future needs and ensure scalability of services.
- Collaborate with cross-functional teams to identify and implement new tools, technologies, and processes to enhance DevOps practices.
- Implement and advocate for “security best practices” to protect our applications and customer data.
- Pioneer and support special projects.
- Serve as a go-to resource for tools development and process improvement, with the ability to write small custom web-based tools for internal use.
- Create and maintain comprehensive documentation for systems, processes, and incident response procedures.
- Effectively contribute to building the overall knowledge and expertise of the technical team.
- Conduct training sessions and provide mentorship to team members and other departments. Focus on teaching others to enable them to take on newer and bigger tasks, fostering a culture of continuous learning and improvement.
- Be available to support emergencies via a Boeing provided mobile device.
- Monitoring and improving the availability and reliability of applications.
- Optimizing infrastructure and enhancing performance.
- Developing tools and scripts to automate repetitive tasks, such as deployment, monitoring, and scaling.
- Quickly addressing failures or outages and implementing solutions to prevent recurrence.
- Proactively analyzing and improving system performance to meet service level targets.
- Working closely with development and operations teams to ensure seamless integration.
Basic Qualifications (Required Skills/Experience):
- 7+ years in software development or advanced technical support role.
- 5+ years of experience in site reliability engineering, DevOps, or a related role.
- Proven experience in site reliability engineering, DevOps, or a related role, with a track record of successfully implementing and managing infrastructure and deployment pipelines.
- Candidate must be eligible for authorization under the Canadian Government Controlled Goods Program (CGP) assessment.
- Must be able to obtain Canadian Secret Level II Security Clearance.
- Must be legally able to work in Canada.
- Individuals must not pose a risk for safeguarding of controlled goods.
- Must be eligible to handle US export-controlled data.
- Fluency in English language.
Preferred Qualifications (Education/Experience):
- Strong proficiency in programming/scripting languages such as Python, Go, or Bash.
- Extensive experience with cloud platforms (e.g., AWS, Azure) and container orchestration technologies (e.g., Kubernetes, Docker).
- Familiarity with monitoring and observability tools (e.g., Prometheus, Grafana, AWS CloudWatch).
•
Similar Jobs
Palantir Data Engineer - 4+ Years - Pan India
Crescendo Global
DevOps Engineer - TS/SCI
Leidos
Azure DevOps Automated Manual Tester New York NY
AHU Technologies Inc
Salary $150K - Azure Build Engineer (.NET Azure DevOps) - WA
Bellatrix Systems
Zoom AI DevOps Engineer
Zoom
More Jobs at Boeing
View all →Senior Human Resources Data Scientist
Boeing
Experienced Software Engineer - AI-ML (GenAI)
Boeing
Senior Human Resources Data Scientist
Boeing
Embedded Software Engineers (Associate/Experienced/Senior)
Boeing
Senior Human Resources Data Scientist
Boeing
Want AI-powered job matching?
Upload your resume and get every job scored, your resume tailored, and hiring manager emails found - automatically.
Get Started Free