Site Reliability Engineer (SRE) – AI / GenAI Infrastructure
Ardent SoftSol Inc.Resume Keywords to Include
Make sure these keywords appear in your resume to improve ATS scoring
Sign up free to auto-tailor your resume with all these keywords and get a higher ATS score
Job Description
🚀 Hiring: Site Reliability Engineer (SRE) – AI / GenAI Infrastructure
📍 Montreal, Quebec, Canada
🏢 Hybrid – 3 Days In Office
🤝 Contract Role
🧑 💻 Experience: 8+ Years
🗓 Face-to-Face Interview Required
We’re looking for a Senior SRE with AI/GenAI platform experience to operate and scale infrastructure supporting large-scale AI workloads (training, inference, model serving, GPU clusters).
🔑 Key Skills (Must-Have):
- Production SRE / Infrastructure Operations (Large-Scale Systems)
- Kubernetes & Docker (Containerization & Orchestration)
- Infrastructure as Code (Terraform, Helm, Ansible, CloudFormation)
- Strong Programming (Python / Go / Java)
- Monitoring & Observability (Prometheus, Grafana, ELK, Datadog)
- GPU / AI Compute Clusters & Distributed Systems
- Networking & Systems Engineering (TCP/IP, DNS, Load Balancing)
- Incident Response, RCA & Reliability Engineering
🛠️ What You’ll Do:
✔ Operate & maintain infrastructure for GenAI platforms
✔ Build automation to reduce operational toil
✔ Manage Kubernetes clusters, GPU compute & distributed storage
✔ Define SLOs/SLIs, error budgets & dashboards
✔ Lead incident response & postmortems
✔ Drive cost optimization, scaling & capacity planning
✔ Ensure security, compliance & disaster recovery readiness
⭐ Strong Plus: Experience in regulated environments (Financial Services / Compliance-heavy domains)
If you have strong SRE + Kubernetes + AI Infrastructure + Automation experience and are open to a Hybrid Contract role in Montreal, let’s connect!
📩 DM me to apply.
If you're interested please share your resume to sagar@ardentsoftsol.com
Similar Jobs
Sr DevOps Engineer
Temple University Health System
Dotnet Developer(.net Server)
People Prime Worldwide
SQL Developer
LanceSoft Inc
SQL Server Developer – Azure
Finance Professionals
SQL Server Developer, Data Solutions
Procom
More Jobs at Ardent SoftSol Inc.
View all →Want AI-powered job matching?
Upload your resume and get every job scored, your resume tailored, and hiring manager emails found - automatically.
Get Started Free