Resume Keywords to Include
Make sure these keywords appear in your resume to improve ATS scoring
Sign up free to auto-tailor your resume with all these keywords and get a higher ATS score
Job Description
Who We Are:
The Machine Learning Platform team at Reddit is a high-impact team that owns the infrastructure that powers recommendations, content discovery, user and content quantification, while directly impacting other teams such as Growth, Ads, Feeds, and Core Machine Learning teams.
What You’ll Do:
As a Staff ML Infrastructure Engineer, you will lead development of a platform for large scale ML models at Reddit.
- Design end-to-end model lifecycle patterns (MLOps) to boost velocity of development for ML engineers, including data preparation, model management, experiment tracking, and more
- Zero-to-one development and support of a graph ML codebase and platform that abstracts away common patterns and enables greater model scalability and iteration
- Collaborate with ML engineers on performance tuning, including improving model training time, efficiency, and GPU training costs in a large, distributed ML training environment
- Optimize batch data processing within a data warehouse and with tools such as Apache Beam, Apache Spark, Ray Data, and more
- Architect pipelines to build and maintain massive graph data structures on the order of billions of nodes and tens of billions of edges
Who You Might Be:
- 7+ years of experience in ML infrastructure, including model training and model deployments
- Hands-on experience with ML optimization, including memory and GPU profiling
- Deep experience with cloud-based technologies for supporting an ML platform, including tools like GCP BigQuery, Google Cloud Storage, infrastructure-as-code (Terraform), and more
- Hands-on experience administering and integrating MLOps tools for experiment tracking, model serving, and model registries (e.g. MLflow or Wandb)
- Proficiency with the common programming languages and frameworks of ML, such as Python, PyTorch, Tensorflow, etc.
- Deep experience working with distributed training frameworks, including Ray and Kubernetes
- Strong focus on scalability, reliability, performance, and ease of use. You are an undying advocate for platform users and have a deep intuition for the machine learning development lifecycle.
- Strong organizational & communication skills
- Experience working with graph databases (Neo4j, JanusGraph, TigerGraph) is a big plus
- Experience working with graph neural networks (GNNs) and associated graph ML frameworks (PyTorch Geometric, Deep Graph Library) is a big plus
Benefits:
- Comprehensive Healthcare Benefits and Income Replacement Programs
- 401k with Employer Match
- Global Benefit programs that fit your lifestyle, from workspace to professional development to caregiving support
- Family Planning Support
- Gender-Affirming Care
- Mental Health & Coaching Benefits
- Flexible Vacation & Paid Volunteer Time Off
- Generous Paid Parental Leave
Similar Jobs
Associate Web Developer - Fresher
Trivora Systems
Salesforce Developer - 100% Remote
VXForward
Senior Database Administrator (PostgreSQL / AWS RDS)
ASYVA INFOTECH
Technical Consultant - Linux and Azure Cloud Engineer job at AHEAD, Inc. in Gurgaon, HR, India
AHEAD, Inc.
Senior BI Testing Analyst
Trilogy Federal
More Jobs at Reddit
View all →Manager, Mid-Market Sales (Client Account Executives)
Machine Learning Engineer, Search and Answers
Machine Learning Engineer, Ads
Machine Learning Engineer, Ads
iOS Software Engineer, i18n: Grow Global and Local Communities
Want AI-powered job matching?
Upload your resume and get every job scored, your resume tailored, and hiring manager emails found - automatically.
Get Started Free