Resume Keywords to Include
Make sure these keywords appear in your resume to improve ATS scoring
Sign up free to auto-tailor your resume with all these keywords and get a higher ATS score
Job Description
What you ’ll do
- Own the day-to-day operational health of compliance approved AI and Data platforms in production, ensuring high availability, performance, and reliability
- Monitor AI and Data services, model inference layers, APIs, and data dependencies using logs, metrics, dashboards, and alerts
- Provide production-focused user support for AI tools and Data platforms, prioritizing issue resolution
- Lead incident triage, coordination, and resolution for platform outages or service degradations, partnering with development and infrastructure teams.
- Perform deep technical troubleshooting across applications, data, and system layers.
- Enhance observability, alerting, and operational runbooks to reduce mean time to detect. (MTTD) and mean time to resolve (MTTR) incidents
- Conduct post-incident root cause analysis and drive corrective and preventive improvements
- Support production deployments, configuration changes, and platform upgrades with a strong focus on risk mitigation and stability
- Automate repetitive operational tasks and support workflows using Python and other scripting tools
- Collaborate closely with AI and Data engineering teams to improve platform resilience, scalability, and overall supportability
What’s required
- Bachelor’s degree in Computer Science, Engineering, Mathematics, Physics, or a related technical discipline
- 3–6 years of experience in application support, production engineering, SRE, or platform operations roles
- Solid proficiency in Python for debugging, automation, and operational tooling, as well as SQL for data validation, issue investigation, and platform troubleshooting.
- Working knowledge of cloud operations preferably AWS and Azure
- Windows environments, .NET applications, and SQL Server, as well as Databricks. Prior experience in supporting Reference/Alternate Data applications will be an added advantage
- Good understanding of production systems, APIs, and distributed services
- Experience supporting or operating AI/ML platforms, with knowledge of model serving, inference pipelines, and dependency management
- Excellent analytical, troubleshooting, and incident management skills
- Commitment to the highest ethical standards
We take care of our people
We invest in our people, their careers, their health, and their well-being. When you work here, we provide:
- Health care benefits
- Maternity, Adoption & related leave policies
- Generous paternity and family care leave policies
- Employee Assistance Program & Mental wellness programs
- Transportation support
- Tuition assistance
About Point72
Point72 is a leading global alternative investment firm led by Steven A. Cohen. Building on more than 30 years of investing experience, Point72 seeks to deliver superior returns for its investors through fundamental and systematic investing strategies across asset classes and geographies. We aim to attract and retain the industry’s brightest talent by cultivating an investor-led culture and committing to our people’s long-term growth. t .
Want AI-powered job matching?
Upload your resume and get every job scored, your resume tailored, and hiring manager emails found - automatically.
Get Started Free