Role Overview
EXL is hiring a mid-level Data Scientist-Data Science-Gen AI Engineer. This is a full-time role in IN. Part of EXL's Lifecycle hiring. Full responsibilities, required qualifications, and the apply link are listed in the description below.
Resume Keywords to Include
Make sure these keywords appear in your resume to improve ATS scoring
Sign up free to auto-tailor your resume with all these keywords and get a higher ATS score
Job Description
As a Machine Learning Systems Architect, your role will involve leading the architecture, development, and deployment of scalable machine learning systems with a focus on real-time inference for Large Language Models (LLMs) to serve multiple concurrent users. You will be optimizing inference pipelines using high-performance frameworks such as vLLM, Groq, ONNX Runtime, Triton Inference Server, and TensorRT to minimize latency and cost.
Your key responsibilities will include:
- Designing and implementing agentic AI systems by utilizing frameworks like LangChain, AutoGPT, and ReAct for autonomous task orchestration.
- Fine-tuning, integrating, and deploying foundation models like GPT, LLaMA, Claude, Mistral, Falcon, and others into intelligent applications.
- Developing and maintaining robust MLOps workflows to manage the full model lifecycle, including training, deployment, monitoring, and versioning.
- Collaborating with DevOps teams to implement scalable serving infrastructure leveraging containerization (Docker), orchestration (Kubernetes), and cloud platforms (AWS, GCP, Azure).
- Implementing retrieval-augmented generation (RAG) pipelines by integrating vector databases such as FAISS, Pinecone, or Weaviate.
- Building observability systems for LLMs to track prompt performance, latency, and user feedback.
- Working cross-functionally with research, product, and operations teams to deliver production-grade AI systems that can handle real-world traffic patterns.
- Staying updated on emerging AI trends, hardware acceleration techniques, and contributing to open-source or research initiatives whenever possible.
Your qualifications for this role should include:
- Strong experience in leading the development and deployment of scalable machine learning systems.
- Proficiency in optimizing inference pipelines using high-performance frameworks.
- Hands-on experience with designing and implementing AI systems utilizing various frameworks.
- Expertise in fine-tuning, integrating, and deploying foundation models into intelligent applications.
- Solid understanding of MLOps workflows and managing the full model lifecycle.
- Previous experience in collaborating with DevOps teams and implementing scalable serving infrastructure.
- Familiarity with retrieval-augmented generation (RAG) pipelines and integrating vector databases.
- Excellent skills in building observability systems and tracking performance metrics.
- Ability to work effectively in a cross-functional team environment and deliver production-grade AI systems.
- Continuous learning and staying updated on emerging AI trends and technologies.
If there are any additional details about the company provided in the job description, please share that information with me for inclusion in the final output. As a Machine Learning Systems Architect, your role will involve leading the architecture, development, and deployment of scalable machine learning systems with a focus on real-time inference for Large Language Models (LLMs) to serve multiple concurrent users. You will be optimizing inference pipelines using high-performance frameworks such as vLLM, Groq, ONNX Runtime, Triton Inference Server, and TensorRT to minimize latency and cost.
Your key responsibilities will include:
- Designing and implementing agentic AI systems by utilizing frameworks like LangChain, AutoGPT, and ReAct for autonomous task orchestration.
- Fine-tuning, integrating, and deploying foundation models like GPT, LLaMA, Claude, Mistral, Falcon, and others into intelligent applications.
- Developing and maintaining robust MLOps workflows to manage the full model lifecycle, including training, deployment, monitoring, and versioning.
- Collaborating with DevOps teams to implement scalable serving infrastructure leveraging containerization (Docker), orchestration (Kubernetes), and cloud platforms (AWS, GCP, Azure).
- Implementing retrieval-augmented generation (RAG) pipelines by integrating vector databases such as FAISS, Pinecone, or Weaviate.
- Building observability systems for LLMs to track prompt performance, latency, and user feedback.
- Working cross-functionally with research, product, and operations teams to deliver production-grade AI systems that can handle real-world traffic patterns.
- Staying updated on emerging AI trends, hardware acceleration techniques, and contributing to open-source or research initiatives whenever possible.
Your qualifications for this role should include:
- Strong experience in leading the development and deployment of scalable machine learning systems.
- Proficiency in optimizing inference pipelines using high-performance frameworks.
- Hands-on experience with designing and implementing AI systems utilizing various frameworks.
- Expertise in fine-tuning, integrating, and deploying foundation models into intelligent applications.
- Solid understanding of MLOps workflows and managing the full model lifecycle.
- Previous experience in collaborating with DevOp
Frequently Asked Questions
How do I apply for the Data Scientist-Data Science-Gen AI Engineer position at EXL?
Use the Apply button above to submit your application directly to EXL. Most applications take less than 5 minutes if your resume and contact details are ready, and you'll be routed to the employer's official application system to finish.
Where is the Data Scientist-Data Science-Gen AI Engineer position at EXL located?
This position is based in IN. EXL has not indicated remote or hybrid options for this role, so candidates should plan for on-site work.
What does a Data Scientist-Data Science-Gen AI Engineer at EXL earn?
EXL has not disclosed a salary range in this posting. Many employers share specifics later in the interview process; you can also ask during a recruiter screen if compensation transparency is important to you.
When was the Data Scientist-Data Science-Gen AI Engineer role at EXL posted?
This role was posted on April 24, 2026 (50 days ago). It's still listed as actively hiring; we re-confirm openings against the source system multiple times per day and remove closed roles.
AI-powered job search
Get every job scored to your resume
Upload your resume and get jobs ranked, your resume tailored, and employee contacts found automatically.
Get Started FreeNo credit card to start