Skip to main content
Hexagon India Pvt. Ltd. logo

Healthcare Data Scientist (AI/ML)

Hexagon India Pvt. Ltd.
Full Timemid
INPosted March 10, 2026

Resume Keywords to Include

Make sure these keywords appear in your resume to improve ATS scoring

PythonSQLAzurePandasNumPyTensorFlowPyTorchscikit-learn

Sign up free to auto-tailor your resume with all these keywords and get a higher ATS score

Job Description

You will be working as a Data Scientist at DenuoSource India Pvt Ltd, a forward-thinking organization specializing in data-driven solutions for various industries, particularly focusing on healthcare. Your role will involve designing and implementing data science models to address complex healthcare challenges, creating scalable data pipelines, and utilizing Large Language Models (LLMs) and Small Language Models (SLMs) to extract insights from clinical and claims data. Collaboration with cross-functional teams to translate data insights into strategic recommendations will be a key aspect of your responsibilities.

  • *Key Responsibilities:**
  • Design, develop, and deploy machine learning and deep learning models for healthcare use cases like claims processing, clinical coding, pricing analytics, and provider profiling
  • Build and optimize end-to-end data pipelines using PySpark and Azure Databricks for large-scale healthcare data processing
  • Implement Generative AI solutions utilizing LLMs and SLMs for tasks like schema code predictions and medical entity extractions
  • Fine-tune SLMs on domain-specific healthcare data to enhance accuracy and reduce inference costs
  • Develop Named Entity Recognition (NER) models for structured information extraction from unstructured clinical text
  • Conduct advanced statistical analysis and predictive modeling to identify trends, anomalies, and actionable patterns in healthcare datasets
  • Create data visualizations and dashboards to effectively communicate findings to technical and non-technical stakeholders
  • Stay updated on emerging AI/ML research and assess its relevance to healthcare analytics problems
  • *Required Qualifications:**
  • 3+ years of hands-on experience in Data Science, with a focus on healthcare or related domains
  • Proficiency in Python and its data science ecosystem (Pandas, NumPy, Scikit-learn, PyTorch/TensorFlow)
  • Experience with PySpark and distributed data processing frameworks
  • Working knowledge of Azure cloud services (Azure Databricks, Azure ML, or similar)
  • Demonstrated experience with Generative AI, including working with LLMs (GPT, Claude, LLaMA, etc.) and prompt engineering
  • Hands-on experience with SLM fine-tuning for domain-specific tasks
  • Familiarity with deep learning architectures such as Transformers, BERT, and attention-based models
  • Experience in building NER pipelines using frameworks like spaCy, Hugging Face Transformers, or BERT-based models
  • Strong statistical foundation and proficiency in SQL for data extraction, manipulation, and analysis
  • Familiarity with data visualization tools
  • *Education:**
  • Bachelor's or Master's degree in Computer Science, Data Science, Statistics, Mathematics, or a related quantitative field
  • Advanced certifications in AI/ML, cloud computing (Azure), or healthcare informatics are advantageous

(Note: Company details have been omitted as there were no additional company details provided in the job description) You will be working as a Data Scientist at DenuoSource India Pvt Ltd, a forward-thinking organization specializing in data-driven solutions for various industries, particularly focusing on healthcare. Your role will involve designing and implementing data science models to address complex healthcare challenges, creating scalable data pipelines, and utilizing Large Language Models (LLMs) and Small Language Models (SLMs) to extract insights from clinical and claims data. Collaboration with cross-functional teams to translate data insights into strategic recommendations will be a key aspect of your responsibilities.

  • *Key Responsibilities:**
  • Design, develop, and deploy machine learning and deep learning models for healthcare use cases like claims processing, clinical coding, pricing analytics, and provider profiling
  • Build and optimize end-to-end data pipelines using PySpark and Azure Databricks for large-scale healthcare data processing
  • Implement Generative AI solutions utilizing LLMs and SLMs for tasks like schema code predictions and medical entity extractions
  • Fine-tune SLMs on domain-specific healthcare data to enhance accuracy and reduce inference costs
  • Develop Named Entity Recognition (NER) models for structured information extraction from unstructured clinical text
  • Conduct advanced statistical analysis and predictive modeling to identify trends, anomalies, and actionable patterns in healthcare datasets
  • Create data visualizations and dashboards to effectively communicate findings to technical and non-technical stakeholders
  • Stay updated on emerging AI/ML research and assess its relevance to healthcare analytics problems
  • *Required Qualifications:**
  • 3+ years of hands-on experience in Data Science, with a focus on healthcare or related domains
  • Proficiency in Python and its data science ecosystem (Pandas, NumPy, Scikit-learn, PyTorch/TensorFlow)
  • Experience with PySpark and distributed data processing frameworks
  • Working knowledge of Azure cloud services (Azure Databrick

Want AI-powered job matching?

Upload your resume and get every job scored, your resume tailored, and hiring manager emails found - automatically.

Get Started Free