Skip to main content
MSCI logo

Python Developer for Data Engineering

MSCI
Full Timemid
INPosted April 15, 2026

Resume Keywords to Include

Make sure these keywords appear in your resume to improve ATS scoring

PythonSQLSnowflakeGitGitHubAirflowPandasAPI

Sign up free to auto-tailor your resume with all these keywords and get a higher ATS score

Job Description

As part of the Data Engineering group at MSCI, you will play a crucial role in delivering data products to MSCI's product lines globally. You will be part of a talented software development team in Mumbai, working on a next-generation metadata-driven data platform that leverages AI to automate and scale data onboarding. This greenfield opportunity will allow you to architect systems that significantly reduce manual effort and accelerate vendor onboarding.

  • *Key Responsibilities:**
  • Design and develop AI-powered automation capabilities for data onboarding, including vendor file classification, metadata auto-suggestion, data profiling engines, and automated quality control frameworks
  • Build LLM-integrated code generation systems for ingestion pipelines, PySpark transformations, and Airflow DAG orchestration
  • Implement metadata management platforms that serve as the control plane for data lifecycle automation
  • Develop RESTful APIs and integration layers connecting AI services, data platforms (Snowflake, Databricks), and orchestration frameworks
  • Create human-in-the-loop workflows for validation, exception handling, and continuous model improvement
  • Collaborate with cross-functional teams across data engineering, governance, and product to deliver end-to-end automation solutions
  • Build scalable, fault-tolerant systems designed for metadata processing at scale
  • *Qualifications Required:**
  • 5-8 years of software development experience with strong Python programming expertise
  • Knowledge of data manipulation libraries (Pandas, Polars) and analysis workflows
  • Proficiency in SQL and data querying across modern data platforms
  • Understanding of columnar storage formats and time-series analytics (ClickHouse, Parquet, Iceberg)
  • Experience with AI-assisted development tools (GitHub Copilot, Cursor, or similar)
  • Strong understanding of RESTful API design and implementation
  • Experience with Git version control and collaborative development workflows
  • Demonstrated ability to take ownership of complex technical solutions end-to-end
  • Strong analytical and problem-solving skills with attention to data quality and reliability

In addition, MSCI offers a culture of high performance and innovation, flexible working arrangements, advanced technology, collaborative workspaces, and a global network of talented colleagues. As part of MSCI, you will have access to transparent compensation schemes, comprehensive employee benefits, and ongoing learning opportunities to support your professional growth and development. As part of the Data Engineering group at MSCI, you will play a crucial role in delivering data products to MSCI's product lines globally. You will be part of a talented software development team in Mumbai, working on a next-generation metadata-driven data platform that leverages AI to automate and scale data onboarding. This greenfield opportunity will allow you to architect systems that significantly reduce manual effort and accelerate vendor onboarding.

  • *Key Responsibilities:**
  • Design and develop AI-powered automation capabilities for data onboarding, including vendor file classification, metadata auto-suggestion, data profiling engines, and automated quality control frameworks
  • Build LLM-integrated code generation systems for ingestion pipelines, PySpark transformations, and Airflow DAG orchestration
  • Implement metadata management platforms that serve as the control plane for data lifecycle automation
  • Develop RESTful APIs and integration layers connecting AI services, data platforms (Snowflake, Databricks), and orchestration frameworks
  • Create human-in-the-loop workflows for validation, exception handling, and continuous model improvement
  • Collaborate with cross-functional teams across data engineering, governance, and product to deliver end-to-end automation solutions
  • Build scalable, fault-tolerant systems designed for metadata processing at scale
  • *Qualifications Required:**
  • 5-8 years of software development experience with strong Python programming expertise
  • Knowledge of data manipulation libraries (Pandas, Polars) and analysis workflows
  • Proficiency in SQL and data querying across modern data platforms
  • Understanding of columnar storage formats and time-series analytics (ClickHouse, Parquet, Iceberg)
  • Experience with AI-assisted development tools (GitHub Copilot, Cursor, or similar)
  • Strong understanding of RESTful API design and implementation
  • Experience with Git version control and collaborative development workflows
  • Demonstrated ability to take ownership of complex technical solutions end-to-end
  • Strong analytical and problem-solving skills with attention to data quality and reliability

In addition, MSCI offers a culture of high performance and innovation, flexible working arrangements, advanced technology, collaborative workspaces, and a global network of talented colleagues. As part of MSCI, you will have access to transparent compensation schemes, comprehensive emplo

Want AI-powered job matching?

Upload your resume and get every job scored, your resume tailored, and hiring manager emails found - automatically.

Get Started Free