Data Engineer (Data Science / Data extraction)

Full Timemid

Noida, Uttar Pradesh, INPosted 11 weeks ago

Role Overview

Luxoft is hiring a mid-level Data Engineer (Data Science / Data extraction). This is a full-time role in Noida. Part of Luxoft's Devops hiring. Full responsibilities, required qualifications, and the apply link are listed in the description below.

Resume Keywords to Include

Make sure these keywords appear in your resume to improve ATS scoring

PythonGraphQLAWSDockerKubernetesTerraformApacheElasticsearch

Job Description

Project description

We're looking for a Data Engineer with hands on experience in graph databases to design, build, and optimize data pipelines and knowledge graph solutions that power advanced analytics and discovery. You'll collaborate with data scientists, platform engineers, and product teams to model complex domains, integrate heterogeneous sources, and deliver queryable, scalable graph data products.

Responsibilities

Graph Data Modeling & Design

Design and implement property graphs and RDF/OWL-based knowledge graphs.

Develop schemas/ontologies, entity resolution and lineage strategies; define best practices for graph modeling, naming, and versioning.

Data Engineering & Integration

Build and maintain ETL/ELT pipelines to ingest, cleanse, transform, and load data into graph stores from APIs, files, RDBMS, event streams.

Implement batch and streaming integrations using tools such as Airflow, dbt, Kafka/Kinesis, Spark/Flink.

Optimize data quality, deduplication, key management, and incremental upserts into graphs.

Querying & APIs

Write advanced queries in Cypher, Gremlin, and/or SPARQL; tune queries and indexes for performance.

Expose graph capabilities via APIs/services (REST/GraphQL/GRANDstack) with robust governance, observability and caching.

Performance, Reliability & Security

Capacity planning, clustering, backups, and high availability for graph databases.

Monitoring/alerting (e.g., Prometheus/Grafana, CloudWatch), profiling and query plan analysis.

Apply security best practices: encryption, RBAC/ABAC, least privilege, secrets management, and data masking/Pii handling.

MLOps/Analytics Enablement (nice if applicable)

Support downstream analytics and graph algorithms (PageRank, community detection, embeddings) and integrate with ML pipelines.

DevOps & SDLC

Infrastructure-as-Code (Terraform, Bicep, CloudFormation), containerization (Docker, Kubernetes), and CI/CD for data/infra.

Documentation, code reviews, and contribution to data governance (catalogs, lineage, metadata).

Skills

Must have

Experience: 6 years in Data Engineering (or similar) with 2+ years focused on graph databases (property graph and/or RDF).

Graph DBs: Hands-on with at least one of:

Property Graph: Neo4j, AWS Neptune (Gremlin/Cypher).

RDF Triple Stores: Ontotext GraphDB, Apache Jena/Fuseki, Blazegraph, Stardog, Neptune (RDF).

Query Languages: Strong in Cypher and/or Gremlin; SPARQL if working with RDF/OWL.

Data Pipelines: Proficient with Airflow (or similar), Kafka/Kinesis, Spark or Flink; building robust ETL/ELT at scale.

Programming: Python (dataframes, APIs, CLI tooling); solid testing practices (pytest/pytest-bdd).

Cloud: Experience with AWS managed graph/datastores, storage, compute, and networking basics.

Performance & Ops: Indexing, memory/GC tuning, query plan analysis, partitioning/sharding concepts, HA/DR, backup/restore.

Security & Governance: Secrets management, IAM, network isolation, PII compliance; familiarity with data catalog/lineage tools.

Communication: Ability to translate domain knowledge into graph models and explain trade-offs to non technical stakeholders.

Nice to have

Knowledge Graphs & Semantics: RDFS, SHACL, ontology engineering, reasoning/inference, vocabulary alignment (SKOS).

Graph Algorithms & Embeddings: Neo4j Graph Data Science, NetworkX, PyTorch Geometric, vector DB integration.

Graph + Search: Integration with Elasticsearch/OpenSearch, hybrid search (BM25 + embeddings).

Data Modeling: Experience migrating from relational to graph; CDC patterns (Debezium), event-driven architectures.

Observability: OpenTelemetry, tracing for data services; data quality frameworks (Great Expectations).

Delivery: Experience with productizing graph APIs, caching layers, SLA/SLO management.

Regulatory: Familiarity with GDPR/CCPA, data retention, sovereignty considerations.

Other

Languages

English: C1 Advanced

Seniority

Senior

Noida, India

Req. VR-121769

Data Science

BCM Industry

20/03/2026

Req. VR-121769

Frequently Asked Questions

How do I apply for the Data Engineer (Data Science / Data extraction) position at Luxoft?

Use the Apply button above to submit your application directly to Luxoft. Most applications take less than 5 minutes if your resume and contact details are ready, and you'll be routed to the employer's official application system to finish.

Where is the Data Engineer (Data Science / Data extraction) position at Luxoft located?

This position is based in Noida. Luxoft has not indicated remote or hybrid options for this role, so candidates should plan for on-site work.

What does a Data Engineer (Data Science / Data extraction) at Luxoft earn?

Luxoft has not disclosed a salary range in this posting. Many employers share specifics later in the interview process; you can also ask during a recruiter screen if compensation transparency is important to you.

When was the Data Engineer (Data Science / Data extraction) role at Luxoft posted?

This role was posted on March 17, 2026 (83 days ago). It's still listed as actively hiring; we re-confirm openings against the source system multiple times per day and remove closed roles.

Browse Remote DevOps Engineer Jobs →

AI-powered job search

Get every job scored to your resume

Upload your resume and get jobs ranked, your resume tailored, and employee contacts found automatically.

Get Started Free

No credit card to start

Data Engineer (Data Science / Data extraction)

Luxoft