Make sure these keywords appear in your resume to improve ATS scoring
PythonJavaSQLAWSKubernetesApacheKafka
Sign up free to auto-tailor your resume with all these keywords and get a higher ATS score
Job Description
<p><strong>Department/Team: </strong>Engineering <br><strong>Clearance Requirement:</strong> Not Required - Must be eligible<br><strong>Location:</strong> Remote <br><strong>Employment Type:</strong> Full-time, Salaried</p>
<h3><span style="font-size: 14pt;"><strong>Role Summary</strong></span></h3>
<p> </p>
<p>We are seeking a Senior Data Engineer to design and build high-performance real-time data platforms that power analytics, machine learning, and operational intelligence. This role focuses on streaming data pipelines, distributed processing, and large-scale event data systems.</p>
<p>You will work on building and operating low-latency data pipelines using technologies such as Apache Flink, Apache Druid, Kafka, and modern data infrastructure, enabling real-time insights across large volumes of structured and unstructured data.</p>
<p>This role requires strong experience in stream processing architectures, distributed systems, and scalable data infrastructure.</p>
<h3><strong>Key Responsibilities:</strong></h3>
<ul>
<li>Design and implement real-time streaming data pipelines for high-volume event data. </li>
<li>Develop and operate distributed data processing systems using technologies such as:</li>
<ul>
<li>Apache Flink</li>
<li>Apache Kafka</li>
<li>Apache Druid</li>
</ul>
</ul>
<ul>
<li>Build scalable ingestion pipelines capable of handling millions of events per second.</li>
<li>Design low-latency analytical data stores for operational dashboards and real-time analytics.</li>
<li>Optimize data pipelines for performance, scalability, and fault tolerance.</li>
<li>Work with product and analytics teams to translate business needs into real-time data models.</li>
<li>Build and maintain data observability, monitoring, and reliability frameworks.</li>
<li>Implement schema evolution and data quality controls across streaming pipelines.</li>
<li>Contribute to data platform architecture decisions and infrastructure design.</li>
<li>Mentor junior engineers and promote best practices in data engineering and distributed systems.</li>
</ul>
<p> </p>
<h3><strong>Required Qualifications</strong></h3>
<ul>
<li>7+ years of experience in data engineering or distributed systems development</li>
<li>Strong experience building streaming data pipelines</li>
<li>Hands-on experience with at least one major streaming framework</li>
<li>Experience with real-time analytical databases </li>
<li>Experience with large-scale distributed systems</li>
<li>Strong SQL skills and experience designing analytical data models</li>
<li>Experience building fault-tolerant, highly scalable pipelines</li>
<li>Proficiency in one or more programming languages:</li>
<ul>
<li>Java</li>
<li>Python</li>
</ul>
<li>Experience with AWS </li>
</ul>
<h3><strong>Preferred Qualifications</strong></h3>
<ul>
<li>Experience operating Apache Flink clusters in production</li>
<li>Experience with Apache Druid real-time ingestion</li>
<li>Experience building low-latency OLAP analytics systems</li>
<li>Experience with Kubernetes-based data infrastructure</li>
<li>Experience with Iceberg / Hudi / Delta Lake</li>
<li>Experience with real-time ML feature pipelines</li>
<li>Experience building observability for data platforms</li>
<li>Experience with high-volume event streams (billions of events/day)</li>
</ul>
<p> </p>