Senior Data Engineer – Real-Time Streaming

Privateer LTD

Full Timesenior Remote

Remote RemotePosted 8 weeks ago

Resume Keywords to Include

Make sure these keywords appear in your resume to improve ATS scoring

PythonJavaSQLAWSKubernetesApacheKafka

Job Description

<p><strong>Department/Team: </strong>Engineering  <br><strong>Clearance Requirement:</strong>  Not Required - Must be eligible<br><strong>Location:</strong> Remote <br><strong>Employment Type:</strong> Full-time, Salaried</p> <h3><span style="font-size: 14pt;"><strong>Role Summary</strong></span></h3> <p> </p> <p>We are seeking a Senior Data Engineer to design and build high-performance real-time data platforms that power analytics, machine learning, and operational intelligence. This role focuses on streaming data pipelines, distributed processing, and large-scale event data systems.</p> <p>You will work on building and operating low-latency data pipelines using technologies such as Apache Flink, Apache Druid, Kafka, and modern data infrastructure, enabling real-time insights across large volumes of structured and unstructured data.</p> <p>This role requires strong experience in stream processing architectures, distributed systems, and scalable data infrastructure.</p> <h3><strong>Key Responsibilities:</strong></h3> <ul> <li>Design and implement real-time streaming data pipelines for high-volume event data. </li> <li>Develop and operate distributed data processing systems using technologies such as:</li> <ul> <li>Apache Flink</li> <li>Apache Kafka</li> <li>Apache Druid</li> </ul> </ul> <ul> <li>Build scalable ingestion pipelines capable of handling millions of events per second.</li> <li>Design low-latency analytical data stores for operational dashboards and real-time analytics.</li> <li>Optimize data pipelines for performance, scalability, and fault tolerance.</li> <li>Work with product and analytics teams to translate business needs into real-time data models.</li> <li>Build and maintain data observability, monitoring, and reliability frameworks.</li> <li>Implement schema evolution and data quality controls across streaming pipelines.</li> <li>Contribute to data platform architecture decisions and infrastructure design.</li> <li>Mentor junior engineers and promote best practices in data engineering and distributed systems.</li> </ul> <p> </p> <h3><strong>Required Qualifications</strong></h3> <ul> <li>7+ years of experience in data engineering or distributed systems development</li> <li>Strong experience building streaming data pipelines</li> <li>Hands-on experience with at least one major streaming framework</li> <li>Experience with real-time analytical databases </li> <li>Experience with large-scale distributed systems</li> <li>Strong SQL skills and experience designing analytical data models</li> <li>Experience building fault-tolerant, highly scalable pipelines</li> <li>Proficiency in one or more programming languages:</li> <ul> <li>Java</li> <li>Python</li> </ul> <li>Experience with AWS </li> </ul> <h3><strong>Preferred Qualifications</strong></h3> <ul> <li>Experience operating Apache Flink clusters in production</li> <li>Experience with Apache Druid real-time ingestion</li> <li>Experience building low-latency OLAP analytics systems</li> <li>Experience with Kubernetes-based data infrastructure</li> <li>Experience with Iceberg / Hudi / Delta Lake</li> <li>Experience with real-time ML feature pipelines</li> <li>Experience building observability for data platforms</li> <li>Experience with high-volume event streams (billions of events/day)</li> </ul> <p> </p>