Skip to main content
Robert Half logo

Data Engineer - Lead

Robert Half
Full Timemid
Houston, Texas, USPosted February 16, 2026

Resume Keywords to Include

Make sure these keywords appear in your resume to improve ATS scoring

PythonRAWSAzureApacheKafkaSparkAirflow

Sign up free to auto-tailor your resume with all these keywords and get a higher ATS score

Job Description

We are looking for an experienced Lead Data Engineer to oversee the design, implementation, and management of advanced data infrastructure in Houston, Texas. This role requires expertise in architecting scalable solutions, optimizing data pipelines, and ensuring data quality to support analytics, machine learning, and real-time processing. The ideal candidate will have a deep understanding of Lakehouse architecture and Medallion design principles to deliver robust and governed data solutions.Responsibilities:

  • Develop and implement scalable data pipelines to ingest, process, and store large datasets using tools such as Apache Spark, Hadoop, and Kafka.
  • Utilize cloud platforms like AWS or Azure to manage data storage and processing, leveraging services such as S3, Lambda, and Azure Data Lake.
  • Design and operationalize data architecture following Medallion patterns to ensure data usability and quality across Bronze, Silver, and Gold layers.
  • Build and optimize data models and storage solutions, including Databricks Lakehouses, to support analytical and operational needs.
  • Automate data workflows using tools like Apache Airflow and Fivetran to streamline integration and improve efficiency.
  • Lead initiatives to establish best practices in data management, facilitating knowledge sharing and collaboration across technical and business teams.
  • Collaborate with data scientists to provide infrastructure and tools for complex analytical models, using programming languages like Python or R.
  • Implement and enforce data governance policies, including encryption, masking, and access controls, within cloud environments.
  • Monitor and troubleshoot data pipelines for performance issues, applying tuning techniques to enhance throughput and reliability.
  • Stay updated with emerging technologies in data engineering and advocate for improvements to the organization's data systems.

Want AI-powered job matching?

Upload your resume and get every job scored, your resume tailored, and hiring manager emails found - automatically.

Get Started Free