Lead Data Engineer Job at WorkHQ, Los Angeles, CA

eHd3TERSUG5YUTJML0xZTGpOS0FjK1AyWVE9PQ==
  • WorkHQ
  • Los Angeles, CA

Job Description

Company Context

Series A, well-funded US startup in HRTech developing WorkHQ.com and an AI Recruiter product.

This is a US-only, Remote role (Mainland).

Role Overview

Lead data infrastructure architect managing billions of data points across 250M+ professional profiles.

Hire data engineers to aid you in that journey.

Core Responsibilities

  • Design scalable data pipelines processing massive record volumes

  • Architect ETL processes using PySpark on Amazon EMR (Open to shifting to other solutions like Data Bricks / Snowflake)

  • Distribute enriched data through medallion architecture across Postgres, Athena, OpenSearch

  • Integrate new data sources into the main pipeline

  • Implement advanced data matching using Splink

Technical Requirements

  • 5-8 years professional data engineering experience

  • Good proficiency in:

    • PySpark and distributed computing

    • AWS data services (EMR, Glue, Athena)

    • Docker

    • Pandas and DataFrame manipulation

    • Complex data format handling (JSONL, Parquet)

  • Strong background in:

    • Big data processing architectures

    • Data warehouse design

    • Performance optimization

  • Advanced Python, SQL skills

Nice to Have

  • Probabilistic record linking expertise

  • OpenSearch/elasticsearch technologies

  • Machine learning data pipeline design

  • Recruitment tech ecosystem knowledge

Technical Stack

  • Big Data: PySpark, EMR

  • Databases: Postgres, OpenSearch

  • Cloud: AWS

  • Containerization: Docker

  • Data Formats: JSONL, Parquet

  • Analytics: Metabase, Athena, Glue

  • Data Processing: Pandas, Splink

Other Considerations

While this role has specific requirements - if you lack a few technical skills, but motivated to learn and lead the platform, please apply for consideration.

If you are coming from Director/Head of/VP levels that is relevant to this job, you can apply as well.

You will need to apply directly on our platform.

Thank you for your time.

Job Tags

Permanent employment, Remote work, Shift work,

Similar Jobs

BD Capital

Loan Processor Job at BD Capital

 ...verify third party services (e.g., credit, flood, appraisals, environmental reports, zoning reports) Collect and verify documentation related...  ..., ensure timely funding of loans, and meet expected service levels Communicate with external parties to the loan including... 

The Ohio State University

Biostatistician II - Center for Biostatistics Job at The Ohio State University

 ...relevant experience required. 4-8 years of relevant experience preferred. Additional Education Desired Additional coursework in epidemiology, public health, biology, or medicine is desired. Required Qualifications A minimum of 4 years of relevant experience as a... 

AFA Sports Performance

Performance Coach Job at AFA Sports Performance

 ...Company Description AFA Sports Performance is dedicated to advancing athletes to their highest potential through tailored, 1:1 coaching in speed, strength, agility, and sports rehab. Our programs are customized based on the individual needs of each athlete,... 

Aloft Tulsa

Night Security Guard Job at Aloft Tulsa

Job Summary:The Security Guard is responsible for maintaining the safety and security of hotel premises, guests, staff, and visitors. This includes regular patrols, responding to alarms or disturbances, enforcing safety regulations, and providing assistance when needed... 

YU & ASSOCIATES INC

Entry Level Environmental Engineer Job at YU & ASSOCIATES INC

Our firm is seeking an Environmental Engineer to join our Northern New Jersey office. The job will consist of a variety of tasks, but candidates can expect to perform soil and groundwater sampling, field inspections and design of solid waste facilities. Will also be providing...