10850 - Sr. Platform Engineer (Hadoop Admin) Job at Hyundai Autoever America, Fountain Valley, CA

eHdFUEJ4SGdYZ1NMKzdNTGc5V0FkZWY5YVE9PQ==
  • Hyundai Autoever America
  • Fountain Valley, CA

Job Description

Purpose:

Hyundai AutoEver America is seeking a highly experienced Senior or Lead Platform Engineer/Site Reliability Engineer (SRE)/Hadoop Admin to manage and enhance our petabyte-scale, on-premises data platform. This platform is built using open-source Hadoop ecosystem. The ideal candidate brings deep technical expertise, a strong understanding of distributed systems, and extensive experience operating and optimizing large-scale data infrastructure. This role requires a hands-on technical leader who can drive platform innovation, ensure high availability and reliability, and mentor team members in best practices for performance, automation, and resiliency.

 

Essential Functions:

  • Own and operate the end-to-end infrastructure of a large-scale, on-prem Hadoop-based data platform, ensuring high availability and reliability.
  • Design, implement, and maintain core platform components, including Hadoop, Hive, Spark, NiFi, Iceberg, ELK, OpenSearch and Ambari.
  • Automate infrastructure management, monitoring, and deployments using CI/CD pipelines (GitLab) and scripting.
  • Implement and enforce security controls, access management, and compliance standards.
  • Perform system upgrades, patching, performance tuning, and troubleshooting across platform components
  • Optimize observability and telemetry using tools like Prometheus, Grafana, and OpenTelemetry for real-time performance monitoring and alerting.
  • Proactively monitor system health, resolve incidents, and conduct root-cause analyses to prevent recurrence.
  • Collaborate with data engineering, analytics, and infrastructure teams to align platform capabilities with evolving needs.
  • Lead technical discussions, mentor junior engineers, and advocate for DevSecOps and SRE best practices.
  • Champion a culture of operational excellence by continuously improving reliability, automation, and performance.

 

Please note this job description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities that are required of the employee for this job. Duties, responsibilities and activities may change at any time with or without notice.

 

Basic Requirements:

  • Bachelor’s degree in computer science, Engineering, or a related field
  • 10+ years of experience in Platform Engineering, Site Reliability Engineering, or similar roles, with proven success managing large-scale, distributed Hadoop infrastructure.
  • Deep expertise in the Hadoop ecosystem, including HDFS, YARN, Hive, Spark, NiFi, Ambari, and Iceberg.
  • Strong Linux system administration skills (CentOS/Rocky preferred), including system tuning, performance optimization, and troubleshooting.
  • Proficiency in containerization and orchestration using Docker and Kubernetes.
  • Solid experience with automation and Infrastructure as Code, leveraging tools like GitLab CI/CD and scripting in Python and bash.
  • Practical knowledge of monitoring and observability tools (e.g., Prometheus, Grafana, OpenTelemetry) and understanding of system health, alerting, and telemetry.
  • Familiarity with networking concepts, security protocols, and data compliance requirements.
  • Experience managing petabyte-scale data platforms and implementing disaster recovery strategies.
  • Understanding of data governance, metadata management, and operational best practices.
  • Demonstrated ability to lead technical projects, mentor engineers, and collaborate effectively with cross-functional teams.
  • Excellent problem-solving, communication, and leadership skills.

 

Certification:

Relevant certifications (e.g., Cloudera/Hortonworks) are a plus.

 

Salary Range - $103,170 - $158,873

 

 

Job Tags

Full time,

Similar Jobs

Whitney Museum of American Art

[Spring 2026] Education - Spanish Initiatives Internship Job at Whitney Museum of American Art

 ...00 hours. Interns are assigned to a specific department at the Museum for the duration of the internship. For more information, including...  ..., please visit our Internships page. The Whitney seeks an Education: Spanish Initiatives intern for the Spring 2026 semester.... 

Advantia Health

Certified Nurse Midwife (CNM) Job at Advantia Health

 ...& Midwives (P&M) of Advantia is seeking a full-scopeCertified Nurse Midwife for our Alexandria, VA location. We are looking for someone...  ...provider experience. Seamless scheduling, mobile check-in, and telehealth make for an experience that's convenient, because everyone's... 

Zenex Staffing Solutions Pvt Ltd.

Registered Nurse - Outpatient Clinic Job at Zenex Staffing Solutions Pvt Ltd.

RN East Portland Family Medicine Clinic 10000 S.E. Main St 1001 Portland, OR 97216 Community Health Center and Federally Qualified Health...  ...and diseases, and SDOH barriers that require medical and nursing interventions. This role will focus primarily on telephone triage... 

Guru Schools

Junior Cloud DevOps Engineer Job at Guru Schools

 ...Position: Junior Cloud DevOps Engineer Location: Tampa, Florida Experience: 0-2 Years Mode of working: Remote Employment Type: Full-Time (WFH) Overview: We are looking for an enthusiastic junior Cloud DevOps Engineer to join our team. This role is perfect... 

CAL FIRE

Helicopter Pilot - Lassen County at CAL FIRE Job at CAL FIRE

 ...other aircraft such as with other firefighting helicopters, air tankers, lead planes, news media helicopters, as well as general aviation and military aircraft. Minimum Requirements You will find the Minimum Requirements in the Class Specification. Additional...