The ideal candidate will develop and maintain advanced data pipelines and solutions on the Cloudera/Hadoop stack, ensuring seamless integration and automation across environments through CI/CD pipelines
Job Summary
The ideal candidate will develop and maintain advanced data pipelines and solutions on the Cloudera/Hadoop stack, ensuring seamless integration and automation across environments through CI/CD pipelines.
You will work closely with architects, data engineers, and DevOps teams to build reliable, scalable, and efficient data processing systems that empower analytics and data-driven decision-making.
DXC Technology prioritizes in-person collaboration while offering flexibility to support wellbeing, productivity, individual work styles, and life circumstances, fostering an inclusive environment.
Matching Summary
The ideal candidate will develop and maintain advanced data pipelines and solutions on the Cloudera/Hadoop stack, ensuring seamless integration and automation across environments through CI/CD pipelines.
Skills & Requirements
Must-have
Cloudera/Hadoop stack
Spark (Scala or PySpark)
CI/CD pipelines
Linux-based environments and scripting
scalable, distributed data processing systems
Nice-to-have
cloud integration experience
containerization technologies
data governance and security frameworks
data quality, lineage, and metadata management
Cloudera or Spark certifications
Key Requirements
Strong experience in Spark (Scala or PySpark) and Hadoop ecosystem
Solid hands-on experience with Cloudera Data Platform (CDP)
Proficiency with GitHub and CI/CD tools
Familiarity with Linux-based environments and scripting
Good communication skills and ability to work collaboratively