Develop and maintain advanced data pipelines and solutions on the Cloudera/Hadoop stack, ensuring seamless integration and automation across environments through CI/CD pipelines
Job Summary
Develop and maintain advanced data pipelines and solutions on the Cloudera/Hadoop stack, ensuring seamless integration and automation across environments through CI/CD pipelines.
Work closely with architects, data engineers, and DevOps teams to build reliable, scalable, and efficient data processing systems that empower analytics and data-driven decision-making.
At DXC Technology, we believe strong connections and community are key to our success. Our work model prioritizes in-person collaboration while offering flexibility to support wellbeing, productivity, individual work styles, and life circumstances.
Matching Summary
Develop and maintain advanced data pipelines and solutions on the Cloudera/Hadoop stack, ensuring seamless integration and automation across environments through CI/CD pipelines.
Skills & Requirements
Must-have
Cloudera/Hadoop stack development
Spark (Scala or PySpark) experience
Cloudera Data Platform (CDP)
GitHub and CI/CD tools
Linux environments and scripting
Scalable, distributed data processing systems
Nice-to-have
Cloud integration experience
Containerization technologies knowledge
Data governance and security frameworks
Data quality, lineage, metadata management
Cloudera or Spark certifications
Key Requirements
Strong experience in Spark (Scala or PySpark)
Solid hands-on experience with Cloudera Data Platform (CDP)
Proficiency with GitHub and CI/CD tools
Familiarity with Linux-based environments and scripting