Data Engineer

Lexmark

Not specified; not specified; robust benefits pyck...
Apache spark and airflow experience
Python, sql, and scala proficiency
Aws or gcp cloud platform expertise
This role combines traditional data engineering with the technical requirements for supporting Machine Learning systems and artificial intelligence applications

Job Summary

  • This role combines traditional data engineering with the technical requirements for supporting Machine Learning systems and artificial intelligence applications.
  • The position involves designing robust data pipelines optimized for high-volume data required by ML models and building feature stores for efficient retrieval.
  • Candidates will collaborate with data scientists to integrate models into production while ensuring high data quality and compliance with privacy laws like GDPR.

Matching Summary

This role combines traditional data engineering with the technical requirements for supporting Machine Learning systems and artificial intelligence applications.

Salary

Not specified; Not specified; Robust benefits package designed to support growth and well-being

Skills & Requirements

Must-have

  • Apache Spark and Airflow experience
  • Python, SQL, and Scala proficiency
  • AWS or GCP cloud platform expertise
  • Docker and Kubernetes containerization
  • Feature store implementation skills
  • ETL/ELT pipeline development
  • Model versioning with MLflow

Nice-to-have

  • Experience mentoring junior engineers
  • Knowledge of GenAI and LangChain
  • Vector database familiarity (FAISS)
  • Strong cross-functional collaboration
  • Leadership in technical decision making

Key Requirements

  • Bachelor's or Master's degree in Computer Science
  • 5+ years of experience in data engineering
  • Strong understanding of distributed systems

Work Rights

Not specified

Tailored Resume

Cover Letter