Data Engineer - Python, Ai

Publix Serving (Civica)

Python programming experience
Nlp pipelines using flair, bert
Large-scale data processing with pyspark
Build NLP pipelines using libraries such as Flair, BERT, and LLM frameworks

Job Summary

  • Build NLP pipelines using libraries such as Flair, BERT, and LLM frameworks.
  • Work on large-scale data processing using PySpark, Pandas, and related data tools.
  • Support CI/CD deployments using GitHub and LightSpeed Enterprise.

Matching Summary

Build NLP pipelines using libraries such as Flair, BERT, and LLM frameworks.

Skills & Requirements

Must-have

  • Python programming experience
  • NLP pipelines using Flair, BERT
  • Large-scale data processing with PySpark
  • Develop scalable ingestion and transformation pipelines
  • Build Flask-based APIs
  • CI/CD workflows using GitHub
  • Autosys JILs for job scheduling
  • Linux command line and shell scripting
  • Monitor application and system health using ITRS Geneos
  • Unit tests with PyTest/unittest
  • Work with REST APIs and microservices
  • Cloud services (ECS) and boto3

Nice-to-have

  • Polars or Dask for high-performance data processing
  • PyTorch or TensorFlow for model training
  • Docker, Kubernetes, or containerized deployments
  • FastAPI, Airflow, or Prefect

Key Requirements

  • 10–12 years of hands-on Python programming experience
  • Experience with NLP libraries
  • Solid experience with PySpark, Pandas, PyArrow
  • Experience building APIs using Flask
  • Experience with MLflow
  • Good understanding of CI/CD practices and Git workflows
  • Experience working with Redis
  • Experience with Autosys JILs
  • Comfortable with Linux command line
  • Exposure to cloud services; AWS boto3 experience

Work Rights

Not specified

Tailored Resume

Cover Letter