Data Engineer - Python, Ai

Workforcity

Not specified
Python programming experience
Nlp libraries (flair, bert)
Pyspark, pandas, pyarrow
The job posting is for a mid-level Data Engineer specializing in Python, AI, and NLP at Workforcity. The role involves building NLP pipelines, managing data processing with tools like PySpark and Pandas, and developing APIs while supporting CI/CD deployments

Job Summary

  • The candidate will build NLP pipelines using libraries such as Flair, BERT, and LLM frameworks, and will also work on large-scale data processing using PySpark, Pandas, and related data tools.
  • The role includes developing APIs, integrating with platform services, and supporting CI/CD deployments using GitHub and LightSpeed Enterprise.
  • Key responsibilities include developing and optimizing ETL/data processing jobs, building and maintaining NLP pipelines, and developing scalable ingestion and data transformation pipelines for AI and analytics use cases.

Matching Summary

Match Score: 85

The job posting is for a mid-level Data Engineer specializing in Python, AI, and NLP at Workforcity. The role involves building NLP pipelines, managing data processing with tools like PySpark and Pandas, and developing APIs while supporting CI/CD deployments.

Skills & Requirements

Must-have

  • Python programming experience
  • NLP libraries (Flair, BERT)
  • PySpark, Pandas, PyArrow
  • Flask-based APIs
  • CI/CD practices and Git workflows
  • Redis in-memory stores
  • Linux command line and shell scripting

Nice-to-have

  • Polars or Dask for data processing
  • PyTorch or TensorFlow for model training
  • Docker, Kubernetes, containerized deployments
  • FastAPI, Airflow, or Prefect

Key Requirements

  • 3–5 years of hands-on Python programming experience
  • Experience with Autosys JILs for job scheduling
  • Exposure to cloud services; AWS boto3 experience is an asset

Work Rights

Not specified

Tailored Resume

Cover Letter