We are looking for a skilled Software Engineer with 3+ years of experience to join our team building scalable data extraction pipelines
Job Summary
We are looking for a skilled Software Engineer with 3+ years of experience to join our team building scalable data extraction pipelines.
You will work on extracting, processing, and structuring data from complex sources such as PDFs and HTML documents using modern ML, NLP, and LLM-based approaches.
Our benefits include Health & Wellness, Flexible Downtime, Continuous Learning, Invest in Your Future, and Family Friendly Perks.
Matching Summary
We are looking for a skilled Software Engineer with 3+ years of experience to join our team building scalable data extraction pipelines.
Skills & Requirements
Must-have
Python programming
Data processing pipelines
ML/NLP techniques
LLMs and prompt engineering
PDF/HTML data extraction
REST APIs and backend development
SQL/NoSQL databases
Software engineering fundamentals
Nice-to-have
Document AI / OCR tools
FastAPI framework
React or frontend development
Distributed systems / streaming pipelines
Vector databases / embeddings / semantic search
Deploying ML models to production
Cloud platforms (AWS / GCP / Azure)
Key Requirements
3+ years of experience
Strong programming skills in Python
Hands-on experience with NLP / ML techniques
Familiarity with LLMs
Experience parsing data from PDFs / HTML
Knowledge of REST APIs
Experience with databases
Good understanding of software engineering fundamentals