Python Backend Developer

271

Hybrid
Expert-level python skills
Experience with opencv and pymupdf
Deep familiarity with tesseract and paddleocr
TransPerfect is seeking a Python Backend Developer to join its AI team, focusing on innovative solutions in document processing, particularly converting complex PDFs into editable formats. The role requires strong technical expertise in Python, OCR, and document AI, along with a strategic mindset to enhance AI applications

Job Summary

  • The role involves leading the research and implementation of a document conversion pipeline to solve the 'last mile' of converting complex PDFs to editable .docx files.
  • You will perform comparative analysis between commercial solutions like ABBYY and open-source AI-native tools such as Mistral OCR and Docling.
  • This is a hybrid role requiring both strategic decision-making on tool selection and hands-on development of scalable AI-driven workflows.

Matching Summary

Match Score: 85

TransPerfect is seeking a Python Backend Developer to join its AI team, focusing on innovative solutions in document processing, particularly converting complex PDFs into editable formats. The role requires strong technical expertise in Python, OCR, and document AI, along with a strategic mindset to enhance AI applications.

Skills & Requirements

Must-have

  • Expert-level Python skills
  • Experience with OpenCV and PyMuPDF
  • Deep familiarity with Tesseract and PaddleOCR
  • Knowledge of LayoutLMv3, Donut, or Nougat
  • Understanding of OOXML document formats
  • Experience with GPT or Claude LLM integration

Nice-to-have

  • Experience with Pandoc AST for format conversion
  • Background in DTP, Typography or Graphic Design
  • Contributions to open-source OCR projects

Key Requirements

  • Python Mastery
  • OCR/Document AI expertise
  • Format Expertise in XML-based documents
  • LLM Integration experience
  • Architectural Vision for API vs custom pipelines

Work Rights

Not specified

Tailored Resume

Cover Letter