Applied Scientist, Document Understanding

Thomson Reuters

New York City, NY, US
Base: $136,000 - $253,000 usd; bonus: eligible for...
Hybrid
Phd or master's in computer science ai nlp
3+ years shipping document understanding systems
Production python pytorch hugging face transformers deepspeed
This role focuses on designing, building, and deploying production-grade document understanding systems that power Westlaw, PracticalLaw, and CoCounsel

Job Summary

  • This role focuses on designing, building, and deploying production-grade document understanding systems that power Westlaw, PracticalLaw, and CoCounsel.
  • The position requires hands-on experience with model development, distillation, evaluation, and deployment of large language models into latency-constrained environments.
  • Thomson Reuters offers a comprehensive benefits package including market competitive health insurance, a 401k plan with company match, and flexible work arrangements.

Matching Summary

This role focuses on designing, building, and deploying production-grade document understanding systems that power Westlaw, PracticalLaw, and CoCounsel.

Salary

Base: $136,000 - $253,000 USD; Bonus: Eligible for Annual Bonus based on performance; Benefits: Comprehensive health, dental, vision, 401k match, tuition reimbursement

Skills & Requirements

Must-have

  • PhD or Master's in Computer Science AI NLP
  • 3+ years shipping document understanding systems
  • Production Python PyTorch Hugging Face Transformers DeepSpeed
  • Document layout analysis and semantic chunking
  • Knowledge graph construction from unstructured text
  • LLM-based information extraction and distillation

Nice-to-have

  • Publications at ACL EMNLP ICLR NeurIPS SIGIR KDD
  • Experience with AzureML or AWS SageMaker
  • Legal domain knowledge and complex document structures
  • RAG and agentic workflows for enterprise systems

Key Requirements

  • PhD or Master's degree required
  • 3+ years post-degree industry experience
  • Proven track record shipping to production
  • Expertise in synthetic data generation

Work Rights

Not specified

Tailored Resume

Cover Letter