Data Scientist (ai Data & Llm Specialist)

Eclipse

Remote
**
Data labeling methodologies
Datasets for llms
Python and data science libraries
** Eclipse is seeking a Data Scientist with a focus on data preparation and labeling for Large Language Models (LLMs) to join their remote team. The ideal candidate should have experience in data quality, data annotation methodologies, and proficiency in Python and data science libraries. **

Job Summary

  • Join the core team at Eclipse, where we’re building an AI agent-first marketplace that connects intelligence with real-world tasks, starting with data collection and labeling.
  • Develop Data Labeling Strategies: Design and document a formal data annotation strategy, including clear, scalable, and efficient guidelines for labeling our data.
  • You’ll do high-impact work to enhance Ethereum’s scalability, shaping the future of crypto.

Matching Summary

Match Score: 75

** Eclipse is seeking a Data Scientist with a focus on data preparation and labeling for Large Language Models (LLMs) to join their remote team. The ideal candidate should have experience in data quality, data annotation methodologies, and proficiency in Python and data science libraries. **

Skills & Requirements

Must-have

  • data labeling methodologies
  • datasets for LLMs
  • Python and data science libraries
  • automate data annotation

Nice-to-have

  • audio data processing
  • modern MLOps principles
  • RLHF pipelines

Key Requirements

  • Proven experience as a Data Scientist
  • hands-on experience with data annotation platforms
  • tokenization, embeddings, and NER
  • Proficiency in Python and common data science libraries
  • Experience using APIs/SDKs
  • Excellent communication skills

Work Rights

Not specified

Tailored Resume

Cover Letter