Distributed Training & Inference Optimization Engineer (llm) - Gpu Optimization Department (gpuod)

Rakuten

Tokyo, Japan
Hybrid
Gpu-accelerated ml frameworks
Distributed training optimizations
Llm inference optimizations
The AI & Data Division at Rakuten is focused on leveraging data for AI initiatives

Job Summary

  • The AI & Data Division at Rakuten is focused on leveraging data for AI initiatives.
  • As a GPU Optimization Engineer, you will enhance LLM training and inference performance.
  • This role offers the opportunity to work on cutting-edge technologies in a global team.

Matching Summary

The AI & Data Division at Rakuten is focused on leveraging data for AI initiatives.

Skills & Requirements

Must-have

  • GPU-accelerated ML frameworks
  • distributed training optimizations
  • LLM inference optimizations

Nice-to-have

  • Contributions to open-source ML frameworks
  • Familiarity with Kubernetes for GPU workloads
  • Experience with inference serving frameworks

Key Requirements

  • 3+ years of hands-on experience
  • Bachelor’s or higher degree in Computer Science

Work Rights

Not specified

Tailored Resume

Cover Letter