Sr. Ai Inference Systems Engineer

Tencent

Palo Alto, California, US
Base: $120,100.00 to $225,700.00 py; bonus/equity:...
Not specified
End-to-end inference optimization
Heterogeneous computing research
High-performance inference frameworks
Tencent is seeking a Senior AI Inference Systems Engineer to lead optimization efforts for large model inference pipelines, focusing on enhancing throughput and minimizing latency. The ideal candidate should possess advanced degrees and significant experience in AI inference optimization, alongside expertise in AI accelerator architectures and parallel computing

Job Summary

  • The role involves leading the optimization of the full inference pipeline for large models.
  • Candidates will conduct in-depth research into various hardware accelerators for real-time inference.
  • The position offers a competitive salary and comprehensive benefits package.

Matching Summary

Match Score: 85

Tencent is seeking a Senior AI Inference Systems Engineer to lead optimization efforts for large model inference pipelines, focusing on enhancing throughput and minimizing latency. The ideal candidate should possess advanced degrees and significant experience in AI inference optimization, alongside expertise in AI accelerator architectures and parallel computing.

Salary

Base: $120,100.00 to $225,700.00 per year; Bonus/Equity: Not specified; Benefits: Medical, dental, vision, life and disability benefits

Skills & Requirements

Must-have

  • End-to-End Inference Optimization
  • Heterogeneous Computing Research
  • High-performance inference frameworks

Nice-to-have

  • Strong analytical skills
  • Cross-team collaboration
  • Experience in tuning ultra-large-scale inference clusters

Key Requirements

  • Master’s or Ph.D. in Computer Science
  • Significant professional experience in AI inference optimization
  • Proficient in at least one AI accelerator architecture

Work Rights

Not specified

Tailored Resume

Cover Letter