Base: $120,100.00 to $225,700.00 py; bonus/equity:...
Not specified
End-to-end inference optimization
Heterogeneous computing research
High-performance inference frameworks
Tencent is seeking a Senior AI Inference Systems Engineer to lead optimization efforts for large model inference pipelines, focusing on enhancing throughput and minimizing latency. The ideal candidate should possess advanced degrees and significant experience in AI inference optimization, alongside expertise in AI accelerator architectures and parallel computing
Job Summary
The role involves leading the optimization of the full inference pipeline for large models.
Candidates will conduct in-depth research into various hardware accelerators for real-time inference.
The position offers a competitive salary and comprehensive benefits package.
Matching Summary
Match Score: 85
Tencent is seeking a Senior AI Inference Systems Engineer to lead optimization efforts for large model inference pipelines, focusing on enhancing throughput and minimizing latency. The ideal candidate should possess advanced degrees and significant experience in AI inference optimization, alongside expertise in AI accelerator architectures and parallel computing.
Salary
Base: $120,100.00 to $225,700.00 per year; Bonus/Equity: Not specified; Benefits: Medical, dental, vision, life and disability benefits
Skills & Requirements
Must-have
End-to-End Inference Optimization
Heterogeneous Computing Research
High-performance inference frameworks
Nice-to-have
Strong analytical skills
Cross-team collaboration
Experience in tuning ultra-large-scale inference clusters
Key Requirements
Master’s or Ph.D. in Computer Science
Significant professional experience in AI inference optimization
Proficient in at least one AI accelerator architecture