Full Stack Llm Engineer

Cerebras Systems

Toronto, Canada
On-site
End-to-end ml model bringup
Model architecture translation
Compiler optimizations
Contribute to the end-to-end bring up of ML models on Cerebras CSX systems

Job Summary

  • Contribute to the end-to-end bring up of ML models on Cerebras CSX systems.
  • Work across the stack: model architecture translation, graph lowering, compiler optimizations, runtime integration, and performance tuning.
  • Propose and prototype improvements across tools, APIs, or automation flows to accelerate future bring ups.

Matching Summary

Contribute to the end-to-end bring up of ML models on Cerebras CSX systems.

Skills & Requirements

Must-have

  • end-to-end ML model bringup
  • model architecture translation
  • compiler optimizations
  • runtime integration
  • performance tuning
  • deep learning frameworks
  • LLVM and/or MLIR

Nice-to-have

  • system-minded generalist
  • fast-paced bringup environments
  • cutting-edge AI research

Key Requirements

  • Bachelor’s, Master’s, or PhD in Computer Science, Engineering, or related field
  • Proficiency in C/C++ programming
  • Strong background in optimization techniques

Work Rights

Not specified

Tailored Resume

Cover Letter