Systems Research Engineer, Gpu Programming

Together AI

San Francisco, California, United States
Base: $160,000 - $230,000; equity: startup equity ...
On-site
Strong background in gpu programming
Experience with cuda or triton frameworks
Knowledge of ml/ai applications and models
The role involves co-designing GPU kernels and model architecture to significantly enhance the performance and efficiency of AI systems

Job Summary

  • The role involves co-designing GPU kernels and model architecture to significantly enhance the performance and efficiency of AI systems.
  • Together AI is a research-driven company on a mission to lower the cost of modern AI systems through software and hardware co-design.
  • Candidates will contribute to leading open-source research and technologies such as FlashAttention, Hyena, FlexGen, and RedPajama.

Matching Summary

The role involves co-designing GPU kernels and model architecture to significantly enhance the performance and efficiency of AI systems.

Salary

Base: $160,000 - $230,000; Equity: Startup equity included; Benefits: Health insurance and remote work flexibility

Skills & Requirements

Must-have

  • Strong background in GPU programming
  • Experience with CUDA or Triton frameworks
  • Knowledge of ML/AI applications and models
  • Proficiency in performance profiling tools
  • Expertise in parallel computing techniques

Nice-to-have

  • Research skills for latest GPU advancements
  • Collaboration with cross-functional teams
  • Interest in open-source AI research

Key Requirements

  • Bachelor's, Master's, or Ph.D. in Computer Science or Electrical Engineering
  • Equivalent practical experience in GPU programming

Work Rights

Not specified

Tailored Resume

Cover Letter