Machine Learning Systems Research Intern, Phd, Summer 2026

Red Hat

Fully remote
Llm inference and optimizations
Pytorch tensor math libraries
Quantization, pruning, knowledge distillation
Red Hat is seeking a Machine Learning Systems Research Intern for the summer of 2026 to work remotely on AI inference and model optimization techniques within their Machine Learning Research Team. The ideal candidate is currently pursuing a Ph.D. in a related field, has strong programming skills, and experience with large language models

Job Summary

  • As an intern, you will work on cutting-edge AI inference and model optimization techniques, and contribute to research and engineering efforts that make LLMs faster and more efficient.
  • Conduct experiments to evaluate the impact of optimization methods on model accuracy, latency, and throughput.
  • Opportunity to contribute to research papers, patents, or open-source projects.

Matching Summary

Match Score: 85

Red Hat is seeking a Machine Learning Systems Research Intern for the summer of 2026 to work remotely on AI inference and model optimization techniques within their Machine Learning Research Team. The ideal candidate is currently pursuing a Ph.D. in a related field, has strong programming skills, and experience with large language models.

Skills & Requirements

Must-have

  • LLM inference and optimizations
  • PyTorch tensor math libraries
  • quantization, pruning, knowledge distillation
  • GPU performance optimizations
  • large language model architectures

Nice-to-have

  • open-source ML frameworks
  • top tier conference publications
  • community-powered approach
  • creative, passionate people

Key Requirements

  • Currently pursuing Ph.D. degree
  • Strong programming skills in C++, CUDA, Python
  • Familiarity with AI model optimization techniques
  • Deep understanding of GPU performance
  • Background in efficient inference techniques

Work Rights

Not specified

Tailored Resume

Cover Letter