Ai Frameworks Software Engineer – Model Compression Algorithm

Intel Retiree Medical Plan Trust

Shanghai, PRC
Intel neural compressor product development
Quantization and compression techniques
Llm and text-to-image/video models
Develop Intel Neural Compressor product and related tools, optimizing for Intel AI platform including CPU, GPU and AI Accelerator

Job Summary

  • Develop Intel Neural Compressor product and related tools, optimizing for Intel AI platform including CPU, GPU and AI Accelerator.
  • Research and implement quantization and compression techniques for large language models (LLMs) and text-to-image/video generation models.
  • Track and explore cutting-edge directions in efficient model deployment and inference/finetuning acceleration.

Matching Summary

Develop Intel Neural Compressor product and related tools, optimizing for Intel AI platform including CPU, GPU and AI Accelerator.

Skills & Requirements

Must-have

  • Intel Neural Compressor product development
  • quantization and compression techniques
  • LLM and text-to-image/video models
  • Python/C++ programming proficiency
  • deep learning framework fundamentals

Nice-to-have

  • strong self-motivation
  • problem-solving skills
  • technological innovation passion
  • practical engineering drive
  • continuous exploration and improvement

Key Requirements

  • Master’s or PHD’s degree
  • Computer science or related major
  • Familiarity with model compression techniques
  • Good English oral and written skill

Work Rights

Not specified

Tailored Resume

Cover Letter