Ai Frameworks Software Engineer – Model Compression Algorithm

Intel Corporation

Shanghai, China
Intel neural compressor product development
Quantization and compression techniques
Llms and text-to-image/video models
Responsibilities include developing the Intel Neural Compressor product and related tools, optimizing for Intel AI platforms (CPU, GPU, AI Accelerator)

Job Summary

  • Responsibilities include developing the Intel Neural Compressor product and related tools, optimizing for Intel AI platforms (CPU, GPU, AI Accelerator).
  • Research and implement quantization and compression techniques for large language models (LLMs) and text-to-image/video generation models.
  • Track and explore cutting-edge directions in efficient model deployment and inference/finetuning acceleration.

Matching Summary

Responsibilities include developing the Intel Neural Compressor product and related tools, optimizing for Intel AI platforms (CPU, GPU, AI Accelerator).

Skills & Requirements

Must-have

  • Intel Neural Compressor product development
  • quantization and compression techniques
  • LLMs and text-to-image/video models
  • Python/C++ programming proficiency
  • deep learning framework fundamentals

Nice-to-have

  • strong sense of teamwork
  • good English oral and written skill
  • self-motivation and problem-solving skills
  • passion for technological innovation

Key Requirements

  • Master's or PhD degree
  • Computer Science major
  • familiarity with model compression techniques
  • experience in model fine-tuning

Work Rights

Not specified

Tailored Resume

Cover Letter