Ai Frameworks Software Engineer – Model Compression Algorithm
Intel Corporation
Shanghai, China
Intel neural compressor product development
Quantization and compression techniques
Llms and text-to-image/video models
Responsibilities include developing the Intel Neural Compressor product and related tools, optimizing for Intel AI platforms (CPU, GPU, AI Accelerator)
Job Summary
Responsibilities include developing the Intel Neural Compressor product and related tools, optimizing for Intel AI platforms (CPU, GPU, AI Accelerator).
Research and implement quantization and compression techniques for large language models (LLMs) and text-to-image/video generation models.
Track and explore cutting-edge directions in efficient model deployment and inference/finetuning acceleration.
Matching Summary
Responsibilities include developing the Intel Neural Compressor product and related tools, optimizing for Intel AI platforms (CPU, GPU, AI Accelerator).