Ai Specialist (ai Engineering)

Hyphen Connect

Oregon, United States
On-site
Model distillation and pruning expertise
4-bit/8-bit quantization techniques
Tensorrt and onnx runtime experience
Hyphen Connect is seeking an AI Specialist Engineer to enhance the performance of large language and vision models for on-device inference. The ideal candidate will have expertise in model optimization techniques and hands-on experience with various AI frameworks

Job Summary

  • The role focuses on enhancing the performance of large language and vision models specifically for on-device inference.
  • Candidates will develop pipelines for model distillation and handle hardware-specific compilation tasks.
  • The position requires benchmarking performance across various NPU and GPU architectures to ensure optimal efficiency.

Matching Summary

Match Score: 85

Hyphen Connect is seeking an AI Specialist Engineer to enhance the performance of large language and vision models for on-device inference. The ideal candidate will have expertise in model optimization techniques and hands-on experience with various AI frameworks.

Skills & Requirements

Must-have

  • Model distillation and pruning expertise
  • 4-bit/8-bit quantization techniques
  • TensorRT and ONNX Runtime experience
  • Edge deployment and NPU/GPU benchmarking
  • Strong C++ and Python programming skills

Nice-to-have

  • Diverse hardware architecture optimization
  • Cutting-edge AI solution development
  • Performance efficiency across architectures

Key Requirements

  • Expertise in model compression and quantization
  • Hands-on experience with TensorRT and ONNX Runtime
  • Strong proficiency in C++ and Python

Work Rights

Not specified

Tailored Resume

Cover Letter