Ai Specialist (ai Engineering)

Hyphen Connect

Singapore, Singapore
On-site
Large language and vision models
Model distillation and pruning techniques
4-bit/8-bit quantization expertise
Hyphen Connect is seeking an AI Specialist Engineer to optimize large language and vision models for on-device inference. The role requires expertise in model compression and deployment across various hardware architectures

Job Summary

  • The role focuses on enhancing the performance of large language and vision models specifically for on-device inference.
  • Candidates will develop pipelines for model distillation and handle hardware-specific compilation tasks.
  • The position requires benchmarking performance across various NPU and GPU architectures to ensure optimal efficiency.

Matching Summary

Match Score: 85

Hyphen Connect is seeking an AI Specialist Engineer to optimize large language and vision models for on-device inference. The role requires expertise in model compression and deployment across various hardware architectures.

Skills & Requirements

Must-have

  • Large language and vision models
  • Model distillation and pruning techniques
  • 4-bit/8-bit quantization expertise
  • TensorRT and ONNX Runtime experience
  • NPU/GPU architecture benchmarking

Nice-to-have

  • Edge deployment proficiency
  • Hardware-specific compilation skills
  • Diverse hardware architecture knowledge

Key Requirements

  • Strong C++ and Python programming skills
  • Hands-on experience with edge deployment tools
  • Expertise in model compression techniques

Work Rights

Not specified

Tailored Resume

Cover Letter