Software Development Engineer - AI/ML, AWS Neuron, Multimodal Inference
Amazon
Seattle, WA, US
143,700.00 - 194,400.00 usd annually py
On-site
Python programming
System level programming
Pytorch or jax frameworks
This role involves architecting and implementing business-critical features for distributed inference support on AWS custom ML accelerators
Job Summary
This role involves architecting and implementing business-critical features for distributed inference support on AWS custom ML accelerators.
Engineers will tune large language models like Llama and DeepSeek to ensure highest performance and maximize efficiency on Trainium and Inferentia silicon.
The team fosters a builder's culture where experimentation is encouraged, emphasizing technical ownership and continuous learning.
Matching Summary
This role involves architecting and implementing business-critical features for distributed inference support on AWS custom ML accelerators.