Hyphen Connect is looking for an experienced LLM Pre-training & Distributed Systems Engineer to manage large-scale machine learning training operations and optimize distributed infrastructure. The ideal candidate should possess deep expertise in GPU clusters and systems engineering, particularly with tools like PyTorch and Kubernetes.
Must-have
Nice-to-have
Not specified