Speechify is seeking a skilled Software Engineer for their Data Infrastructure & Acquisition team to enhance data collection for AI model training. The role focuses on building and operating cloud infrastructure, particularly in Google Cloud, to support the ingestion pipeline
Job Summary
The role is responsible for all aspects of data collection to support model training operations at petabyte-scale.
Candidates will operate and extend the cloud infrastructure running on GCP using Terraform to ingest audio data.
Speechify offers a competitive salary range of $140,000-$200,000 plus bonus and equity in a fully distributed setting.
Matching Summary
Match Score: 85
Speechify is seeking a skilled Software Engineer for their Data Infrastructure & Acquisition team to enhance data collection for AI model training. The role focuses on building and operating cloud infrastructure, particularly in Google Cloud, to support the ingestion pipeline.
Salary
Base: $140,000-$200,000; Bonus/Equity: + bonus + equity; Benefits: Not specified
Skills & Requirements
Must-have
5+ years software development experience
Proficiency in bash and Python scripting
Experience with Docker and Infrastructure-as-Code
Professional experience with GCP cloud provider
Nice-to-have
Experience with web crawlers
Large-scale data processing workflows
Ability to handle multiple tasks
Strong written and verbal communication skills
Key Requirements
BS/MS/PhD in Computer Science or related field
5+ years industry experience in software development