Software Engineer, Data Infrastructure & Acquisition - Chennai, India
Speechify
Chennai, India
Competitive salaries; not specified; not specified
On-site
Proficiency with bash/python scripting in linux environments
Experience with docker and infrastructure-as-code concepts
Professional experience with gcp cloud provider
The role is responsible for all aspects of data collection to support model training operations at petabyte-scale
Job Summary
The role is responsible for all aspects of data collection to support model training operations at petabyte-scale.
Candidates will operate and extend the cloud infrastructure for the ingestion pipeline, currently running on GCP and managed with Terraform.
Speechify offers a competitive salary, a friendly atmosphere, and an opportunity to build products that directly impact people with learning differences.
Matching Summary
The role is responsible for all aspects of data collection to support model training operations at petabyte-scale.
Salary
Competitive salaries; Not specified; Not specified
Skills & Requirements
Must-have
Proficiency with bash/Python scripting in Linux environments
Experience with Docker and Infrastructure-as-Code concepts
Professional experience with GCP cloud provider
Nice-to-have
Experience with web crawlers and large-scale data processing
Ability to handle multiple tasks and adapt to changing priorities
Strong written and verbal communication skills
Key Requirements
BS/MS/PhD in Computer Science or related field
5+ years of industry experience in software development