Software Engineer, Data Infrastructure & Acquisition - Hyderabad, India
Speechify
Hyderabad, India
Not specified; not specified; competitive salaries...
On-site
Proficiency with bash/python scripting in linux
Experience with docker and infrastructure-as-code
Professional experience with gcp cloud provider
The role is responsible for all aspects of data collection to support model training operations at petabyte-scale
Job Summary
The role is responsible for all aspects of data collection to support model training operations at petabyte-scale.
Candidates will operate and extend the cloud infrastructure for the ingestion pipeline, currently running on GCP and managed with Terraform.
Speechify offers a competitive salary, a friendly atmosphere, and an opportunity to build products that directly impact people with learning differences.
Matching Summary
The role is responsible for all aspects of data collection to support model training operations at petabyte-scale.
Salary
Not specified; Not specified; Competitive salaries mentioned
Skills & Requirements
Must-have
Proficiency with bash/Python scripting in Linux
Experience with Docker and Infrastructure-as-Code
Professional experience with GCP cloud provider
Nice-to-have
Experience with web crawlers and large-scale data processing
Ability to handle multiple tasks and adapt to changing priorities
Strong written and verbal communication skills
Key Requirements
BS/MS/PhD in Computer Science or related field
5+ years of industry experience in software development