The Sr. SDE will own the LLM-as-a-Judge evaluation pipeline — designing, building, and scaling automated evaluation systems that leverage large language models to assess data quality
Job Summary
The Sr. SDE will own the LLM-as-a-Judge evaluation pipeline — designing, building, and scaling automated evaluation systems that leverage large language models to assess data quality.
The Sr. SDE will design and build GenAI-powered diagnostic and workflow tools — including conversational troubleshooting agents, automated quality assessment tools, guided remediation systems, and workflow copilots.
The Sr. SDE will design and implement robust backend services, APIs, and data pipelines on AWS leveraging Amazon Bedrock, SageMaker, Lambda, ECS/EKS, Step Functions, DynamoDB, OpenSearch, and S3.
Matching Summary
The Sr. SDE will own the LLM-as-a-Judge evaluation pipeline — designing, building, and scaling automated evaluation systems that leverage large language models to assess data quality.