Competitive comp; meaningful equity; not specified
On-site
Ai/llm evaluation frameworks
Python and testing frameworks
Automated testing methodologies
The role focuses on designing evaluation frameworks to ensure the accuracy, safety, and performance of Large Language Model-based systems in the construction industry
Job Summary
The role focuses on designing evaluation frameworks to ensure the accuracy, safety, and performance of Large Language Model-based systems in the construction industry.
Candidates must have 5-8 years of QA experience with specific expertise in AI/ML systems and the ability to build automated test pipelines.
The company offers high ownership from day one, a small team environment with zero bureaucracy, and competitive compensation including meaningful equity.
Matching Summary
The role focuses on designing evaluation frameworks to ensure the accuracy, safety, and performance of Large Language Model-based systems in the construction industry.
Salary
Competitive comp; Meaningful equity; Not specified
Skills & Requirements
Must-have
AI/LLM evaluation frameworks
Python and testing frameworks
Automated testing methodologies
Prompt engineering and testing
API testing tools like Postman
NLP concepts understanding
Nice-to-have
Construction or project site experience
Startup experience
Generative AI product experience
CI/CD pipeline knowledge
Cloud platform familiarity (AWS/GCP/Azure)
AI safety and bias detection exposure
Key Requirements
Bachelor's degree in Computer Science or Engineering