Remote Senior Software Engineer – LLM Evaluation (US-based)
- Hiring Organisation
- Turing
- Location
- Oxford, England, United Kingdom
tools in Python that can verify the quality of code and identify error patterns. Hypothesize on steps in the software engineering cycle (prototyping, architecture design, API design, production implementation, launch, experiments, monitoring, operational maintenance) and evaluate model capabilities on them. Design verification mechanisms that can automatically … building production-grade software. Experience building full-stack applications and deploying scalable software using modern languages and tools. Deep understanding of software architecture, design, development, debugging, and code quality/review assessment. Excellent oral and written communication skills for clear, structured evaluation rationales. Engagement Details: Commitment: flexible engagement, minimum ...