Research Engineer
# About the role
Science Machine is building agentic AI software for automating bioanalysis workflows. Our platform helps scientists and bioanalysis teams turn complex analytical processes into reliable, repeatable, AI-assisted workflows across data ingestion, analysis, reporting, review, and compliance.
The Research Engineer executes our research projects. The work spans agent systems, how we quantify their performance, and fine-tuning open models. Research direction comes from product feedback; you turn it into shipped results.
You will work close to the science: formulating problems, designing experiments, shipping implementations, curating data, and interpreting results. You will work with biologists, scientific contractors, and the rest of the team, and act as a technical sparring partner on what to build next.
## What you'll do
- Turn research directions into projects: scoping, experiment design, implementation, data, and results.
- Define how we measure system performance: which metrics matter, which proxies are honest, which signals to trust.
- Build the evaluation infrastructure to put those metrics into practice: benchmarks, harnesses, and the tooling around them.
- Develop the research agent stack: memory and in-context learning, test time compute, and models.
- Fine-tune and post-train open models to improve agent performance.
- Work with biologists, scientific contractors, and annotators to build reproducible training and evaluation data pipelines.
- Track what frontier labs are shipping and bring back what's relevant.
## Essential experience
- Experience building ML/AI systems in a research-adjacent context (industry, lab, or PhD).
- Experience building LLM-powered systems: prompts, context engineering, agent architectures.
- Experience working with evaluations and benchmarks, including in tasks where "correct" is ambiguous.
- Familiarity with model training generally, including the data, optimisation, and evaluation work around it.
- Strong engineering fundamentals. Fluent in Python and comfortable across the AI/ML stack.
- Experience running experiments rigorously, in academia or industry. You think about confounds. You can defend your results.
- Experience with training and evaluation data pipelines, including reproducibility and observability.
## Nice to have
- Experience in a life-science domain (biology, chemistry, medicine, bioinformatics).
- Post-training experience on LLMs.
- Peer-reviewed publications, or other settings where your ideas were stress-tested.
- Open source contributions to scientific or AI tooling.
## Essential qualities
- High ownership: you notice what needs doing and carry it through.
- Skeptical of your own results: you assume a good-looking number is wrong until you understand why it isn't.
- Strong opinions, weakly held: you push back, defend a position, and change your mind when evidence moves.
- Hands-on with the unglamorous parts: data cleaning, contractor coordination, eval annotation, whatever the project needs.
- Reliable under ambiguity: you make progress when problems aren't yet well-defined.
- Curious about science: you want to learn the bioanalysis domain.
- Clear communicator: you can explain technical decisions and results to engineers, scientists, and founders.