Research Engineer

# About the role

Science Machine is building agentic AI software for automating bioanalysis workflows. Our platform helps scientists and bioanalysis teams turn complex analytical processes into reliable, repeatable, AI-assisted workflows across data ingestion, analysis, reporting, review, and compliance.

The Research Engineer executes our research projects. The work spans agent systems, how we quantify their performance, and fine-tuning open models. Research direction comes from product feedback; you turn it into shipped results.

You will work close to the science: formulating problems, designing experiments, shipping implementations, curating data, and interpreting results. You will work with biologists, scientific contractors, and the rest of the team, and act as a technical sparring partner on what to build next.

## What you'll do

- Turn research directions into projects: scoping, experiment design, implementation, data, and results.

- Define how we measure system performance: which metrics matter, which proxies are honest, which signals to trust.

- Build the evaluation infrastructure to put those metrics into practice: benchmarks, harnesses, and the tooling around them.

- Develop the research agent stack: memory and in-context learning, test time compute, and models.

- Fine-tune and post-train open models to improve agent performance.

- Work with biologists, scientific contractors, and annotators to build reproducible training and evaluation data pipelines.

- Track what frontier labs are shipping and bring back what's relevant.

## Essential experience

- Experience building ML/AI systems in a research-adjacent context (industry, lab, or PhD).

- Experience building LLM-powered systems: prompts, context engineering, agent architectures.

- Experience working with evaluations and benchmarks, including in tasks where "correct" is ambiguous.

- Familiarity with model training generally, including the data, optimisation, and evaluation work around it.

- Strong engineering fundamentals. Fluent in Python and comfortable across the AI/ML stack.

- Experience running experiments rigorously, in academia or industry. You think about confounds. You can defend your results.

- Experience with training and evaluation data pipelines, including reproducibility and observability.

## Nice to have

- Experience in a life-science domain (biology, chemistry, medicine, bioinformatics).

- Post-training experience on LLMs.

- Peer-reviewed publications, or other settings where your ideas were stress-tested.

- Open source contributions to scientific or AI tooling.

## Essential qualities

- High ownership: you notice what needs doing and carry it through.

- Skeptical of your own results: you assume a good-looking number is wrong until you understand why it isn't.

- Strong opinions, weakly held: you push back, defend a position, and change your mind when evidence moves.

- Hands-on with the unglamorous parts: data cleaning, contractor coordination, eval annotation, whatever the project needs.

- Reliable under ambiguity: you make progress when problems aren't yet well-defined.

- Curious about science: you want to learn the bioanalysis domain.

- Clear communicator: you can explain technical decisions and results to engineers, scientists, and founders.

Job Details

Company
ScienceMachine
Location
Greater London, England, United Kingdom
Posted