AI Data Engineer

Data Engineer AI Startup Real-Time Data & ML Pipelines

 

I'm currently partnered with a next-generation AI startup that's building real-time intelligence systems capable of understanding and adapting to the world as it happens. They're at the forefront of streaming data, continuous learning, and large-scale inference - and they're now looking for a Data Engineer to help power that mission.

 

This isn't a maintenance role. It's a build-it-from-the-ground-up opportunity in a company that's solving genuinely hard, technical problems around real-time data pipelines and ML infrastructure.

 

️ The Opportunity

The team is looking for a Data Engineer who can design and scale real-time data architectures and machine learning pipelines that drive continuous model training and experimentation.

 

You'll be working in close collaboration with ML engineers, product leads, and platform teams to make sure the company's AI models are learning, and improving, in production.

 

What You'll Be Doing

  • Architect and maintain real-time streaming data systems (Kafka, Kinesis, or Flink)
  • Build robust feature pipelines using Airflow, Prefect, or Dagster
  • Manage and optimise data storage solutions (Snowflake, BigQuery, Redshift, or Delta Lake)
  • Automate and scale model training pipelines in close partnership with ML engineers
  • Deploy, observe, and improve pipelines using Docker, Kubernetes, Terraform, or dbt
  • Champion data reliability, scalability, and performance across the platform

 

The Tech Environment

 

You'll likely be working with some combination of:

  • Languages: Python, Scala, Go
  • Streaming: Kafka / Flink / Spark Structured Streaming
  • Workflow orchestration: Airflow / Prefect / Dagster
  • Data storage & processing: Snowflake / Detabricks / BigQuery / Redshift
  • Infrastructure: Docker / Kubernetes / Terraform / dbt
  • Monitoring: Prometheus / Grafana / OpenTelemetry
  • Cloud: AWS / GCP / Azure

 

What They're Looking For

  • Proven experience building scalable data pipelines in real-time or near real-time environments
  • Strong background in data architecture, performance tuning, and distributed systems
  • Comfort working end-to-end - from data ingestion to model-ready outputs
  • An interest in AI, ML ops, and data-driven product development
  • Someone who thrives in fast-moving, high-ownership start-up environments

 

Why This Role?

  • Join a well-funded AI startup with massive growth potential
  • Be the first Data Engineer, shaping the entire data backbone
  • Work with exceptionally talented engineers and researchers solving real-world AI problems
  • Competitive package, plus meaningful equity
  • A culture that values autonomy, creativity, and continuous learning

 

Interested?

If you've built or scaled data pipelines in production and want to be part of something genuinely cutting-edge, I'd love to chat.

 

Drop me a message here on LinkedIn or send over your details - happy to share more about the team, the vision, and what they're building.

Company
SR2 | Socially Responsible Recruitment | Certified B Corporation
Location
South East, United Kingdom
Employment Type
Permanent
Salary
GBP Annual
Posted
Company
SR2 | Socially Responsible Recruitment | Certified B Corporation
Location
South East, United Kingdom
Employment Type
Permanent
Salary
GBP Annual
Posted