City of London, London, United Kingdom Hybrid/Remote Options
ARC IT Recruitment Ltd
/MTTR via automation, clear SLAs, and robust RCAs/post-mortems. Safer, faster releases (blue/green, canary, feature flags) in partnership with Trading, Quant, and Engineering. Mature observability (logs/metrics/traces), capacity planning, and performance tuning for low-latency flows. Strong production hygiene and controls aligned to MiFID II/MAR/best-ex. Leadership of More ❯
SR2 | Socially Responsible Recruitment | Certified B Corporation™
ideally Python , Rust is a bonus Experience with distributed systems, REST APIs, and microservices Knowledge of Kafka (or similar), PostgreSQL , and time-series data Familiar with Docker, monitoring, and observability tools ✅ Experience in a startup or scale-up , collaborating closely with engineers in a fast-moving environment Bonus points if you’ve worked in energy markets, trading systems, industrial control More ❯
City of London, London, United Kingdom Hybrid/Remote Options
SR2 | Socially Responsible Recruitment | Certified B Corporation™
ideally Python , Rust is a bonus Experience with distributed systems, REST APIs, and microservices Knowledge of Kafka (or similar), PostgreSQL , and time-series data Familiar with Docker, monitoring, and observability tools ✅ Experience in a startup or scale-up , collaborating closely with engineers in a fast-moving environment Bonus points if you’ve worked in energy markets, trading systems, industrial control More ❯
of investment into the latest tech & AWS tools What they're looking for... Strong experience within AWS & AWS services within networking and security Proficient within Terraform, CloudFormation or Ansible Observability tools like Cloud Watch, CloudTrail, OpenSearch Grafana/Kinesis Have a background within core infrastructure services like networking, security, patching and has transitioned to a Platform/Cloud focused Engineer More ❯
of investment into the latest tech & AWS tools What they're looking for... Strong experience within AWS & AWS services within networking and security Proficient within Terraform, CloudFormation or Ansible Observability tools like Cloud Watch, CloudTrail, OpenSearch Grafana/Kinesis Have a background within core infrastructure services like networking, security, patching and has transitioned to a Platform/Cloud focused Engineer More ❯
of investment into the latest tech & AWS tools What they're looking for... Strong experience within AWS & AWS services within networking and security Proficient within Terraform, CloudFormation or Ansible Observability tools like Cloud Watch, CloudTrail, OpenSearch Grafana/Kinesis Have a background within core infrastructure services like networking, security, patching and has transitioned to a Platform/Cloud focused Engineer More ❯
hands-on experience in Microsoft Azure ML Studio * Experience using business intelligence tools, preferably Power BI * Experience applying Generative AI and prompting techniques * Strong understanding of data governance, model observability, and compliance frameworks * Proven ability to deliver secure, scalable, and responsible data science solutions If this sounds like you and you are available on short notice, apply now More ❯
Continuous Deployment and Integration Extensive experience in technical delivery and agile methodologies A strong advocate for test-driven development and a mentor to others in this practice Experience with Observability as a practice (logging, App Insights, metrics and auditing). An expert of DevOps (CI/CD) & Cloud Services (Azure preferred) Expert of several Design Patterns. A master of good More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Amber Labs
environments. Excellent communication skills and a strong interest in the application of AI in public services. Desirable: Experience with multi-agent orchestration (LangGraph, AutoGen, CrewAI). Familiarity with AI observability tools (TruLens, Helicone). Awareness of AI safety and reliability frameworks (Guardrails AI). Experience working in government or public sector digital projects . More ❯
environments. Excellent communication skills and a strong interest in the application of AI in public services. Desirable: Experience with multi-agent orchestration (LangGraph, AutoGen, CrewAI). Familiarity with AI observability tools (TruLens, Helicone). Awareness of AI safety and reliability frameworks (Guardrails AI). Experience working in government or public sector digital projects . More ❯
Continuous Deployment and Integration Extensive experience in technical delivery and agile methodologies A strong advocate for test-driven development and a mentor to others in this practice Experience with Observability as a practice (logging, App Insights, metrics and auditing). An expert of DevOps (CI/CD) & Cloud Services (Azure preferred) Expert of several Design Patterns. A master of good More ❯
United Kingdom, Birmingham, West Midlands (County)
Uniting Ambition
with MLOps practices and AI development frameworks (e.g., Azure AI, LangChain, Hugging Face). Relevant certifications in Azure Architecture, Data, or AI disciplines. Knowledge of automation tools, monitoring, and observability platforms. If you have these skills and would like to find out more, please apply now. More ❯
a subject matter expert internally and externally, including with key customer stakeholders. Translating business requirements into robust technical solutions, ensuring alignment across teams. Promoting strong data governance, documentation, and observability across systems. Staying ahead of industry trends and introducing new technologies and methods into the team. What We’re Looking For Proven experience managing and mentoring data science teams. Strong More ❯
Leading and scaling a technically advanced team responsible for building and productionising a mission-critical backend platform. Architecting and maintaining high-availability, data-intensive systems across AWS with strong observability and monitoring foundations. Collaborating with cross-functional teams to integrate APIs and services, maintaining clean architecture principles. Driving technical quality through mentorship, test-driven development, and modern CI/CD More ❯
Leading and scaling a technically advanced team responsible for building and productionising a mission-critical backend platform. Architecting and maintaining high-availability, data-intensive systems across AWS with strong observability and monitoring foundations. Collaborating with cross-functional teams to integrate APIs and services, maintaining clean architecture principles. Driving technical quality through mentorship, test-driven development, and modern CI/CD More ❯
practices for automation tools such as Power Automate Desktop. * Build out robust ALM processes using Azure DevOps or GitHub - including pipelines, solution management, environment variables, and connection references. * Implement observability and monitoring through Application Insights, Azure Monitor, and alerting frameworks. * Design secure integration layers using Azure services such as API Management, Service Bus, Functions, Logic Apps, and Key Vault. * Lead More ❯
practices for automation tools such as Power Automate Desktop.* Build out robust ALM processes using Azure DevOps or GitHub - including pipelines, solution management, environment variables, and connection references.* Implement observability and monitoring through Application Insights, Azure Monitor, and alerting frameworks.* Design secure integration layers using Azure services such as API Management, Service Bus, Functions, Logic Apps, and Key Vault.* Lead More ❯
world delivery. Partner with Product and Design to align tech decisions with business impact, bringing empathy, pragmatism, and curiosity to every discussion. Build a culture of continuous improvement, from observability and resilience to team rituals and engineering craft. Coach engineers to grow, empowering them to make technical decisions, own outcomes, and share accountability. What are we looking for? We’re More ❯
London, England, United Kingdom Hybrid/Remote Options
Client Server
computing environment with both on-premise and cloud (AWS) based services. You'll collaborate with internal teams to optimise HPC platforms, with a focus on improving performance, resilience and observability of the compute infrastructure. You'll also contribute to continuous improvements initiatives, automating wherever possible and sharing your expertise with the team. Location/WFH: You'll join colleagues based More ❯
South West London, London, United Kingdom Hybrid/Remote Options
Client Server
computing environment with both on-premise and cloud (AWS) based services. You'll collaborate with internal teams to optimise HPC platforms, with a focus on improving performance, resilience and observability of the compute infrastructure. You'll also contribute to continuous improvements initiatives, automating wherever possible and sharing your expertise with the team. Location/WFH: You'll join colleagues based More ❯
services/message buses and other architectural elements Deploy these applications using features such as containers to cloud leveraging CI/CD to support this process backed with good observability when running these in production Ensure quality through the creation of documentation and use of unit/integration/contract testing with a consideration of security/performance requirements More ❯
London, England, United Kingdom Hybrid/Remote Options
Client Server
crypto offering and split your time between hands-on development with people management (70/30). You'll set the technical direction, mentor engineers and ensure code quality, observability, scalability and security are embedded into high-quality, high-impact releases. You'll be working with a modern, cloud native tech stack using Java, Spring Boot, AWS, Kafka and CI More ❯
London, South East, England, United Kingdom Hybrid/Remote Options
Client Server Ltd
crypto offering and split your time between hands-on development with people management (70/30). You'll set the technical direction, mentor engineers and ensure code quality, observability, scalability and security are embedded into high-quality, high-impact releases. You'll be working with a modern, cloud native tech stack using Java, Spring Boot, AWS, Kafka and CI More ❯
and performance bottlenecks, dispatching product-related issues to immediate stakeholders where applicable. Support client, employment, installation and configuration processes Work within a culture of operational excellence, focusing on appropriate observability tooling (monitoring, logging, tracing, alerting, self-healing). Ensure compliance and alignment with wider governance policies and standards by implementing security measures and best practices. Taking part in training and More ❯
their core software products. Expect a collaborative engineering culture, modern cloud-native stack, and plenty of freedom to influence tooling, architecture, and reliability practices. If youre passionate about automation, observability, and designing systems that just dont fail , this is the perfect environment for you. Tech Stack Cloud: AWS (EC2, RDS, S3, IAM, Lambda, CloudWatch) Containerisation & Orchestration: Docker, Kubernetes (EKS) Infrastructure … as Code: Terraform Configuration Management: Ansible Monitoring & Observability: Prometheus, Grafana, ELK Stack CI/CD: GitHub Actions Scripting & Automation: Python, Bash, or Go What Youll Be Doing Designing and maintaining reliable, scalable, and secure infrastructure for production systems. Automating operational tasks and improving system efficiency. Implementing observability tooling to monitor system health, performance, and capacity. Working closely with development teams … how reliability and performance are engineered at scale. Work with talented developers and DevOps engineers in a collaborative environment. AWS | Site Reliability | SRE | Cloud | Kubernetes | Terraform | CI/CD | Observability | Python | Go | Automation Click APPLY NOW to be considered for this position! Follow ReVybe IT Recruitment to stay up to date with the latest Cloud, Platform & SRE opportunities. More ❯