City of London, London, United Kingdom Hybrid / WFH Options
ARC IT Recruitment Ltd
/MTTR via automation, clear SLAs, and robust RCAs/post-mortems. Safer, faster releases (blue/green, canary, feature flags) in partnership with Trading, Quant, and Engineering. Mature observability (logs/metrics/traces), capacity planning, and performance tuning for low-latency flows. Strong production hygiene and controls aligned to MiFID II/MAR/best-ex. Leadership of More ❯
City of London, London, United Kingdom Hybrid / WFH Options
SR2 | Socially Responsible Recruitment | Certified B Corporation™
ideally Python , Rust is a bonus Experience with distributed systems, REST APIs, and microservices Knowledge of Kafka (or similar), PostgreSQL , and time-series data Familiar with Docker, monitoring, and observability tools ✅ Experience in a startup or scale-up , collaborating closely with engineers in a fast-moving environment Bonus points if you’ve worked in energy markets, trading systems, industrial control More ❯
production-grade AI/ML applications, including LLMs and anomaly detection models. Familiarity with cloud infrastructure (AWS preferred), container orchestration (Kubernetes), and workflow tools (Airflow, Argo). Experience with observability tools (e.g., Grafana, CloudWatch) and RESTful API development. More ❯
SR2 | Socially Responsible Recruitment | Certified B Corporation™
ideally Python , Rust is a bonus Experience with distributed systems, REST APIs, and microservices Knowledge of Kafka (or similar), PostgreSQL , and time-series data Familiar with Docker, monitoring, and observability tools ✅ Experience in a startup or scale-up , collaborating closely with engineers in a fast-moving environment Bonus points if you’ve worked in energy markets, trading systems, industrial control More ❯
of investment into the latest tech & AWS tools What they're looking for... Strong experience within AWS & AWS services within networking and security Proficient within Terraform, CloudFormation or Ansible Observability tools like Cloud Watch, CloudTrail, OpenSearch Grafana/Kinesis Have a background within core infrastructure services like networking, security, patching and has transitioned to a Platform/Cloud focused Engineer More ❯
of investment into the latest tech & AWS tools What they're looking for... Strong experience within AWS & AWS services within networking and security Proficient within Terraform, CloudFormation or Ansible Observability tools like Cloud Watch, CloudTrail, OpenSearch Grafana/Kinesis Have a background within core infrastructure services like networking, security, patching and has transitioned to a Platform/Cloud focused Engineer More ❯
london (croydon), south east england, united kingdom
Morson Edge
of investment into the latest tech & AWS tools What they're looking for... Strong experience within AWS & AWS services within networking and security Proficient within Terraform, CloudFormation or Ansible Observability tools like Cloud Watch, CloudTrail, OpenSearch Grafana/Kinesis Have a background within core infrastructure services like networking, security, patching and has transitioned to a Platform/Cloud focused Engineer More ❯
of investment into the latest tech & AWS tools What they're looking for... Strong experience within AWS & AWS services within networking and security Proficient within Terraform, CloudFormation or Ansible Observability tools like Cloud Watch, CloudTrail, OpenSearch Grafana/Kinesis Have a background within core infrastructure services like networking, security, patching and has transitioned to a Platform/Cloud focused Engineer More ❯
hands-on experience in Microsoft Azure ML Studio * Experience using business intelligence tools, preferably Power BI * Experience applying Generative AI and prompting techniques * Strong understanding of data governance, model observability, and compliance frameworks * Proven ability to deliver secure, scalable, and responsible data science solutions If this sounds like you and you are available on short notice, apply now More ❯
environments. Excellent communication skills and a strong interest in the application of AI in public services. Desirable: Experience with multi-agent orchestration (LangGraph, AutoGen, CrewAI). Familiarity with AI observability tools (TruLens, Helicone). Awareness of AI safety and reliability frameworks (Guardrails AI). Experience working in government or public sector digital projects . More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Amber Labs
environments. Excellent communication skills and a strong interest in the application of AI in public services. Desirable: Experience with multi-agent orchestration (LangGraph, AutoGen, CrewAI). Familiarity with AI observability tools (TruLens, Helicone). Awareness of AI safety and reliability frameworks (Guardrails AI). Experience working in government or public sector digital projects . More ❯
london, south east england, united kingdom Hybrid / WFH Options
Amber Labs
environments. Excellent communication skills and a strong interest in the application of AI in public services. Desirable: Experience with multi-agent orchestration (LangGraph, AutoGen, CrewAI). Familiarity with AI observability tools (TruLens, Helicone). Awareness of AI safety and reliability frameworks (Guardrails AI). Experience working in government or public sector digital projects . More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Amber Labs
environments. Excellent communication skills and a strong interest in the application of AI in public services. Desirable: Experience with multi-agent orchestration (LangGraph, AutoGen, CrewAI). Familiarity with AI observability tools (TruLens, Helicone). Awareness of AI safety and reliability frameworks (Guardrails AI). Experience working in government or public sector digital projects . More ❯
company's customer experience (CX) vision. You will collaborate closely with other software engineers, product teams, and AI specialists to develop LLM AI-powered applications, ensuring their scalability, security, observability and performance. This role is hands-on, with a primary focus on coding, testing, and deploying AI solutions in a fast-paced, agile environment. Responsibilities: Code Development and Testing Write More ❯
practices for automation tools such as Power Automate Desktop. * Build out robust ALM processes using Azure DevOps or GitHub - including pipelines, solution management, environment variables, and connection references. * Implement observability and monitoring through Application Insights, Azure Monitor, and alerting frameworks. * Design secure integration layers using Azure services such as API Management, Service Bus, Functions, Logic Apps, and Key Vault. * Lead More ❯
practices for automation tools such as Power Automate Desktop.* Build out robust ALM processes using Azure DevOps or GitHub - including pipelines, solution management, environment variables, and connection references.* Implement observability and monitoring through Application Insights, Azure Monitor, and alerting frameworks.* Design secure integration layers using Azure services such as API Management, Service Bus, Functions, Logic Apps, and Key Vault.* Lead More ❯
London, England, United Kingdom Hybrid / WFH Options
Client Server
crypto offering and split your time between hands-on development with people management (70/30). You'll set the technical direction, mentor engineers and ensure code quality, observability, scalability and security are embedded into high-quality, high-impact releases. You'll be working with a modern, cloud native tech stack using Java, Spring Boot, AWS, Kafka and CI More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Client Server Ltd
crypto offering and split your time between hands-on development with people management (70/30). You'll set the technical direction, mentor engineers and ensure code quality, observability, scalability and security are embedded into high-quality, high-impact releases. You'll be working with a modern, cloud native tech stack using Java, Spring Boot, AWS, Kafka and CI More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Immersum
Hands-on experience with cloud platforms (AWS preferred) . Excellent problem-solving skills and a “get stuff done” attitude. Nice to Have Experience with Grafana or similar monitoring/observability tools. API endpoint design & maintenance. Prior experience in fast-scaling startups or international data systems. Why Join Us Work directly with leadership (CEO and core team) to influence the company More ❯
Hands-on experience with cloud platforms (AWS preferred) . Excellent problem-solving skills and a “get stuff done” attitude. Nice to Have Experience with Grafana or similar monitoring/observability tools. API endpoint design & maintenance. Prior experience in fast-scaling startups or international data systems. Why Join Us Work directly with leadership (CEO and core team) to influence the company More ❯
london, south east england, united kingdom Hybrid / WFH Options
Immersum
Hands-on experience with cloud platforms (AWS preferred) . Excellent problem-solving skills and a “get stuff done” attitude. Nice to Have Experience with Grafana or similar monitoring/observability tools. API endpoint design & maintenance. Prior experience in fast-scaling startups or international data systems. Why Join Us Work directly with leadership (CEO and core team) to influence the company More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Immersum
Hands-on experience with cloud platforms (AWS preferred) . Excellent problem-solving skills and a “get stuff done” attitude. Nice to Have Experience with Grafana or similar monitoring/observability tools. API endpoint design & maintenance. Prior experience in fast-scaling startups or international data systems. Why Join Us Work directly with leadership (CEO and core team) to influence the company More ❯
Mentor and coach junior and transitioning data engineers to accelerate their development and strengthen the teams overall capabilities. Lead production operations by enforcing standards around testing, CI/CD, observability, and documentation to ensure platform reliability and regulatory compliance. Collaborate effectively with business clients and cross-functional teams to translate requirements into technical solutions and drive innovation across BNY. To More ❯
the architecture of our platform: modular, secure, scalable, and maintainable from day one Define integration patterns across internal services and third-party providers Own key infrastructure choices (messaging systems, observability, deployment strategies, etc.) Collaborate closely with Product Managers, Designers, and Mobile Engineers to shape end-to-end journeys Be hands-on in code when needed, but primarily act as a More ❯
their core software products. Expect a collaborative engineering culture, modern cloud-native stack, and plenty of freedom to influence tooling, architecture, and reliability practices. If youre passionate about automation, observability, and designing systems that just dont fail , this is the perfect environment for you. Tech Stack Cloud: AWS (EC2, RDS, S3, IAM, Lambda, CloudWatch) Containerisation & Orchestration: Docker, Kubernetes (EKS) Infrastructure … as Code: Terraform Configuration Management: Ansible Monitoring & Observability: Prometheus, Grafana, ELK Stack CI/CD: GitHub Actions Scripting & Automation: Python, Bash, or Go What Youll Be Doing Designing and maintaining reliable, scalable, and secure infrastructure for production systems. Automating operational tasks and improving system efficiency. Implementing observability tooling to monitor system health, performance, and capacity. Working closely with development teams … how reliability and performance are engineered at scale. Work with talented developers and DevOps engineers in a collaborative environment. AWS | Site Reliability | SRE | Cloud | Kubernetes | Terraform | CI/CD | Observability | Python | Go | Automation Click APPLY NOW to be considered for this position! Follow ReVybe IT Recruitment to stay up to date with the latest Cloud, Platform & SRE opportunities. More ❯