Databases (Mongo) Test automation following Test Driven Development Practices including Unit, Integration and end-to-end testing Supporting a highly-available production system, diagnosing issues raised from logs and observability tooling (Dynatrace), triage and resolution. Company Benefits A Competitive Salary, Pension Scheme and Life Assurance Along with 25 Days Annual Leave plus an Additional Day on us for your Birthday More ❯
release strategy balancing risk and speed of delivery Assisting the team with support process and incident management Pairing with other team members and encourage a focus on quality and observability in the team Working closely with the Product Owner and Delivery Lead on prioritizing product features and uncovering edge cases Managing the issue backlog and facilitating bug triage based on More ❯
Advisor, and NLP-driven analytics. Background with BI tools such as Tableau, Domo, or Sisense. Web development experience using HTML, CSS, JavaScript, Python. Understanding of data governance, lineage, and observability tools. Familiarity with AI model monitoring, interpretability, and ethical AI. Knowledge of operational resilience and compliance frameworks such as DORA. Salary Range = 160000 - 220000 USD Annually + Benefits + Bonus More ❯
mesh, API gateways, and commercial vs. open source software. Approaches to managing Architectural debt, Architecture governance and evolution in practice Micro services topologies, including operational concerns such as resiliency, observability, discovery and routing, security etc. Have experience with, and understand how to lead, legacy integration and remediation (facades, strangler approaches, et. al.). Deep understanding of different integration patterns and More ❯
mesh, API gateways, and commercial vs. open source software. Approaches to managing Architectural debt, Architecture governance and evolution in practice Micro services topologies, including operational concerns such as resiliency, observability, discovery and routing, security etc. Have experience with, and understand how to lead, legacy integration and remediation (facades, strangler approaches, et. al.). Deep understanding of different integration patterns and More ❯
mesh, API gateways, and commercial vs. open source software. Approaches to managing Architectural debt, Architecture governance and evolution in practice Micro services topologies, including operational concerns such as resiliency, observability, discovery and routing, security etc. Have experience with, and understand how to lead, legacy integration and remediation (facades, strangler approaches, et. al.). Deep understanding of different integration patterns and More ❯
frameworks, sharing insights, and experimenting with cutting-edge technologies. Preferred Requirements Experience integrating Salesforce with external agents via APIs and open standards (MCP, A2A). Familiarity with prompt governance, observability, and monitoring frameworks. Background in cross-platform integrations (e.g., Hyperscaler SDKs to Salesforce Flows). Prior exposure to conversational voice pipelines or multimodal integrations via hyperscaler services. Advanced AI/ More ❯
They will be a strong communicator, and may have previously worked in an SRE role, a software engineering role or a systems engineering role. Key Responsibilities: Participate in building observability, monitoring and alerting for key services - continuously improving our SLI & SLOs and observability data enabling faster issue detection and incident resolution Collaborate with senior engineers and product teams to ensure More ❯
engineering role. This role is a Sun-Thursday shift. Key Responsibilities: Support the migration of Clearing applications to Google Cloud Platform (GCP) and 7 day trading Participate in building observability, monitoring and alerting for key services - continuously improving our SLI & SLOs and observability data enabling faster issue detection and incident resolution Collaborate with senior engineers and product teams to ensure More ❯
and help shape how platform engineering is done as the team continues to scale. Tech stack AWS (Core services - EC2, RDS, S3, IAM, etc.) Configuration Management Ansible Monitoring and Observability Grafana, Prometheus Kubernetes (building and managing production clusters) Terraform (IaC provisioning) GitHub Actions (CI/CD pipelines) What They’re Looking For Experience in AWS cloud infrastructure (ideally in a … regulated or high-traffic environment) Previous experience working with Monitoring and Observability Tools Hands-on Kubernetes know-how, specifically with EKS. Solid IaC experience with Terraform. Experience with containerisation (Docker, Helm) and CI/CD (GitHub Actions or similar) A good communicator who enjoys working collaboratively across product and engineering The client is willing to take someone that doesn't More ❯
fund administrators, and institutional investors around the world. Working closely with our engineering teams, you'll design and maintain cloud infrastructure, improve our CI/CD pipelines, and enhance observability so we can ship high-quality features quickly and confidently. You'll bring your initiative as well as your technical skills to solve real operational challenges, ensuring our systems are … production and staging environments Build and improve CI/CD pipelines to support rapid, high-quality deployments Monitor and improve system availability, performance, and cost-efficiency Implement and manage observability tools (logging, metrics, tracing) Enhance infrastructure-as-code using AWS CDK and related tools Collaborate with engineers to streamline development workflows and deployment strategies Champion DevOps best practices across the More ❯
Architect for Scale & Resilience: Make critical decisions on system design and performance to support a growing platform with increasing complexity and scale. Elevate Operational Maturity: Lead improvements to monitoring, observability, and developer workflows - ensuring backend systems are resilient and teams can ship confidently. Embed Security by Design: Take responsibility for backend security posture, ensuring systems meet best practices and compliance … and SQS. Infrastructure as Code: Experience with Terraform or similar tools for infrastructure automation. High-Throughput Systems: Strong experience in real production projects handling large-scale data flows. Monitoring & Observability: Proficiency in tools like Datadog, Prometheus, and Grafana. Security & Networking: Solid understanding of networking principles, security best practices, and cloud security. Agile & Fast-Paced Environments: Experience in agile teams, working More ❯
to ship quickly, securely, and reliably at scale. You'll guide the evolution of our document processing pipelines, data stores, and AI integrations, while championing DevOps/SRE practices, observability, security, and cost efficiency. You'll influence engineering culture, mentor engineers, and help define architectural direction across the company. This is an opportunity to build the foundations of a global … Finance to align platform priorities with business goals. Infrastructure & Operations Lead adoption of containers, Kubernetes, and Infrastructure as Code (Terraform/CDK). Build and enforce best practices in observability, monitoring, and incident response. Set and maintain SLOs/SLAs for platform reliability and performance. Drive FinOps practices: forecasting, monitoring, and optimizing infrastructure spend. Developer Experience Build internal tooling, templates More ❯
more than 10 individuals) Career coaching Skills & Mentoring I can mentor in: Technical communication, cross-functional collaboration, performance reviews, managing up Python, data structures, machine learning, LLM fine-tuning, observability, large-scale ML deployments Availability times are shown in your local timezone. Reviews Recent reviews highlight the effectiveness of coaching sessions in ML system design, behavioral interview preparation, career guidance More ❯
AWS Bedrock, LangGraph, and MCP. Partner with business leaders to identify opportunities for AI-powered transformation across customer and employee journeys. Ensure AI systems are designed with security, scalability, observability, and compliance in mind. Guide technical teams on best practices in MLOps, data integration, and lifecycle management. Present AI strategies and solutions to senior stakeholders and clients, aligning technology with More ❯
performing team of ML engineers. Combine ML with physics-based risk models (flooding, tropical cyclones, wildfires) to deliver grounded, high-impact solutions. Establish gold-standard practices for evaluation, deployment, observability, and maintainability in ML model development. Turn complex technical challenges into clear business outcomes for colleagues and customers. Requirements: MSc or PhD Degree in Computer Science, Artificial Intelligence, Mathematics, Statistics More ❯
performing team of ML engineers. Combine ML with physics-based risk models (flooding, tropical cyclones, wildfires) to deliver grounded, high-impact solutions. Establish gold-standard practices for evaluation, deployment, observability, and maintainability in ML model development. Turn complex technical challenges into clear business outcomes for colleagues and customers. Requirements: MSc or PhD Degree in Computer Science, Artificial Intelligence, Mathematics, Statistics More ❯
performing team of ML engineers. Combine ML with physics-based risk models (flooding, tropical cyclones, wildfires) to deliver grounded, high-impact solutions. Establish gold-standard practices for evaluation, deployment, observability, and maintainability in ML model development. Turn complex technical challenges into clear business outcomes for colleagues and customers. Requirements: MSc or PhD Degree in Computer Science, Artificial Intelligence, Mathematics, Statistics More ❯
performing team of ML engineers. Combine ML with physics-based risk models (flooding, tropical cyclones, wildfires) to deliver grounded, high-impact solutions. Establish gold-standard practices for evaluation, deployment, observability, and maintainability in ML model development. Turn complex technical challenges into clear business outcomes for colleagues and customers. Requirements: MSc or PhD Degree in Computer Science, Artificial Intelligence, Mathematics, Statistics More ❯
london (city of london), south east england, united kingdom
Harnham
performing team of ML engineers. Combine ML with physics-based risk models (flooding, tropical cyclones, wildfires) to deliver grounded, high-impact solutions. Establish gold-standard practices for evaluation, deployment, observability, and maintainability in ML model development. Turn complex technical challenges into clear business outcomes for colleagues and customers. Requirements: MSc or PhD Degree in Computer Science, Artificial Intelligence, Mathematics, Statistics More ❯
TDD (NUnit). Expertise in RESTful & GraphQL APIs, Git, and SOLID principles. Strategic thinking, strong communication, and a love for collaboration. Bonus: Experience with Azure, DevOps, Entity Framework, and observability practices. Why You'll Love It Here: Developer-led culture with hack days, and open access to leadership. Transparent progression and tailored development plans. Great perks: profit share, training budget More ❯
TDD (NUnit). Expertise in RESTful & GraphQL APIs, Git, and SOLID principles. Strategic thinking, strong communication, and a love for collaboration. Bonus: Experience with Azure, DevOps, Entity Framework, and observability practices. Why You'll Love It Here: Developer-led culture with hack days, and open access to leadership. Transparent progression and tailored development plans. Great perks: profit share, training budget More ❯
models for real-time inference. Responsibilities Develop APIs for AI inference used by internal and external clients Benchmark and optimize bottlenecks in our inference stack Enhance system reliability and observability, and manage system outages Research and implement optimizations for LLM inference Qualifications Experience with ML systems and deep learning frameworks such as PyTorch, TensorFlow, ONNX Knowledge of LLM architectures and More ❯
in biotech, pharma, or AI-driven drug discovery Experience in both large organisations (with structured processes and metrics) and smaller/startup environments (delivering with limited resources) Knowledge of observability and reliability practices for product platforms Security or compliance experience Why Join? Be part of a world-class AI-first research environment shaping the future of drug discovery Work on More ❯
in biotech, pharma, or AI-driven drug discovery Experience in both large organisations (with structured processes and metrics) and smaller/startup environments (delivering with limited resources) Knowledge of observability and reliability practices for product platforms Security or compliance experience Why Join? Be part of a world-class AI-first research environment shaping the future of drug discovery Work on More ❯