Observability Jobs in the Thames Valley

51 to 72 of 72 Observability Jobs in the Thames Valley

Senior Software Engineer

slough, south east england, united kingdom
Ripple
and evolving the platform to meet new business and compliance requirements. By advancing an already established production system, you will play a key role in increasing scale, efficiency, and observability, ensuring Ripple products interact seamlessly with global banks and payment providers in a consistent and trustworthy way. We are looking for a software engineer passionate about learning and understanding the More ❯
Posted:

Linux Production Engineer

slough, south east england, united kingdom
Autonomai Recruitment
distributed systems Contribute to ongoing improvements in reliability, latency, and scalability Qualifications: Linux expertise with a solid understanding of networking and containerisation Proficiency in at least Python Experience with observability tooling Proven track record in designing and maintaining highly distributed systems Apply now for a confidential chat More ❯
Posted:

Crypto Production Engineer

slough, south east england, united kingdom
Autonomai Recruitment
environments Skills & Experience Passion for blockchain and appetite to learn and innovate Experience managing proof-of-stake blockchain infrastructure Linux expertise with hands-on experience in Kubernetes, networking, and observability tools Familiarity with infrastructure-as-code and configuration management tools Proven ability to design, build, and maintain complex, high-availability systems Exposure to DeFi, on-chain trading Comfortable working in More ❯
Posted:

Cloud Engineer

slough, south east england, united kingdom
algo1
how to manage workloads at scale. Proficient with Infrastructure as Code tools and practices. Comfortable writing automation, configuration, and tooling to simplify operations and reduce manual effort. Knowledgeable about observability tools & best practices. Ability to collaborate across teams with excellent written and verbal communication skills. Nice to Have Qualifications: Experience with multi-cloud and/or hybrid deployments. Knowledge of More ❯
Posted:

GenAI Solution Architect

slough, south east england, united kingdom
Capgemini
Establish best practices for prompt engineering, model safety, bias mitigation, and responsible AI. Ensure compliance with data privacy regulations (GDPR, HIPAA, etc.) and internal governance policies. Define monitoring and observability strategies for GenAI systems in production. Stakeholder Engagement Translate business requirements into technical specifications and solution blueprints. Present architectural decisions and trade-offs to technical and non-technical stakeholders. Support More ❯
Posted:

Solutions Architect – Payment Platforms & POS Integration

slough, south east england, united kingdom
Hybrid / WFH Options
YQN Pay
and post-launch operations. Develop and maintain reference architectures, documentation, and governance processes for ongoing platform enhancements. Guide adoption of modern infrastructure approaches, including cloud-native deployments, microservices, and observability frameworks. Contribute directly to business growth through hands-on architecture while mentoring junior engineers as the team scales. Align technology designs with compliance, regulatory, and security requirements (e.g., PCI DSS More ❯
Posted:

Product Engineer (Backend)

slough, south east england, united kingdom
Granola
of our backend infrastructure Design and implement performant APIs and services Build infrastructure to support cutting-edge AI capabilities Optimise database performance and query efficiency Continuously improve reliability and observability through enhanced monitoring and alerting Collaborate cross-functionally to ensure our infrastructure supports continuous product innovation Your background looks something like: Engineering experience in tech and product-driven environments Strong More ❯
Posted:

Cloud Engineer DV Cleared

slough, south east england, united kingdom
Damia Group
automation, and container orchestration. You will be instrumental in shaping enterprise-ready cloud solutions by applying deep technical expertise in AWS alongside knowledge of multi-cloud environments, identity management, observability, and cost optimisation. Key Responsibilities Design and implement secure, scalable AWS cloud architectures Drive Infrastructure as Code (IaC) adoption using Terraform and CloudFormation Build, optimise, and automate CI/CD … GitHub Actions, and related tools Deploy and manage containerised solutions with Docker, Kubernetes, and Helm Implement strong security and access controls using IAM, Vault, and Secrets Manager Enhance platform observability using Prometheus, Grafana, and ELK Stack Collaborate with cross-functional teams to deliver robust, high-availability solutions Key Skills & Experience Extensive hands-on experience with AWS (Azure knowledge beneficial) Expertise … in Terraform, CloudFormation, and automation tooling Strong containerisation skills with Kubernetes, Docker, and related platforms Proven background in cloud security, IAM, and governance Solid understanding of monitoring and observability stacks Ability to influence architecture decisions and align solutions to best practices Desired Certifications AWS Certified Solutions Architect – Associate/Professional AWS Certified Security – Specialty HashiCorp Certified: Terraform Associate Kubernetes Certified More ❯
Posted:

Platform Engineer

slough, south east england, united kingdom
Reelables
Lead infrastructure-as-code initiatives using Terraform and CloudFormation Build and maintain CI/CD pipelines that enable our team to deploy confidently multiple times per day Implement comprehensive observability across our entire technology stack Drive initiatives to improve developer productivity and system reliability Innovation & Technical Leadership Research and implement cutting-edge technologies for IoT data processing and real-time … real-time analytics, and time-series databases DevOps & Reliability Engineering Proven track record building and maintaining CI/CD pipelines in production environments Deep understanding of monitoring, logging, and observability tools and practices Experience with container orchestration (Docker, Kubernetes) and microservices architectures Knowledge of security best practices for cloud-native applications Scale & Performance Focus Experience building systems that handle millions More ❯
Posted:

Staff Software Engineer

slough, south east england, united kingdom
Hybrid / WFH Options
Albany Growth
and system architecture across multiple teams. Tackle complex engineering challenges end-to-end, delivering the most critical components yourself. Champion engineering best practices—from design and implementation to testing, observability, and scaling. Collaborate with product, data, and business teams to align priorities and ensure technical solutions meet real business goals. Mentor and coach engineers and technical leads, helping them grow More ❯
Posted:

Senior Full Stack Engineer

slough, south east england, united kingdom
Xapien
code across the stack. Participating in architectural discussions and helping shape engineering best practices. Troubleshooting and resolving production issues across services and systems. Contributing to CI/CD pipelines, observability, and automation alongside platform engineers. Your Skills & Experience: Must-haves to be successful in this role: Strong experience writing backend services in Go. Proficiency in React and modern JavaScript/… and code styles. Nobody can do everything, but here are a few related things we’re interested in: Experience working lower in the stack, e.g., databases, infrastructure, Kubernetes, or observability tooling. Exposure to CI/CD tooling Interest in natural language processing, AI, or distributed systems. Here’s our promise to you: We are going to work with you – to More ❯
Posted:

Product Analytics Lead - AI + SaaS Scale-up - London (Hybrid)

slough, south east england, united kingdom
Hybrid / WFH Options
Stax - Deeptech Talent
supports embedded dashboards, cross-source joins, and both human and AI-native querying (e.g. Text-to-SQL). Leverage in-app instrumentation (session replays, logs, engagement events) to enable observability, anomaly detection, and customer experience analytics. Build AI-Enabled Interactions Lead the development of agentic capabilities—such as natural language querying, AI-generated dashboards, and real-time recommendations—integrating structured More ❯
Posted:

Senior DevOps Engineer

slough, south east england, united kingdom
TechShack
DevOps initiatives, improving CI/CD pipelines and automation Promoting a security-first mindset (ISO/SOC2 standards) Working closely with developers to enhance developer experience and efficiency Driving observability and reliability improvements (SRE-style) Helping shape infrastructure standards, tooling, and reusable code modules Tech Environment AWS (EKS, Lambda, Step Functions, Batch, API Gateway) Terraform (core IaC tool) Kubernetes (EKS … and Helm charts Python (used for Lambdas and testing) Prometheus + Loki for monitoring and observability Serverless-first architecture approach What They’re Looking For Hands-on AWS experience (not just certifications) Strong Terraform and Kubernetes (EKS) skills Solid understanding of cloud security and compliance Confidence to lead discussions and challenge technical decisions constructively Bonus: Python experience and exposure to More ❯
Posted:

Principal Software Engineer

slough, south east england, united kingdom
Orbis Group
evolution of real-time APIs and data platforms handling billions of requests Stay deeply hands-on with Golang while influencing design and long-term technical strategy Drive improvements in observability, testing, and performance across all services Mentor senior engineers and play a key role in shaping engineering culture What They’re Looking For 8+ years’ backend engineering experience, ideally at … designing distributed systems and event-driven architectures (Kafka, RabbitMQ, WebSockets) Deep understanding of PostgreSQL , Redis , and high-performance data systems Strong DevOps mindset — CI/CD, infrastructure as code, observability (Grafana, Prometheus, OpenTelemetry) Exceptional communicator, able to influence architecture and direction across teams Nice to Have Experience with AWS , Kubernetes , or other cloud-native environments Exposure to financial data systems More ❯
Posted:

Platform Engineer

Caldecotte, Milton Keynes, Buckinghamshire, England, United Kingdom
Connells Group HQ
mindset, working directly with development teams to understand their needs and deliver solutions. You will work across multiple technical domains including orchestration, automation, CI/CD pipelines, cloud services, observability, and security, developing deeper expertise in areas that align with platform priorities and your interests. Experience with Microsoft Azure is essential.You will play your part in operating the platform aligned … with Docker and basic Kubernetes concepts Understanding of cloud networking concepts (VNets, subnets, NSGs) Awareness of cloud security best practices and compliance requirements Basic knowledge of monitoring, logging, and observability tools Understanding of cloud cost management and resource optimisation principles Comfort with troubleshooting and supporting development teams Understanding of service reliability and incident response practices Connells Group UK is an More ❯
Employment Type: Full-Time
Salary: Competitive salary
Posted:

Staff Quality Engineer (Waitrose Apps)

Bracknell, Berkshire, South East, United Kingdom
Hybrid / WFH Options
John Lewis Head Office
the teams checks, your role in the team will be to mentor others in testing practice; coach them to adopt and improve their quality approaches including deployment approaches and observability; review and contribute to the teams codebase and pipeline configuration; help the team with their system of work from first business need to monitoring services in production. At all times … performance, resource usage, variable bandwidth, device compatibility, accessibility etc.) and advising on how these risks should be mitigated. Understanding operational and non-functional requirements (such as resilience, performance and observability) and how solutions are implemented and tested. Desirable skills/experience you may have Bitrise/Gitlab CI GraphQL Backend for Frontend (BFF) patterns Microservice Architectures Experience of cloud infrastructure More ❯
Employment Type: Permanent, Work From Home
Salary: £90,000
Posted:

Staff Quality Engineer (Waitrose Apps)

reading, south east england, united kingdom
Hybrid / WFH Options
John Lewis Head Office
the teams checks, your role in the team will be to mentor others in testing practice; coach them to adopt and improve their quality approaches including deployment approaches and observability; review and contribute to the teams codebase and pipeline configuration; help the team with their system of work from first business need to monitoring services in production. At all times … performance, resource usage, variable bandwidth, device compatibility, accessibility etc.) and advising on how these risks should be mitigated. Understanding operational and non-functional requirements (such as resilience, performance and observability) and how solutions are implemented and tested. Desirable skills/experience you may have Bitrise/Gitlab CI GraphQL Backend for Frontend (BFF) patterns Microservice Architectures Experience of cloud infrastructure More ❯
Posted:

Senior Software Engineer ML Platform & GenAI [Dubai Based]

slough, south east england, united kingdom
oryxsearch.io
Senior Software Engineer II – ML Platform & GenAI (relocation to dubai) Location: Dubai, United Arab Emirates As part of the Machine Learning Platform & Generative AI Applications team, this role sits at the intersection of engineering, data science, and product development. The More ❯
Posted:

Machine Learning Engineer

buckinghamshire, south east england, united kingdom
Hybrid / WFH Options
Rightmove
scientists to take models from development to production-grade systems, ensuring scalability, reproducibility, and robustness. Automating feature engineering and data pipeline processes, ensuring reproducibility and auditability. Implementing monitoring and observability to detect drift, bias, and performance degradation, and setting up rollback/recovery processes. Using MLOps tools (e.g., Vertex Pipelines, Kubeflow, Weights & Biases) for experiment tracking, model registry, and automated … distributed systems). 3+ years of experience as an ML Engineer, MLOps Engineer, Data Engineer, or similar, in a larger-scale, production-focused environment. Hands-on with model monitoring, observability, and retraining pipelines. Exposure to feature stores, registries, and experimentation frameworks. Familiarity with business-driven metrics and experience balancing ML performance with commercial goals. Experience with generative AI and LLM More ❯
Posted:

Azure AI Engineer - Contract

slough, south east england, united kingdom
Adeptis Group
is adopted across a national-level environment. Key Responsibilities Design, develop, test, and deploy Agentic AI solutions on Microsoft Azure AI Foundry. Build and optimise AI pipelines , orchestration, and observability frameworks. Ensure solutions meet security, governance, and compliance standards. Collaborate with cross-functional teams to align AI projects with business outcomes. Develop proof-of-concepts and evaluate their business impact. … ML Studio . Strong understanding of data science, machine learning , and agentic AI orchestration . Experience with Generative AI, NLP, and prompt engineering . Solid grasp of AI governance, observability , and responsible AI frameworks. Excellent communication skills and stakeholder collaboration experience. Desirable Microsoft Certified: Azure AI Engineer Associate . For more information, apply via the link or contact Hubert Colvin More ❯
Posted:

Application Test Manager - Platform Migration (Contract)

slough, south east england, united kingdom
Alexander Ash Consulting
Alexander Ash are partnering with a leading markets firm undertaking a major transformation of its legacy Market Data infrastructure, a platform supporting thousands of mission-critical applications and workloads into a modern, cloud-native, performance-optimised environment with enhanced scalability More ❯
Posted:

Enterprise Sales Development Representative

slough, south east england, united kingdom
Venatrix
Enterprise Sales Development Representative – SaaS Sales Development Representative (SDR) – SaaS | £50 Base + £85k OTE |/Tech Sales/DevOps/Observability Are you an ambitious SDR ready to take on bigger deals, better coaching, and a clearer path to promotion? We’re hiring a Sales Development Representative for one of the fastest-growing SaaS scale-ups in the DevOps … and observability space. This company is transforming how engineering and security teams understand and act on their data—offering real-time insights while reducing observability costs by up to 70%. With strong funding and a leadership team from top-tier SaaS companies, they’re now expanding their UK footprint and building a high-performance sales team in London. The More ❯
Posted:
Observability
the Thames Valley
10th Percentile
£61,625
25th Percentile
£72,500
Median
£75,000
75th Percentile
£84,688
90th Percentile
£85,000