|
1 to 25 of 59 Permanent Observability Jobs in Slough
slough, south east england, united kingdom BGC Group
built on Solace PubSub+, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging More ❯
slough, south east england, united kingdom Humanoid
and core infrastructure - from development and deployment to monitoring and continuous improvement. Build and maintain robust CI/CD pipelines for both software and ML workflows. Ensure reliability, scalability, observability, and security of production systems and ML infrastructure. Automate deployment, orchestration, and environment management using modern DevOps tooling. Collaborate closely with software engineers, data scientists, and product teams to bring More ❯
slough, south east england, united kingdom Bourne Search Ltd
build/test/release Write Python tools to remove toil Containerise services and environments Start shaping Infra as Code (Terraform/Helm/Ansible a plus) Level up observability: logs, metrics, alerts Join incident drills and learn production best practice Qualifications Solid Python for automation Linux, Git, CI/CD basics (GitHub Actions or Jenkins) 2.1 or higher CS More ❯
slough, south east england, united kingdom Arrows
strong problem-solving skills and attention to detail Great communication and a collaborative mindset Bonus Points: Experience with Node.js or frontend technologies like React Familiarity with Grafana , TeamCity , or observability tooling Interest or experience in financial services , compliance, or digital banking If you're excited to work in a fast-paced environment where engineers lead innovation - we’d love to More ❯
slough, south east england, united kingdom Damia Group
team development Unblock delivery challenges with pragmatic technical decision-making Champion engineering best practices, including TDD, CI/CD, code reviews, pair programming, and GitOps Promote sustainable approaches to observability, SRE, and DevSecOps in production systems Act as a trusted point of contact for both internal and client-facing technical discussions Key Skills & Experience Proficiency in Java, Python, Go, JavaScript More ❯
slough, south east england, united kingdom Inferity AI
reasoning Design scalable systems and APIs for data-intensive, low-latency workflows Optimize database interactions for high-speed visual search and reasoning Manage cloud infrastructure (compute, networking, storage, security, observability) Set up CI/CD pipelines for fast, reliable deployments Deploy and integrate AI/ML models into production pipelines Take a product-first approach: asking “why,” not just “how More ❯
slough, south east england, united kingdom Experis UK
BDD approaches (e.g., Cucumber, Gherkin) for test automation Containerisation & Microservices Container Technologies: Practical understanding of Docker or equivalent solutions Microservice Patterns: Experience architecting microservice-based systems with built-in observability and security Cloud Services & Environments Cloud Providers: Demonstrable experience with AWS or Azure Security & Configuration: Ability to build, configure, and secure cloud environments effectively Security & CI/CD Security Integration More ❯
slough, south east england, united kingdom Hybrid / WFH Options rmg digital
with cloud migrations or large-scale infrastructure modernisation projects Proficiency in at least one major cloud platform ( AWS , Azure , or GCP ) Experience with automation, CI/CD, and infrastructure observability Scripting experience in Python Excellent communication skills and a collaborative, delivery-focused mindset Contract Details 📅 Start Date: ASAP 💰 Day Rate: £500+ per day (depending on experience) ⏳ Duration: Initial 6 months More ❯
slough, south east england, united kingdom NETbuilder
with DevOps teams to integrate Elastic into CI/CD, automation, and cloud environments. Manage client expectations and ensure effective stakeholder communication. Stay up to date with Elastic and observability best practices. Tech Skills: Extensive hands-on experience with the Elastic Stack (Elasticsearch, Kibana, Logstash, Beats, etc.) . Familiarity with DevOps practices and tools (CI/CD, automation, infrastructure-as More ❯
slough, south east england, united kingdom algo1
CD best practices, and cloud platforms (AWS, GCP, or Azure). Experience with relational databases and data processing and query engines (Spark, Trino, or similar). Familiarity with monitoring, observability, and alerting systems for production ML (Prometheus, Grafana, Datadog, or equivalent). Understanding of ML concepts. You don't need to train models, but you should speak the language of More ❯
slough, south east england, united kingdom HelmGuard AI
applications to handle growing user bases and increasing traffic loads. API Design : Strong knowledge of RESTful API design principles and experience building developer-friendly APIs. Operational Excellence Monitoring and Observability for AI Models : Some experience of implementing logging, metrics, and alerting systems to maintain visibility into application performance and health specifically where those services rely upon third party model providers More ❯
slough, south east england, united kingdom Hybrid / WFH Options Arrows
architecture and development of backend services using C#, ASP.NET, .NET Core Automate infrastructure, CI/CD pipelines, and cloud operations (AWS/Azure) Promote engineering best practices, security, and observability Mentor engineers and foster a culture of continuous improvement Contribute to technology direction, including adoption of tools like Go and Python What We’re Looking For Deep expertise in C# More ❯
slough, south east england, united kingdom Hybrid / WFH Options Montash
Build, deploy, and maintain ML models as services, streaming applications, or batch jobs across real-time and offline platforms. Develop scalable model APIs with strong CI/CD and observability practices. Implement model testing, monitoring, and rollback capabilities in production environments. Collaborate with Data Scientists to translate prototypes into reliable, maintainable ML applications. Identify opportunities to develop new ML solutions More ❯
slough, south east england, united kingdom Retelligence
best practices. Engineering Delivery Work collaboratively within an Agile team to plan, estimate, and deliver engineering initiatives on time. Implement and maintain CI/CD pipelines , automated testing, and observability tools. Contribute to continuous improvement across systems, processes, and tooling. Technical Excellence Gain hands-on experience with messaging and event-driven architectures such as Azure Service Bus or Kafka . More ❯
slough, south east england, united kingdom Hybrid / WFH Options DRC Search
to define and deliver an ambitious roadmap Build a high-performance culture based on collaboration, ownership, and continuous improvement Drive modern engineering practices – CI/CD, cloud-native design, observability, and automation Support hiring, mentoring, and career development across the engineering team About You Proven experience leading engineering teams in a SaaS environment Strong background as a hands-on engineer More ❯
slough, south east england, united kingdom Prism Digital
Kubernetes: Workload orchestration and container management CI/CD: GitHub Actions or Azure DevOps pipelines with end-to-end automation Event-Driven Architecture: Kafka or similar messaging systems Monitoring & Observability: Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking, fintech, healthcare) Background in Site More ❯
slough, south east england, united kingdom Retelligence
ensuring solutions are scalable, secure, and maintainable . Translate complex business requirements into actionable engineering plans and achievable milestones. Embed strong engineering practices — CI/CD pipelines, automated testing, observability, and version control discipline . Technology Strategy & Innovation Champion the adoption of modern technologies and architectural patterns, including Azure cloud-native services , messaging infrastructures (e.g. Service Bus, Kafka), and microservices More ❯
slough, south east england, united kingdom Hybrid / WFH Options mkodo
legal, product owner, designer) Leads projects and features to good outcomes, ensuring appropriate engineering decisions are made to factor in technical debt, systems design, stability/reliability, monitoring/ observability and business need. Hands-On Guidance Contribute to key backend systems when your expertise is needed. Review and refine critical code, ensuring alignment with architectural goals and best practices. Provide More ❯
slough, south east england, united kingdom Duffel
us to silently drop spans. An enthusiasm for both software development and systems engineering. A high bar for code and configuration quality and readability. A good understanding of current observability and reliability practices. Experienced and comfortable in running incident response. Big picture thinking - you can make trade offs on technical work streams against business impact. Fantastic communication skills. You're More ❯
slough, south east england, united kingdom Hybrid / WFH Options RED Global
objectives. Proven experience as a technical leader within SRE, DevOps, or large-scale engineering environments. Hands-on expertise in cloud infrastructure (e.g., AWS, Azure), containerisation (Kubernetes, Docker), and modern observability stacks. Strong track record of driving developer productivity improvements through tooling, automation, and process refinement. Deep understanding of reliability engineering principles, including SLIs, SLOs, and error budgets. Excellent communication and More ❯
slough, south east england, united kingdom Motive Group
lead the next phase of platform maturity. This is your opportunity to: 🔧 Build and scale high-performance, secure, cloud-native infrastructure 📦 Lead platform architecture, CI/CD, DevOps tooling, observability, and security 🌍 Impact product lines used across 9+ international markets 💥 Own platform reliability, developer experience, and operational excellence 🧠 Drive a forward-thinking engineering culture focused on velocity and resilience This More ❯
slough, south east england, united kingdom Hybrid / WFH Options Arrows
Kafka CI/CD pipelines with fully automated deployments and testing TDD, BDD, and modern coding standards Microservices architecture – understanding both its power and its trade-offs Scaling, reliability, observability, and all things non-functional 🧠 We’re Looking For Someone Who: Has worked on OTT (Over-the-top) technologies Knows how to implement lean/agile practices like Scrum, Kanban More ❯
slough, south east england, united kingdom algo1
and semi-structured data across storage layers Integrate AI-driven personalisation and real-time insights into user flows Contribute to overall system design: service boundaries, data ownership, scaling, and observability Essential Qualifications: Built modern full-stack applications w/focus on backend (Python/Java/Go) Worked with a variety of storage technologies (eg. Postgres, Mongo, Redis, Object) Experience More ❯
slough, south east england, united kingdom algo1
and semi-structured data across storage layers Integrate AI-driven personalisation and real-time insights into user flows Contribute to overall system design: service boundaries, data ownership, scaling, and observability Essential Qualifications: Built modern full-stack applications (Python/Java/Go + React) Worked with a variety of storage technologies (eg. Postgres, Mongo, Redis, Object) Experience with async and More ❯
slough, south east england, united kingdom WALT Labs
Google Cloud Platform (GCP) services. Familiarity with incident.io for incident tracking and management (of equivalent) Proficiency in using JIRA for task management and support workflows. Strong experience working with observability tools (Grafana) Strong troubleshooting and problem-solving skills in cloud environments. Understanding of cloud security and performance optimisation best practices. Knowledge of scripting or automation tools (e.g., Python, Terraform) is More ❯
|
|