Permanent Observability Job Vacancies

276 to 300 of 734 Permanent Observability Jobs

Data Engineer

London, United Kingdom, Vauxhall
Citywire
often, and embrace hands-on problem-solving; maturing projects as they become foundational parts of the company's infrastructure, whether that means writing resilient, test-driven code, designing for observability, or building systems that can scale and recover gracefully. You’ll have the space to experiment and the responsibility to stabilise when it counts. You’ll work across AWS and … and CI/CD pipelines in a cloud-native environment. -Database Familiarity: Skilled in both SQL and NoSQL (PostgreSQL, DynamoDB, OpenSearch, or equivalents), using ORMs like Django or SQLAlchemy. -Observability & Monitoring: Comfortable using tools like CloudWatch, X-Ray, and structured logging to keep systems running smoothly. -Mindset: Curious, Collaborative, and Proactive - you enjoy solving problems hands-on and aren’t More ❯
Employment Type: Permanent
Posted:

Site Reliability Engineer

Watford, Hertfordshire, United Kingdom
Hybrid / WFH Options
Wickes
Responsibilities: You'll define and manage Service Level Objectives (SLOs) and Service Level Indicators (SLIs), driving initiatives to enhance reliability, performance, and scalability. You will design, implement, and manage observability solutions, including monitoring, logging, and tracing, with strong expertise in Datadog for proactive dashboards and alerts. Automate manual operational tasks to reduce toil and improve system resilience. Collaboration is key … and best practices for building resilient architectures. Strong experience with Infrastructure as Code (IaC), particularly with Terraform and Ansible. Experience with Packer is also required. Proven experience with modern observability stacks, with specific expertise in Datadog. Proficient in using JIRA and Confluence. Solid understanding of CI/CD pipelines and their role in maintaining a stable production environment. What's More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Cloud Platform Data Engineer

England, United Kingdom
BMC Software, Inc
scale event-driven workflows using EventBridge and Lambda. Work with DynamoDB for fast, scalable key-value storage. Develop and maintain Java Spring Boot microservices deployed on EC2 instances. Ensure observability, monitoring, and fault-tolerance across the system. Collaborate with DevOps, Data Engineering, and Product teams to design scalable, cost-effective cloud solutions. Maintain security best practices in a cloud-native … performance tuning, and cost-optimization in cloud environments with Kafka for data streaming. Familiarity with CI/CD and infrastructure-as-code tools (e.g., Terraform, CloudFormation). Experience with observability tools (e.g., CloudWatch, OpenTelemetry). Experience working in a global enterprise software company. Our commitment to you! BMC's culture is built around its people. We have 6000+ brilliant minds More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Engineering Excellence Lead

London, United Kingdom
Hybrid / WFH Options
Trili
Collaborate with People/HR and engineering leadership on career pathing, training, and coaching for engineering staff. Technology Enablement: Evaluate and deploy tools - especially AI - that support engineering productivity, observability, and collaboration. Work closely with DevOps, QA, and SRE teams to align infrastructure and operational excellence with engineering needs. Own key vendor relationships, evaluation of partnerships and represent technology on … scaling engineering orgs across multiple geographies or domains (e.g., front-end, back-end, infrastructure). Familiarity with tools like Linear, Asana, GitHub, Datadog, DORA metrics, or similar performance/observability platforms. Background in organisational change management or engineering program management. What you can expect from us Competitive salary with substantial incentive schemes Generous long-term incentive plan (LTIP) tez token More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Restaurant Technology Problem Manager

London, United Kingdom
Hybrid / WFH Options
McDonald's Corporation
as follows: Own ITIL Problem & Change Management Take ownership of ITIL Problem Management activities, proactively identifying, addressing and fixing root causes of incidents and recurring issues within the system. Observability lead, promoting stability across the estate by collaborating with cross-functional teams to implement preventive measures. Actively take part in ITIL Change Management processes, ensuring that changes to the system … efficiently. Experience in implementing changes while following ITIL change management processes. Understanding of basic security principles and best practices for securing infrastructure. Optional but advantageous technical skills: Proficient using observability tools (NewRelic and Thousand Eyes), BI platform and data visualisation tools (such as Tableau and Power BI) and technology tools (Jira, Confluence). System Administration: Proficiency in Linux/Unix More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Site Reliability Engineer

Edinburgh, United Kingdom
Hybrid / WFH Options
Dayshape
improve management of our cloud estate in Azure Improve our ability to scale the BAU team through automation of manual tasks, and delivery of self-service tooling. Improve the observability of Dayshape through reporting and tool creation Drive improvements to our Cost of Giving Service (COGS) Diagnose, triage, investigate and manage escalated issues Drive improvements to our incident management and … within an established SRE team and be able to talk through the key principles of managing a large infrastructure estate. Monitoring infrastructure and applications hosted using taking into consideration: Observability, Alerting, Uptime SLA's and SLO's Azure Devops pipeline management. Strong collaboration with both engineering teams and colleagues in customer-facing teams. Excellent communicator both in written and verbal More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Elasticsearch Platform Engineer

City of London, London, United Kingdom
Hybrid / WFH Options
Tec Partners
focus on security, resilience, and continuous improvement. Key Responsibilities: Manage and maintain Elastic Cloud Enterprise (ECE) environments, ensuring high availability and performance. Design and deploy scalable Elasticsearch solutions for Observability and Search use cases. Implement robust security, privacy, and compliance controls across Elasticsearch systems. Optimise system configurations and queries to enhance performance and reduce latency. Collaborate with cross-functional teams More ❯
Employment Type: Permanent
Salary: £77000 - £116000/annum
Posted:

Lead Developer

Manchester, North West, United Kingdom
Hybrid / WFH Options
Daniel James Resourcing Ltd
Design and implement robust, scalable, and secure backend services Contribute to strategic technical decisions around architecture and platform direction Embed engineering best practices across code quality, testing, deployment and observability Mentor and support the growth of team members, promoting a culture of continuous learning Tech Stack The role will suit someone who is confident in modern cloud-native development and More ❯
Employment Type: Permanent
Salary: £85,000
Posted:

Senior Data Engineer

London, United Kingdom
Story Terrace Inc
of networking and cloud infrastructure (e.g., AWS, Azure). Experience with modern data processing frameworks (e.g., dbt, Apache Airflow, Spark, or similar). Requirements A strong focus on system observability and data quality. Emphasis on rapid scalability of solutions ( consider market ramp up when entering a new market ) Relentless pursuit of system security. Adaptable mindset - open to using different tools More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Full Stack Engineer

City, London, United Kingdom
Comoro
Fi authentication systems, CRMs and partnered PropTech tools Continually hone and perfect our homegrown DevOps and CI/CD processes by further developing GitHub Actions pipelines, Terraform definitions and observability integrations. Ensure quality & reliability: establish testing best practices (unit, integration, end-to-end), conduct code reviews and demand high quality standards Shape and refine our cloud-native platform to optimise More ❯
Employment Type: Permanent
Salary: GBP 80,000 - 85,000 Annual
Posted:

Senior Full Stack Engineer

Aldgate, Greater London, United Kingdom
Comoro
Fi authentication systems, CRMs and partnered PropTech tools Continually hone and perfect our homegrown DevOps and CI/CD processes by further developing GitHub Actions pipelines, Terraform definitions and observability integrations. Ensure quality & reliability: establish testing best practices (unit, integration, end-to-end), conduct code reviews and demand high quality standards Shape and refine our cloud-native platform to optimise More ❯
Employment Type: Permanent
Salary: £80000 - £85000/annum Plus Bonus and Benefits
Posted:

Senior Customer Support Engineer

United Kingdom
NetBox Labs, Inc
automation community. NetBox Labs is the commercial steward of open source NetBox, the world's most popular network source of truth, and Orb, the next-generation open source network observability platform. Our products include NetBox Enterprise, a fully supported self-managed NetBox with advanced features, and NetBox Cloud, a secure, scalable, and reliable SaaS edition of NetBox. NetBox powers thousands More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Engineer

London, United Kingdom
Hybrid / WFH Options
Our Future Health Limited
using modern, agile development practices like code review, TDD, CI/CD and pairing using tools like Git and GitHub. Experience of operationally managing software components once live, including; observability, logging, metrics, error reporting, debugging and live incident management. Experience of working with sensitive personal data. Competitive salary starting from £85,000 Generous Pension Scheme - We invest in your future More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Engineer (Core Data Services)

London, United Kingdom
Hybrid / WFH Options
Our Future Health
using modern, agile development practices like code review, TDD, CI/CD and pairing using tools like Git and GitHub. Experience of operationally managing software components once live, including; observability, logging, metrics, error reporting, debugging and live incident management. Experience of working with sensitive personal data. Competitive salary starting from £85,000 Generous Pension Scheme - We invest in your future More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Software Engineer

London, United Kingdom
P2P
of the following a bonus: Java experience Python experience Ruby experience Big data technologies: Spark, Trino, Kafka Financial Markets experience SQL: Postgres, Oracle Cloud-native deployments: AWS, Docker, Kubernetes Observability: Splunk, Prometheus, Grafana For more information about DRW's processing activities and our use of job applicants' data, please view our Privacy Notice at . California residents, please review the More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Staff Software Engineer

London, United Kingdom
JDA Software
across complex systems. Solid knowledge of database systems, data modeling, and query optimization. Experience with Maven artifact deployment, Android XML, and Compose layout systems. Familiarity with monitoring, logging, and observability tools. Experience with performance optimization and security best practices. Understanding of agile development methodologies. History of mentoring junior developers and providing technical leadership. Knowledge of Dagger, Retrofit 2, RxJava, Room More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Software Engineer

Edinburgh, United Kingdom
Hybrid / WFH Options
Aveni UK
execute solutions. Work with AWS cloud-native services (Lambda, Step Functions, DynamoDB) to develop efficient cloud-basedapplications. Ensure CI/CD best practices ,contributing to GitLab pipelines ,automation, and observability improvements. Integrate AI-powered tools (e.g., GitHubCopilot) to enhance development workflows. Drive continuous improvement in performance, security, andmaintainability . Support cross-squad collaboration ,ensuring architectural consistency and code reusability. Requirements More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer II London, England, United Kingdom London, England, United Kingdom

London, United Kingdom
Axon Enterprise
or similar. Experience of code collaboration such as GitHub, ArgoCD, or similar. Experience utilizing CI/CD platforms to automate provisioning infrastructure, software builds, tests, and releases. Experience using observability tools such as APM, logging, and metrics to assist with debugging issues. Experience using Infrastructure as Code tools for provisioning infrastructure such as Terraform, Cloudformation, or similar. Experience designing tooling More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer (DV Security Clearance)

London
CGI
automation scripts, infrastructure as code, creating tooling or frameworks and feature development, ideally using Java and/or python. • Experience of engineering enablement products such as CI/CD, Observability and Alerting • Experience creating designs and documentation, including 'how to user guides' • Experience of investigating and resolving incidents and problems aligned to the SLAs • Continuously seeking opportunities for system performance More ❯
Employment Type: Permanent
Posted:

Senior Backend Engineer (MLOps)

Dundee, Angus, United Kingdom
Optimove
control systems (Git) and CI/CD pipelines. Understanding of database systems and data pipeline architectures. Ability to troubleshoot and optimize complex production systems under load. Experience with monitoring, observability, and performance profiling tools. Strong communication and collaboration skills for working with ML researchers and data scientists. Nice to have: Python experience for interfacing with ML frameworks Experience with real More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning Engineering Manager

London, United Kingdom
Hybrid / WFH Options
BGL Group
prototypes into high-quality production systems Platform & Engineering Standards • Contribute to the design and evolution of our internal ML platform and tooling • Champion best practices in CI/CD, observability, reproducibility, and infrastructure-as-code for ML • Ensure all deployed systems meet requirements for resilience, testing, security, and performance • Influence and contribute to shared frameworks, libraries, and deployment pipelines Strategy More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Machine Learning Engineering Manager (London)

London, UK
Hybrid / WFH Options
BGL Group
prototypes into high-quality production systems Platform & Engineering Standards • Contribute to the design and evolution of our internal ML platform and tooling • Champion best practices in CI/CD, observability, reproducibility, and infrastructure-as-code for ML • Ensure all deployed systems meet requirements for resilience, testing, security, and performance • Influence and contribute to shared frameworks, libraries, and deployment pipelines Strategy More ❯
Employment Type: Full-time
Posted:

Senior Site Reliability Engineer - Networking

United Kingdom
Hybrid / WFH Options
Lambda Inc
configuration and deployments Contribute to the implementation and operations of Software Defined Networks Help to deploy and manage Spine and Leaf networks Ensure high availability of our network through observability, failover, and redundancy Ensure clients have predictable networking performance through the use of network engineering and other applicable technologies Help with deploying and maintaining network monitoring and management tools You More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Platform Engineer - Edinburgh / Glasgow

Edinburgh, United Kingdom
WeAreTechWomen
captivating customer experiences. We are looking for experienced DevOps/Platform Engineers to join our vibrant community of Platform Engineering professionals, encompassing knowledge and experience in DevOps, DevSecOps, SRE, Observability, and Internal Developer Platforms/Portals, based at our Scotland offices on a full-time, permanent basis. In our team you will learn: Working within a high performing delivery team More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Staff Engineer (APIs)

Cardiff, South Glamorgan, United Kingdom
Admiral Group Plc
of high-impact integration solutions across services and platforms. Collaborate on reusable API assets such as SDKs, templates, shared schemas, and common middleware. Implement robust error handling, logging, and observability across services and endpoints. Promote automation of API tests, documentation, contract validation, and pipeline integration. Collaboration & Engineering Maturity Act as a subject matter expert for APIs across squads and tribes More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:
Observability
10th Percentile
£57,500
25th Percentile
£65,000
Median
£80,000
75th Percentile
£97,500
90th Percentile
£120,000