Observability Job Vacancies

276 to 300 of 2,262 Observability Jobs

Senior IaC Software Engineer

Basingstoke, England, United Kingdom
Hybrid / WFH Options
JR United Kingdom
native Infrastructure-as-Code (IaC) solutions from the ground up? Our client is seeking a talented and motivated Senior Software Engineer to lead the development of our next-generation observability platform. THIS IS NOT A DEVOPS ROLE. Responsibilities Collaborate within a dynamic software engineering team to architect and build a new cloud-native IaC platform. Develop software using technologies such More ❯
Posted:

Senior IaC Software Engineer

Hull, England, United Kingdom
Hybrid / WFH Options
JR United Kingdom
native Infrastructure-as-Code (IaC) solutions from the ground up? Our client is seeking a talented and motivated Senior Software Engineer to lead the development of our next-generation observability platform. THIS IS NOT A DEVOPS ROLE. Responsibilities Collaborate within a dynamic software engineering team to architect and build a new cloud-native IaC platform. Develop software using technologies such More ❯
Posted:

Site Reliability Engineer

Bristol, Gloucestershire, United Kingdom
Hybrid / WFH Options
Twinstream Limited
Socials & Events Cycle to Work Scheme & Life Assurance Key Responsibilities of the Site Reliability Engineer: Work closely with engineers and sysadmins to increase performance and reduce toil Advance system observability, monitoring and alerting Automate, troubleshoot, and proactively resolve issues before they escalate Improve development environments to meet delivery and quality targets Research and evaluate tools and platforms to support scale More ❯
Employment Type: Permanent
Salary: GBP 80,000 - 110,000 Annual
Posted:

Site Reliability Engineer

BS1, Bristol, City of Bristol, United Kingdom
Hybrid / WFH Options
Twinstream Limited
Socials & Events Cycle to Work Scheme & Life Assurance Key Responsibilities of the Site Reliability Engineer: Work closely with engineers and sysadmins to increase performance and reduce toil Advance system observability, monitoring and alerting Automate, troubleshoot, and proactively resolve issues before they escalate Improve development environments to meet delivery and quality targets Research and evaluate tools and platforms to support scale More ❯
Employment Type: Permanent
Salary: £80000 - £110000/annum Hybrid, Great Benefits
Posted:

Site Reliability Engineer

Bristol, Avon, South West, United Kingdom
Hybrid / WFH Options
Twinstream Limited
Socials & Events Cycle to Work Scheme & Life Assurance Key Responsibilities of the Site Reliability Engineer: Work closely with engineers and sysadmins to increase performance and reduce toil Advance system observability, monitoring and alerting Automate, troubleshoot, and proactively resolve issues before they escalate Improve development environments to meet delivery and quality targets Research and evaluate tools and platforms to support scale More ❯
Employment Type: Permanent, Work From Home
Posted:

Solutions Architect [UAE Based]

City of London, London, United Kingdom
AI71
multi-tenant SaaS or large enterprise application. Certifications: AWS Certified Solutions Architect, Google Professional Cloud Architect, Azure Solutions Architect Expert. Experience in data architecture, AI/ML integration, and observability frameworks . More ❯
Posted:

Solutions Architect [UAE Based]

London Area, United Kingdom
AI71
multi-tenant SaaS or large enterprise application. Certifications: AWS Certified Solutions Architect, Google Professional Cloud Architect, Azure Solutions Architect Expert. Experience in data architecture, AI/ML integration, and observability frameworks . More ❯
Posted:

Manager, SRE

London, England, United Kingdom
GroupM
some experience in a leadership or managerial position. Strong knowledge of cloud platforms (AWS, GCP, Azure) and modern infrastructure technologies (Kubernetes, Docker, Terraform). Expertise in monitoring, logging, and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk). Proficiency in at least one programming or scripting language (e.g., Python, Go, Bash). Deep understanding of networking, databases, and distributed systems. Strong More ❯
Posted:

Senior DevOps Engineer

Belfast, United Kingdom
Menlo Ventures
Terraform). Experience in software development in general, with skills in a high-level language (e.g., Python, JavaScript, TypeScript, Java) and familiarity with modern development practices Understanding of Cloud Observability, Monitoring, and Tracing tools (Datadog, CloudWatch, Jaeger, ELK) and how best to leverage to support effective MTTR and mitigate high CFR Our UK benefits: Stock Options Annual Performance Bonus or More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Lead Site Reliability Engineer

Glasgow, Scotland, United Kingdom
J.P. MORGAN-1
such as Python, Java Spring Boot, Unix Shell. Deep knowledge of software applications and technical processes with emerging depth in one or more technical disciplines Proficiency and experience in observability such as white and black box monitoring, SLO alerting, and telemetry collection using tools such as Grafana, Geneos, Dynatrace, Prometheus, Datadog, Splunk, etc. Proficiency in continuous integration and continuous delivery More ❯
Posted:

Senior Technical Delivery Manager

London Area, United Kingdom
Ownera
optimize data flow, connectivity, and interoperability Help to implement best practices and process improvements to enhance delivery efficiency and team performance Work with various internal teams to continuously improve observability and supportability capabilities of the company platform Key Requirements A highly motivated, technical and detail-oriented support engineer, able to work autonomously with minimal direction, passionate about learning new things More ❯
Posted:

Senior Technical Delivery Manager

City of London, London, United Kingdom
Ownera
optimize data flow, connectivity, and interoperability Help to implement best practices and process improvements to enhance delivery efficiency and team performance Work with various internal teams to continuously improve observability and supportability capabilities of the company platform Key Requirements A highly motivated, technical and detail-oriented support engineer, able to work autonomously with minimal direction, passionate about learning new things More ❯
Posted:

Senior AI Engineer

London, England, United Kingdom
Hybrid / WFH Options
Ten Lifestyle Group
cost optimisation). Experience with cloud platforms (AWS, GCP, Azure) and infrastructure-as-code (Terraform). Familiarity and hands-on with DevOps practices (CI/CD, Docker, K8s) and observability tools (Prometheus, Grafana, Datadog). Experience in distributed systems and scaling. Knowledge and hands-on experience with multiple data stores (both SQL and NoSQL). Desired experience in building agentic More ❯
Posted:

Azure - Practice Architect

England, United Kingdom
TEKsystems
Enterprise level Cloud & DevOps standards and best practices in the areas of cloud infrastructure, infrastructure as code, and DevOps toolchain Provide thought leadership, design and implementation roadmap for Continuous Observability platform to maintain overall services & infrastructure health along with automated remediation and disaster recovery failovers capabilities Team Leadership: Leadership & Mentoring: As an Azure Sr Architect, you will be responsible to More ❯
Posted:

Senior Site Reliability Engineer

Glasgow, Scotland, United Kingdom
Morgan Stanley
an on-call rota Working with the Vault squad & wider Technology stakeholders, you will support the continuous improvement of our services through the development of automation scripting and effective observability solutions. Enforce adherence to architectural standards/principles, global product-specific guidelines, usability design standards, etc. Working with Technology teams, you will onboard new tools to support business needs and More ❯
Posted:

Software Engineer

Leeds, Yorkshire, United Kingdom
Lloyds Banking Group
or GCP): Migration and operation of cloud environments, including compute and storage scalability Containerisation & Virtualisation: Familiarity with virtual and physical server provisioning, especially in strategic data centres Platform Resilience & Observability: Designing for uptime, performance, and root cause analysis. Web Services & APIs: Used for Integration with 24+ LBGI systems Batch Processing: Understanding of batch suite performance and scheduling constraints RPA & Automation More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

DevOps Engineer Infrastructure - GammaLabs

United Kingdom
Hybrid / WFH Options
Gamma Communications plc
position will align to a discipline where you will be expected to build and support solutions aligned with SDLC principles, providing technical excellence with a focus on scripting and observability coupled with a security mindset. What will you be doing day-to-day? Automation and Orchestration: Streamline the delivery and support processes by leveraging automation and IaC principles. Support and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Technical Delivery Manager

London, England, United Kingdom
Ownera
optimize data flow, connectivity, and interoperability Help to implement best practices and process improvements to enhance delivery efficiency and team performance Work with various internal teams to continuously improve observability and supportability capabilities of the company platform Key Requirements A highly motivated, technical and detail-oriented support engineer, able to work autonomously with minimal direction, passionate about learning new things More ❯
Posted:

Principal GenAI Infrastructure Engineer

Cardiff, Wales, United Kingdom
Hybrid / WFH Options
ZipRecruiter
/CD pipelines using GitHub Actions, AWS CodePipeline, Jenkins, and other tools, with an emphasis on reliability, reusability, and performance. Contribute to the design and integration of monitoring and observability solutions (CloudWatch, Prometheus, Grafana) to ensure infrastructure and model health. Champion software engineering excellence through Test-Driven Development (TDD), rigorous test automation, and continuous quality assurance practices. Support architectural decisions More ❯
Posted:

Cloud Platform Engineer (DV Security Clearance)

London
CGI
a bias for Infrastructure (Python, Go, C#) • IAM Policy and Authentication/Authorization schemes • Web Services and REST API • Databases and Storage Systems • Development Build, Test, and Deployment Pipelines • Observability and Monitoring (Open Telemetry, TIG and ELK stacks) #LI-JS2 Together, as owners, let's turn meaningful insights into action. Life at CGI is rooted in ownership, teamwork, respect and More ❯
Employment Type: Permanent
Posted:

Cloud Engineer/Architect (DevOps)

London, England, United Kingdom
ION
of reliability and automation. Help the team provide guidelines and blueprints on the DevOps lifecycle of applications. Maintain our internal tooling and automation to improve the reliability, scalability, and observability of our services. Proactively identify and solve issues across the whole stack, together with the rest of the infrastructure and engineering teams. Contribute to raising awareness in the security and More ❯
Posted:

Principal Network Engineer - London Stock Exchange Group

London, England, United Kingdom
Jobs via eFinancialCareers
and the ability to balance multiple priorities in a fast-paced environment. DESIRED SKILLS AND EXPERIENCE: Financial Services experience Real-time and low latency market data experience Service orchestration, observability and monitoring platform experience Solid understanding of a Programming Language (preferably Python) Agile tools (Jira, GIT among other DevOps principles) AWS, Azure, GCP Certification LSEG is a leading global financial More ❯
Posted:

Junior Platform Engineer

Belfast, United Kingdom
Proofpoint
adaptability and a commitment to continuous learning. Understanding of Continuous Integration and Continuous Deployment (CI/CD) principles and their role in efficient software development. Awareness of monitoring and observability practices to support system reliability and performance. A proactive mindset, eagerness to learn, and a collaborative approach to solving engineering challenges. Why Proofpoint Protecting people is at the heart of More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Engineering Tech Lead - Distributed Systems

London, England, United Kingdom
Hybrid / WFH Options
Smarkets
designing, developing, and implementing distributed systems Can demonstrate deep knowledge in running services in cloud microservice environments and hands-on experience with Kubernetes Familiarity with AWS cloud Familiarity with observability principles and tools (Grafana, Prometheus, Sentry Elasticsearch, Jaeger) Excellent planning and communications skills and able to lead conversations with development and product teams Preferred Skills and Experience 6-8+ More ❯
Posted:

Head of Data Engineering & Analytics

London, England, United Kingdom
ZipRecruiter
transactional operations and columnar formats for efficient large-scale analytical querying. Support DevOps practices including CI/CD, infrastructure-as-code, automated testing, release and version control and system observability for data pipelines. Establish metrics and KPIs and identify and deploy tools to measure data pipeline health, data quality, timeliness and accuracy, team performance, cost-effectiveness, and business impact. Actively More ❯
Posted:
Observability
10th Percentile
£57,500
25th Percentile
£65,000
Median
£80,000
75th Percentile
£97,500
90th Percentile
£120,000