Observability Job Vacancies

226 to 250 of 485 Observability Jobs

Staff Software Engineer

London, United Kingdom
Optimizely
will: Design and evolve the architecture of highly scalable, reliable, and secure distributed systems. Drive technical excellence across the engineering organization by setting standards for code quality, system design, observability, and operational best practices. Collaborate closely with Product, UX, and Application Engineering teams to deliver impactful features while ensuring architectural soundness and scalability. Mentor and guide senior and mid-level More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Service Operations Manager

London, United Kingdom
Saab UK
concerns and driving service excellence. Communicate effectively with internal and external stakeholders, providing insights and updates on service health and operational performance. Continuous Improvement Lead initiatives to increase automation, observability, and operational resilience. Stay abreast of industry trends, emerging technologies, and best practices, fostering a culture of continuous learning within the team. Requirements Proven experience in IT Service Operations, ideally More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Service Operations Manager

London, South East, England, United Kingdom
Saab UK
concerns and driving service excellence. Communicate effectively with internal and external stakeholders, providing insights and updates on service health and operational performance. Continuous Improvement Lead initiatives to increase automation, observability, and operational resilience. Stay abreast of industry trends, emerging technologies, and best practices, fostering a culture of continuous learning within the team. Requirements Proven experience in IT Service Operations, ideally More ❯
Employment Type: Full-Time
Salary: Salary negotiable
Posted:

Senior Director, Technical Product Management

United Kingdom
Smarsh
An Engineer's Product Leader: Your technical credibility is non-negotiable. You have a deep, hands-on command of the modern cloud-native landscape (Kubernetes, AWS, CI/CD, Observability) and a background in software engineering or architecture. You don't just talk the talk; you can hold your own in complex architectural debates, gain the respect of top-tier More ❯
Posted:

Site Reliability Engineer - London

London, United Kingdom
Hybrid / WFH Options
Valarian Technologies Limited
you thrive in a fast-paced environment where you can make a real difference, we want to hear from you! Required skills/expertise: Develop and implement a comprehensive observability strategy for self-hosted deployments, including infrastructure and tooling for monitoring, alerting, and troubleshooting. This will involve designing and implementing robust metrics and logging systems. Engineer the ACRA platform for More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Staff, Software Engineer

Ireland
Hybrid / WFH Options
Fanatics Inc
require both strategic foresight and technical precision. Set engineering standards by developing modular, performant, and maintainable code that leads by example. Own the full product lifecycle-including design, deployment, observability, and long-term maintenance-ensuring platform reliability at scale. Collaborate cross-functionally with Product, Quant, and Engineering leadership to align technical execution with business goals. Apply advanced software design methodologies More ❯
Employment Type: Permanent
Salary: EUR 125,000 - 150,000 Annual
Posted:

Intergration Engineer

Edinburgh, Midlothian, United Kingdom
Hybrid / WFH Options
Aberdeen
Implement automated deployment and testing of integration components using Azure DevOps or GitHub Actions. Contribute to Infrastructure as Code (IaC) practices using Bicep or Terraform. Set up and maintain observability for integration components using Azure Monitor, Application Insights, and Log Analytics. Support incident response and root cause analysis for integration-related issues. Apply security best practices across integration solutions, including More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Platform Engineer

London, United Kingdom
Hybrid / WFH Options
Ebury
Contribute to the design and implementation of new systems and services, meeting reliability and scalability standards. Develop and maintain infrastructure and application monitoring, incident management, and troubleshooting procedures. Utilize observability tools to gain insights into system performance and health, guiding improvement decisions. Design and implement automation tools and processes to boost efficiency and minimize downtime. Participate in on-call rotation More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Engineer -Llama Stack Operator- (Ireland)

Ireland
Hybrid / WFH Options
Red Hat
E2E) test cases and automation Ability to quickly learn and use new tools and technologies. The following will be considered a plus: Experience with Security (FIPS, FedRAMP, CVE Management), Observability, Performance or Scale Understanding of DevOps methodology, scrum, and/or Jira. Experience with AI and Machine Learning platforms, tools, and frameworks, such as LlamaStack, LangChain, PyTorch, LLaMA.cpp, vLLM, LangGraph More ❯
Employment Type: Permanent
Salary: EUR 125,000 - 150,000 Annual
Posted:

Machine Learning Engineer, Senior Manager

united states
Hybrid / WFH Options
Credit Acceptance
the domain(s) and the business. Requirements: Hands-on expertise in scaling and maintaining production-grade ML services, with a strong focus on ML/LLM Operations (versioning, automation, observability, automated training and monitoring, etc.) and ability to balance ML model complexity with production requirements Passion for identifying new business opportunities and experience of using a test and learn approach More ❯
Posted:

Site Reliability Engineer Manager

Manchester, Lancashire, England, United Kingdom
FDM Group
contributor to the stability, performance, and scalability of services, supporting the organisations digital transformation and long-term technology vision. You’ll work actively with container platforms, VMware infrastructure, and observability tooling, ensuring their services are resilient and efficient. You’ll also lead and participate in post-mortems, drive automation, and continuously improve the platform through engineering-led solutions. This role … of platform technologies, including VMware infrastructure, container platforms and orchestration (e.g., Kubernetes, OpenShift), databases, and applications Manage environments and support CI/CD pipelines using Infrastructure as Code Improve observability using tools such as Dynatrace, ensuring proactive monitoring and alerting Lead and contribute to post-mortems to identify and implement long-term fixes aligning with organisations long term objectives Troubleshoot … Code and CI/CD Experience with container platforms and orchestration such as Docker, Kubernetes and OpenShift Hands-on experience with VMware technologies in a production environment Familiarity with observability platforms, such as Dynatrace and experience with either Linux or Windows operating systems Proven ability to troubleshoot across a broad range of platform technologies A mindset focused on continuous improvement More ❯
Employment Type: Contractor
Rate: £50,000 - £70,000 per annum
Posted:

Manager II, Software Engineering

Dublin, Ireland
Kaseya Limited
data science teams to deliver AI-enhanced features and intelligent automation. Guide the integration of AI/ML into both engineering workflows and customer-facing capabilities. Establish and evolve observability practices including structured logging, distributed tracing, and real-time alerting. Promote a culture of automation across testing, deployment, infrastructure, and compliance. Partner with QA and DevOps to implement shift-left … CI/CD pipelines, and infrastructure as code (IaC). Demonstrated experience with AI/ML technologies and their practical application in product development or engineering efficiency. Familiarity with observability stacks and SRE practices. Proficiency in TDD, BDD, and integrating quality gates into the development lifecycle. Extensive experience with multi-tenant SaaS architectures and managing performance at scale. Experience with … on multiple concurrent initiatives. Ability to balance technical depth with strategic thinking and business alignment. Tools Development & Deployment:GitHub, Docker, Kubernetes AI/ML:Azure AI, OpenAI, and similar Observability:Dynatrace, New Relic, Grafana, or similar QA & Testing:Selenium, Playwright, Postman, Cucumber, or similar Automation & IaC:Terraform, Ansible, Bicep, or similar Incident Management:PagerDuty, Opsgenie, or similar Security & Compliance:Snyk More ❯
Employment Type: Permanent
Salary: EUR 150,000 - 200,000 Annual
Posted:

Senior DevOps Platform Engineer

London, United Kingdom
CDW LLC
including Salesforce-specific pipelines. Build and maintain Infrastructure as Code (IaC) using Terraform and Ansible. Design highly reliable, scalable, and secure infrastructure supporting performance-critical workloads. Build proactive monitoring, observability, and alerting with Prometheus, Grafana, Azure Monitor, DataDog, and Dynatrace. Troubleshoot complex system issues spanning applications, networks, and infrastructure. Define platform SLAs, SLOs, and governance standards for self-service use. … Infrastructure as Code with Terraform and Ansible, along with scripting in PowerShell, Python, or Bash Experience implementing GitOps workflows and managing platform SLAs, SLOs, and governance standards Familiarity with observability and monitoring tools including Prometheus, Grafana, Azure Monitor, DataDog, or Dynatrace Preferred experience supporting Salesforce DevOps pipelines and working with Java, .NET, or Node.js application environments Exposure to AI/ More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Developer

Warrington, Cheshire, United Kingdom
Hybrid / WFH Options
ECS Resource Group Ltd
transition from project-based delivery to product-focused development. Embed disciplined code promotion processes and improve CI/CD practices. Drive improvements in code quality and maintainability. Enhance application observability, including logging and monitoring. Provide technical guidance and advocate for best development practices. Essential Skills: Strong knowledge of JavaScript & TypeScript . Experience with Next.js and Node.js . Familiarity with Git More ❯
Employment Type: Contract
Rate: £495 - £500/day inside ir35
Posted:

DATA SOLUTION ARCHITECT

Baltimore, Maryland, United States
US Main
data management. • Strong understanding of database design, performance tuning, data governance, and security best practices. • Proficiency in data modeling, ETL processes, and data integration techniques. • Experience with monitoring and observability tools like Splunk, Datadog, or New Relic. • Knowledge of cloud computing platforms and containerization technologies (e.g., AWS, Azure, Kubernetes). Specialized Experience: • At least five (5) years of the required More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

AI/ML Engineer

London, United Kingdom
Hiring Group
secure handling of sensitive operational data and compliance with relevant standards Developed and maintained robust APIs for system integration Drove operational excellence and continuous improvement Implemented and managed monitoring, observability, and troubleshooting tools for deployed systems Designed and handled containerised applications (e.g., Docker, Kubernetes) Qualifications Bachelor's degree in Computer Science, Engineering, or a related technical field Relevant experience as More ❯
Employment Type: Permanent
Salary: £50000 - £100000/annum
Posted:

Founding Full Stack Engineer

Nationwide, United Kingdom
Hybrid / WFH Options
W Talent
PostgreSQL Architect scalable document processing pipelines for large datasets Build AI-native user experiences and intelligent agent workflows using state-of-the-art LLMs Improve system performance, stability, and observability Deploy to Azure using infrastructure-as-code (Bicep) and CI/CD via GitHub Actions Collaborate directly with users to deeply understand workflows and pain points Influence engineering best practices More ❯
Employment Type: Permanent
Salary: £80000 - £130000/annum
Posted:

Senior Software Engineer, iCloud Platform

London, United Kingdom
Apple Inc
innovation cycles. You will have the opportunity to take ambiguity and refine it into valuable outcomes, taking risks where justified by the reward.You will understand how CI/CD, observability, and SLOs form part of a mature product offering and push for best practices. Use your insight to prevent production issues before they happen. When issues do occur you will More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Engineer - C++ & Python

Bristol, Avon, South West, United Kingdom
Connexa
leaks, and performance bottlenecks Turn research prototypes into robust, production-ready software modules Lead architecture discussions and enforce clean, scalable design patterns Drive engineering standards across CI/CD, observability, and system modularisation Mentor developers through code reviews, pair programming, and design walkthroughs Bridge the gap between research and deployable robotics software-across embedded and cloud platforms What we're More ❯
Employment Type: Permanent
Posted:

Graduate Platform Engineer

London, United Kingdom
BAE Systems (New)
technologies: Logical reasoning, scripting ability, security concepts (light) Infrastructure as Code (Terraform) AWS infrastructure (VPC, EC2, IAM) Linux tooling and system admin CI/CD pipelines from infra perspective Observability, logging, monitoring GitOps, container orchestration (K8s) Benefits As well as a competitive pension scheme, BAE Systems also offers employee share plans, an extensive range of flexible discounted health, wellbeing & lifestyle More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Principal ML Engineer

London, South East, England, United Kingdom
Hybrid / WFH Options
Method Resourcing
teams to operationalize models and ship ML-powered features into production. Continuously assess and iterate on production models, balancing long-term ML strategy with tactical improvements. Champion code quality, observability, and resilience within their ML systems through reviews and hands-on contributions. Help shape their internal ML standards and practices, ensuring they stay ahead of industry advancements. Offer technical mentorship More ❯
Employment Type: Full-Time
Salary: £150,000 - £180,000 per annum
Posted:

Snowflake Centre of Excellence Lead

London, United Kingdom
Hybrid / WFH Options
Kubrick
colleagues and clients across the Snowflake ecosystemExperience in design and delivering business solutions on other modern data platforms (e.g. Databricks, Azure, AWS or GCP native stacks)Experience with platform observability and CI/CD for data platformsHands-on experience with modern data engineering tools such as dbt, Fivetran, Matillion or AirflowHistory of supporting pre-sales activities in a product or More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Low Latency Network Engineer

London, United Kingdom
Millennium Management LLC
optimization, anomaly detection, and predictive analytics. Understanding of AI frameworks and libraries (e.g., TensorFlow, PyTorch, Scikit-learn) and their application in network automation and monitoring. Experience with telemetry and observability frameworks (e.g., Prometheus, Grafana) for real-time network monitoring and troubleshooting. Experience : Minimum of 7 years' of experience in network engineering, operations, and support. Proven ability to work hands-on More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

System Engineer (Annapolis Junction, MD) with Security Clearance

Annapolis Junction, Maryland, United States
Set of X
Python Experience with IaC principles and automation tools such as Ansible, Puppet and SaltStack General HPC technical knowledge regarding compute, network, memory, and storage components Experience with monitoring and observability tools such as Grafana Clearance: TS/SCI clearance with polygraph is required. Total Compensation Package We offer a comprehensive compensation package designed to support your well-being and professional More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Senior Linux Systems Engineer (Kernel)

New York, United States
Bloomberg
back to the open-source community; it is a rewarding experience you can explore with us. We'll expect you to: Build and evolve eBPF-based tools to enhance observability of the network and other operating system layers Improve Bloomberg's internal Linux kernel regression testing framework Contribute to upstream Linux kernel development and enhancement requests Investigate and resolve complex More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:
Observability
10th Percentile
£57,500
25th Percentile
£67,500
Median
£80,000
75th Percentile
£99,875
90th Percentile
£126,625