Permanent Observability Job Vacancies

376 to 400 of 450 Permanent Observability Jobs

Data Integration Manager

City of London, London, United Kingdom
Intelix.AI
ETL, events, iPaaS). Define data contracts , lineage, and data quality SLAs ; implement RBAC and security controls. Lead data migration , cutover , and legacy decommissioning/archival . Stand up observability (SLO/SLI, monitoring, runbooks) and support run operations. Enable visualisation frameworks for decision-making. Partner with Enterprise Architecture to align to target state and standards. Support agentic/AI More ❯
Posted:

Data Integration Manager

london, south east england, united kingdom
Intelix.AI
ETL, events, iPaaS). Define data contracts , lineage, and data quality SLAs ; implement RBAC and security controls. Lead data migration , cutover , and legacy decommissioning/archival . Stand up observability (SLO/SLI, monitoring, runbooks) and support run operations. Enable visualisation frameworks for decision-making. Partner with Enterprise Architecture to align to target state and standards. Support agentic/AI More ❯
Posted:

Data Integration Manager

slough, south east england, united kingdom
Intelix.AI
ETL, events, iPaaS). Define data contracts , lineage, and data quality SLAs ; implement RBAC and security controls. Lead data migration , cutover , and legacy decommissioning/archival . Stand up observability (SLO/SLI, monitoring, runbooks) and support run operations. Enable visualisation frameworks for decision-making. Partner with Enterprise Architecture to align to target state and standards. Support agentic/AI More ❯
Posted:

Data Integration Manager

london (city of london), south east england, united kingdom
Intelix.AI
ETL, events, iPaaS). Define data contracts , lineage, and data quality SLAs ; implement RBAC and security controls. Lead data migration , cutover , and legacy decommissioning/archival . Stand up observability (SLO/SLI, monitoring, runbooks) and support run operations. Enable visualisation frameworks for decision-making. Partner with Enterprise Architecture to align to target state and standards. Support agentic/AI More ❯
Posted:

ETL Data Quality Developer with Alation, BigEye Exp.

San Antonio, Texas, United States
Robotics technology LLC
Ensuring Data Quality: They use Bigeyes data observability features to monitor data quality and identify potential issues.Understanding Data Relationships:They use Alations data catalog to understand the relationships between different data assets.Data Lineage: They use data lineage to identify the impact of changes to data assets.Data Profiling:They use data profiling to determine the usability of data assets.Trust Flags: They More ❯
Employment Type: Any
Salary: USD Annual
Posted:

Global Head of Technical Account Management (TAM)

London, United Kingdom
Coralogix, inc
success across all regions. Partner closely with R&D, Customer Success, Product, Sales, and Support to drive holistic customer outcomes. Hands-On Technical Expertise Maintain hands-on fluency in observability tooling, logging infrastructure, and cloud environments. Act as a senior technical escalation point for complex deployments or architectural challenges. Provide in-depth technical guidance on customer environments, use cases, and … performance analytics. Collaborate on the development of tools and dashboards to ensure visibility and impact tracking. Requirements Technical Experience 10+ years of technical experience in Cloud DevOps, SaaS, or observability, with 5+ years in leadership roles. Strong hands-on experience with AWS, GCP, Azure, K8S, Terraform and observability tools: Prometheus, Grafana, OpenTelemetry, ELK, Splunk, Datadog, and similar. Proficiency with metrics … team members are encouraged to challenge the status quo and contribute to our shared mission. If you thrive in dynamic environments and are eager to shape the future of observability solutions, we'd love to hear from you. Coralogix is an equal opportunity employer and encourages applicants from all backgrounds to apply. More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Security Engineer

Glasgow, United Kingdom
Experis - ManpowerGroup
Role Overview We are seeking a highly capable Security Engineer to join a focused team developing a telemetry pipeline MVP. This role requires deep technical expertise in containerised environments, observability tooling, and secure infrastructure design. The ideal candidate will ensure that security is embedded across the pipeline architecture, from deployment to data flow, while collaborating closely with DevOps and development … risk analysis for the telemetry pipeline Collaborate with DevOps engineers to embed security into infrastructure-as-code and deployment workflows Monitor and respond to security events and alerts from observability platforms Maintain documentation of security architecture, policies, and incident response procedures Required Skills & Experience Strong hands-on experience with Kubernetes and OpenShift in secure production environments Proficiency in GitLab and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Data Platform Engineer

London, United Kingdom
Hybrid / WFH Options
Lyst
able to: Contribute to every part of our system, ranging from code and tests to infrastructure changes. Ensure the stability of our system by implementing and improving monitoring and observability tools. Write resilient code that is well tested. Be curious - not just the code, but the architecture of our platforms and everything that enables the business to thrive. Gain expertise … the rest of the organisation, and almost all of Lyst engineering engages with us on a regular basis. We care about robustness and integrity in our pipelines and use observability tools to monitor. Experience in developing robust and secure software solutions and data pipelines. Effective communication skills, comfortable working with technical and non-technical individuals and teams. Proficiency in developing … within public cloud technologies and architecture (perferably AWS exp). Experience with containers (Docker) and container orchastration. Experience with Infrastructure as Code (we use Terraform). Experience utilising monitoring, observability and logging tools. Experience with git, gitOps, github actions. Exposure or experience with cloud data warehouse/data platforms (we useSnowflake). Things that matter to us: You have a More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Director of Product Engineering

Oxford, England, United Kingdom
Hlx Life Sciences
of a secure, cloud-native SaaS platform Partner with Product, UX, and scientific teams to translate genomic needs into scalable software features Oversee full engineering lifecycle – infra, DevOps, QA, observability, and application layer Build and mentor a high-performing engineering team, setting standards and best practices Maintain regulatory alignment and readiness for healthcare/genomics SaaS products Drive innovation by … evolution What You Bring Proven software engineering leadership, including strategy, hiring, delivery, and technical oversight Deep experience building and scaling SaaS platforms (cloud-native, Kubernetes, Terraform, CI/CD, observability) Expertise in modern stacks (Python, TypeScript/Node.js, React) and major clouds (AWS, GCP, Azure, Oracle) Knowledge of security and privacy frameworks: RBAC, encryption, secure API design, identity/auth More ❯
Posted:

Director of Product Engineering

banbury, south east england, united kingdom
Hlx Life Sciences
of a secure, cloud-native SaaS platform Partner with Product, UX, and scientific teams to translate genomic needs into scalable software features Oversee full engineering lifecycle – infra, DevOps, QA, observability, and application layer Build and mentor a high-performing engineering team, setting standards and best practices Maintain regulatory alignment and readiness for healthcare/genomics SaaS products Drive innovation by … evolution What You Bring Proven software engineering leadership, including strategy, hiring, delivery, and technical oversight Deep experience building and scaling SaaS platforms (cloud-native, Kubernetes, Terraform, CI/CD, observability) Expertise in modern stacks (Python, TypeScript/Node.js, React) and major clouds (AWS, GCP, Azure, Oracle) Knowledge of security and privacy frameworks: RBAC, encryption, secure API design, identity/auth More ❯
Posted:

Global Platform Team Lead and Senior Director - IT Network

London, United Kingdom
The Boston Consulting Group GmbH
networking (SDN), and AI-driven automation. Ensureend-to-end network automationto improve operational efficiency, agility, and reliability. Drivezero-trust network securityprinciples, ensuring compliance and proactive threat mitigation. Establish aglobal observability and telemetry frameworkforreal-time network insights. Align network strategies withbusiness growth, cloud-first initiatives, and digital transformation. Network Infrastructure & Cloud Networking: Overseeglobal network architecture, spanningdata centers, cloud environments, and enterprise … Implementreal-time incident detection and responseusing AI-driven network analytics. Ensurehigh availability, network resilience, and 24x7 operational support. Develop afollow-the-sun support model, ensuringglobal network performance optimization. Implementnetwork observability and predictive analyticstoproactively prevent outages. Security, Compliance & Risk Management: Drivezero-trust security frameworks, ensuringsecure and resilient network access. Ensure adherence toISO 27001, NIST, SOC 2, GDPR, and industry best practices. … a senior leadership role, managinglarge-scale global network environments. Deep expertise incloud networking (AWS, Azure, GCP), SD-WAN, and network automation. Proven track record inend-to-end network automation, observability, and self-healing networks. Experience inAI-driven networking, predictive analytics, and network telemetry. Strong understanding ofzero-trust networking, compliance frameworks, and security policies. Excellent leadership, communication, and stakeholder management skills. More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Platform & Backend Engineer Engineering London

London, United Kingdom
Elder HQ
with event sourcing. All of our systems are on Kubernetes and using the Google Cloud Platform. This role comes with the opportunity to take ownership of our GCP infrastructure, observability, and platform reliability, with a focus on ensuring our systems remain secure, scalable, and well maintained. We encourage collaboration and our engineers are involved in the full development lifecycle, from … engineers in building new APIs and data contracts to support new functionality Maintaining and evolving our cloud infrastructure (GCP, Kubernetes) to ensure high availability, security, and performance Managing service observability and reliability, including logging, metrics and alerting (we use Prometheus and Grafana) Handling database and service upgrades (e.g. MySQL, Kubernetes), secrets management and security best practices Taking ownership of platform … Solid understanding of security best practices across infrastructure and applications, including secrets management and credential rotation. Familiarity with infrastructure-as-code or automation tools is a plus Experience with observability tools (such as Prometheus and Grafana), service monitoring, and debugging in production environments A demonstrated interest in staying up-to-date with new technology, new frameworks, new languages and other More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Python Engineer £100k benefits

Manchester, Lancashire, England, United Kingdom
Hybrid / WFH Options
Interquest
Great opportunity for Senior Python Engineers to work remotely for a UK based AI scale-up. You'd join a large engineering department and would work within a cross functional product-based team responsible for building cloud-native, event-driven More ❯
Employment Type: Full-Time
Salary: £80,000 - £100,000 per annum
Posted:

Senior Python Engineer (£100k + benefits)

Manchester, North West, United Kingdom
Hybrid / WFH Options
InterQuest Group (UK) Limited
Great opportunity for Senior Python Engineers to work remotely for a UK based AI scale-up. You'd join a large engineering department and would work within a cross functional product-based team responsible for building cloud-native, event-driven More ❯
Employment Type: Permanent, Work From Home
Posted:

DevSecOps Engineer with Security Clearance

Reston, Virginia, United States
Echelon Services, LLC
Job Title: DevSecOps Engineer Location: Reston, VA or Charleston, SC Clearance Required: TS/SCI Employment Type: Full-Time C ompany Overview Echelon Services LLC is a Native Hawaiian-Owned 8(a) small business that delivers mission-critical IT, cybersecurity More ❯
Employment Type: Permanent
Salary: USD 200,000 Annual
Posted:

Senior Software Engineer - Network Production Engineer

London, United Kingdom
Bloomberg L.P
tools to manage a large-scale, multi-vendor network with an emphasis on automation, telemetry, and model-driven infrastructure as code. Automate the full network lifecycle-including provisioning, configuration, observability, testing, troubleshooting, and capacity planning. Collaborate with architecture and design teams and the CTO office to implement new technologies that ensure scalability, efficiency, and operational resilience. Develop tools and platforms … that enhance the observability, reliability, and performance of the production network. Enhance existing monitoring and observability frameworks, integrating intelligent alerting and self-remediation capabilities to reduce manual intervention and improve incident response. Define and measure service-level objectives (SLOs) to track infrastructure performance and reliability. Write software utilizing orchestration systems to automate tasks and interact with other systems. Provide mentorship More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Engineer - NSPE Firewall

London, United Kingdom
Bloomberg L.P
tools to manage a large-scale, multi-vendor network with an emphasis on automation, telemetry, and model-driven infrastructure as code. Automate the full network lifecycle-including provisioning, configuration, observability, testing, troubleshooting, and capacity planning. Collaborate with architecture and design teams and the CTO office to implement new technologies that ensure scalability, efficiency, and operational resilience. Develop tools and platforms … that enhance the observability, reliability, and performance of the production network. Enhance existing monitoring and observability frameworks, integrating intelligent alerting and self-remediation capabilities to reduce manual intervention and improve incident response. Define and measure service-level objectives (SLOs) to track infrastructure performance and reliability. Write software utilizing orchestration systems to automate tasks and interact with other systems. Provide mentorship More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Splunk Enterprise Monitoring Engineer

Decatur, Georgia, United States
SpiceOrb
CD Summary: We are looking for a highly skilled Splunk Subject Matter Expert (SME) and Enterprise Monitoring Engineer to lead the design, implementation, and optimization of our monitoring and observability ecosystem. The ideal candidate will be an expert in Splunk, with a strong background in enterprise IT infrastructure, system performance monitoring, and log analytics. You will play a pivotal role … Strong understanding of network protocols, system logs, and application telemetry. Preferred Qualifications: Splunk certifications (e.g., Splunk Certified Power User, Admin, Architect). Experience with Splunk ITSI, Enterprise Security, or Observability Suite. Knowledge of cloud-native environments (AWS, Azure, or GCP) and cloud monitoring integrations. Experience with log aggregation, security event monitoring, or compliance (e.g., PCI, HIPAA, SOX). Familiarity with More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Senior DevOps Engineer (AWS)

London, United Kingdom
Hybrid / WFH Options
La Fosse Associates
and help define repeatable deployment patterns. Collaborate with SREs, platform engineers, and architects to ensure smooth delivery. Provide hands-on expertise in Kong API Gateway migration . Contribute to observability using Datadog . Use Jira , Confluence , and SonarQube for tracking, documentation, and quality control. Required Skills & Experience 7+ years' experience as a DevOps or Platform Engineer, with deep knowledge of … ability to work with GitLab CI/CD for build and release pipelines. Experience with containerised applications and modern cloud architectures. Familiarity with deploying ACS platforms . Understanding of observability tools - Datadog preferred. Comfortable in collaborative, agile teams, with strong documentation and delivery discipline. If you're a highly skilled AWS engineer looking to lead on a cutting-edge API More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Director of AWS Platforms

London, United Kingdom
Boston Consulting Group
for the creation, implementation, and continuous improvement of BCG's modern, fully automated SACM function. As the beating heart of IT, this system will serve as the backbone for observability, service reliability, release and change management, and infrastructure management. The leader will drive the automation and governance of BCG's configuration management database (CMDB), integrating it with SRE, ITSM, and … Establish the CMDB as a real-time, trusted system of record for configuration items across cloud, on-prem, and hybrid environments. Embed SACM capabilities into core IT processes including observability, incident response, service management, and architecture governance. Champion automation, transparency, and traceability of all infrastructure, software, and asset relationships. Automation & Integration: Build and operate a fully automated CMDB with bi … reduce risk and accelerate safe deployments. Operational Excellence & SRE Alignment: Apply SRE principles to ensure reliability, performance, and resilience of the SACM platform. Embed SACM into 24x7 operations and observability platforms to support real-time decision-making. Support incident prevention, root cause analysis, and continuous improvement through data-driven insights. Define and enforce service level objectives (SLOs) and key performance More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AWS Senior Platform Engineer

London, United Kingdom
CACI Limited
at scale, leveraging AWS Organizations, Landing Zones, and multi-account best practices. Develop and maintain Infrastructure as Code solutions using Terraform, CloudFormation, and AWS CDK. Champion security, compliance, and observability by integrating services like AWS Security Hub, GuardDuty, and Inspector. Design CI/CD pipelines to enable seamless deployments and self-service models for customers. Innovate with AWS Networking, KMS … Proficiency in Python, Go, or similar languages for automation and scripting. Expert-level knowledge of AWS Networking, TLS, and security best practices. Experience with container orchestration (Kubernetes, EKS) and observability tools (Grafana, ELK). A passion for innovation, problem-solving, and delivering high-impact solutions. Why Work For Us? 25 days holiday + bank holidays Up to 5% employer pension More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Platform Engineer

Caldecotte, Milton Keynes, Buckinghamshire, England, United Kingdom
Connells Group HQ
mindset, working directly with development teams to understand their needs and deliver solutions. You will work across multiple technical domains including orchestration, automation, CI/CD pipelines, cloud services, observability, and security, developing deeper expertise in areas that align with platform priorities and your interests. Experience with Microsoft Azure is essential.You will play your part in operating the platform aligned … with Docker and basic Kubernetes concepts Understanding of cloud networking concepts (VNets, subnets, NSGs) Awareness of cloud security best practices and compliance requirements Basic knowledge of monitoring, logging, and observability tools Understanding of cloud cost management and resource optimisation principles Comfort with troubleshooting and supporting development teams Understanding of service reliability and incident response practices Connells Group UK is an More ❯
Employment Type: Full-Time
Salary: Competitive salary
Posted:

Systems Development Engineer, Kuiper Enterprise Technology-Low Earth Orbit Satellites

Bellevue, Washington, United States
Amazon Kuiper Manufacturing Enterprises LLC
on AWS. Key job responsibilities Manage and maintain Kuiper's SAP Infrastructure, Collaborate directly with customers to understand their unique use cases and implement tailored solutions Implement and improve observability measures across the team's infrastructure Implement and maintain Infrastructure as Code (IaC) practices for all managed systems Apply DevOps best practices to improve system reliability, scalability, and security Troubleshoot … you will function as a DevOps Engineer. You will operate and support the team's services, as well as developing automation for upgrades/patching/testing, and enhance observability of the systems. You will work with stakeholders and senior engineers to design and develop custom solutions and integrations between tools and other services in a secure manner. About the More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Senior Software Engineer

Dublin, Ireland
General Motors
credentialing, key exchange via serial/USB) aligned to enterprise security standards. Develop robust Wi-Fi networking and enterprise service integration (REST, message queues) with resilient error handling. Enable observability with structured logging, metrics, and diagnostics; participate in on-call rotations supporting global plant operations. Collaborate on API contracts, device state models, and secure endpoints; influence architecture for scalability and … using Java Spring Boot (REST APIs, data persistence, messaging/streaming integration). Build and maintain Angular front-end applications (TypeScript, RxJS) with responsive, accessible, and performant UIs. Establish observability across services and UIs (logging, metrics, tracing, SLOs, dashboards). Apply security best practices (OWASP, OAuth2/OIDC, secrets management). Drive coding standards, testing strategies, and design reviews; mentor More ❯
Employment Type: Permanent
Salary: EUR 125,000 - 150,000 Annual
Posted:

Staff Platform Engineer with Security Clearance

Lexington, Massachusetts, United States
Hybrid / WFH Options
Raft
for disconnected operations and must ensure a smooth software deployment process for applications developed on IL4 and delivering them to IL6/SIPR. You will be responsible for ensuring observability, monitoring, and alerting operate as engineered by client application teams. These processes will be documented and executed with the assistance of run books, checklists, and rely on you to keep … applications Highly preferred: - Background within DoD/Air Force AOC Weapon System and operating standards within cleared facilities (SIPR, IL6) - Familiarity with AWS and cloud technologies - Skill in operating observability tooling and alerting (Prometheus, Grafana, etc.) - Knowledge of Platform One Big Bang Clearance Requirements: Active Secret security clearance Work Type: Hybrid - Hanscom AFB, MA highly preferred (or local to Reston More ❯
Employment Type: Permanent
Salary: USD 190,000 Annual
Posted:
Observability
10th Percentile
£57,500
25th Percentile
£67,500
Median
£80,000
75th Percentile
£100,000
90th Percentile
£130,000