Observability Job Vacancies

401 to 425 of 516 Observability Jobs

Cloud Engineer DV Cleared

london, south east england, united kingdom
Damia Group
automation, and container orchestration. You will be instrumental in shaping enterprise-ready cloud solutions by applying deep technical expertise in AWS alongside knowledge of multi-cloud environments, identity management, observability, and cost optimisation. Key Responsibilities Design and implement secure, scalable AWS cloud architectures Drive Infrastructure as Code (IaC) adoption using Terraform and CloudFormation Build, optimise, and automate CI/CD … GitHub Actions, and related tools Deploy and manage containerised solutions with Docker, Kubernetes, and Helm Implement strong security and access controls using IAM, Vault, and Secrets Manager Enhance platform observability using Prometheus, Grafana, and ELK Stack Collaborate with cross-functional teams to deliver robust, high-availability solutions Key Skills & Experience Extensive hands-on experience with AWS (Azure knowledge beneficial) Expertise … in Terraform, CloudFormation, and automation tooling Strong containerisation skills with Kubernetes, Docker, and related platforms Proven background in cloud security, IAM, and governance Solid understanding of monitoring and observability stacks Ability to influence architecture decisions and align solutions to best practices Desired Certifications AWS Certified Solutions Architect – Associate/Professional AWS Certified Security – Specialty HashiCorp Certified: Terraform Associate Kubernetes Certified More ❯
Posted:

Cloud Engineer DV Cleared

slough, south east england, united kingdom
Damia Group
automation, and container orchestration. You will be instrumental in shaping enterprise-ready cloud solutions by applying deep technical expertise in AWS alongside knowledge of multi-cloud environments, identity management, observability, and cost optimisation. Key Responsibilities Design and implement secure, scalable AWS cloud architectures Drive Infrastructure as Code (IaC) adoption using Terraform and CloudFormation Build, optimise, and automate CI/CD … GitHub Actions, and related tools Deploy and manage containerised solutions with Docker, Kubernetes, and Helm Implement strong security and access controls using IAM, Vault, and Secrets Manager Enhance platform observability using Prometheus, Grafana, and ELK Stack Collaborate with cross-functional teams to deliver robust, high-availability solutions Key Skills & Experience Extensive hands-on experience with AWS (Azure knowledge beneficial) Expertise … in Terraform, CloudFormation, and automation tooling Strong containerisation skills with Kubernetes, Docker, and related platforms Proven background in cloud security, IAM, and governance Solid understanding of monitoring and observability stacks Ability to influence architecture decisions and align solutions to best practices Desired Certifications AWS Certified Solutions Architect – Associate/Professional AWS Certified Security – Specialty HashiCorp Certified: Terraform Associate Kubernetes Certified More ❯
Posted:

Cloud Engineer DV Cleared

london (city of london), south east england, united kingdom
Damia Group
automation, and container orchestration. You will be instrumental in shaping enterprise-ready cloud solutions by applying deep technical expertise in AWS alongside knowledge of multi-cloud environments, identity management, observability, and cost optimisation. Key Responsibilities Design and implement secure, scalable AWS cloud architectures Drive Infrastructure as Code (IaC) adoption using Terraform and CloudFormation Build, optimise, and automate CI/CD … GitHub Actions, and related tools Deploy and manage containerised solutions with Docker, Kubernetes, and Helm Implement strong security and access controls using IAM, Vault, and Secrets Manager Enhance platform observability using Prometheus, Grafana, and ELK Stack Collaborate with cross-functional teams to deliver robust, high-availability solutions Key Skills & Experience Extensive hands-on experience with AWS (Azure knowledge beneficial) Expertise … in Terraform, CloudFormation, and automation tooling Strong containerisation skills with Kubernetes, Docker, and related platforms Proven background in cloud security, IAM, and governance Solid understanding of monitoring and observability stacks Ability to influence architecture decisions and align solutions to best practices Desired Certifications AWS Certified Solutions Architect – Associate/Professional AWS Certified Security – Specialty HashiCorp Certified: Terraform Associate Kubernetes Certified More ❯
Posted:

DevOps Engineer - AWS

IRELAND, Republic of Ireland
The Recruitment Company
/year + Benefits Your Role: Design and implement Kubernetes (EKS) platforms on AWS Maintain and optimize CI/CD pipelines using GitHub Actions and Harness.io Implement and manage observability with Datadog across platforms Support and enhance AWS cloud infrastructure Review, audit, optimize, and document deployment processes Adhere to change management processes aligned with ISO27001 and PCI-DSS Enable self … costs across infrastructure Your Experience & Qualifications: 5+ years’ experience in a senior Platform/DevOps role Strong Kubernetes experience, particularly with AWS ECS Expertise in AWS EKS and Datadog observability Solid AWS knowledge across compute, database, and security services Proficiency in Infrastructure as Code using Terraform A solid understanding of container security and best practices Strong scripting skills (Python, Bash More ❯
Employment Type: Permanent
Salary: £80000 - £105000/annum
Posted:

Technical Lead

Cardiff, South Glamorgan, United Kingdom
Aryza Group
to deployment and maintenance. • Ensure on-time delivery by identifying and mitigating risks early. • Champion CI/CD practices and ensure smooth, automated deployment pipelines. 5. Reliability, Security, and Observability • Own the uptime, latency, and performance SLAs of financial APIs and services. • Proactively monitor risk vectors and enforce observability via metrics, logging, and alerting. • Work with DevSecOps to embed security More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AWS Cloud Architect

Tarrytown, New York, United States
Robotics technology LLC
Define and enforce SSDLC (Secure Software Development Lifecycle) and DevOps processes, integrating security and compliance controls Design and implement Terraform Enterprise strategies for modular, reusable IaC at scale Establish observability frameworks: metrics, logs, traces (Prometheus, Grafana, Loki, Jaeger, Dynatrace, Splunk) Design robust network, security, backup, and disaster recovery architectures across clouds Facilitate client workshops, gather requirements, and present solution designs … Anthos and hybrid cloud orchestration Proven experience designing and implementing Terraform Enterprise for large portfolios Strong background in DevOps tooling, CI/CD pipelines, and SSDLC integration Expertise in observability solutions: PrometheGrafana, Dynatrace, Splunk, distributed tracing Solid understanding of networking (VPC, VPN, load balancing) and cloud security (IAM, encryption, WAF, zero trust) Experience with multicloud backup and disaster recovery strategies More ❯
Employment Type: Any
Salary: USD Annual
Posted:

Senior Full-Stack Developer (Golang/ React)

Edinburgh & Lothians, Scotland, United Kingdom
Hybrid / WFH Options
Neogen Recruitment Solutions Ltd
will own full-stack features end-to-end: design and implement backend services in Go, build responsive React frontends, integrate third-party systems, and help maintain CI/CD, observability and security. Youll work closely with product, design and customer success to deliver production-grade software. Key responsibilities - Design, implement and maintain backend services and APIs in Go (Golang). … with CI/CD tooling (GitHub Actions, GitLab CI, CircleCI, etc.). - Automated testing experience (unit/integration/e2e). - Good engineering practices: code reviews, TDD/BDD, observability (metrics/logs/tracing). - Strong communication skills and ability to work asynchronously in a remote team. Nice-to-have - Experience with procurement, finance or ERP integrations (SAP, Oracle More ❯
Employment Type: Permanent, Work From Home
Salary: £80,000
Posted:

Cloud Architect

Tarrytown, New York, United States
Robotics technology LLC
Define and enforce SSDLC (Secure Software Development Lifecycle) and DevOps processes, integrating security and compliance controls Design and implement Terraform Enterprise strategies for modular, reusable IaC at scale Establish observability frameworks: metrics, logs, traces (Prometheus, Grafana, Loki, Jaeger, Dynatrace, Splunk) Design robust network, security, backup, and disaster recovery architectures across clouds Facilitate client workshops, gather requirements, and present solution designs … Anthos and hybrid cloud orchestration Proven experience designing and implementing Terraform Enterprise for large portfolios Strong background in DevOps tooling, CI/CD pipelines, and SSDLC integration Expertise in observability solutions: PrometheGrafana, Dynatrace, Splunk, distributed tracing Solid understanding of networking (VPC, VPN, load balancing) and cloud security (IAM, encryption, WAF, zero trust) Experience with multicloud backup and disaster recovery strategies More ❯
Employment Type: Any
Salary: USD Annual
Posted:

Data Engineer London, Singapore

London, United Kingdom
GSR Markets Limited
Monitor, troubleshoot, and optimize data pipelines to ensure performance and cost efficiency. Implement data governance, access controls, and security measures in line with best practices and regulatory standards. Develop observability and anomaly detection tools to support Tier 1 systems. Work with engineers and business teams to gather requirements and translate them into technical solutions. Maintain documentation, follow coding standards, and … to work across technical and non-technical teams. Additional Strengths Experience with orchestration tools like Apache Airflow. Knowledge of real-time data processing and event-driven architectures. Familiarity with observability tools and anomaly detection for production systems. Exposure to data visualization platforms such as Tableau or Looker. Relevant cloud or data engineering certifications. What we offer: A collaborative and transparent … ELT workflows with Apache Airflow (or similar) and integrating them into containerised CI/CD pipelines (Docker, GitHub Actions, Jenkins, etc.)? Select Which option best describes your experience building observability and automated anomaly detection tooling for data pipelines? Select What best describes your current location and working rights status? Select By submitting your application, you confirm that you have read More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Director, Engineering

Dublin, Ireland
Cornerstone Research
lead the global Cloud & Automation team. As a key member of the Engineering leadership team, you will drive the modernization of our SaaS platform, delivering intelligent automation, AI-powered observability, and highly scalable infrastructure across AWS and GCP. You will lead the transformation of legacy monolithic systems into modern, service-based architectures while building a high-performing, diverse engineering team. … architectures. Hands-on with Terraform, AWS CloudFormation to manage and version infrastructure. Ability to design modular, reusable infrastructure components across environments. Deep understanding of cloud-native practices, security, and observability tooling. Experience with relational databases: SQL Server, Oracle, MySQL. Strong time management, communication, and cross-functional leadership skills. Master's degree in computer science, Engineering, or equivalent experience. Culture and More ❯
Employment Type: Permanent
Salary: EUR 150,000 - 200,000 Annual
Posted:

Principal Architect

London, United Kingdom
Fractal
and automation. Collaborate closely with Product Managers and GenAI/Data Science leaders to translate business vision into technical delivery. Scalability & Reliability Implement frameworks for CI/CD, testing, observability, and MLOps integration. Ensure systems meet performance, uptime, and data security standards required by global CPG enterprises. Plan for multi-market, multi-category deployments. Long-term Productization Create a … intensive applications (streaming, APIs, ETL, data warehouses, event-driven systems). Experience in enterprise SaaS, AI/ML integration, or analytics platforms. Deep knowledge of DevOps, CI/CD, observability, and security best practices. Degree in Computer Science, Engineering, or related field (MS preferred). Benefits Lead engineering for a high-growth AI-first company disrupting the global CPG industry. More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Systems Engineer - Kubernetes Implementation (Ground-Up Build) with Security Clearance

Maryland, United States
FUSE Engineering
or integrate with existing runtimes (e.g., containerd, CRI-O). Implement cluster networking, load balancing, and service discovery mechanisms. Ensure robust authentication, RBAC, auditing, and security policy enforcement. Integrate observability tooling for metrics, logging, and tracing. Work with cross-functional teams to ensure the platform supports internal use cases at scale. Write clean, maintainable, well-tested code and documentation. Stay More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Application Support Specialist

London, Cathedrals, United Kingdom
SR2
a collaborative, low-ego environment to maintain and improve a market-leading platform that drives revenue and builds stronger audience relationships. As an Application Support Engineer, you will: Ensure observability tools are configured to give full visibility of system health. Respond to incidents and urgent issues in a timely and effective manner. Investigate and triage operational issues in collaboration with More ❯
Employment Type: Permanent
Salary: £40000 - £50000/annum
Posted:

GenAI Architect - Global Insurance Firm

London, South East, England, United Kingdom
Robert Walters
Databricks, M365, SharePoint, Salesforce, ServiceNow). Guide engineering teams through reference implementations and scalable architecture models. Partner with platform teams on enabling LLMOps/AgentOps-covering prompt management, evaluation, observability, and governance. Collaborate with business stakeholders to shape AI use cases-automated summarisation, triage, classification, and decision support. Ensure secure, policy-compliant access to enterprise data in AI pipelines. What More ❯
Employment Type: Full-Time
Salary: £100,000 - £200,000 per annum
Posted:

Head of Engineering (Hands on)- London Hybrid

London, United Kingdom
Hybrid / WFH Options
Develop
hosting Ability to design data pipelines and feedback loops for improving AI-driven features Awareness of emerging AI areas such as multimodal, edge AI, or AI in DevOps/observability Why Join? A leadership role with real ownership and autonomy A chance to shape the future of the engineering function while remaining close to the code Competitive salary (£90k-£120k More ❯
Employment Type: Permanent, Work From Home
Posted:

GPU Cluster Architect

Amsterdam, North Holland, Netherlands
Hybrid / WFH Options
Highfield Professional Solutions Ltd
solutions (InfiniBand HDR/NDR, RoCEv2) at rack, POD, and DC scale. Data & Storage: Partner with storage teams to optimise training data access, checkpointing, and high-performance throughput. Reliability & Observability: Translate signals from monitoring and telemetry systems into design improvements and reliability gains. Cross-Functional Collaboration: Work closely with reliability, networking, storage, and data centre engineering teams to deliver designs More ❯
Employment Type: Permanent
Salary: £90000 - £115000/annum
Posted:

Data Lead - HRIS Project

Birmingham, West Midlands, United Kingdom
Tarmac Trading Limited
role you'll need: Practical experience and execution of data governance & data management practices and tools such as data cataloguing, data discovery, lineage mapping, data dictionaries, data quality/observability and data security, data compliance The ability to align data initiatives with business goals and drive data maturity across the organisation. To be able to Identify opportunities where data can More ❯
Employment Type: Contract
Posted:

Customer Technology Advisory - AMS

Hoofddorp, Noord-Holland, Netherlands
Kyndryl
work with others. Required Skills and Experience Deep domain knowledge of Services offerings and technical solutions in Application Management Services covering but not limited to Application Modernization & Migration, Application Observability, Application Maintenance & Support, application Engineering including Development, Testing & Release Management Demonstrated experience translating distinctive technical knowledge into actionable customer insights and solutions Prior consultative selling experience Externally recognized as an More ❯
Employment Type: Permanent
Salary: EUR Annual
Posted:

Channel Security Solutions Engineer

London, United Kingdom
Hybrid / WFH Options
Cisco Systems
Channel Security Solutions Engineer Apply () Location:London, United Kingdom Alternate LocationLuxembourg, Belgium, Area of InterestEngineer - Pre Sales and Product Management Job TypeProfessional Technology InterestSecurity and Observability Job Id What You'll Do This role requires a growth mindset with an extensive skill set - the combination of creative vision, cross-functional collaboration, sales experience, strong execution, Service Provider, and global service More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

United Kingdom
Hybrid / WFH Options
Halian Technology Limited
We're Hiring: Mid-Level Site Reliability Engineer (SRE) This role would be Fully Remote, Permanent position Are you passionate about automation, observability, and scaling systems to support millions of users? Join ourclients SRE teamwithin thePlatform Engineeringorganization and help us build resilient, secure, and high-performing infrastructure. What Youll Do: Diagnose and resolve complex infrastructure and application issues Participate in … 24x7 on-call rotation, SCRUM, and deployment planning Perform Root Cause Analysis and guide application teams Improve system availability using industry-leading observability tools Influence architecture and design decisions with a security-first mindset Build tools and automation to streamline delivery and operations Tech Stack Highlights: Java, Kotlin, C++, Postgres AWS (EC2, ECS, Fargate, Route53, ALB/NLB) Observability: New More ❯
Employment Type: Permanent, Work From Home
Salary: £90,000
Posted:

Senior DevOps Engineer (AWS)

Seattle, Washington, United States
Sinclair Broadcast Group
Senior AWS Platform/DevOps Engineer to design, deploy, and operate the backbone of our new streaming platform. You will own our AWS environment end-to-end: infrastructure, automation, observability, and security. As our first DevOps hire, you'll work directly with the Head of Engineering (ex-Amazon, ex-Prime Video) and a small, senior development team. This is a … AWS CodePipeline). Security & Compliance Configure IAM roles, policies, and least-privilege access. Implement and monitor CloudTrail, GuardDuty, Security Hub, WAF. Enforce tagging, cost controls, and guardrails across environments. Observability & Reliability Set up CloudWatch metrics, dashboards, and alarms. Implement distributed tracing (AWS X-Ray) and log analytics (OpenSearch or equivalent). Create synthetic canaries to validate key user flows (login … acceptable with willingness to switch) Skilled in CI/CD pipelines (GitHub Actions, CodePipeline, or Jenkins) Strong grasp of IAM, VPC networking, and AWS security best practices Experience with observability: CloudWatch, X-Ray, logging pipelines, and alerting Proficiency in scripting languages (Python, Bash) Strong communication and documentation skills; able to work independently in a small, fast-moving team Nice to More ❯
Employment Type: Permanent
Salary: USD 170,000 Annual
Posted:

Cloud Engineer

London, South East, England, United Kingdom
Robert Walters
cloud infrastructure while championing automation through Infrastructure as Code solutions such as Terraform. Your day-to-day activities will involve collaborating with SRE and engineering teams to enhance system observability, proactively managing operational risks, maintaining high standards of security compliance, and ensuring robust disaster recovery capabilities. You will be responsible for documenting risks, tracking remediation actions, implementing process improvements, and … to cloud-hosted services to ensure seamless business operations.* Maintain the reliability and security of cloud environments by implementing robust monitoring tools and adhering to industry best practices.* Enhance observability and telemetry within cloud-hosted environments using SRE methodologies to deliver on Service Level Agreements (SLAs), Objectives (SLOs), and Indicators (SLIs).* Document and regularly review operational risks within the More ❯
Employment Type: Full-Time
Salary: £70,000 - £85,000 per annum
Posted:

Site Reliability Engineering (SRE) Manager

London, United Kingdom
Hybrid / WFH Options
SS&C
and postmortem processes, driving root cause analysis and long-term fixes. Automation & Tooling Champion automation to reduce toil and improve system reliability. Oversee the development and maintenance of internal observability, tools and platforms. Collaborate with engineering and DevOps teams to embed reliability into the software development lifecycle. Collaboration & Strategy Partner with product, engineering, DevOps and Customer Support teams to align … on priorities and roadmaps. Contribute to the strategic direction of infrastructure and reliability initiatives. Advocate for best practices in observability, CI/CD, and infrastructure as code. What You Will Bring: Proven experience managing or leading SRE, DevOps, or infrastructure teams. Strong background in systems engineering, cloud platforms (AWS, Azure), and container orchestration (Kubernetes). Proficiency in monitoring, alerting, and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Lead DevOps Engineer

Nottingham, Nottinghamshire, United Kingdom
London Stock Exchange Group
CI/CD pipelineswithGitLab CI or Jenkinsto enable fast, secure, and reliable software delivery. o Champion Kubernetes-based platformsusingAmazon EKSandIstio Service Meshto build scalable, service-oriented architectures. o Drive observability and reliability engineeringthrough proactive monitoring, alerting, and incident response strategies. o Mentor and guide DevOps engineers, fostering a culture of continuous improvement, automation, and operational excellence. o Collaborate cross-functionallywith … We're looking for someone with deep expertise in: oInfrastructure as Code: Terraform, CloudFormation o Security best practices: IAM, KMS, encryption in transit/at rest, DevSecOps o Monitoring & observability: Datadog, Prometheus, Grafana, ELK, or similar What You Bring o 6+ years in DevOps or platform engineering, with experience in a technical lead role. o Proven experience designing and operating More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AWS Cloud Developer

London, South East, England, United Kingdom
Hybrid / WFH Options
VML Enterprise Solutions
mindset to your work Communicate effectively with excellent written and verbal skills. Familiarity with Diagrams-as-Code for documenting infrastructure architecture. Designing solutions observing cross-cutting concerns such as observability and system security Taking ownership of deployments in a true devops model MINIMUM QUALIFICATIONS/SKILLS Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent work … skills. Familiarity with Diagrams-as-Code for documenting infrastructure architecture is a plus. Understanding of modern authentication protocols such as OAuth2 and OIDC. Consideration of cross-cutting concerns like observability and security in infrastructure design. Contributions to Open Source projects are a plus. As an equal opportunity employer we welcome applications that reflect the diversity of our wider community. Please More ❯
Employment Type: Full-Time
Salary: Salary negotiable
Posted:
Observability
10th Percentile
£57,500
25th Percentile
£67,500
Median
£80,000
75th Percentile
£100,000
90th Percentile
£130,000