151 to 175 of 546 Observability Jobs

Sr. Project Manager/Program Manager - Digital Twin / AIOps (OSS)

Hiring Organisation
Stackstudio Digital Ltd
Location
Reading, Berkshire, South East, United Kingdom
Employment Type
Permanent
Salary
£70,000
streaming/data pipelines (Kafka, Pub/Sub, Dataflow) Familiarity with cloud-native (GCP/Azure/AWS), Kubernetes, API-first integration, and observability stacks (metrics, logs, traces) Experience in ML/AI (feature engineering, MLOps, model monitoring), automation frameworks (RPA/BPM/runbooks), and security/compliance. Proven ...

Infrastructure Networking Engineer (GKE Specialist)

Hiring Organisation
Searchability NS&D
Location
England, United Kingdom
Evaluate and optimise Cluster Architecture and Tenancy configurations. Assess and improve Networking and Connectivity setups within the cloud environment. Review Security protocols, Operations, and Observability standards. Analyse Automation processes and CI/CD pipelines for efficiency. Audit Cost Management, Billing structures, and Testing methodologies. Key Skills: Google Kubernetes Engine ...

DevOps Engineer

Hiring Organisation
Opus Recruitment Solutions Ltd
Location
Bristol, Avon, England, United Kingdom
Employment Type
Contractor
Contract Rate
£400 per day
/CD pipelines for Node.js services Architecting scalable, resilient cloud solutions for data-heavy workloads Managing containerised environments (Docker, Kubernetes/EKS) Improving observability across platforms, including monitoring, logging, and alerting Collaborating closely with Node.js engineers to streamline deployments and environments Embedding best practices across security, cost optimisation, and operational ...

Site Reliability Engineer Trainer / Coach

Hiring Organisation
CBSbutler Holdings Limited
Location
Cheshire, North West, United Kingdom
Employment Type
Contract, Work From Home
engineering teams on reliability engineering, automation, and incident management. - Guide teams in defining and implementing SLOs, SLIs, and error budgets. - Promote best practices in observability, monitoring and incident response. - Create and curate e-learning content, assessments, and certification pathways. Skills and Experience required: - Proven experience as an SRE, SRE Coach ...

Java Software Engineer

Hiring Organisation
La Fosse
Location
England, UK
Azure/GCP) • Strong testing mindset (unit, integration, contract) and automation awareness • Understanding of OAuth2/OIDC, JWT and general security patterns • Familiarity with observability tools (OpenTelemetry, Prometheus, Grafana, ELK etc.) Nice to have • Previous work with healthcare or regulated environments • Experience in distributed systems or platform teams • Experience mentoring ...

Principal Engineer

Hiring Organisation
Hays
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£600.0 - £640.0 per day + up to £640 pd (Inside IR35)
programming language (4GL) Strong skills in object-oriented analysis and design (OOAD) Proven experience with C#, Java (Spring Boot, JPA/Hibernate), REST APIs, observability, monitoring, queue technologies, security, and relational databases (especially Postgres). Strong understanding of SOLID principles, microservice development focused on high availability and data integrity ...

Senior Software Engineer - Backend

Hiring Organisation
Fruition Group
Location
Leeds, West Yorkshire, Yorkshire, United Kingdom
Employment Type
Permanent
Salary
£70,000
decisions across teams. Lead by example, writing high-quality, maintainable code in Node.js and TypeScript. Design and optimise CI/CD pipelines, improving automation, observability, and release processes. Collaborate cross-functionally with product and platform teams to deliver robust services. Mentor and coach engineers, helping to raise the overall ...

Senior Backend Engineer

Hiring Organisation
Fruition Group
Location
Leeds, West Yorkshire, Yorkshire, United Kingdom
Employment Type
Permanent
Salary
£70,000
decisions across teams. Lead by example, writing high-quality, maintainable code in Node.js and TypeScript. Design and optimise CI/CD pipelines, improving automation, observability, and release processes. Collaborate cross-functionally with product and platform teams to deliver robust services. Mentor and coach engineers, helping to raise the overall ...

AWS Solutions Architect

Hiring Organisation
Henderson Scott
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£550 - £575 per day
decision-making. Desirable (Optional) Experience: We value architects who can adapt quickly to unfamiliar tools. Experience in any of the following is highly advantageous: Observability: Elasticsearch Stack, Dynatrace, Prometheus, or Grafana. Security & Identity: Hashicorp Vault, LDAP, Redhat SSO, OIDC, and Firewalling (Fortigate/AWS Network Firewall). Infrastructure/DevOps ...

Infrastructure Architect - SC Cleared

Hiring Organisation
Sanderson Recruitment
Location
London, United Kingdom
Employment Type
Contract
Contract Rate
£600 - £650 per day + Inside IR-35
/CD. Desirable Skills Advanced networking (EVPN/VXLAN). Automation and DevOps tools (Ansible, Terraform, CI/CD pipelines). Advanced security and observability solutions. Kubernetes and modern application/data platforms. Storage and GPU technologies. Professional Requirements Proven experience leading technical design authority sessions. Strong documentation skills (HLDs ...

Java Developer – Market Connectivity

Hiring Organisation
Solytics Partners
Location
City of London, London, United Kingdom
resolve connectivity, performance, and messaging issues in production. Conduct latency tuning, load testing, and system optimization. Collaborate with SRE/DevOps teams to improve observability, CI/CD, and deployment pipelines. Key Requirements: Strong expertise in Core Java, including concurrency, memory management, and GC tuning. Hands-on experience integrating ...

Machine Learning Engineer

Hiring Organisation
algo1
Location
City of London, London, United Kingdom
serving latency or pipeline robustness. Month 3: Own and deliver a major infrastructure component (e.g., feature store, training orchestration, or model registry); improve system observability with logging, metrics, and alerting. Month 6: Lead the end-to-end productionisation of our foundation model, meeting latency, throughput, and reliability SLAs; mentor teammates ...

Senior Engineer Data, AI & Analytics (m/w/d) - Hybrid

Hiring Organisation
Purpose Green
Location
Charlottenburg, Berlin, Germany
Employment Type
Permanent
Salary
EUR 50,000 - 75,000 Annual
Redshift, OpenSearch) • Hands-on experience deploying AI/LLM-based systems into production • Experience using dbt Cloud for transformation pipelines • Familiarity with tracing and observability (e.g., Langfuse, OpenTelemetry) • Experience preparing datasets and running supervised fine-tuning (SFT) of LLMs • Exposure to reverse ETL tools (e.g., Census, Hightouch) or building custom ...

Platform Engineer / DevOps (AI / ML)

Hiring Organisation
Europe-Leading Media Technology Company
Location
Norwich, England, United Kingdom
/CD pipelines that enable frequent, safe releases Manage infrastructure using IaC (Terraform, Pulumi, or equivalent), keeping environments repeatable and auditable Build strong observability: metrics, logging, tracing, alerting, dashboards, and runbooks that are actually used Support storage and data systems, including S3-compatible object storage and PostgreSQL at scale Support ...

Lead Back End Engineer

Hiring Organisation
mkodo
Location
City of London, London, United Kingdom
projects and features to good outcomes, ensuring appropriate engineering decisions are made to factor in technical debt, systems design, stability/reliability, monitoring/observability and business need. Hands-On Guidance Contribute to key backend systems when your expertise is needed. Review and refine critical code, ensuring alignment with architectural ...

Senior Software Engineer, Infrastructure Security

Hiring Organisation
Klaviyo Inc
Location
Dublin, Ireland
Employment Type
Permanent
Salary
EUR 125,000 - 150,000 Annual
applying AI thoughtfully to improve engineering productivity and system capabilities. You take ownership of operational excellence for the systems you build, including performance, reliability, observability, and on call participation where required. You enjoy questioning convention and continuously improving how things work, whether that's architecture, tooling, workflows, or team practices. ...

Data and Solutions Engineer

Hiring Organisation
Perch Group
Location
Manchester, England, United Kingdom
Development : Build and manage high-quality, secure, and governed ETL processes using on-premise and cloud-based data sources. Data Quality & Monitoring: Design data observability and quality checks into all ETL processes, enabling proactive identification of discrepancies. Performance Optimisation: Optimise data platforms, ETL processes, and database solutions for reusability ...

MongoDB-Site Reliability

Hiring Organisation
Barclays Bank Plc
Location
Knutsford, Cheshire, UK
Employment Type
Full-time
paced environment, your role will be essential to ensuring our infrastructure remains resilient, secure, and scalable. You'll work on automating operations, enhancing system observability, and driving continuous improvements that reduce downtime and improve efficiency. If you're motivated by solving, multi-layered problems and building systems that perform reliably ...

Artificial Intelligence Engineer

Hiring Organisation
Zeus® | AI-Powered Logistics
Location
London Area, United Kingdom
intro (role, your goals) Python data task (clean/transform + quick viz) API & systems chat (design a small data pull + retries/observability) Simulation case (parameter trade-offs; short write-up) Stakeholder round (present findings; Q&A) What we offer Private health insurance Pension plan Central working location ...

Software Test & Verification Engineer

Hiring Organisation
AssetCool
Location
Leeds, England, United Kingdom
structured test reports. Investigate field failures by analysing logs, telemetry, and system behaviour to identify root causes. Collaborate with engineers to improve testability, observability, and fault handling in production software. Contribute to continuous integration pipelines for automated builds, static analysis, and test execution. Support lab and field-testing efforts ...

Senior Full-Stack AI Engineer — Frontend & LLM-Enabled UX (Level 3)

Hiring Organisation
Allegis Global Solutions
Location
London, England, United Kingdom
teams: sprint planning, incremental delivery, feature flagging, A/B tests and operationalizing metrics. Strong software engineering fundamentals: testing (unit/integration/E2E), observability, performance profiling, and security-aware development. Excellent communication skills and experience mentoring engineers and collaborating across product, design and ML teams. Preferred Requirements Familiarity with ...

Senior Platform Engineer

Hiring Organisation
Revybe IT Recruitment Ltd
Location
Cardiff, South Glamorgan, Wales, United Kingdom
Employment Type
Full-Time
Salary
£65,000 - £80,000 per annum
Tech Stack Cloud: AWS (EC2, RDS, S3, IAM, CloudWatch, Lambda) Infrastructure as Code: Terraform Containerisation & Orchestration: Docker, Kubernetes (EKS), Helm Configuration Management: Ansible Monitoring & Observability: Grafana, Prometheus CI/CD: GitHub Actions Automation & Scripting: Python, Bash, Go or Java What We’re Looking For Proven experience running AWS cloud infrastructure … regulated (financial) environment. Hands-on experience managing Kubernetes clusters (preferably EKS). Strong understanding of Infrastructure as Code using Terraform. Familiarity with monitoring and observability stacks such as Prometheus and Grafana. Experience building and maintaining CI/CD pipelines (GitHub Actions or similar). Strong scripting or automation skills using ...

Senior Site Reliability Engineering - Database Operations

Hiring Organisation
N26 GmbH
Location
Berlin, Germany
Employment Type
Permanent
Salary
EUR Annual
strategy, roadmap, and architecture. Lead and drive incident management and troubleshooting efforts, ensuring a stable and predictable environment. Define, implement, and maintain comprehensive observability solutions (metrics, logging, tracing) to ensure the Platform meets availability targets The ideal candidate is a seasoned SRE expert ready to tackle the challenges … with CI/CD pipelines (GitHub Actions, ArgoCD, Jenkins, or similar). Familiarity with networking and security best practices in cloud environments. Familiarity with observability tools (DataDog, Prometheus, Grafana, OpenTelemetry). Nice to have: Experience in scaled operational storage solution production systems. In-depth knowledge of database internals including performance ...

Senior SRE - Database Operations

Hiring Organisation
N26 GmbH
Location
Berlin, Germany
Employment Type
Permanent
Salary
EUR Annual
strategy, roadmap, and architecture. Lead and drive incident management and troubleshooting efforts, ensuring a stable and predictable environment. Define, implement, and maintain comprehensive observability solutions (metrics, logging, tracing) to ensure the Platform meets availability targets The ideal candidate is a seasoned SRE expert ready to tackle the challenges … with CI/CD pipelines (GitHub Actions, ArgoCD, Jenkins, or similar). Familiarity with networking and security best practices in cloud environments. Familiarity with observability tools (DataDog, Prometheus, Grafana, OpenTelemetry). Nice to have: Experience in scaled operational storage solution production systems. In-depth knowledge of database internals including performance ...

Software Engineer

Hiring Organisation
KMJJ Enterprise, LLC
Location
Fayette, Alabama, United States
Employment Type
Any
Salary
USD Annual
methodologies, and tools such as GitLab CI and Jenkins Git Source Control System Position Desired Skills Datacenter Infrastructure Management (DCIM) tools such as Netbox Observability and Analytics platform solutions such as Splunk Identity and Access Management (IAM) solutions such as Keycloak Secret Management tools such as HashiCorp Vault Familiar with ...