451 to 475 of 1,199 Permanent Observability Jobs

Director, MedTech Surgery Data Analytics & AI

Hiring Organisation
Jobleads-UK
Location
High Wycombe, England, United Kingdom
Responsible AI leader role patterns). Platform & Architecture Partnership (Modern Data Stack): Define target-state data architecture for Surgery (integration patterns, pipeline standards, interoperability, observability) and drive reusable components/patterns to accelerate delivery of data products. Partner with platform/architecture leaders to ensure scalable cloud foundations, APIs ...

Senior Software Engineer - Billing (VAT & Invoicing)

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
testing, including E2E/Cypress. Backend Excellence: Engineer sophisticated backend solutions involving API versioning, caching strategies, and complex data migration plans. Operational Maturity: Lead observability and SRE practices; define SLOs, manage incident responses, and conduct blameless post‐mortems. Security & Risk: Oversee operational security, including secrets hygiene and dependency risk management ...

ERP Ecosystem Architect

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
documenting architecture artefacts for business and technical audiences.**Architecture Skills*** **Application architecture:** DDD, modular monoliths, microservices, API gateways.* **Non-functional architecture:** scalability, resilience, observability, DR, SLAs.* **Technical architecture:** reference models, interim and target-state design, roadmaps.* **DevOps :** manage architecture deliverables, decisions, and cross-system dependencies as Azure DevOps work items ...

Enterprise Hybrid Platform Architect (Advisory) - Manager - National Security

Hiring Organisation
Jobleads-UK
Location
Manchester, England, United Kingdom
landing zone specific” solutions. You must understand the importance of both enterprise MI, for long‐term decision making, and (near) real‐time observability for operations. You should have an appreciation of different operating models associated with both legacy and cloud environments, and be able to contribute to op model design ...

Enterprise Hybrid Platform Architect (Advisory) - Manager - National Security

Hiring Organisation
Jobleads-UK
Location
Bristol, England, United Kingdom
landing zone specific” solutions. You must understand the importance of both enterprise MI, for long‐term decision making, and (near) real‐time observability for operations. You should have an appreciation of different operating models associated with both legacy and cloud environments, and be able to contribute to op model design ...

Enterprise Hybrid Platform Architect (Advisory) - Manager - National Security

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
landing zone specific” solutions. You must understand the importance of both enterprise MI, for long‐term decision making, and (near) real‐time observability for operations. You should have an appreciation of different operating models associated with both legacy and cloud environments, and be able to contribute to op model design ...

Dynamics 365 CE Developer Lead

Hiring Organisation
Medline Industries
Location
Northbrook, Illinois, United States
Employment Type
Permanent
Salary
USD 174,000 Annual
preventative improvements. Participate in on call support during critical operational incidents, enterprise migrations, and hypercare periods. Leverage Application Insights and related tooling to support observability and performance optimization of the CRM platform. Job Requirements Must have: Minimum of 7 years of hands on development experience on the Microsoft Dynamics ...

Director, MedTech Surgery Data Analytics & AI

Hiring Organisation
Johnson & Johnson
Location
Buckinghamshire, United Kingdom
Employment Type
Full Time
leader role patterns). 5) Platform & Architecture Partnership (Modern Data Stack) Define target-state data architecture for Surgery (integration patterns, pipeline standards, interoperability, observability) and drive reusable components/patterns to accelerate delivery of data products. Partner with platform/architecture leaders to ensure scalable cloud foundations, APIs, and data ...

Staff SRE: Lead Reliability & Observability

Hiring Organisation
Jobleads-UK
Location
Belfast, Northern Ireland, United Kingdom
DailyPay in Belfast is looking for a Staff Site Reliability Engineer to lead operational excellence and reliability practices. You will champion the SRE mission while working collaboratively across the engineering organization. The ideal candidate will ...

Strategic Account Director, Capital Markets & Observability

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
ITRS in Greater London is seeking an experienced Account Director to manage and grow revenue within financial services. You will be responsible for building strategic relationships, managing the sales cycle, and collaborating with internal teams ...

AWS DevOps Engineer - Cloud Native

Hiring Organisation
83zero Ltd
Location
London, United Kingdom
Employment Type
Permanent
Salary
£80000 - £90000/annum 5% Bonus, Pension 6% , PH
pipelines using tools such as GitLab CI, Jenkins, or ArgoCD Deploying and managing containerised workloads with Docker and Kubernetes Implementing monitoring, logging, and observability solutions using Prometheus, Grafana, ELK, and CloudWatch Improving platform scalability, automation, resilience, and security Working within Agile delivery teams across complex transformation programmes Supporting DevSecOps … pipeline engineering Docker and Kubernetes Scripting and automation using Python, Bash, or PowerShell AWS networking including VPCs, subnets, and security groups Monitoring and observability tooling Troubleshooting and optimising cloud infrastructure Working within secure or regulated environments What's on Offer Exposure to enterprise-scale cloud transformation programmes Access to industry ...

Senior Software Development Engineer in Test

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
from traditional testing approaches towards a modern, engineering‐led quality strategy: unit testing, contract testing, component testing, integration testing, E2E flows, synthetics, and strong observability across our microservices. We’re looking for a Senior SDET who is hands‐on, highly technical, and passionate about setting teams up for long‐term … teams adopt best practices confidently. Collaborate with Engineering and DevOps to evolve CI/CD pipelines and embed automation earlier in the lifecycle. Improve observability around testing and reliability, integrating logs, traces, metrics, synthetics, and alerts to increase confidence in releases. Promote good testing principles and high‐quality engineering practices ...

Generative AI Engineer

Hiring Organisation
Immersum
Location
London Area, United Kingdom
event-driven architectures using Kafka , enabling real-time data processing, system decoupling, and auditability Ensure high performance, reliability, and scalability across distributed systems, including observability, monitoring, and production readiness Partner with product, operations, and investment stakeholders to refine requirements and deliver iterative solutions in Agile environments Core requirements 8+ years … Working with OpenAI or Anthropic APIs Using vector databases and embedding-based search systems Applying prompt engineering techniques effectively Building AI evaluation, monitoring, and observability frameworks Understanding ML fundamentals where relevant (embeddings, fine-tuning, context engineering) Additional note The business is also interested in speaking with UK-based engineers ...

Senior Solutions Architect

Hiring Organisation
Code Wizards Group
Location
Theale, Berkshire, UK
scaling strategies Integrate GameLift with AWS services (Lambda, DynamoDB, API Gateway, etc.) Optimise cost, performance, and multi-region deployments Implement monitoring, logging, and observability solutions Qualifications AWS Certified Solutions Architect – Professional (required) Additional AWS certifications (e.g., DevOps, Security) desirable SKILLS AND EXPERIENCE Proven experience in solutions architecture or senior technical … Code: Terraform, CloudFormation, AWS CDK Game Development: Unreal/Unity basics, backend integration patterns Multiplayer Systems: Backend architecture, session management, real-time systems Observability: CloudWatch, Prometheus, Grafana, or similar Soft Skills Strong communication and presentation skills Ability to engage both technical and non-technical stakeholders Strategic thinking and problem-solving ...

Senior Solutions Architect

Hiring Organisation
Code Wizards Group
Location
Theale, England, United Kingdom
scaling strategies Integrate GameLift with AWS services (Lambda, DynamoDB, API Gateway, etc.) Optimise cost, performance, and multi-region deployments Implement monitoring, logging, and observability solutions Qualifications AWS Certified Solutions Architect – Professional (required) Additional AWS certifications (e.g., DevOps, Security) desirable SKILLS AND EXPERIENCE Proven experience in solutions architecture or senior technical … Code: Terraform, CloudFormation, AWS CDK Game Development: Unreal/Unity basics, backend integration patterns Multiplayer Systems: Backend architecture, session management, real-time systems Observability: CloudWatch, Prometheus, Grafana, or similar Soft Skills Strong communication and presentation skills Ability to engage both technical and non-technical stakeholders Strategic thinking and problem-solving ...

DevOps & Infrastructure Engineer

Hiring Organisation
Computer Futures
Location
United Kingdom
Employment Type
Permanent
Salary
GBP 50,000 - 70,000 Annual
Cyber Essentials) Lead incident response and root cause analysis for security and infrastructure-related events Monitoring, Reliability & Support Implement monitoring, alerting, and observability across infrastructure and applications Define SLAs/SLOs and ensure systems meet availability and performance requirements Provide 3rd line support and escalation for complex infrastructure issues Conduct … Hyper-V) Understanding of networking concepts (VLANs, firewalls, VPNs) and enterprise storage Experience with databases and messaging systems (PostgreSQL, RabbitMQ) Exposure to monitoring and observability tools (e.g., Prometheus, Grafana) Strong understanding of cyber security best practices, patching, and vulnerability management Ability to produce clear technical documentation and communicate with both ...

Machine Learning Systems & Infrastructure Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Ship workloads with Docker and Kubernetes; maintain IaC (Terraform) for the surfaces you own and CI/CD pipelines, including self‐hosted GPU runners. Observability and reliability: Monitoring, logging, and alerting for job performance, data‐pipeline health, and cost (e.g., Prometheus/Grafana, OpenTelemetry); define SLOs and incident response … stores; and object storage with caching layers. Familiarity with ML workflow orchestration and experiment tracking (e.g., Kubeflow Pipelines, MLflow). Experience with monitoring and observability tooling (e.g., Prometheus/Grafana, OpenTelemetry) and CI/CD for infra and ML workflows (e.g., GitHub Actions). At SpAItial, we are committed ...

MLOps Architect - AWS

Hiring Organisation
Quantiphi
Location
United Kingdom
based systems. Serve as a technical authority across multiple internal and customer projects, contributing architectural patterns, best practices, and reusable frameworks. Enable observability, monitoring, drift detection, lineage tracking, and auditability across ML/LLM systems. Define and implement standards for model deployment, monitoring, governance, and automation to ensure production-grade … code (Terraform, Helm, CDK). Hands-on understanding of model drift detection, A/B testing, canary rollouts, and blue-green deployments. Familiarity with Observability stacks (Prometheus, Grafana, CloudWatch, OpenTelemetry). SQL and data transformation experience using Snowflake, Databricks, Spark. Ability to translate business goals into scalable AI/ ...

Network Analytics & Automation Leader with AI Platforms

Hiring Organisation
Jobleads-UK
Location
Chester, England, United Kingdom
Overview Automation Technologies and AI/ML-Driven Platforms and Analytics Tools; in the realm of automation technologies and AI/ML-driven observability platforms and analytics tools, the following are essential: Terraform Itential NetDevOps Splunk Python React JS Django Database Technologies Proficiency with database technologies is crucial, including: MySQL ...

Remote Principal Cloud Platform Engineer

Hiring Organisation
Jobleads-UK
Location
Cambridge, England, United Kingdom
cloud infrastructure, ensuring reliability and security of services for over 3 million users. Candidates should have substantial experience with Kubernetes, Infrastructure as Code, and observability tools. The position offers a remote work option and collaboration within a dynamic team committed to innovation. #J-18808-Ljbffr ...

Senior SRE & AI/ML Platform Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
involves building scalable and resilient data solutions, coordinating incident management, and mentoring team members. The ideal candidate will have strong skills in site reliability, observability, and automation tools, and will play a key role in shaping a collaborative and innovative team culture. Competitive benefits, including comprehensive healthcare and retirement plans ...

Senior Site Reliability Engineer - Global Tech Ops

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
major cloud platforms like AWS. The position emphasizes leadership and collaborative troubleshooting within the global technical operations team. Ideal applicants will bring expertise in observability tools and Infrastructure as Code practices. #J-18808-Ljbffr ...

Senior AI/ML Data Platform SRE Lead

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Greater London. In this role, you will develop scalable and resilient data solutions, manage incident resolutions, and foster team collaboration. Your experience with observability tools and site reliability principles will be key, along with proficiency in Python or PySpark. The position offers the chance to drive strategic change ...

Platform Engineer (Cloud)

Hiring Organisation
Paragon Alpha - Hedge Fund Talent Business
Location
London Area, United Kingdom
this role, you would be responsible for designing, developing and managing platform APIs to automate cloud workflows, as well as contributing to platform observability including monitoring, logging and tracing. The role involves collaboration with teams across the firm, primarily including Cloud Engineering and Security. Stack: Python/Go, AWS, Kubernetes ...

Site Reliability Engineer

Hiring Organisation
Arrows
Location
City of London, London, United Kingdom
media platform supporting high traffic, customer facing systems used by millions daily 🌍 You’ll be working across: ☁️ Kubernetes ⚙️ Terraform & Automation 🚀 CI/CD pipelines 📊 Observability & platform reliability 🌐 Fastly/Akamai CDN platforms Strong CDN experience is absolutely essential for this role. Hands on Fastly or Akamai exposure is a core ...