151 to 175 of 1,199 Permanent Observability Jobs

Senior Software Engineer-Typescript

Hiring Organisation: Jobleads-UK
Location: Belfast, Northern Ireland, United Kingdom

appetite for rapid prototyping and iteration. Collaborative communicator who can translate between technical and non‐technical stakeholders. Quality‐focused engineer with an interest in observability, reliability and security. What’s on offer Feel safe and secure whatever life brings, with health insurance (including access to a digital doctor), life assurance ...

Senior Fullstack / Cloud Engineer

Hiring Organisation: huru
Location: Tetbury, England, United Kingdom

enterprise SaaS, or regulated B2B environments. Systems that handle sensitive personal data. IoT, wearable devices, mobile-app-connected cloud systems, or device data ingestion. Observability tools such as Grafana, Prometheus, Datadog, OpenSearch, ELK, or similar. Working in a startup or scale-up where ownership is broad and systems are still ...

Site Reliability Engineer

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

systems knowledge and strong communication skills. You’ll be responsible for tackling complex production issues, deploying resilient infrastructure, and continuously improving the stability and observability of our platform as we grow. A typical day may involve: Deploying clusters of 1,000+ GPUs using custom written playbooks; modifying these tools ...

Engineering Manager / Technical Lead – Offensive Security Infrastructure

Hiring Organisation: Unity Systems
Location: United Kingdom

cloud architectures, microservices, containerization, and platform engineer ing.Software Enginee ringExpert-level proficiency in Python and Go.Strong software engineering fundamentals including system design, architecture, testing, observability, and performance optimizat ion.Experience building scalable, maintainable, and production-grade syst ems.Innovation & Growth Min dsetPassion for research, experimentation, and solving complex security challen ges.Contributions ...

Director, Product Engineering

Hiring Organisation: Jobleads-UK
Location: United Kingdom

squads spanning Workforce Intelligence, Strategic Workforce Planning, Job & Skills Architecture, Organizational Design, and Synappy. Set the bar for engineering excellence - architecture, code quality, testing, observability, on-call practice, and developer experience - and hold the organization to it. Partner closely with Product Management, Design, and Data Science to convert customer problems ...

Python Developer

Hiring Organisation: Accenture
Location: Greater Leeds Area, United Kingdom

other architectural elements Deploy these applications using features such as containers to cloud leveraging CI/CD to support this process backed with good observability when running these in production Ensure quality through the creation of documentation and use of unit/integration/contract testing with a consideration ...

Senior ML Platform Engineer

Hiring Organisation: Cubiq Recruitment
Location: City of London, London, United Kingdom

high-throughput readers, prefetching, sharding, caching, and storage-format choices Infrastructure to support large-scale, multi-node distributed model training: orchestration, configuration, reproducibility, and observability Performance of CPU and IO-bound stages, including video encoding via ffmpeg and frame-level concurrency With the robotics team: The integration surface between ...

AI/MLOps Engineer

Hiring Organisation: Vaco LLC
Location: Dallas, Texas, United States
Employment Type: Permanent
Salary: USD Annual

with natural language processing (NLP) and conversational AI solutions. Familiarity with data engineering concepts and distributed data processing. Knowledge of monitoring tools and model observability best practices. Additional Information Ability to travel up to 25% as needed. Vaco by Highspring values a diverse workplace and strongly encourages women, people ...

Site Reliability Engineer

Hiring Organisation: Jobleads-UK
Location: United Kingdom

ServiceMesh, ODF, ACS, ACM, AMQ)Ability to work within complex multi‐cloud or hybrid environments, with a solid foundation in distributed systemsPractical knowledge of observability tooling such as Prometheus, Grafana, Loki, and TempoProficiency in IaC tools (Kustomize, Helm) and scripting languages (Bash, Python), with experience managing GitOps pipelines using Tekton ...

Senior Software Engineer

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

strain, stress, immunity) backed by research. Infrastructure & Reliability:Maintain 99.5%+ uptime on a platform that processes billions of events. Design for fault tolerance, observability, and horizontal scale What We Are Looking For 5+ years of production backend engineering.You have shipped systems that handle real traffic at scale. ...

Senior Software Engineer, Platform & Production

Hiring Organisation: Moodsonic
Location: United Kingdom

attack surface, threat models, and access control as a matter of course, with working knowledge of cryptography and data protection fundamentals Code quality and observability as habits. Tests, substantive review, documentation, and the instinct to measure how software behaves in production are part of doing the work, not chores after ...

Principal / Consulting Engineer

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

APIs, and data flows• Contribute directly to high-impact work, including system modernisation and risk reduction• Coach engineers and strengthen engineering practices such as observability, automation, and incident learning• Support teams in using AI-assisted development and production AI patterns where they add real value**Requirements:**• Experience working at senior ...

Cloud Platform Engineer - Salesforce & DevOps

Hiring Organisation: Vaco LLC
Location: Saint Louis, Missouri, United States
Employment Type: Permanent
Salary: USD 180,000 Annual

frameworks that improve deployment reliability, environment consistency, and delivery speed while troubleshooting pipeline and deployment issues. Reliability & Operations: Develop monitoring dashboards, reliability metrics, and observability capabilities supporting service-level objectives and Salesforce platform health. Support proactive monitoring, automated remediation, incident response, root cause analysis, and platform optimization. Platform Enablement & Collaboration ...

Multi-Cloud Network Engineer (533500)

Hiring Organisation: Vaco LLC
Location: Addison, Texas, United States
Employment Type: Permanent
Salary: USD Annual

Logging/Troubleshooting - Monitoring/Diagnostics utilizing VPC Flow Logs/Azure Network Watcher/GCP Network Intelligence Center/CloudWatch/3rd Party Observability Solutions Technical Escalation Point for Complex Network Issues IaC/Configuration Management Tools - AWS CloudFormation/Azure Bicep/ARM/GCP Deployment Manager ...

Lead DevOps Engineer

Hiring Organisation: Bounce Digital
Location: City of London, London, United Kingdom

role. You’ll take ownership across: • Kubernetes & container orchestration • AWS cloud infrastructure • Terraform/Terragrunt IaC • CI/CD & release automation • Customer deployment environments • Observability, telemetry & monitoring • Security, reliability & infrastructure standards The product sits within AI/drug discovery research, with enterprise customers already live and strong growth underway. Looking ...

DevOps Engineer

Hiring Organisation: Formula
Location: Newcastle Upon Tyne, England, United Kingdom

plus) • Big data database design: PostgreSQL, MongoDB, or Hadoop/S3 • Python/Flask for API development • Infrastructure-as-Code: Terraform or Ansible • Observability stacks: Prometheus, Grafana, ELK Contract Details: • Day Rate: Up to £300.00 per day Outside IR35 • Contract Length: 3 months • Location: North of England preferred, hybrid • Start ...

Senior Platform Engineer (Kubernetes)

Hiring Organisation: La Fosse
Location: City of London, London, United Kingdom

Engineer to support and evolve a large-scale, on-premises Kubernetes platform. This is a hands-on engineering role focused on platform reliability, automation, observability, security, and developer enablement using a modern cloud-native toolset, all running within private infrastructure. Hybrid Working – Up to 1 day per week on-site ...

Site Reliability Engineer

Hiring Organisation: Harvey Nash
Location: Glasgow, Lanarkshire, Scotland, United Kingdom
Employment Type: Full-Time
Salary: Salary negotiable

across the SDLC. I'm looking to speak with self-starting candidates with: Strong cloud environment experience (AWS, Azure or GCP) Solid experience in observability and monitoring tools . You should be proficient in a development language such as Python , with experience using it to automate tasks, build tools ...

Senior AI Engineer

Hiring Organisation: SplendIT
Location: City of London, London, United Kingdom

modern LLM frameworks Integrate LLMs into real-world applications and APIs Build and optimize RAG pipelines and vector search solutions Design evaluation, monitoring, and observability frameworks for AI systems Collaborate closely with architects, tech leads, and product teams in agile delivery pods Write clean, scalable, and well-tested Python code ...

Platform Engineer / DevOps Engineer

Hiring Organisation: Clarify Consultancy Ltd
Location: Liverpool, Merseyside, North West, United Kingdom
Employment Type: Permanent
Salary: £55,000

maintain scalable cloud infrastructure Improve CI/CD pipelines and deployment automation Support Kubernetes-based environments and containerised applications Enhance platform reliability, monitoring, and observability Troubleshoot production and infrastructure issues Contribute to infrastructure security and operational best practices Drive continuous improvement across tooling and platform operations This is a hands ...

GCP CloudOps Engineer

Hiring Organisation: Anson Mccade
Location: Manchester, North West, United Kingdom
Employment Type: Permanent, Work From Home

infrastructure-as-code tooling Python, Bash, or similar scripting languages CI/CD pipelines and DevOps tooling Kubernetes or containerised workloads Monitoring and observability tooling Linux administration Incident management and root cause analysis Reference: AMC-AQU-GCPA Postcode: Manchester (M1) #adqu ...

Front End Developer

Hiring Organisation: Trinity Global Consulting
Location: Springfield, Virginia, United States
Employment Type: Permanent
Salary: USD Annual

Security+ certification Experience with containerization technologies such as Docker and Kubernetes Familiarity with cloud platforms (Azure, AWS, or GCP) Experience with monitoring, logging, and observability tools Exposure to DevSecOps practices and secure development pipelines Experience working in Agile or Scrum development environments Benefits At Trinity Global Consulting (TGC), we value ...

Cloud Engineer

Hiring Organisation: Bristow Holland
Location: Ipswich, Suffolk, England, United Kingdom
Employment Type: Full-Time
Salary: £55,000 - £60,000 per annum

Kubernetes, ideally Azure Kubernetes Service (AKS) Docker and container technologies PowerShell and/or Python scripting Cloud networking, security and identity management Monitoring and observability tools Troubleshooting complex infrastructure and production issues Working within enterprise or business-critical environments Desirable Experience Infrastructure as Code (Terraform or Bicep) CI/ ...

Senior Data Platform Engineer

Hiring Organisation: Robert Half
Location: Hampshire, England, United Kingdom

tooling such as Fivetran. • Experience with infrastructure as code and CI/CD tooling, including Terraform and GitHub Actions. • Strong knowledge of monitoring and observability tooling such as Cloud Monitoring and PagerDuty. • Proven experience integrating CRM, ERP, and Billing/BSS systems. • Experience delivering critical cutover and hypercare phases within ...

Lead Software Engineer (MongoDB / Node.js / JavaScript)

Hiring Organisation: Adria Solutions
Location: Manchester, Lancashire, England, United Kingdom
Employment Type: Full-Time
Salary: £50,000 - £75,000 per annum

RESTful API design AWS or similar cloud platforms Microservices architecture Testing frameworks (Vitest, Jest, Mocha, etc.) CI/CD pipelines & DevOps practices GitHub workflows Observability tools (e.g., DataDog) Docker/Kubernetes Tech Stack You’ll Be Working With Node.js, JavaScript/TypeScript Express.js, Fastify MongoDB AWS Vue.js, Nuxt.js Nice ...