851 to 875 of 1,399 Permanent Observability Jobs

Senior Backend Engineer (NestJS / AWS)

Hiring Organisation
Eequ
Location
Chester, Cheshire, UK
Employment Type
Full-time
responsibility for our AWS-based infrastructure (EC2, RDS, S3, CloudWatch and related services). Manage resources using infrastructure as code tools. Maintain and improve observability: logging, metrics, alerts, and dashboards. Lead incident response when production issues occur and drive follow-up improvements. Technical direction and coaching Make and communicate architectural ...

Senior Backend Engineer (NestJS / AWS)

Hiring Organisation
Eequ
Location
Guildford, Surrey, UK
Employment Type
Full-time
responsibility for our AWS-based infrastructure (EC2, RDS, S3, CloudWatch and related services). Manage resources using infrastructure as code tools. Maintain and improve observability: logging, metrics, alerts, and dashboards. Lead incident response when production issues occur and drive follow-up improvements. Technical direction and coaching Make and communicate architectural ...

Senior Backend Engineer (NestJS / AWS)

Hiring Organisation
Eequ
Location
Cambridge, Cambridgeshire, UK
Employment Type
Full-time
responsibility for our AWS-based infrastructure (EC2, RDS, S3, CloudWatch and related services). Manage resources using infrastructure as code tools. Maintain and improve observability: logging, metrics, alerts, and dashboards. Lead incident response when production issues occur and drive follow-up improvements. Technical direction and coaching Make and communicate architectural ...

Senior Backend Engineer (NestJS / AWS)

Hiring Organisation
Eequ
Location
Shrewsbury, Shropshire, UK
Employment Type
Full-time
responsibility for our AWS-based infrastructure (EC2, RDS, S3, CloudWatch and related services). Manage resources using infrastructure as code tools. Maintain and improve observability: logging, metrics, alerts, and dashboards. Lead incident response when production issues occur and drive follow-up improvements. Technical direction and coaching Make and communicate architectural ...

Senior Backend Engineer (NestJS / AWS)

Hiring Organisation
Eequ
Location
Chelmsford, Essex, UK
Employment Type
Full-time
responsibility for our AWS-based infrastructure (EC2, RDS, S3, CloudWatch and related services). Manage resources using infrastructure as code tools. Maintain and improve observability: logging, metrics, alerts, and dashboards. Lead incident response when production issues occur and drive follow-up improvements. Technical direction and coaching Make and communicate architectural ...

Senior Backend Engineer (NestJS / AWS)

Hiring Organisation
Eequ
Location
Reading, Berkshire, UK
Employment Type
Full-time
responsibility for our AWS-based infrastructure (EC2, RDS, S3, CloudWatch and related services). Manage resources using infrastructure as code tools. Maintain and improve observability: logging, metrics, alerts, and dashboards. Lead incident response when production issues occur and drive follow-up improvements. Technical direction and coaching Make and communicate architectural ...

Senior Backend Engineer (NestJS / AWS)

Hiring Organisation
Eequ
Location
Slough, Berkshire, UK
Employment Type
Full-time
responsibility for our AWS-based infrastructure (EC2, RDS, S3, CloudWatch and related services). Manage resources using infrastructure as code tools. Maintain and improve observability: logging, metrics, alerts, and dashboards. Lead incident response when production issues occur and drive follow-up improvements. Technical direction and coaching Make and communicate architectural ...

Senior Backend Engineer (NestJS / AWS)

Hiring Organisation
Eequ
Location
Dartford, Kent, UK
Employment Type
Full-time
responsibility for our AWS-based infrastructure (EC2, RDS, S3, CloudWatch and related services). Manage resources using infrastructure as code tools. Maintain and improve observability: logging, metrics, alerts, and dashboards. Lead incident response when production issues occur and drive follow-up improvements. Technical direction and coaching Make and communicate architectural ...

Senior Backend Engineer (NestJS / AWS)

Hiring Organisation
Eequ
Location
Colchester, Essex, UK
Employment Type
Full-time
responsibility for our AWS-based infrastructure (EC2, RDS, S3, CloudWatch and related services). Manage resources using infrastructure as code tools. Maintain and improve observability: logging, metrics, alerts, and dashboards. Lead incident response when production issues occur and drive follow-up improvements. Technical direction and coaching Make and communicate architectural ...

Senior Backend Engineer (NestJS / AWS)

Hiring Organisation
Eequ
Location
Hemel Hempstead, Hertfordshire, UK
Employment Type
Full-time
responsibility for our AWS-based infrastructure (EC2, RDS, S3, CloudWatch and related services). Manage resources using infrastructure as code tools. Maintain and improve observability: logging, metrics, alerts, and dashboards. Lead incident response when production issues occur and drive follow-up improvements. Technical direction and coaching Make and communicate architectural ...

Senior Backend Engineer (NestJS / AWS)

Hiring Organisation
Eequ
Location
Hull, East Yorkshire, UK
Employment Type
Full-time
responsibility for our AWS-based infrastructure (EC2, RDS, S3, CloudWatch and related services). Manage resources using infrastructure as code tools. Maintain and improve observability: logging, metrics, alerts, and dashboards. Lead incident response when production issues occur and drive follow-up improvements. Technical direction and coaching Make and communicate architectural ...

Senior Backend Engineer (NestJS / AWS)

Hiring Organisation
Eequ
Location
York, North Yorkshire, UK
Employment Type
Full-time
responsibility for our AWS-based infrastructure (EC2, RDS, S3, CloudWatch and related services). Manage resources using infrastructure as code tools. Maintain and improve observability: logging, metrics, alerts, and dashboards. Lead incident response when production issues occur and drive follow-up improvements. Technical direction and coaching Make and communicate architectural ...

Senior Backend Engineer (NestJS / AWS)

Hiring Organisation
Eequ
Location
Bolton, Greater Manchester, UK
Employment Type
Full-time
responsibility for our AWS-based infrastructure (EC2, RDS, S3, CloudWatch and related services). Manage resources using infrastructure as code tools. Maintain and improve observability: logging, metrics, alerts, and dashboards. Lead incident response when production issues occur and drive follow-up improvements. Technical direction and coaching Make and communicate architectural ...

Senior Backend Engineer (NestJS / AWS)

Hiring Organisation
Eequ
Location
Wakefield, West Yorkshire, UK
Employment Type
Full-time
responsibility for our AWS-based infrastructure (EC2, RDS, S3, CloudWatch and related services). Manage resources using infrastructure as code tools. Maintain and improve observability: logging, metrics, alerts, and dashboards. Lead incident response when production issues occur and drive follow-up improvements. Technical direction and coaching Make and communicate architectural ...

Senior Backend Engineer (NestJS / AWS)

Hiring Organisation
Eequ
Location
Milton Keynes, Buckinghamshire, UK
Employment Type
Full-time
responsibility for our AWS-based infrastructure (EC2, RDS, S3, CloudWatch and related services). Manage resources using infrastructure as code tools. Maintain and improve observability: logging, metrics, alerts, and dashboards. Lead incident response when production issues occur and drive follow-up improvements. Technical direction and coaching Make and communicate architectural ...

Senior Backend Engineer (NestJS / AWS)

Hiring Organisation
Eequ
Location
Stoke-on-Trent, Staffordshire, UK
Employment Type
Full-time
responsibility for our AWS-based infrastructure (EC2, RDS, S3, CloudWatch and related services). Manage resources using infrastructure as code tools. Maintain and improve observability: logging, metrics, alerts, and dashboards. Lead incident response when production issues occur and drive follow-up improvements. Technical direction and coaching Make and communicate architectural ...

Senior Backend Engineer (NestJS / AWS)

Hiring Organisation
Eequ
Location
United Kingdom
responsibility for our AWS-based infrastructure (EC2, RDS, S3, CloudWatch and related services). Manage resources using infrastructure as code tools. Maintain and improve observability: logging, metrics, alerts, and dashboards. Lead incident response when production issues occur and drive follow-up improvements. 4️⃣ Technical direction and coaching Make and communicate ...

Sportsbook Solutions Architect

Hiring Organisation
The Unit
Location
Ireland
Employment Type
Permanent
Salary
EUR 80,000 - 100,000 Annual
GitOps practices with GitHub Actions and ArgoCD, including automated testing, vulnerability scanning, and environment promotion workflows. Drive the definition and implementation of observability standards - Prometheus, Grafana, Loki/ELK, Jaeger, Sentry - enabling end-to-end visibility and SLA tracking. Define scalability and reliability patterns (KEDA, HPA, circuit breakers, bulkheads, caching … integration patterns. Proficiency in API and event contract design using OpenAPI and AsyncAPI; knowledge of GraphQL federation is a plus. Strong background in observability, monitoring, and tracing , with Prometheus/Grafana/ELK or equivalent. Familiarity with cloud agnostic deployments (AWS, GCP, or Azure) and cost/performance trade offs. ...

Senior Software Engineer - AI & ML (Based in Dubai)

Hiring Organisation
Property Finder
Location
City of London, London, United Kingdom
/semantic search infrastructure Evaluation dashboards, prompt/version management, and feedback loops Own services end-to-end: from design and implementation to monitoring, observability, and on-call, ensuring high availability, performance, and reliability. Collaborate with cross-functional teams (Product, Data Science, Data Engineering, Design, DevOps/SRE) to translate … NodeJS, or Python Solid understanding of cloud architecture and cloud-native technologies, preferably AWS. Experience designing and operating highly distributed, scalable services with strong observability (metrics, logs, traces, dashboards, alerts). Familiarity with MLOps practices and tools: CI/CD for ML, model deployment patterns, monitoring model performance and data ...

Senior Software Engineer - AI & ML (Based in Dubai)

Hiring Organisation
Property Finder
Location
London Area, United Kingdom
/semantic search infrastructure Evaluation dashboards, prompt/version management, and feedback loops Own services end-to-end: from design and implementation to monitoring, observability, and on-call, ensuring high availability, performance, and reliability. Collaborate with cross-functional teams (Product, Data Science, Data Engineering, Design, DevOps/SRE) to translate … NodeJS, or Python Solid understanding of cloud architecture and cloud-native technologies, preferably AWS. Experience designing and operating highly distributed, scalable services with strong observability (metrics, logs, traces, dashboards, alerts). Familiarity with MLOps practices and tools: CI/CD for ML, model deployment patterns, monitoring model performance and data ...

Senior Software Engineer, Platform Observability Remote - Ireland

Hiring Organisation
Twilio
Location
Dublin, Ireland
Employment Type
Permanent
Salary
EUR 60,000 - 90,000 Annual
thousands of businesses and empower millions of developers to craft personalized customer experiences. This role is for a Software Engineer on Twilio's Platform Observability team, focused on rebuilding and unifying our observability stack to enable faster incident response, deeper insights, and more cost-effective platform operations. You'll help … utilized at Twilio-making it structured, accessible, affordable, and actionable. Over the next 3 years, Twilio is rebuilding nearly every component of our observability platform, from data collection to real-time analytics. You will drive core initiatives that move Twilio from fragmented tooling to a unified, OpenTelemetry-first observability stack ...

Site Reliability Engineer - Global Hedge Fund

Hiring Organisation
Paragon Alpha - Hedge Fund Talent Business
Location
London Area, United Kingdom
performance trading platform, with a strong focus on automation, reliability, and system resilience. You will be responsible for building operational tooling and automation, improving observability and incident response, and applying core SRE principles to ensure the stability, performance, and scalability of mission-critical trading systems. Stack: Python, Linux, Kubernetes, Terraform ...

Site Reliability Engineer - Global Hedge Fund

Hiring Organisation
Paragon Alpha - Hedge Fund Talent Business
Location
City of London, London, United Kingdom
performance trading platform, with a strong focus on automation, reliability, and system resilience. You will be responsible for building operational tooling and automation, improving observability and incident response, and applying core SRE principles to ensure the stability, performance, and scalability of mission-critical trading systems. Stack: Python, Linux, Kubernetes, Terraform ...

OpenAI Architect (FDE)

Hiring Organisation
HCLTech
Location
London Area, United Kingdom
/function calling, Responses/Chat Completions, Embeddings, Files/Batch, Moderations), fine‐tuning pipelines, and agentic RAG then drive PoC → Production with governance, observability, and cost control. Keep solutions portable with pragmatic use of cloud services, LangChain/LangGraph/Semantic Kernel, and standard vector stores. What …/perf SLOs. • Fine‐tuning lifecycle: Own dataset curation, training/eval, bias checks, rollback/versioning, and telemetry for tuned models. • Operability: Add observability (OpenAI Observability/OpenTelemetry), token/cost telemetry, retries/backoff, idempotency, and feature flags/canaries; document runbooks and SOPs. Cross‐platform & enterprise integration ...

OpenAI Architect (FDE)

Hiring Organisation
HCLTech
Location
City of London, London, United Kingdom
/function calling, Responses/Chat Completions, Embeddings, Files/Batch, Moderations), fine‐tuning pipelines, and agentic RAG then drive PoC → Production with governance, observability, and cost control. Keep solutions portable with pragmatic use of cloud services, LangChain/LangGraph/Semantic Kernel, and standard vector stores. What …/perf SLOs. • Fine‐tuning lifecycle: Own dataset curation, training/eval, bias checks, rollback/versioning, and telemetry for tuned models. • Operability: Add observability (OpenAI Observability/OpenTelemetry), token/cost telemetry, retries/backoff, idempotency, and feature flags/canaries; document runbooks and SOPs. Cross‐platform & enterprise integration ...