1 to 25 of 121 Observability Jobs in the East of England

Senior Infrastructure Support Engineer

Hiring Organisation
Nscale
Location
Peterborough, Cambridgeshire, UK
Employment Type
Full-time
. Practical experience with GPU drivers and GPU logs investigation tools, e.g. nvidia-smi. Performance diagnostics using NCCL on large scale clusters. Observability and incident response. Build and use alerting stacks and dashboards, interpret metrics and alerts, and drive runbooks to resolution; contribute to SLOs and post‐incident reviews. Strong ...

Senior Infrastructure Support Engineer

Hiring Organisation
Nscale
Location
Bedford, Bedfordshire, UK
Employment Type
Full-time
. Practical experience with GPU drivers and GPU logs investigation tools, e.g. nvidia-smi. Performance diagnostics using NCCL on large scale clusters. Observability and incident response. Build and use alerting stacks and dashboards, interpret metrics and alerts, and drive runbooks to resolution; contribute to SLOs and post‐incident reviews. Strong ...

Senior Infrastructure Support Engineer

Hiring Organisation
Nscale
Location
Colchester, Essex, UK
Employment Type
Full-time
. Practical experience with GPU drivers and GPU logs investigation tools, e.g. nvidia-smi. Performance diagnostics using NCCL on large scale clusters. Observability and incident response. Build and use alerting stacks and dashboards, interpret metrics and alerts, and drive runbooks to resolution; contribute to SLOs and post‐incident reviews. Strong ...

Senior Infrastructure Support Engineer

Hiring Organisation
Nscale
Location
Watford, Hertfordshire, UK
Employment Type
Full-time
. Practical experience with GPU drivers and GPU logs investigation tools, e.g. nvidia-smi. Performance diagnostics using NCCL on large scale clusters. Observability and incident response. Build and use alerting stacks and dashboards, interpret metrics and alerts, and drive runbooks to resolution; contribute to SLOs and post‐incident reviews. Strong ...

Senior Infrastructure Support Engineer

Hiring Organisation
Nscale
Location
Cambridge, Cambridgeshire, UK
Employment Type
Full-time
. Practical experience with GPU drivers and GPU logs investigation tools, e.g. nvidia-smi. Performance diagnostics using NCCL on large scale clusters. Observability and incident response. Build and use alerting stacks and dashboards, interpret metrics and alerts, and drive runbooks to resolution; contribute to SLOs and post‐incident reviews. Strong ...

Senior Infrastructure Support Engineer

Hiring Organisation
Nscale
Location
Ipswich, Suffolk, UK
Employment Type
Full-time
. Practical experience with GPU drivers and GPU logs investigation tools, e.g. nvidia-smi. Performance diagnostics using NCCL on large scale clusters. Observability and incident response. Build and use alerting stacks and dashboards, interpret metrics and alerts, and drive runbooks to resolution; contribute to SLOs and post‐incident reviews. Strong ...

Senior Infrastructure Support Engineer

Hiring Organisation
Nscale
Location
Chelmsford, Essex, UK
Employment Type
Full-time
. Practical experience with GPU drivers and GPU logs investigation tools, e.g. nvidia-smi. Performance diagnostics using NCCL on large scale clusters. Observability and incident response. Build and use alerting stacks and dashboards, interpret metrics and alerts, and drive runbooks to resolution; contribute to SLOs and post‐incident reviews. Strong ...

Senior Infrastructure Support Engineer

Hiring Organisation
Nscale
Location
Hemel Hempstead, Hertfordshire, UK
Employment Type
Full-time
. Practical experience with GPU drivers and GPU logs investigation tools, e.g. nvidia-smi. Performance diagnostics using NCCL on large scale clusters. Observability and incident response. Build and use alerting stacks and dashboards, interpret metrics and alerts, and drive runbooks to resolution; contribute to SLOs and post‐incident reviews. Strong ...

AI Infra Engineer | $74-$168/hr

Hiring Organisation
HelixRecruit
Location
Norwich, Norfolk, UK
Employment Type
Full-time
high-performance compute environments. Collaborate closely with research and product teams to integrate model-serving pipelines, memory systems, and reasoning components. Implement monitoring, observability, and failover mechanisms to ensure high system reliability and fault tolerance. Evaluate and refine infrastructure performance, identifying bottlenecks and improving efficiency across data, compute, and model ...

AI Infra Engineer | $74-$168/hr

Hiring Organisation
HelixRecruit
Location
Basildon, Essex, UK
Employment Type
Full-time
high-performance compute environments. Collaborate closely with research and product teams to integrate model-serving pipelines, memory systems, and reasoning components. Implement monitoring, observability, and failover mechanisms to ensure high system reliability and fault tolerance. Evaluate and refine infrastructure performance, identifying bottlenecks and improving efficiency across data, compute, and model ...

AI Infra Engineer | $74-$168/hr

Hiring Organisation
HelixRecruit
Location
Watford, Hertfordshire, UK
Employment Type
Full-time
high-performance compute environments. Collaborate closely with research and product teams to integrate model-serving pipelines, memory systems, and reasoning components. Implement monitoring, observability, and failover mechanisms to ensure high system reliability and fault tolerance. Evaluate and refine infrastructure performance, identifying bottlenecks and improving efficiency across data, compute, and model ...

AI Infra Engineer | $74-$168/hr

Hiring Organisation
HelixRecruit
Location
Bedford, Bedfordshire, UK
Employment Type
Full-time
high-performance compute environments. Collaborate closely with research and product teams to integrate model-serving pipelines, memory systems, and reasoning components. Implement monitoring, observability, and failover mechanisms to ensure high system reliability and fault tolerance. Evaluate and refine infrastructure performance, identifying bottlenecks and improving efficiency across data, compute, and model ...

AI Infra Engineer | $74-$168/hr

Hiring Organisation
HelixRecruit
Location
Hemel Hempstead, Hertfordshire, UK
Employment Type
Full-time
high-performance compute environments. Collaborate closely with research and product teams to integrate model-serving pipelines, memory systems, and reasoning components. Implement monitoring, observability, and failover mechanisms to ensure high system reliability and fault tolerance. Evaluate and refine infrastructure performance, identifying bottlenecks and improving efficiency across data, compute, and model ...

Java Software Engineer : Global Bank : £150k + bonus : Hybrid

Hiring Organisation
Hunter Bond
Location
Stevenage, Hertfordshire, UK
Employment Type
Full-time
integrating new technologies and frameworks to support evolving business needs Bonus Experience Kafka, Spark, Trino, Redis SQL (Postgres, Oracle) Cloud technologies: AWS, Kubernetes, Docker Observability tools: Splunk, Prometheus, Grafana Secondary languages: Python, Ruby Compensation: Competitive (circa £150k total comp) + benefits Location: London (Hybrid, flexible working available ...

Java Software Engineer : Global Bank : £150k + bonus : Hybrid

Hiring Organisation
Hunter Bond
Location
Colchester, Essex, UK
Employment Type
Full-time
integrating new technologies and frameworks to support evolving business needs Bonus Experience Kafka, Spark, Trino, Redis SQL (Postgres, Oracle) Cloud technologies: AWS, Kubernetes, Docker Observability tools: Splunk, Prometheus, Grafana Secondary languages: Python, Ruby Compensation: Competitive (circa £150k total comp) + benefits Location: London (Hybrid, flexible working available ...

Java Software Engineer : Global Bank : £150k + bonus : Hybrid

Hiring Organisation
Hunter Bond
Location
Luton, Bedfordshire, UK
Employment Type
Full-time
integrating new technologies and frameworks to support evolving business needs Bonus Experience Kafka, Spark, Trino, Redis SQL (Postgres, Oracle) Cloud technologies: AWS, Kubernetes, Docker Observability tools: Splunk, Prometheus, Grafana Secondary languages: Python, Ruby Compensation: Competitive (circa £150k total comp) + benefits Location: London (Hybrid, flexible working available ...

Java Software Engineer : Global Bank : £150k + bonus : Hybrid

Hiring Organisation
Hunter Bond
Location
Watford, Hertfordshire, UK
Employment Type
Full-time
integrating new technologies and frameworks to support evolving business needs Bonus Experience Kafka, Spark, Trino, Redis SQL (Postgres, Oracle) Cloud technologies: AWS, Kubernetes, Docker Observability tools: Splunk, Prometheus, Grafana Secondary languages: Python, Ruby Compensation: Competitive (circa £150k total comp) + benefits Location: London (Hybrid, flexible working available ...

Java Software Engineer : Global Bank : £150k + bonus : Hybrid

Hiring Organisation
Hunter Bond
Location
Basildon, Essex, UK
Employment Type
Full-time
integrating new technologies and frameworks to support evolving business needs Bonus Experience Kafka, Spark, Trino, Redis SQL (Postgres, Oracle) Cloud technologies: AWS, Kubernetes, Docker Observability tools: Splunk, Prometheus, Grafana Secondary languages: Python, Ruby Compensation: Competitive (circa £150k total comp) + benefits Location: London (Hybrid, flexible working available ...

Java Software Engineer : Global Bank : £150k + bonus : Hybrid

Hiring Organisation
Hunter Bond
Location
Ipswich, Suffolk, UK
Employment Type
Full-time
integrating new technologies and frameworks to support evolving business needs Bonus Experience Kafka, Spark, Trino, Redis SQL (Postgres, Oracle) Cloud technologies: AWS, Kubernetes, Docker Observability tools: Splunk, Prometheus, Grafana Secondary languages: Python, Ruby Compensation: Competitive (circa £150k total comp) + benefits Location: London (Hybrid, flexible working available ...

Java Software Engineer : Global Bank : £150k + bonus : Hybrid

Hiring Organisation
Hunter Bond
Location
Chelmsford, Essex, UK
Employment Type
Full-time
integrating new technologies and frameworks to support evolving business needs Bonus Experience Kafka, Spark, Trino, Redis SQL (Postgres, Oracle) Cloud technologies: AWS, Kubernetes, Docker Observability tools: Splunk, Prometheus, Grafana Secondary languages: Python, Ruby Compensation: Competitive (circa £150k total comp) + benefits Location: London (Hybrid, flexible working available ...

Java Software Engineer : Global Bank : £150k + bonus : Hybrid

Hiring Organisation
Hunter Bond
Location
Norwich, Norfolk, UK
Employment Type
Full-time
integrating new technologies and frameworks to support evolving business needs Bonus Experience Kafka, Spark, Trino, Redis SQL (Postgres, Oracle) Cloud technologies: AWS, Kubernetes, Docker Observability tools: Splunk, Prometheus, Grafana Secondary languages: Python, Ruby Compensation: Competitive (circa £150k total comp) + benefits Location: London (Hybrid, flexible working available ...

Java Software Engineer : Global Bank : £150k + bonus : Hybrid

Hiring Organisation
Hunter Bond
Location
Cambridge, Cambridgeshire, UK
Employment Type
Full-time
integrating new technologies and frameworks to support evolving business needs Bonus Experience Kafka, Spark, Trino, Redis SQL (Postgres, Oracle) Cloud technologies: AWS, Kubernetes, Docker Observability tools: Splunk, Prometheus, Grafana Secondary languages: Python, Ruby Compensation: Competitive (circa £150k total comp) + benefits Location: London (Hybrid, flexible working available ...

Full Stack Engineer

Hiring Organisation
Consortia
Location
Ipswich, Suffolk, UK
Employment Type
Full-time
event driven: Akka, Rx • Messaging and streaming: Kafka, RabbitMQ, ActiveMQ • Databases: SQL and NoSQL • Containers & orchestration: Docker, Kubernetes, Helm, Kustomize • Tooling: Jenkins, Git • Observability: ELK, DataDog • Cloud: AWS, including EKS and Lambda Why this role stands out This is an environment where engineers are encouraged to take broad responsibility, contribute ...

Full Stack Engineer

Hiring Organisation
Consortia
Location
Bedford, Bedfordshire, UK
Employment Type
Full-time
event driven: Akka, Rx • Messaging and streaming: Kafka, RabbitMQ, ActiveMQ • Databases: SQL and NoSQL • Containers & orchestration: Docker, Kubernetes, Helm, Kustomize • Tooling: Jenkins, Git • Observability: ELK, DataDog • Cloud: AWS, including EKS and Lambda Why this role stands out This is an environment where engineers are encouraged to take broad responsibility, contribute ...

Full Stack Engineer

Hiring Organisation
Consortia
Location
Colchester, Essex, UK
Employment Type
Full-time
event driven: Akka, Rx • Messaging and streaming: Kafka, RabbitMQ, ActiveMQ • Databases: SQL and NoSQL • Containers & orchestration: Docker, Kubernetes, Helm, Kustomize • Tooling: Jenkins, Git • Observability: ELK, DataDog • Cloud: AWS, including EKS and Lambda Why this role stands out This is an environment where engineers are encouraged to take broad responsibility, contribute ...