1 to 25 of 28 Permanent OpenTelemetry Jobs in London

Kubernetes Linux AIOps Engineer – Elite Quant Hedge Fund

Hiring Organisation
Winston Fox
Location
City of London, London, United Kingdom
Automation/scripts from scratch. Configuration Management Tools (Ansible/Puppet/Kapitan/Terraform....) Observability: Experience within the modern open-source ecosystem (ELK, OpenTelemetry, LGTM stack, Prometheus, Grafana, Loki...) CI/CD and GitLab/GitOps : working with Development teams. A track-record in Engineering for Developer Experience/ ...

OTEL (OpenTelemetry) Architect

Hiring Organisation
Intuition IT Solutions Ltd
Location
London, United Kingdom
Employment Type
Permanent
Salary
GBP 75,000 Annual
Role: OTel (Open Telemetry) Architect Location: London, UK Type: Permanent Job Description: OTel Architect: Strong expertise in Open Telemetry architecture, telemetry pipelines, and observability strategy across cloud and hybrid environments. Skilled in designing scalable collector deployments, telemetry standardization, integration with monitoring platforms (Dynatrace, Datadog, Grafana, etc.), and enabling RCA/ ...

Senior DevOps, Infrastructure & Security Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
deployment Modern CI/CD platforms including GitHub Actions, Cloud Build, Buildkite, CircleCI, or similar Cloud platforms, ideally GCP Observability tooling including Prometheus, Grafana, OpenTelemetry, or equivalent PostgreSQL operations, backup, recovery, and data durability Identity management, API gateways, networking, and access controls Bash and Python scripting for automation and tooling ...

Full Stack Engineer

Hiring Organisation
Prolo
Location
London Area, United Kingdom
skills and ability to work in a small team Adaptable : Comfortable with ambiguity and rapid iteration Nice-to-Have Qualifications Experience with observability tools (OpenTelemetry, Prometheus, Grafana) Familiarity with agentic coding workflows — using AI agents to scaffold, refactor, test, and document code autonomously Experience with FastAPI or similar async Python ...

Senior Full Stack Engineer

Hiring Organisation
Prolo
Location
City of London, London, United Kingdom
learn quickly Knowledge of event-driven architectures and message queues Knowledge of API design principles, data validation, and serialisation Experience with observability tools (OpenTelemetry, Prometheus, Grafana) Experience with AWS Lambda and serverless architectures Experience working across the stack (backend + some frontend) Understanding of web technologies, HTTP, and API integrations ...

Staff Site Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
builders. Bonus Points Deep experience with Google Cloud Platform (GCP) services and tools. Expert-level knowledge of modern observability platforms (e.g., Prometheus, Grafana, Datadog, OpenTelemetry). Experience designing and building reliable systems capable of handling high throughput and low latency. Significant experience with Go and Terraform. Familiarity with working ...

Go Full Stack Developer

Hiring Organisation
itecopeople
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£60,000
governance Experience with any of the following would be highly advantageous: Microsoft Azure Python GitOps tooling (Argo CD/Flux) Observability tooling (Prometheus, Grafana, OpenTelemetry) AI/LLM-enabled applications Event-driven architectures and messaging platforms What's on Offer Opportunity to work on cutting-edge AI and cloud-native ...

Senior Software Development Engineer in Test

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
knowledge (GitLab CI, Jenkins, etc.) and familiarity with Infrastructure‐as‐Code and cloud‐native environments (containers, Kubernetes). Understanding of observability (Grafana, Datadog, Prometheus, OpenTelemetry, etc.) and how it supports quality and testing. Ability to explain technical concepts clearly to different audiences. A collaborative style, with experience driving culture change ...

Python Developer

Hiring Organisation
Information Tech Consultants
Location
Greater London, England, United Kingdom
pytest-asyncio), Vitest/React Testing Library, and Playwright for E2E .Familiarity with observability tooling — structured logging, error tracking (e.g. Sentry), and tracing (e.g. OpenTelemetry/Jaeger) .Sound version control practices with Git and GitHub (branching, pull requests, conventional commits) .Nice to Hav eExperience with background processing and messaging (Celery ...

AI Platform/ DevOps Engineer

Hiring Organisation
The Portfolio Group
Location
City of London, London, Castle Baynard, United Kingdom
Employment Type
Permanent
Salary
£70000 - £80000/annum + Benefits
maintain CI/CD pipelines for inference services, retrievers, ingestion workflows, and RAG components Implement observability across AI workloads using CloudWatch, MLflow, and OpenTelemetry - covering latency, throughput, cost, and system health Apply secure-by-design principles including IAM, encryption, network controls, and audit logging Work closely with AI engineers ...

Lead Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
platforms. Containerization & Deployment: Proficiency with containerization technologies such as Docker or Kubernetes. Observability: Hands-on experience with modern observability tooling (e.g., Prometheus, DataDog, Jaeger, OpenTelemetry). Data Governance: Experience with data privacy (GDPR/CCPA) and security compliance in a regulated financial environment. Bullish is proud to be an equal ...

Site Reliability Engineer, iCloud

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Operating System, including Kernel, Memory, Process, Threads, Static/Shared Libraries, IPC, Signals. Experience in developing iOS apps using Xcode and Swift. Experience in OpenTelemetry Standards/distributed tracing like jaeger At Apple, we believe in treating all applicants fairly and equally. Because to create products that serve everyone ...

Senior Software Development Engineer in Test

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
knowledge (GitLab CI, Jenkins, etc.) and familiarity with Infrastructure‐as‐Code and cloud‐native environments (containers, Kubernetes). Understanding of observability (Grafana, Datadog, Prometheus, OpenTelemetry, etc.) and how it supports quality and testing. Ability to explain technical concepts clearly to different audiences. A collaborative style, with experience driving culture change ...

Platform Engineer (AI Infrastructure)

Hiring Organisation
We Love Alfa
Location
London, United Kingdom
Employment Type
Permanent
Salary
GBP 120,000 - 180,000 Annual
Rust Knowledge of confidential computing, including TEE, SEV, TDX or CoCo Experience with Ceph or distributed storage systems Familiarity with Prometheus, Grafana or OpenTelemetry Experience with BGP, RDMA or high performance networking Exposure to NVIDIA GPU infrastructure or bare metal cloud environments Why this role matters AI infrastructure is constrained ...

Senior Software Engineer - Identity & Authentication

Hiring Organisation
Jobleads-UK
Location
City Of London, England, United Kingdom
Gateway or similar Exposure to Aurora PostgreSQL or DynamoDB Knowledge of microservices architectures Exposure to security concepts (IAM, encryption, networking) Experience with observability tooling (OpenTelemetry, Honeycomb, Grafana) Experience in regulated or enterprise environments Our Tech Stack AWS TypeScript React Node.js (NestJS) REST APIs Auth0 Transmit Security Kong Gateway GitHub GitHub ...

Artificial Intelligence Engineer

Hiring Organisation
Coforge
Location
City of London, London, United Kingdom
cloud platforms (Azure, AWS, GCP) REST APIs & microservices architecture Docker/Kubernetes Security protocols (OAuth2, JWT, RBAC) Knowledge of observability tools (Prometheus, Grafana, OpenTelemetry) Preferred Skills Understanding of AI Gateway patterns (routing, caching, orchestration) Experience with RAG architectures and prompt engineering Exposure to AI governance, compliance, and regulatory frameworks Prior ...

Platform Principal Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
/Open Tofu module design. (MUST) Kubernetes Engineering: GitOps (Argo CD/Flux), secrets management, ingress/mesh, and OPA/Gatekeeper. (MUST) Observability: OpenTelemetry (MUST) Tooling: Spacelift, Atlantis, or Terraform Cloud (Desired) Governance: EPAC (Enterprise Policy as Code) (Desired) What You'll Bring To Us Recent, hands-on experience ...

Database Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
challenge of multi‐tenant, multi‐region, multi‐cloud scenarios with rigorous data integrity. Security & Observability mindset: build deep observability (Prometheus/Grafana/OpenTelemetry/Humio) and guardrails for secure operation. Engineering via code: deliver backend services in Java with clean relational modeling and performant DDL. Interview Process Stage ...

Infrastructure Engineering, AVP-2

Hiring Organisation
State Street
Location
Greater London, United Kingdom
Employment Type
Full Time
cluster provisioning, scaling, and recovery Observability, Monitoring & Reliability Engineering Design and maintain platform observability frameworks using: Prometheus, Grafana, Dynatrace, Elasticsearch Azure Monitor, Log Analytics OpenTelemetry (where applicable) Ensure proactive monitoring of cluster health, application performance, and infrastructure metrics Drive incident management practices , root cause analysis (RCA), and continuous reliability improvements … cloud-native architectures. Hands-on experience with DevOps Platform Tooling (i.e) ArgoCD, Terraform, Azure Devops, scripting Operational experience with observability tools Dynatrace, Prometheus, Grafana, OpenTelemetry Experience influencing or owning platform/product roadmaps in partnership with Product Management. Solid background in cloud native engineering concepts, performance optimization, security, and governance. ...

Principal AI Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
behalf of users in a regulated enterprise environment. The Tech Stack Core Platform: Python (Primary), Go or TypeScript (Secondary), Kubernetes, Docker, Terraform. Observability & Evals: OTel, LangSmith, Arize, Braintrust. Who You Are An Architect at Heart: You have strong, reasoned opinions on Durable Execution vs. Standard Async, Vector Search vs. Keyword ...

Entry Level - Site Reliability Engineer - (Remote - United Kingdom)

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Summary Yelp engineering culture is driven by our values: we’re a cooperative team that values individual authenticity and encourages creative solutions to problems. All new engineers deploy working code their first week, and we ...

Integration Developer FTC

Hiring Organisation
itecopeople
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£60,000
with developers, data engineers, and stakeholders Technology Stack Kafka/Redpanda Docker & Kubernetes Microsoft Azure REST APIs & webhooks CI/CD & Infrastructure as Code OpenTelemetry, Prometheus & Grafana Required Skills Strong software engineering background Experience building integration or event-driven platforms Kafka, Redpanda, or similar streaming technologies Enterprise system integrations … development experience Strong communication and collaboration skills Desirable Skills Go and/or Python CDC pipeline development Azure cloud experience Observability tooling (Prometheus, Grafana, OpenTelemetry) Experience within regulated environments What's on Offer Hybrid working - 2 days per week in London Salary up to £60,900 Generous pension and holiday ...

Platform Engineer

Hiring Organisation
itecopeople
Location
London, United Kingdom
Employment Type
Permanent
Salary
£54000 - £60900/annum
environments Manage cloud infrastructure using Terraform and ARM Templates Support GitOps-based CI/CD and deployment pipelines Operate observability tooling including Grafana and OpenTelemetry Maintain platform networking, service mesh and API gateway layers Support event streaming infrastructure and platform reliability Participate in incident response and platform operations Contribute … Prometheus Kubernetes networking, security and service mesh technologies Experience with the following would be highly beneficial: Cilium Istio/Service Mesh ArgoCD or Flux OpenTelemetry Kafka or Redpanda Python automation tooling Policy as Code/Kubernetes security tooling To progress matters please send your CV Laura at (url removed) Services ...

Platform Engineer

Hiring Organisation
itecopeople
Location
London, England, United Kingdom
environments Manage cloud infrastructure using Terraform and ARM Templates Support GitOps-based CI/CD and deployment pipelines Operate observability tooling including Grafana and OpenTelemetry Maintain platform networking, service mesh and API gateway layers Support event streaming infrastructure and platform reliability Participate in incident response and platform operations Contribute … Prometheus Kubernetes networking, security and service mesh technologies Experience with the following would be highly beneficial: Cilium Istio/Service Mesh ArgoCD or Flux OpenTelemetry Kafka or Redpanda Python automation tooling Policy as Code/Kubernetes security tooling What's on Offer Opportunity to help shape a greenfield cloud-native ...

Machine Learning Systems & Infrastructure Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
including self‐hosted GPU runners. Observability and reliability: Monitoring, logging, and alerting for job performance, data‐pipeline health, and cost (e.g., Prometheus/Grafana, OpenTelemetry); define SLOs and incident response for the systems you own. Security and access: Manage secrets, IAM, and network boundaries (e.g., Tailscale, cloud … layers. Familiarity with ML workflow orchestration and experiment tracking (e.g., Kubeflow Pipelines, MLflow). Experience with monitoring and observability tooling (e.g., Prometheus/Grafana, OpenTelemetry) and CI/CD for infra and ML workflows (e.g., GitHub Actions). At SpAItial, we are committed to creating a diverse and inclusive workplace. ...