26 to 50 of 57 Permanent OpenTelemetry Jobs

Database Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
Manchester, England, United Kingdom
region, and multi‐cloud environments while ensuring data integrity and mobility. Security & Observability Mindset: Prioritize security and build deep observability (Prometheus/Grafana/OpenTelemetry/Humio) with automated guardrails. Engineering via Code: Primarily deliver via code; use Java to build robust, testable backend services that orchestrate our data layer ...

Platform Principal Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
/Open Tofu module design. (MUST) Kubernetes Engineering: GitOps (Argo CD/Flux), secrets management, ingress/mesh, and OPA/Gatekeeper. (MUST) Observability: OpenTelemetry (MUST) Tooling: Spacelift, Atlantis, or Terraform Cloud (Desired) Governance: EPAC (Enterprise Policy as Code) (Desired) What You'll Bring To Us Recent, hands-on experience ...

Database Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
Southampton, England, United Kingdom
challenge of multi‐tenant, multi‐region, multi‐cloud scenarios with rigorous data integrity. Security & Observability mindset: build deep observability (Prometheus/Grafana/OpenTelemetry/Humio) and guardrails for secure operation. Engineering via code: deliver backend services in Java with clean relational modeling and performant DDL. Interview Process Stage ...

Database Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
challenge of multi‐tenant, multi‐region, multi‐cloud scenarios with rigorous data integrity. Security & Observability mindset: build deep observability (Prometheus/Grafana/OpenTelemetry/Humio) and guardrails for secure operation. Engineering via code: deliver backend services in Java with clean relational modeling and performant DDL. Interview Process Stage ...

Lead Software Engineer, AI Platform (Remote)

Hiring Organisation
Lennar Homes
Location
Irving, Texas, United States
Employment Type
Permanent
Salary
USD Annual
AgentCore Gateway. Manage container deployment on AWS ECS Fargate using read-only filesystems. Deploy infrastructure-as-code using Terraform or CloudFormation. Instrument observability using OpenTelemetry tracing and structured JSON logging. Conduct technical design reviews and architecture governance. Transfer knowledge to the platform team through live sessions and documentation. Requirements Python ...

Senior DevOps Engineer

Hiring Organisation
Morgan McKinley
Location
Oxford, Oxfordshire, England, United Kingdom
Employment Type
Full-Time
Salary
Salary negotiable
data in a hybrid cloud environment Designing and maintaining robust CI/CD Automation pipelines Implementation of open-source standards for observability (e.g., OpenTelemetry ) Strong troubleshooting, analytical, and system-debugging skills Desired Skills We are also keen to discuss experience in: FinOps practices, including cost control, optimization, and calculation ...

Mid-Senior AI Engineer

Hiring Organisation
Renude
Location
United Kingdom
tuning (SFT), instruction tuning, and parameter-efficient model adaptation workflows Experience with evaluation and monitoring frameworks for LLM systems, including RAGAS, OpenEvals, DeepEval, LangSmith, OpenTelemetry or LLM-as-a-Judge evaluation frameworks Familiarity with graph databases such as Neo4j or Amazon Neptune Experience with recommendation systems, personalisation pipelines or ranking ...

Staff Site Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
Belfast, Northern Ireland, United Kingdom
mission, leading both our dedicated SRE team and the wider engineering organisation. Our Tech Stack: Go, Terraform, Kubernetes, AWS, DataDog, Loki, Grafana, Tempo, Mimir, OpenTelemetry How You Will Make an Impact: Act as a technical team leader and SRE subject matter expert across the organisation Influence engineering culture to promote ...

Infrastructure Engineering, AVP-2

Hiring Organisation
State Street
Location
Greater London, United Kingdom
Employment Type
Full Time
cluster provisioning, scaling, and recovery Observability, Monitoring & Reliability Engineering Design and maintain platform observability frameworks using: Prometheus, Grafana, Dynatrace, Elasticsearch Azure Monitor, Log Analytics OpenTelemetry (where applicable) Ensure proactive monitoring of cluster health, application performance, and infrastructure metrics Drive incident management practices , root cause analysis (RCA), and continuous reliability improvements … cloud-native architectures. Hands-on experience with DevOps Platform Tooling (i.e) ArgoCD, Terraform, Azure Devops, scripting Operational experience with observability tools Dynatrace, Prometheus, Grafana, OpenTelemetry Experience influencing or owning platform/product roadmaps in partnership with Product Management. Solid background in cloud native engineering concepts, performance optimization, security, and governance. ...

Platform Development Engineer in Test

Hiring Organisation
Pyramid Consulting, Inc
Location
United States
Employment Type
Permanent
Salary
USD 960 Annual
Immediate need for a talented Platform Development Engineer in Test . This is a 12+ months contract opportunity with long-term potential and is located in US(Remote- CST). Please review the job description ...

Principal AI Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
behalf of users in a regulated enterprise environment. The Tech Stack Core Platform: Python (Primary), Go or TypeScript (Secondary), Kubernetes, Docker, Terraform. Observability & Evals: OTel, LangSmith, Arize, Braintrust. Who You Are An Architect at Heart: You have strong, reasoned opinions on Durable Execution vs. Standard Async, Vector Search vs. Keyword ...

Senior AI Platform Engineer

Hiring Organisation
Vaco LLC
Location
San Francisco, California, United States
Employment Type
Permanent
Salary
USD 200,000 Annual
Kafka, Pulsar). Understanding of retrieval-augmented generation (RAG) patterns. Background in authorization/identity systems (ReBAC, RBAC, Zanzibar-style). Familiarity with observability (OpenTelemetry, DataHub, MLflow, Prometheus). Experience in enterprise AI governance (audit, lineage, compliance). Contributions to open-source AI frameworks (LangChain, OpenAI MCP, Hugging Face, etc. ...

Entry Level - Site Reliability Engineer - (Remote - United Kingdom)

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Summary Yelp engineering culture is driven by our values: we’re a cooperative team that values individual authenticity and encourages creative solutions to problems. All new engineers deploy working code their first week, and we ...

SRE Managing Consultant - Cloud Operating Model

Hiring Organisation
Jobleads-UK
Location
Manchester, England, United Kingdom
/SLOs, incident management, observability, and continuous improvement across cloud and hybrid platforms. Exposure to modern observability tooling and ecosystems (e.g. Datadog, Dynatrace, Prometheus, OpenTelemetry, Loki), with a strong understanding of how metrics, logs, and traces are applied to inform reliability strategy, incident management, and operational decision‐making. To obtain ...

Cloud SRE - Global Observability Lead (Remote UK)

Hiring Organisation
Jobleads-UK
Location
Newcastle upon Tyne, England, United Kingdom
Staff Site Reliability Engineer - Cloud to architect the Observability Centre of Excellence, ensuring reliability and uptime of global platforms. This role involves implementing OpenTelemetry, developing automation scripts, and optimizing platform performance while collaborating with engineering teams. Required skills include experience with observability tools like NewRelic and Terraform, alongside coding proficiency ...

Lead Software Development Engineer - Shared Platforms

Hiring Organisation
Jobleads-UK
Location
Nottingham, England, United Kingdom
software engineering best practices. Experience with CI/CD tools such as Jenkins and unit testing frameworks (JUnit, Mockito). Familiarity with OpenTelemetry, data lakes and observability products (metrics, traces, logs, Go). Effective communication across engineering teams to maximize inner‐sourcing opportunities and reduce waste. Proven ability to deliver ...

Engineering Manager

Hiring Organisation
Smoothwall (part of the Qoria family)
Location
Leeds, England, United Kingdom
ability to champion and integrate AI-driven tooling into the team's daily development lifecycle. Technologies to expect GCP, Go, React, Terraform, Github, Otel, DataDog, Postgres, BiqQuery, Firestore Why choose Smoothwall by Qoria? In this role, you can expect: Employee stock options Enhanced holiday & family leave Tech Allowance ...... ...

Integration Developer FTC

Hiring Organisation
itecopeople
Location
London, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£60,000
with developers, data engineers, and stakeholders Technology Stack Kafka/Redpanda Docker & Kubernetes Microsoft Azure REST APIs & webhooks CI/CD & Infrastructure as Code OpenTelemetry, Prometheus & Grafana Required Skills Strong software engineering background Experience building integration or event-driven platforms Kafka, Redpanda, or similar streaming technologies Enterprise system integrations … development experience Strong communication and collaboration skills Desirable Skills Go and/or Python CDC pipeline development Azure cloud experience Observability tooling (Prometheus, Grafana, OpenTelemetry) Experience within regulated environments What's on Offer Hybrid working - 2 days per week in London Salary up to £60,900 Generous pension and holiday ...

Platform Engineer

Hiring Organisation
itecopeople
Location
London, United Kingdom
Employment Type
Permanent
Salary
£54000 - £60900/annum
environments Manage cloud infrastructure using Terraform and ARM Templates Support GitOps-based CI/CD and deployment pipelines Operate observability tooling including Grafana and OpenTelemetry Maintain platform networking, service mesh and API gateway layers Support event streaming infrastructure and platform reliability Participate in incident response and platform operations Contribute … Prometheus Kubernetes networking, security and service mesh technologies Experience with the following would be highly beneficial: Cilium Istio/Service Mesh ArgoCD or Flux OpenTelemetry Kafka or Redpanda Python automation tooling Policy as Code/Kubernetes security tooling To progress matters please send your CV Laura at (url removed) Services ...

Platform Engineer

Hiring Organisation
itecopeople
Location
London, England, United Kingdom
environments Manage cloud infrastructure using Terraform and ARM Templates Support GitOps-based CI/CD and deployment pipelines Operate observability tooling including Grafana and OpenTelemetry Maintain platform networking, service mesh and API gateway layers Support event streaming infrastructure and platform reliability Participate in incident response and platform operations Contribute … Prometheus Kubernetes networking, security and service mesh technologies Experience with the following would be highly beneficial: Cilium Istio/Service Mesh ArgoCD or Flux OpenTelemetry Kafka or Redpanda Python automation tooling Policy as Code/Kubernetes security tooling What's on Offer Opportunity to help shape a greenfield cloud-native ...

Staff Site Reliability Engineer - Cloud

Hiring Organisation
Jobleads-UK
Location
Newcastle upon Tyne, England, United Kingdom
Observability Centre of Excellence, directly influencing the reliability and uptime of global platforms that keep world industries moving.**Key Exciting Responsibilities:*** Lead a global "OTel First" strategy, **implementing OpenTelemetry** at scale across a diverse technological landscape.* Spearhead the development of automation scripts and Infrastructure as Code using Terraform to ensure … Drive root cause analysis and problem management to proactively prevent incidents and improve the customer experience.**Essential Skills & Experience:*** Hands-on **experience with the OpenTelemetry Collector**, APIs, and SDKs.* Extensive experience with observability tools like NewRelic, Datadog, or Splunk.* Strong proficiency in Infrastructure as Code (Terraform, Ansible) and cloud platforms ...

Frontend Developer JBLE1 NI

Hiring Organisation
VANRATH
Location
Belfast, UK
Jira or similar project management tools Desirable Technologies React or Angular frameworks D3.js or WebSocket integration Figma or similar UX/UI design tools OpenTelemetry Testing tools including: Jest Selenium Playwright Cypress Accessibility standards (WCAG) Exposure to AWS or Google Cloud Platform Advantageous Background Active GitHub portfolio or open-source ...

Product Owner with Healthcare

Hiring Organisation
HAN IT STAFFING INC
Location
United States
Employment Type
Permanent
Salary
USD 70 Annual
lineage and provenance Work closely with engineering and architecture teams on: APIs and schema contracts,Event-driven integrations, Cloud events, Data observability, Traceability and OTEL concepts Cross-Functional Collaboration Collaborate with: Engineering teams, Architects, Provider operations, Claims teams, Network management, Compliance, Analytics and reporting teams Facilitate alignment between business strategy ...

Machine Learning Systems & Infrastructure Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
including self‐hosted GPU runners. Observability and reliability: Monitoring, logging, and alerting for job performance, data‐pipeline health, and cost (e.g., Prometheus/Grafana, OpenTelemetry); define SLOs and incident response for the systems you own. Security and access: Manage secrets, IAM, and network boundaries (e.g., Tailscale, cloud … layers. Familiarity with ML workflow orchestration and experiment tracking (e.g., Kubeflow Pipelines, MLflow). Experience with monitoring and observability tooling (e.g., Prometheus/Grafana, OpenTelemetry) and CI/CD for infra and ML workflows (e.g., GitHub Actions). At SpAItial, we are committed to creating a diverse and inclusive workplace. ...

Senior Product Manager

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
regulatory frameworks (SEBI, MAS, PRA, APRA CPS 230, DORA-adjacent regimes). Strong working knowledge of the observability and monitoring landscape — metrics, logs, traces, OpenTelemetry, alerting, dashboards, AIOps. Demonstrated ability to operate at the senior end of customer relationships: head of production, head of market data, head of trading technology ...