476 to 500 of 1,305 Observability Jobs

Senior Frontend Engineer II - eCommerce

Hiring Organisation: Prenuvo
Location: Santa Rosa, California, United States
Employment Type: Permanent
Salary: USD Annual

using a hook-first architecture (separating business logic from presentational components) and modern state patterns (optimistic updates, single-flight request patterns). Ensure Reliability & Observability: Set up and maintain Datadog RUM dashboards, implement comprehensive error tracking, and debug production issues to maintain high system reliability. Champion Testing & Quality: Write ...

Staff Frontend Engineer, DC Infrastructure Tooling

Hiring Organisation: CoreWeave
Location: Sunnyvale, California, United States
Employment Type: Permanent
Salary: USD Annual

directly deploying infrastructure, but understanding the ecosystem helps you build better tools for the engineers who do. Experience with frontend observability, CI/CD pipelines, and modern build systems. We're building this team's practices from scratch, and having strong opinions about how to ship tooling matters. Familiarity with ...

Staff Frontend Engineer, DC Infrastructure Tooling

Hiring Organisation: CoreWeave
Location: San Jose, California, United States
Employment Type: Permanent
Salary: USD Annual

Senior Software Engineer - Flights

Hiring Organisation: Jobleads-UK
Location: City of Edinburgh, Scotland, United Kingdom

e.g., E2E/Cypress). Backend Excellence: Engineers sophisticated backend solutions involving API versioning, caching strategies, and complex data migration plans. Operational Maturity: Leads observability and SRE practices; defines SLOs, manages incident responses, and conducts blameless post-mortems. Security & Risk: Oversees operational security, including secrets hygiene and dependency risk management ...

SRE-NOC Engineer (24/7): Reliability, Automation & Observability

Hiring Organisation: Jobleads-UK
Location: United Kingdom

NICE is looking for an SRE – NOC to combine traditional Network Operations with engineering-driven reliability practices. The role emphasizes 24/7 service reliability, incident response, and operational automation. You will lead incident responses ...

Senior Java Software Engineer

Hiring Organisation: Harrington Starr
Location: London Area, United Kingdom

requirements into technical solutions Apply modern software engineering practices including TDD, BDD, CI/CD, automated testing, and code review Contribute to platform reliability, observability, monitoring, and operational improvement Mentor and support junior developers while helping drive engineering standards and delivery quality Collaborate with distributed and offshore teams where required … JUnit, Mockito, Spock Git/Maven Cloud-native infrastructure (AWS/Azure/GCP) Terraform and Infrastructure as Code Messaging and event-driven architecture Observability and monitoring tooling AI-assisted development tooling and automation Requirements Strong commercial Java engineering background within financial services Experience building scalable distributed systems and APIs ...

Full Stack Engineer, External Data Analytics

Hiring Organisation: Talent Software Services
Location: Reston, Virginia, United States
Employment Type: Permanent
Salary: USD 10,875 Annual

dashboard builders, we engineer the analytics backbone that supports 's most critical exam experiences and decision-making. We celebrate deep expertise in data engineering, observability, security, and platform reliability while delivering as a cohesive, high-performing team aligned to enterprise outcomes. About the Opportunity As a Full Stack Engineer … JavaScript, and solid communication skills and naturally curious about technology. You will contribute to engineering excellence by following established architectural patterns, automation standards, and observability practices that improve system reliability and operational readiness. This includes implementing robust automated testing strategies using tools to ensure quality, reliability, and seamless user experiences. ...

Senior Platform Engineer

Hiring Organisation: Team17
Location: Wakefield, England, United Kingdom

PowerShell, Python, Bash, or similar technologies. Assist with Infrastructure as Code and platform configuration management. Improve repeatability, documentation, and operational consistency across platform services. Observability, Support & Reliability Support monitoring, alerting, and operational dashboards across platform services. Troubleshoot and resolve complex infrastructure, cloud, and platform issues. Participate in incident response, root … source control platforms such as Perforce, GitHub, GitLab Experience supporting TeamCity, Jenkins, GitHub Actions, or similar CI/CD platforms. Experience with monitoring and observability platforms such as Datadog, Grafana, Prometheus, or similar. Experience with Docker, Kubernetes, or container-based platforms. Experience with package management platforms such as Artifactory ...

Senior Software Engineer

Hiring Organisation: Jobleads-UK
Location: Reading, England, United Kingdom

solution design for complex cross‐cutting services Make pragmatic architectural decisions balancing scalability, security and maintainability Improve performance, reliability and observability across shared services Define engineering standards and patterns adopted by other teams Lead by example through high‐quality, production‐ready code Partner with product squads to understand their needs … patterns, access control and service integration Mentor engineers and raise technical capability across the organisation Drive CI/CD maturity and deployment confidence Embed observability (metrics, tracing, logging) as first‐class concerns Perform thoughtful code reviews and provide constructive feedback Continuously improve team practices and technical standards Required Strong Java ...

Platform Development Engineer in Test

Hiring Organisation: Pyramid Consulting, Inc
Location: United States
Employment Type: Permanent
Salary: USD 960 Annual

Product, Engineering, SRE, DevOps, and Security teams to improve platform testability and reliability. Drive shift-left quality engineering practices across agile product teams. Implement observability-driven testing using logs, traces, and metrics. Leverage AI-assisted testing tools to accelerate test creation, maintenance, and coverage analysis. Define quality metrics, defect tracking … Familiarity with Kafka, event-driven systems, and asynchronous testing strategies. Experience embedding SAST, DAST, dependency scanning, and secrets detection into pipelines. Strong understanding of observability platforms such as Datadog, Splunk, Grafana, Prometheus, or OpenTelemetry. Experience implementing test data management and data masking strategies in regulated environments. Knowledge of AI-assisted ...

Global Banking & Markets - Software Engineer - Vice President - London London · United Kingdom [...]

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

standard for years to come. What You Will Do Design, build, and operate high‐availability, multi‐region, cloud‐native services with security and comprehensive observability (metrics, distributed tracing, structured logging) built in at every layer. Develop event‐driven architectures, multi‐stage processing pipelines, and optimized data paths for high‐throughput … patterns (retry, dead‐letter queues, error isolation). Cloud & Infrastructure : Cloud platforms (GCP, AWS), container orchestration (Kubernetes, Docker), and JVM tuning for containerized workloads. Observability & Operations : Application instrumentation (metrics, distributed tracing, structured logging) and production support in high‐availability environments. Data & Performance : Data modeling, SQL/NoSQL databases, caching strategies ...

Global Banking & Markets - Software Engineer - Vice President - London

Hiring Organisation: Jobleads-UK
Location: City Of London, England, United Kingdom

standard for years to come. What You Will Do Design, build, and operate high‐availability, multi‐region, cloud‐native services with security and comprehensive observability (metrics, distributed tracing, structured logging) built in at every layer. Develop event‐driven architectures, multi‐stage processing pipelines, and optimized data paths for high‐throughput … patterns (retry, dead‐letter queues, error isolation). Cloud & Infrastructure: Cloud platforms (GCP, AWS), container orchestration (Kubernetes, Docker), and JVM tuning for containerized workloads. Observability & Operations: Application instrumentation (metrics, distributed tracing, structured logging) and production support in high‐availability environments. Data & Performance: Data modeling, SQL/NoSQL databases, caching strategies ...

Head of Platform Engineering

Hiring Organisation: Anonymous
Location: London, South East, England, United Kingdom
Employment Type: Full-Time
Salary: Competitive salary

improvements across platform engineering and operational practices Drive CI/CD maturity, deployment automation, and release reliability Improve environment consistency and operational scalability Strengthen observability, resilience, and production readiness Support modern infrastructure and container-based platform initiatives Partner with engineering teams to improve delivery confidence and operational effectiveness Contribute ...

Principal Consultant

Hiring Organisation: Searchability®
Location: United Kingdom

Experience delivering AI solutions into production • Strong Python engineering fundamentals • Azure AI Foundry, Azure ML, Databricks, SageMaker, Bedrock, or similar • MLOps, CI/CD, observability, and production AI delivery • Experience leading teams, workshops, and technical discussions You’ll be: • Leading client workshops and shaping AI strategy • Acting as a trusted ...

Senior Consultant VMware

Hiring Organisation: COMPUTACENTER (UK) LIMITED
Location: South East London, London, United Kingdom
Employment Type: Permanent

design/build/operate experience Strong NSX (T0/T1, DFW, VRFs, EVPN), vSAN (ESA/OSA) Automation with Terraform, Ansible, PowerCLI, APIs Observability with Aria Ops/Ops for Networks/Logs Migration experience with HCX Strong communication, documentation, and stakeholder engagement Preferred Skills Kubernetes on vSphere ...

Site Reliability Engineer (£90k+ + Equity) at EQUALS

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

What you will do: Manage and evolve AWS infrastructure via Pulumi (TypeScript), covering ECS/Fargate, RDS, ElastiCache, and Lambda. Own the monitoring and observability stack using Datadog APM to reduce alert fatigue and lead incident response. Optimize data pipelines and performance, including Airbyte replication and PostgreSQL tuning for high ...

AI Architect

Hiring Organisation: Charles Simon Associates Ltd
Location: City of London, London, United Kingdom
Employment Type: Permanent
Salary: £95,000

influencing technical teams Nice to have: Experience in consulting/client-facing roles Exposure to bids, RFPs, or pre-sales Knowledge of AI observability & evaluation frameworks Why this role: Work on enterprise AI transformation (not PoCs) Own solutions from idea architecture production High visibility with senior stakeholders Access to cutting ...

Specialty Software Engineer Lead

Hiring Organisation: V2Soft
Location: Dallas, Texas, United States
Employment Type: Permanent
Salary: USD Annual

operational stability. Experience mentoring engineers and resolving complex production issues. Nice to Have Skills: Experience with CI/CD pipelines, automated testing frameworks, and observability practices (logging, metrics, tracing). Experience operating services on container/cloud platforms and supporting high-availability systems. Experience defining governance and change management practices ...

Senior Data engineer

Hiring Organisation: Tenth Revolution Group
Location: London, South East, England, United Kingdom
Employment Type: Full-Time
Salary: £65,000 - £80,000 per annum

fact optimisation) Deep understanding of medallion (Bronze/Silver/Gold) lakehouse and data mesh principles Proven experience defining data contracts, cataloguing standards, and observability practices Strong performance tuning and cost optimisation experience in cloud data warehouses Hands-on experience with Snowflake, dbt, S3, Athena Deep expertise across ...

Data Architect

Hiring Organisation: Tria
Location: London, United Kingdom
Employment Type: Permanent

management, and regulatory compliance It would be a bonus if you have: Familiarity with emerging technologies such as Apache Iceberg, Delta Lake, or data observability tooling Experience leading architecture workshops with both technical and non-technical stakeholders A background providing technical mentorship to BI Engineers or Developers To apply, please ...

Tetragon Senior Linux Security Engineer

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

About this role: Cilium Tetragon is a flexible, Kubernetes-aware security tool, with real-time observability and enforcement. Leveraging the power of eBPF, Tetragon offers a low-overhead, in kernel solution that enhances security posture by monitoring system behaviors such as process executions, system call activities, and both network ...

Senior Data Engineer

Hiring Organisation: Tenth Revolution Group
Location: London, South East, England, United Kingdom
Employment Type: Full-Time
Salary: £60,000 - £80,000 per annum

fact optimisation) Deep understanding of lakehouse patterns (Bronze/Silver/Gold) and data mesh principles Proven experience defining data contracts, cataloguing standards and observability practices Strong background in performance tuning and cloud cost optimisation Hands-on experience with Snowflake, dbt, S3 and Athena Deep knowledge of the AWS data ...

Data Engineer

Hiring Organisation: Robert Half
Location: London, South East, England, United Kingdom
Employment Type: Contractor
Contract Rate: Salary negotiable

cutover, including responsibility for build, integration, and operational readiness. Role The Data Engineer will own the end-to-end data platform (ingestion, orchestration, infrastructure, observability) Design and deliver scalable data pipelines across CRM, ERP and billing systems Lead platform readiness for cutover and hypercare phases Build and maintain integrations incl. ...

Remote Java Architect

Hiring Organisation: The Computer Merchant, Ltd
Location: United States
Employment Type: Permanent
Salary: USD 625 Annual

driven integration patterns Guide cloud-native deployment architecture on Azure Support FHIR-based interoperability initiatives and healthcare integration requirements Establish engineering standards for resiliency, observability, security, and performance Conduct architecture and design reviews across development teams Mentor senior engineers and provide technical leadership across modernization workstreams Collaborate with Product ...

Full Stack Engineer

Hiring Organisation: Wave Talent
Location: Maidenhead, England, United Kingdom

systems and accessible UI principles Backend: Node.js with TypeScript, REST APIs, SQL and NoSQL databases Infrastructure: Modern CI/CD pipelines, automated testing and observability tooling What you’ll bring Strong full-stack experience with JavaScript/TypeScript, React and Node.js Proven ability to take ownership of engineering projects ...