Observability Jobs in the East Midlands

1 to 25 of 59 Observability Jobs in the East Midlands

SR Site Reliability Engineer

Mablethorpe, England, United Kingdom
Wakapi
systems using load balancing, auto-scaling, canary releases, and blue-green deployments. Develop and maintain monitoring and logging dashboards with tools like New Relic, Prometheus, Grafana, and Datadog, ensuring observability through metrics, tracing, log aggregation, and alerting. Help teams determine settings and thresholds for alerts and automations based on application performance requirements. Monitor, optimize, and ensure system reliability and performance … like Terraform. Strong understanding of scalability, high availability patterns, and DevOps metrics such as DORA. Knowledge of SLM metrics (SLAs, SLOs, SLIs) and their application. Experience with monitoring and observability tools like New Relic, Prometheus, Grafana, and Datadog. Experience working with Kafka and improving performance in event-driven, real-time data architectures. Familiarity with cloud providers like AWS, Azure, or … GCP. Experience with CI/CD tools such as GitHub Actions, Jenkins, or GitLab CI. Strong analytical and communication skills. Nice-to-haves Familiarity with Observability-as-Code tooling and practices. Knowledge of Chaos Engineering practices. Senior Level: Mid-Senior, Employment: Full-time, Industry: Software Development #J-18808-Ljbffr More ❯
Posted:

Site Reliability Engineer

Chesterfield, England, United Kingdom
Hybrid / WFH Options
JR United Kingdom
automation and internal tools for deployment, monitoring, and incident response Tune performance across OS, network, and cloud layers — this role is hands-on and detail-oriented Improve system resilience, observability, and security in a high-stakes production environment Requirements: Fluent in Linux — not just using it, but understanding how it works under the hood Advanced terminal skills — manipulating systems efficiently … time environments Hands-on with Docker (Kubernetes is a plus), infrastructure-as-code, and CI/CD tooling Strong scripting and automation experience in Python and Bash Familiarity with observability stacks (Prometheus, OpenTelemetry, eBPF) Cloud infrastructure experience (AWS/GCP/Azure), with attention to IAM and software supply chain security Curious, persistent, and comfortable experimenting at the lowest levels More ❯
Posted:

Cloud Platform Lead

Derby, England, United Kingdom
JR United Kingdom
AWS in a production environment Expertise in Kubernetes including AKS EKS containerization and Helm Proven ability to meet and maintain SOC 2 or equivalent compliance Strong background in automation observability and GitOps workflows Comfortable using AI coding tools like GitHub Copilot Cursor or Claude to enhance delivery Bonus if you have experience supporting hybrid or disconnected deployment environments or working … Be Using Cloud : Azure including AKS API Management and DevOps Pipelines and AWS including EKS Lambda and CloudFormation Infrastructure as Code and GitOps : Terraform Bicep Pulumi ArgoCD and FluxCD Observability : Prometheus Grafana OpenTelemetry and Datadog Security and Compliance : HashiCorp Vault Azure Key Vault AWS KMS OPA Gatekeeper and Drata or similar ? Interested in exploring this further This is a high More ❯
Posted:

Python Developer

Derby, England, United Kingdom
Xpertise Recruitment
doing: Developing secure, high-performance APIs Working with databases like PostgreSQL, DynamoDB and MongoDB Deploying cloud-native services using tools like Terraform, Docker and Kubernetes Improving performance, security and observability in backend systems Collaborating with product managers, cloud engineers and designers What they’re looking for: Proven experience writing Python APIs (Django, Flask or FastAPI) Solid database design and cloud More ❯
Posted:

Digital Enterprise Architect

Chesterfield, England, United Kingdom
JR United Kingdom
of practices (e.g., Cloud, Platforms. AI, Strategy, Custom Application Development, Network & Edge, Security, Resiliency, etc.) Articulate the vision for modern engineering (e.g., agile, cloud-native, DevOps) and operations (e.g., observability, automated response, SRE, etc.) and able to articulate a path toward a target operating model (people, process, and tools) SoftServe is an Equal Opportunity Employer. All qualified applicants will receive More ❯
Posted:

Digital Enterprise Architect

Nottingham, England, United Kingdom
JR United Kingdom
of practices (e.g., Cloud, Platforms. AI, Strategy, Custom Application Development, Network & Edge, Security, Resiliency, etc.) Articulate the vision for modern engineering (e.g., agile, cloud-native, DevOps) and operations (e.g., observability, automated response, SRE, etc.) and able to articulate a path toward a target operating model (people, process, and tools) SoftServe is an Equal Opportunity Employer. All qualified applicants will receive More ❯
Posted:

Digital Enterprise Architect

Northampton, England, United Kingdom
JR United Kingdom
of practices (e.g., Cloud, Platforms. AI, Strategy, Custom Application Development, Network & Edge, Security, Resiliency, etc.) Articulate the vision for modern engineering (e.g., agile, cloud-native, DevOps) and operations (e.g., observability, automated response, SRE, etc.) and able to articulate a path toward a target operating model (people, process, and tools) SoftServe is an Equal Opportunity Employer. All qualified applicants will receive More ❯
Posted:

Site Reliability Engineer

Lincoln, England, United Kingdom
JR United Kingdom
services. Strong background in Linux administration and troubleshooting. Proven experience in implementing and managing CI/CD pipelines and Infrastructure as Code (IAC) solutions. Proven experience in monitoring and observability tools to proactively manage system health. Skills and Strengths: AWS (Amazon Web Services) Auto Scaling Fargate Route53 Observability tools (New Relic, DataDog, Splunk) Scripting (Ansible, Bash, Python, GO) CI/ More ❯
Posted:

Site Reliability Engineer

Northampton, England, United Kingdom
JR United Kingdom
services. Strong background in Linux administration and troubleshooting. Proven experience in implementing and managing CI/CD pipelines and Infrastructure as Code (IAC) solutions. Proven experience in monitoring and observability tools to proactively manage system health. Skills and Strengths: AWS (Amazon Web Services) Auto Scaling Fargate Route53 Observability tools (New Relic, DataDog, Splunk) Scripting (Ansible, Bash, Python, GO) CI/ More ❯
Posted:

Site Reliability Engineer

Derby, England, United Kingdom
JR United Kingdom
services. Strong background in Linux administration and troubleshooting. Proven experience in implementing and managing CI/CD pipelines and Infrastructure as Code (IAC) solutions. Proven experience in monitoring and observability tools to proactively manage system health. Skills and Strengths: AWS (Amazon Web Services) Auto Scaling Fargate Route53 Observability tools (New Relic, DataDog, Splunk) Scripting (Ansible, Bash, Python, GO) CI/ More ❯
Posted:

Site Reliability Engineer

Leicester, England, United Kingdom
JR United Kingdom
services. Strong background in Linux administration and troubleshooting. Proven experience in implementing and managing CI/CD pipelines and Infrastructure as Code (IAC) solutions. Proven experience in monitoring and observability tools to proactively manage system health. Skills and Strengths: AWS (Amazon Web Services) Auto Scaling Fargate Route53 Observability tools (New Relic, DataDog, Splunk) Scripting (Ansible, Bash, Python, GO) CI/ More ❯
Posted:

Site Reliability Engineer

Nottingham, England, United Kingdom
JR United Kingdom
services. Strong background in Linux administration and troubleshooting. Proven experience in implementing and managing CI/CD pipelines and Infrastructure as Code (IAC) solutions. Proven experience in monitoring and observability tools to proactively manage system health. Skills and Strengths: AWS (Amazon Web Services) Auto Scaling Fargate Route53 Observability tools (New Relic, DataDog, Splunk) Scripting (Ansible, Bash, Python, GO) CI/ More ❯
Posted:

Senior Site Reliability Engineering Manager

Nottingham, Nottinghamshire, United Kingdom
Capital One (Europe) plc
reliability engineering teams (sourced from internal associates and preferred third party vendors) in applying Site Reliability Engineering principles to in-house developed applications. Optimise and reduce operational overheads through observability and service automation. Identify growth opportunities for your manager level reportees on how to achieve their technical, business and personal goals. Work closely with peer senior manager people leader(s … Technical leadership coupled with a passion for software engineering and operational processes. Strong background in software/system engineering and architecture within the cloud. Strong background/appreciation in observability principles, techniques and toolsets. Demonstrable knowledge in the software development lifecycle within a cloud based environment. Demonstrable knowledge of developing and managing RESTful API services written within a modern OO More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Lead DevOps Engineer

Leicester, England, United Kingdom
Hybrid / WFH Options
JR United Kingdom
balancing technical, security, and operational priorities. Bonus Skills (Nice to Have) Proficiency in infrastructure-as-code using Terraform. Experience setting up and managing CI/CD pipelines. Familiarity with observability tools and techniques. Fully remote role or hybrid option from Belfast. Long-term incentive scheme participation. Private health coverage, including critical illness and life insurance. Wellness support including gym discounts More ❯
Posted:

Lead DevOps Engineer

Lincoln, England, United Kingdom
Hybrid / WFH Options
JR United Kingdom
balancing technical, security, and operational priorities. Bonus Skills (Nice to Have) Proficiency in infrastructure-as-code using Terraform. Experience setting up and managing CI/CD pipelines. Familiarity with observability tools and techniques. Fully remote role or hybrid option from Belfast. Long-term incentive scheme participation. Private health coverage, including critical illness and life insurance. Wellness support including gym discounts More ❯
Posted:

Lead DevOps Engineer

Nottingham, England, United Kingdom
Hybrid / WFH Options
JR United Kingdom
balancing technical, security, and operational priorities. Bonus Skills (Nice to Have) Proficiency in infrastructure-as-code using Terraform. Experience setting up and managing CI/CD pipelines. Familiarity with observability tools and techniques. Fully remote role or hybrid option from Belfast. Long-term incentive scheme participation. Private health coverage, including critical illness and life insurance. Wellness support including gym discounts More ❯
Posted:

Lead DevOps Engineer

Chesterfield, England, United Kingdom
Hybrid / WFH Options
JR United Kingdom
balancing technical, security, and operational priorities. Bonus Skills (Nice to Have) Proficiency in infrastructure-as-code using Terraform. Experience setting up and managing CI/CD pipelines. Familiarity with observability tools and techniques. Fully remote role or hybrid option from Belfast. Long-term incentive scheme participation. Private health coverage, including critical illness and life insurance. Wellness support including gym discounts More ❯
Posted:

Lead DevOps Engineer

Derby, England, United Kingdom
Hybrid / WFH Options
JR United Kingdom
balancing technical, security, and operational priorities. Bonus Skills (Nice to Have) Proficiency in infrastructure-as-code using Terraform. Experience setting up and managing CI/CD pipelines. Familiarity with observability tools and techniques. Fully remote role or hybrid option from Belfast. Long-term incentive scheme participation. Private health coverage, including critical illness and life insurance. Wellness support including gym discounts More ❯
Posted:

Senior AI Engineer

Derby, England, United Kingdom
JR United Kingdom
with AI from day one: Make a real impact while developing your skills in generative AI - building chatbots/agents using LLM. ? Tackle complex engineering challenges: Think scalable infrastructure, observability, CI/CD pipelines, and model deployment. ? Work cross-functionally: Partner with product managers and machine learning teams to deliver practical, user-facing AI features. ? Join a company with serious More ❯
Posted:

Azure Architect

Nottingham, England, United Kingdom
JR United Kingdom
used for this (Visual Studio Code, Git). A great candidate will have good knowledge of Infrastructure as Code with Azure CI/CD integration, and the use of observability tools like Prometheus or Loki. Excellent skills in creating clear, detailed High Level & Low Level documentation with technical diagrams. A strong understanding of Microsoft Azure Best Practices – Well-Architected Framework More ❯
Posted:

AI Tech Lead – Agentic AI, LangGraph, ML, Python, CI/CD, LLM’s, Startup, UK Remote

Chesterfield, England, United Kingdom
Hybrid / WFH Options
JR United Kingdom
core systems, focusing on scalability, performance, and reliability. Write clean, maintainable code and contribute actively to the codebase. Define and uphold engineering best practices (code quality, CI/CD, observability, etc.). Collaborate closely with the CTO and product team to align technical delivery with strategic goals. Continuously improve team operations, development workflows, and developer experience. Play a key role More ❯
Posted:

AI Tech Lead – Agentic AI, LangGraph, ML, Python, CI/CD, LLM’s, Startup, UK Remote

Derby, England, United Kingdom
Hybrid / WFH Options
JR United Kingdom
core systems, focusing on scalability, performance, and reliability. Write clean, maintainable code and contribute actively to the codebase. Define and uphold engineering best practices (code quality, CI/CD, observability, etc.). Collaborate closely with the CTO and product team to align technical delivery with strategic goals. Continuously improve team operations, development workflows, and developer experience. Play a key role More ❯
Posted:

AI Tech Lead – Agentic AI, LangGraph, ML, Python, CI/CD, LLM’s, Startup, UK Remote

Northampton, England, United Kingdom
Hybrid / WFH Options
JR United Kingdom
core systems, focusing on scalability, performance, and reliability. Write clean, maintainable code and contribute actively to the codebase. Define and uphold engineering best practices (code quality, CI/CD, observability, etc.). Collaborate closely with the CTO and product team to align technical delivery with strategic goals. Continuously improve team operations, development workflows, and developer experience. Play a key role More ❯
Posted:

AI Tech Lead – Agentic AI, LangGraph, ML, Python, CI/CD, LLM’s, Startup, UK Remote

Lincoln, England, United Kingdom
Hybrid / WFH Options
JR United Kingdom
core systems, focusing on scalability, performance, and reliability. Write clean, maintainable code and contribute actively to the codebase. Define and uphold engineering best practices (code quality, CI/CD, observability, etc.). Collaborate closely with the CTO and product team to align technical delivery with strategic goals. Continuously improve team operations, development workflows, and developer experience. Play a key role More ❯
Posted:

AI Tech Lead – Agentic AI, LangGraph, ML, Python, CI/CD, LLM’s, Startup, UK Remote

Leicester, England, United Kingdom
Hybrid / WFH Options
JR United Kingdom
core systems, focusing on scalability, performance, and reliability. Write clean, maintainable code and contribute actively to the codebase. Define and uphold engineering best practices (code quality, CI/CD, observability, etc.). Collaborate closely with the CTO and product team to align technical delivery with strategic goals. Continuously improve team operations, development workflows, and developer experience. Play a key role More ❯
Posted:
Observability
the East Midlands
10th Percentile
£51,500
25th Percentile
£51,875
Median
£55,000
75th Percentile
£57,500