Observability Jobs in the Midlands

26 to 38 of 38 Observability Jobs in the Midlands

DevOps Engineer

Nottingham, England, United Kingdom
GTS Group Ltd
based microservices. Troubleshoot production issues, ensuring uptime and documenting processes on the internal wiki. Automate deployments, testing processes, and infrastructure provisioning (Terraform, Ansible, GitHub Actions). Implement monitoring and observability solutions for proactive issue detection. Provide occasional support for internal IT infrastructure (e.g., laptops, printers, office networking). Occasionally maintain and support CMS platforms (Magento, Joomla, WordPress). Experience Required … management) Docker containerization Python scripting for automation Git version control Desirable (Future-Facing Skills): Infrastructure as Code (Terraform, Pulumi, Ansible) Container orchestration (Kubernetes) Go development for microservice utilities Modern observability tools (Prometheus, Grafana, Datadog) CI/CD pipeline management (GitHub Actions, GitLab CI, Jenkins) Firewall-as-a-Service solutions (e.g., Cloudflare) Endpoint/device management (e.g., Intune, NinjaOne) Exposure to More ❯
Posted:

DevOps Lead

Birmingham, West Midlands, United Kingdom
Hybrid/Remote Options
Robert Walters
to improve performance Develop strategies to improve performance across group technology DevOps Lead: Experience Technical dept across but not limited to: Java, UNIX, Linux, Middleware, Web-Logic, Cloud Platforms Observability tools Designing/Developing/Implementing technology advancements Experience of improving resilience of complex production environments The permanent opportunity for a DevOps Lead will pay a salary range of More ❯
Employment Type: Permanent, Work From Home
Salary: £80,000
Posted:

DevOps Engineer

Birmingham, England, United Kingdom
Explore Group
next-generation AI products. You’ll join a small, experienced team developing an internal Kubernetes-based platform that enables AI innovation across the organisation automating everything from deployments to observability, and helping developers build smarter applications with confidence. What you’ll be doing: Designing, deploying, and maintaining Azure Kubernetes (AKS) environments Managing Infrastructure as Code with Terraform and improving GitOps … workflows (ArgoCD/GitHub Actions) Building observability and monitoring stacks using Prometheus, Grafana, and Loki Supporting AI workloads (LLMs, RAG, and document processing applications) running on Kubernetes Automating platform operations with Python, Go, and shell scripting Implementing security guardrails, PII compliance tooling, and best practices for production AI systems What you’ll need: 3+ years’ experience in DevOps or Platform … Engineering Strong background in Azure and Kubernetes Hands-on experience with Terraform, CI/CD, and container orchestration Familiarity with observability tools (Prometheus, Grafana, Loki) Scripting or programming skills in Python or Go Interest in AI infrastructure, LLMOps, or large language model deployment More ❯
Posted:

Site Reliability Engineer

Hereford, Herefordshire, England, United Kingdom
Hybrid/Remote Options
Hays Specialist Recruitment Limited
role focused on ensuring service availability, performance, and cost-efficiency across both cloud and on-prem infrastructure.You'll work closely with development and support teams to evolve infrastructure, enhance observability, and proactively mitigate reliability risks.Key Responsibilities:Collaborate with software engineers to improve reliability and performanceAutomate operational tasks and reduce alert fatigueEnhance monitoring and observability to pre-empt issuesSupport development environments … protocolsExperience with cloud platforms, ideally AWS (EC2, RDS, S3, Lambda)Desirable:Coding experience in Java, Go, Python or similarKnowledge of cross-domain technologiesExperience in service management environmentsPractical application of observability patternsExperience with AzureAdditional Information:Due to the nature of the work, successful candidates will be required to undergo security vetting.We welcome applications from all backgrounds and are committed to creating More ❯
Employment Type: Contractor
Rate: £500 - £600 per day
Posted:

Machine Learning Engineer

Greater Coventry Area, United Kingdom
Explore Group
across the organization. What you’ll be doing: Building and maintaining a Kubernetes-hosted AI platform (AKS) Deploying and managing LLMOps tools such as LiteLLM, Langflow, and Langfuse Implementing observability with Prometheus, Grafana, and Loki Managing infrastructure through Terraform, ArgoCD, and GitHub Actions Supporting internal AI applications including RAG, document processing, and internal AI assistants What you’ll need … years in Platform or DevOps Engineering (Azure preferred) Strong experience with Kubernetes, Docker, and Terraform Programming or scripting skills in Python or Go Familiarity with GitOps, Helm, and observability tools A learning mindset and interest in LLM operations More ❯
Posted:

Site Reliability Engineer

Nottingham, England, United Kingdom
Hybrid/Remote Options
KDR Talent Solutions
importantly, drive the blameless post-mortem process to find the root cause and engineer a permanent fix. Partner with development teams to consult on new features, ensuring reliability and observability are designed in from day one. What You'll Need: Deep experience in the Microsoft Azure ecosystem (especially PaaS, App Services). Strong commercial experience with Infrastructure as Code (especially … proven background in an SRE, DevOps, or Software Engineering (with an operations focus) role. Solid scripting/programming skills for automation (e.g., PowerShell, Python, Bash). Expertise with modern observability tools (e.g., Datadog, Application Insights, Log Analytics, Grafana). A collaborative mindset with a strong sense of ownership and a passion for engineering reliability. What's In It For You More ❯
Posted:

Senior Machine Learning Engineer

Warwick, England, United Kingdom
DeepRec.ai
A fast-growing technology business is developing advanced software for accounting, payroll, tax, and practice management. With a strong engineering foundation and a clear commercial vision, the company is now expanding its focus on artificial intelligence to transform how professional More ❯
Posted:

Senior Agentic AI Engineer

Birmingham, England, United Kingdom
Method Resourcing
act? This is a chance to design and deliver agentic AI systems on Azure that automate real business workflows through tool use, retrieval, and reasoning, with the reliability and observability of true production engineering. In this position you’ll take ownership of designing and scaling end-to-end agentic solutions on Azure, combining LLMs, APIs, and orchestration frameworks to deliver … Productionise on Azure using AI Foundry/OpenAI, Azure ML, Functions, Event Grid/Service Bus, and Kubernetes. Build LLMOps pipelines for evaluation, monitoring, safety, and cost control. Define observability standards across prompts, tools, and data flows. Establish governance patterns, safety, privacy, and auditability. Stay hands-on with critical code paths while guiding architecture and best practice. 🧠Required Skills/ More ❯
Posted:

Head of Cloud – Contract (Outside IR35)

Derby, England, United Kingdom
Hybrid/Remote Options
Experis UK
Head of Cloud – Contract (Outside IR35) Location: Hybrid (East Midlands/London 1-2 days/week onsite) Rate: Up to £700/day Contract Type: Outside IR35 Duration: 3-6 months (initial), with potential extension Start Date: ASAP About More ❯
Posted:

Senior Software Engineer - Building APIs - C# - .NET - SQL - Azure - AI

Royal Leamington Spa, England, United Kingdom
InterCity Partners Ltd
patterns where appropriate Ensure APIs are well-documented using OpenAPI/Swagger standards Build and maintain a developer portal for internal and external API consumers Quality & Operations Implement comprehensive observability including logging, monitoring, and alerting Design for reliability, fault tolerance, and graceful degradation Optimize API performance, scalability, and cost efficiency Write clean, maintainable code with thorough testing and documentation Configure … and modern security patterns Testing mindset - you write unit tests and understand integration testing API documentation experience using OpenAPI/Swagger and maintaining developer portals Production systems mindset covering observability, reliability, and operational excellence Architectural thinking - ability to design systems for scale, security, and evolution Keywords RESTful APIs C# .Net Azure AI LLM ML Machine Learning SaaS Scale Up OAuth More ❯
Posted:

CDS Platform Engineer (Splunk) -

Telford, Shropshire, West Midlands, United Kingdom
Sanderson Government and Defence
insight, and proactive incident management. Key Responsibilities Translate high-level monitoring non-functional requirements (NFRs) into actionable configurations across tools such as Splunk, Dynatrace, and AppDynamics. Deliver full-stack observability solutions, including application-aware network performance monitoring (NPM), synthetics, log analytics, and infrastructure metrics. Provide live support for monitoring technologies and assist with live service support, including key business events … improvement initiatives and tooling exploitation to enhance operational efficiency efficiency within immature teams Required Skills and Experience Strong understanding and expereince in SRE principals and methodologies Strong understanding of Observability within a complex tech stack Hands-on experience with monitoring tools such as Splunk, Splunk ITSI, Dynatrace, AppDynamics, and synthetic monitoring platforms. Strong understanding and experience with implementing and using More ❯
Employment Type: Contract
Rate: £500 - £550 per day
Posted:

Full Stack Engineer - AI

Birmingham, England, United Kingdom
Hybrid/Remote Options
Amberes
product features. You will move fast from concept to customer, working across the stack to design APIs, build front-end interfaces, integrate AI models, and ensure performance, reliability, and observability in production. Key Responsibilities Build and ship AI-driven features end-to-end, from prototype to production Design, implement, and maintain inference services with strong observability Develop and optimise retrieval More ❯
Posted:

SC OaaS CDS Platform Engineer

Telford, Shropshire, United Kingdom
Hybrid/Remote Options
Experis IT
Dependent on business needs. Rate: up to £552 p/d Umbrella inside IR35 Clearance required: SC eligible but Active Security Clearance is desired Role purpose/summary The Observability as a Service (OaaS) Programme comprising an Observability Centre of Excellence (COE) and Dynatrace SaaS platform requires an engineer with Dynatrace understanding to be responsible for supporting the Dynatrace OaaS More ❯
Employment Type: Contract
Rate: GBP Daily
Posted:
Observability
the Midlands
10th Percentile
£52,500
25th Percentile
£57,188
Median
£73,801
75th Percentile
£112,406
90th Percentile
£195,000