Mansfield, England, United Kingdom Hybrid/Remote Options
Future Talent Group
using Terraform. Implement and optimise CI/CD pipelines using GitHub Actions, Docker, and GitOps practices. Deploy, orchestrate, and manage Kubernetes (AKS/Container Apps) workloads. Configure monitoring and observability with Azure Monitor, Application Insights, Log Analytics, and OpenTelemetry. Partner with software engineering and infrastructure teams to drive DevOps best practices across the organisation. Manage security and compliance in Azure More ❯
/mentoring development teams in the correct application of DevOps principles It would be great if you also have: Hands-on experience with Cloud Security Experience using Monitoring/Observability, Logging, and Alerting platforms High level understanding of networking fundamentals covering Load balancers, DNS, virtual networks and firewalls Experience delivering technical documentation for the environments you setup Experience with serverless More ❯
Birmingham, West Midlands, United Kingdom Hybrid/Remote Options
Inspire People
Edinburgh or Belfast. About the Role As a Senior Site Reliability Engineer, you will: - Build and scale DBT's product platform and services in AWS. - Provide development teams with observability, monitoring, CI/CD pipelines and service-level objectives. - Participate in an on-call rota (with allowance), helping to keep DBT services resilient and reliable. - Mentor junior engineers and contribute More ❯
tools such as Airflow ● Experience with cloud platforms (AWS, GCP, or Azure) and infrastructure-as-code tools (e.g., Docker, Terraform, CloudFormation). ● Familiarity with data quality, data governance, and observability tools (e.g., Great Expectations, Monte Carlo).[3] ● Experience with BI and data visualization tools (e.g., Looker, Tableau, Power BI). ● Experience working with product analytics solution (Amplitude, Mixpanel) ● Experience More ❯
using Playwright . Drive performance, load, and scalability testing with tools like k6 . Embed quality checks into CI/CD pipelines (GitHub Actions, Jenkins, GitLab CI). Use observability tools (logs, metrics, traces) for smarter system feedback. Champion BDD and collaborative quality practices across teams. Mentor others in modern QE methods and system thinking. 🧠 What You’ll Bring Strong More ❯
Warwick, England, United Kingdom Hybrid/Remote Options
Ocho
in Git, SQL optimisation, and async architecture. Excellent communicator who values clarity, documentation, and collaboration. Nice to Have Experience with Supabase , Kubernetes , Docker , Azure , GitHub Actions , vector databases , or observability tools like Prometheus , Grafana , and Langfuse . What Success Looks Like 3 months: You’ve established your 1:1 rhythm, shipped your first automation workflow, and built a trusted partnership More ❯
Nottingham, Nottinghamshire, United Kingdom Hybrid/Remote Options
London Stock Exchange Group
not static and must evolve over time as technology and standards change. You are not afraid to dive deep - writing code, defining standards around CI/CD, maximizing automation, observability and supportability whilst making sure solutions are cost effective. A confident communicator, you will lead with data when collaborating with stakeholders. You will lead by example and mentor more junior More ❯
with React, Vue, or Blazor Integrate LLMs and GenAI features into core product experiences Lead technical decision-making and mentor engineers within your squad Ensure best practices across testing, observability, and code quality What We’re Looking For Proven experience delivering AI/ML-powered production systems (not prototypes) Strong full-stack capability – C# .NET + modern JavaScript frameworks Solid More ❯
Worcester, England, United Kingdom Hybrid/Remote Options
Chapman Tate Associates
Infrastructure as Code (Terraform, Bicep, PowerShell). Solid grasp of Azure security and identity management in line with Zero Trust principles. Experience with CI/CD pipelines , monitoring , and observability tools such as Azure Monitor and Log Analytics. Excellent communication skills, stakeholder engagement, and a proactive approach to problem-solving. Strategic mindset with the ability to balance hands-on delivery More ❯
Loughborough, Leicestershire, East Midlands, United Kingdom Hybrid/Remote Options
Previsico Limited
Science, Operations, and Design to shape roadmaps and deliver impactful outcomes. Lead system refactoring and optimisation initiatives , managing technical debt pragmatically and promoting long-term system health. Ensure high observability and resilience through monitoring, alerting, and proactive issue prevention. Contribute to and influence technical strategy within your team or domain, linking engineering priorities to business objectives. Represent engineering in external More ❯
experience with LLMs/GenAI/ML in production Strong background in C#, .NET, REST APIs , and cloud platforms (Azure, AWS, or GCP) Agile mindset with focus on testing, observability, and secure delivery Excellent communication and cross-functional collaboration skills Nice to have Experience with vector databases , RAG systems , or multi-agent AI Python skills for AI/ML development More ❯
West Midlands (County), Birmingham, United Kingdom Hybrid/Remote Options
Sherborne Talent Solutions
automation, and optimisation of CI/CD pipelines to drive speed, reliability, and consistency. Manage and optimise Azure infrastructure for scalability, security, performance, and cost control. Champion modern monitoring, observability, and incident management practices to maintain high availability. Partner with engineering, architecture, and product leadership to accelerate delivery and reduce operational friction. Drive adoption of FinOps principles to balance technical More ❯
ensure efficient delivery of software updates. Senior DevOps Engineer (in addition to above) Contribute to the architecture and evolution of our cloud infrastructure strategy. Drive best practices for automation, observability, and security within DevOps. Mentor and coach junior team members, supporting their technical growth. Evaluate new technologies and tools to improve operational efficiency. Champion continuous improvement across our delivery pipelines … GitHub Actions, or Jenkins). Experience with Infrastructure as Code (Terraform, Bicep, or ARM templates). Proficiency in scripting languages (PowerShell, Bash, or Python). Experience with monitoring and observability tools (e.g., Application Insights, Grafana, Prometheus). Understanding of containerisation and orchestration (Docker, Kubernetes). Familiarity with security best practices in cloud environments. Desirable Experience within SaaS or FinTech environments. More ❯
for leading and executing the migration of data, dashboards, alerts, and configurations from Splunk systems to Elasticsearch. This role involves deep technical expertise in Splunk architecture, data ingestion, and observability tools, along with strong project management and stakeholder communication skills. Must have skills: -Splunk -ELK Stack -Kibana Nice to have skills: -stakeholder communication skills -strong project management Responsibilities: Minimum number More ❯
Birmingham, England, United Kingdom Hybrid/Remote Options
EML
deployment processes with a focus on minimizing security risks. Site Reliability Engineering (SRE): Ensure system reliability, scalability, and performance through proactive monitoring and secure incident response. Develop and implement observability tools to monitor system health, detect anomalies, and identify security threats. Perform root cause analysis and implement solutions to prevent recurring issues, including security vulnerabilities. Define and measure Service Level More ❯
Mansfield, England, United Kingdom Hybrid/Remote Options
develop
written A collaborative team player who thrives in a hybrid work environment Nice to Have Experience working in retail or consumer-facing product teams Knowledge of monitoring, analytics, or observability tools Exposure to testing in microservices architectures More ❯
Birmingham, West Midlands, United Kingdom Hybrid/Remote Options
ByteHire
or communicating with robotic automation systems and integrating with physical devices Desktop app development with Electron CI/CD setup, rollback strategies, and deployment automation Sentry, NewRelic, or other observability tooling implementation More ❯
United Kingdom, Birmingham, West Midlands (County)
Uniting Ambition
with MLOps practices and AI development frameworks (e.g., Azure AI, LangChain, Hugging Face). Relevant certifications in Azure Architecture, Data, or AI disciplines. Knowledge of automation tools, monitoring, and observability platforms. If you have these skills and would like to find out more, please apply now. More ❯
and tuning system performance across multiple services and environments. Supporting development teams with deployment pipelines, CI/CD processes, and platform tools. Troubleshooting complex application and infrastructure challenges. Championing observability, incident response, and continuous improvement within SRE practices. What We’re Looking For Strong experience with Microsoft Azure and cloud-native technologies. Deep knowledge of Terraform, Kubernetes, and App Services. … pipelines with Azure DevOps. Experience in a Site Reliability, DevOps, or Platform Engineering role. Solid scripting or programming ability (PowerShell, Bash, Python, or similar). Familiarity with monitoring and observability tools such as Datadog, Azure Application Insights, or Log Analytics. Excellent collaboration and communication skills with the ability to work cross-functionally. A proactive mindset with a genuine passion for More ❯
based microservices. Troubleshoot production issues, ensuring uptime and documenting processes on the internal wiki. Automate deployments, testing processes, and infrastructure provisioning (Terraform, Ansible, GitHub Actions). Implement monitoring and observability solutions for proactive issue detection. Provide occasional support for internal IT infrastructure (e.g., laptops, printers, office networking). Occasionally maintain and support CMS platforms (Magento, Joomla, WordPress). Experience Required … management) Docker containerization Python scripting for automation Git version control Desirable (Future-Facing Skills): Infrastructure as Code (Terraform, Pulumi, Ansible) Container orchestration (Kubernetes) Go development for microservice utilities Modern observability tools (Prometheus, Grafana, Datadog) CI/CD pipeline management (GitHub Actions, GitLab CI, Jenkins) Firewall-as-a-Service solutions (e.g., Cloudflare) Endpoint/device management (e.g., Intune, NinjaOne) Exposure to More ❯
Birmingham, West Midlands, United Kingdom Hybrid/Remote Options
Robert Walters
to improve performance Develop strategies to improve performance across group technology DevOps Lead: Experience Technical dept across but not limited to: Java, UNIX, Linux, Middleware, Web-Logic, Cloud Platforms Observability tools Designing/Developing/Implementing technology advancements Experience of improving resilience of complex production environments The permanent opportunity for a DevOps Lead will pay a salary range of More ❯
next-generation AI products. You’ll join a small, experienced team developing an internal Kubernetes-based platform that enables AI innovation across the organisation automating everything from deployments to observability, and helping developers build smarter applications with confidence. What you’ll be doing: Designing, deploying, and maintaining Azure Kubernetes (AKS) environments Managing Infrastructure as Code with Terraform and improving GitOps … workflows (ArgoCD/GitHub Actions) Building observability and monitoring stacks using Prometheus, Grafana, and Loki Supporting AI workloads (LLMs, RAG, and document processing applications) running on Kubernetes Automating platform operations with Python, Go, and shell scripting Implementing security guardrails, PII compliance tooling, and best practices for production AI systems What you’ll need: 3+ years’ experience in DevOps or Platform … Engineering Strong background in Azure and Kubernetes Hands-on experience with Terraform, CI/CD, and container orchestration Familiarity with observability tools (Prometheus, Grafana, Loki) Scripting or programming skills in Python or Go Interest in AI infrastructure, LLMOps, or large language model deployment More ❯
across the organization. What you’ll be doing: Building and maintaining a Kubernetes-hosted AI platform (AKS) Deploying and managing LLMOps tools such as LiteLLM, Langflow, and Langfuse Implementing observability with Prometheus, Grafana, and Loki Managing infrastructure through Terraform, ArgoCD, and GitHub Actions Supporting internal AI applications including RAG, document processing, and internal AI assistants What you’ll need … years in Platform or DevOps Engineering (Azure preferred) Strong experience with Kubernetes, Docker, and Terraform Programming or scripting skills in Python or Go Familiarity with GitOps, Helm, and observability tools A learning mindset and interest in LLM operations More ❯