Birmingham, West Midlands, United Kingdom Hybrid/Remote Options
DWP Digital
Senior SiteReliability Engineer Pay up to £78,517 plus 28.97% employer pension contributions, hybrid working, flexible hours, and a truly great work life balance. DWP. … Digital with Purpose. We have a fantastic opportunity to join our community of experts at DWP Digital as a Senior SiteReliability Engineer, within one of our SRE teams at the heart of Digital Transformation. We're using fresh ideas and leading-edge tech to build and maintain digital solutions that will be used by nearly every person … Demonstrable experience of developing cloud based and supporting cloud-based applications in AWS & Azure. Incident Resolution: Strong experience in resolving complex technical incidents, ensuring minimal downtime and swift recovery. ReliabilityEngineering: Expertise in reliabilityengineering, including capacity and performance management through effective monitoring, logging, and alerting. Leadership: Demonstrated ability to engage with stakeholders at all levels More ❯
About the Role As a Senior SRE Engineer , you will be a hands-on technical expert driving the reliability, scalability, and availability of the engineering platform. Working collaboratively across teams, you will develop and implement automated solutions, address operational challenges, and ensure the platform's robust performance. This role demands strong technical acumen, a proactive mindset, and the … ability to influence platform improvements through technical excellence. Job Responsibilities Platform Stability and Reliability Ensure the platform meets performance, availability, and reliability SLAs. Proactively identify and resolve performance bottlenecks and risks in production environments. Maintain and improve monitoring, logging, and alerting frameworks to detect and prevent incidents. Incident Management Act as the primary responder for critical incidents, ensuring … faster and more reliable software delivery. Write and maintain Infrastructure as Code (IaC) using tools like Pulumi or Terraform to provision and manage resources. Collaboration and Mentorship Collaborate with SRE, CI/CD, Developer Experience, and Templates teams to improve the platform’s reliability and usability. Mentor junior engineers by sharing knowledge and best practices in SRE and operational More ❯
Job Title: Splunk SiteReliability Engineer/Migration Specialist (Contract) Location: Birmingham (Hybrid/On-site, required 3 days per week) Contract Type: Contract Duration: 3 months rolling Job Summary: We are seeking an experienced Splunk SME/Migration Specialist to lead and support the migration of observability workloads from Splunk to Elasticsearch (ELK Stack) . The … third-party tools. Ensure migration plans include validation, rollback procedures, and knowledge transfer. Platform Operations & Incident Response: Maintain Splunk infrastructure in both Production and Non-Production environments. Support Splunk SRE and Application teams in incident investigation and resolution. Proactively monitor system health and performance metrics. Upgrades and Change Management: Plan and execute upgrades to Splunk components. Perform pre- and post … tooling. Proficiency in Linux/Unix systems and networking protocols . Hands-on experience with Scripting (eg, Python, Shell/Bash). Experience supporting or working alongside DevOps/SRE teams . Strong analytical, troubleshooting, and communication skills. Desirable: Experience with containerized environments such as Docker or Kubernetes . Industry certifications such as Splunk Certified Power User/Admin/ More ❯