Birmingham, England, United Kingdom Hybrid / WFH Options
Deutsche Bank
environment for delivering change into production. In this role you will work in a team as the ITAO for Automation platforms in DB, focusing on operational maintenance automation and observability portfolio of applications. What we'll offer you A healthy, engaged and well-supported workforce are better equipped to do their best work and, more importantly, enjoy their lives inside More ❯
Derby, England, United Kingdom Hybrid / WFH Options
Cooper Parry
work cross-functionally in a hybrid business/technical team, so strong communication is a must. Of course you'll need an understanding of data pipeline monitoring, alerting, and observability best practices. You also need: Proven experience in a Data Engineering role within enterprise environments Hands-on experience designing, implementing and maintaining complex data pipelines using Azure Data Services. (Synapse More ❯
Stoke-on-Trent, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
We are seeking Grafana Consultants to join a rapidly expanding AI Ops consultancy. The role involves working with clients on AI Ops projects, streamlining cloud operations, and advising on observability solutions. You will be part of a team scaling to support enterprise to SME clients, working remotely within the UK. Ideal candidates will have a couple of years of experience … as a System Engineer or SRE, with a keen interest in AI Ops, observability, and modern cloud technologies such as Grafana, PagerDuty, Prometheus, and related tools. This position offers a salary of £55,000 and the flexibility to work remotely within the UK. #J-18808-Ljbffr More ❯
Leicester, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
Job Title: AI Ops | Grafana | Observability | Pagerduty | Prometheus | SRE | Site Reliability Engineer | Telecommunications | Consultant | Dashboard | Systems Engineer Client: Opus Recruitment Solutions Location: UK (Remote) Job Category: Other EU work permit required: Yes Job Views: 2 Posted: 06.06.2025 Expiry Date: 21.07.2025 Job Description: Are you looking to advance your career into SRE and AI Ops? An exciting opportunity has arisen with … advise on and implement AI Ops solutions for large enterprise to SME clients. This role involves working with international clients and scaling a team of 4 SREs in the Observability domain. The focus is on modern AI Ops projects to optimize cloud operations. If you have a couple of years of experience as a System Engineer or SRE and want More ❯
models for a suite of digital platforms, including web, mobile, APIs, and customer data services. Youll ensure 24/7 support through both internal and third-party resources, drive observability, automation, and scalability best practices, and help shape the modernisation of support functions as digital capabilities are increasingly brought in-house. Key Responsibilities: Lead and develop a high-performing digital … Page Load Time, MTTR, API Response Time, and Error Rates Work closely with engineering, product, infrastructure, and third-party vendors to support rapid change and stable delivery Establish strong observability and monitoring frameworks , making IT the first to know when issues arise Define and automate code handover processes , collaborating with product and engineering teams on Operational Acceptance Testing Contribute to … and communication skills A clear customer-first mindset and ability to deliver under pressure Ability to ruthlessly prioritise and adapt in rapidly changing environments Track record of embedding automation, observability, and performance monitoring tools in support teams ITIL v3/v4 Certification preferred If you're passionate about driving digital excellence and want to lead the future of application support More ❯
never the limit. The Role: Head of Platform Operations and Performance Join British Airways as our Head of Platform Operations and Performance and take the lead in ensuring resilience, observability, and excellence across our customer-facing digital journeys. This pivotal leadership role sits within the Commercial directorate and drives operational effectiveness for our Ecommerce & Platforms division. What You'll Do … Own and drive operational performance management across Ecommerce & Platforms, using operational, commercial and journey metrics (e.g. uptime, response times, bugs) to guide improvements. Foster a culture of observability by leveraging data insights to proactively monitor, troubleshoot, and optimise platform performance. Lead operational resilience, ensuring secure, scalable, and uninterrupted platform services, while managing SLAs and engaging with cybersecurity teams. Direct incident More ❯
never the limit. The Role: Head of Platform Operations and Performance Join British Airways as our Head of Platform Operations and Performance and take the lead in ensuring resilience, observability, and excellence across our customer-facing digital journeys. This pivotal leadership role sits within the Commercial directorate and drives operational effectiveness for our Ecommerce & Platforms division. What You’ll Do … Own and drive operational performance management across Ecommerce & Platforms, using operational, commercial and journey metrics (e.g. uptime, response times, bugs) to guide improvements. Foster a culture of observability by leveraging data insights to proactively monitor, troubleshoot, and optimise platform performance. Lead operational resilience, ensuring secure, scalable, and uninterrupted platform services, while managing SLAs and engaging with cybersecurity teams. Direct incident More ❯
Coventry, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
technical teams, including hiring, coaching, and performance management Solid commercial C++ experience on complex systems Proven experience with large, multi-component systems and distributed team practices Strong background in observability and logging Familiarity with infrastructure-as-code and automated deployments (Terraform, Helm or Flux) We make an active choice to be inclusive towards everyone every day. Please let us know More ❯
Stratford-upon-avon, Warwickshire, United Kingdom Hybrid / WFH Options
Big Red Recruitment
strategy and improvement We're looking for someone who: Proven experience in IT operational leadership roles Strong knowledge of Azure, DevOps/cloud engineering, and platform reliability Experience driving observability and monitoring strategies across large-scale services Good working knowledge of cybersecurity practices, frameworks (NIST, ISO 27001, Cyber Essentials), and audit processes Vendor, contract, and license management experience across software More ❯
Birmingham, England, United Kingdom Hybrid / WFH Options
SCC
hybrid cloud solutions. Define and maintain reference architectures, ensuring consistency and scalability across the product portfolio. Develop and implement Infrastructure as Code templates to standardise deployments. Establish and promote observability practices, including monitoring, logging, and alerting. Support the engineering teams with design reviews, coding tasks, and troubleshooting complex issues. Oversee performance and reliability objectives, ensuring products meet agreed service levels. More ❯
Telford, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
Investigations: Lead technical deep-dives and spike solutions to evaluate technologies, libraries, and approaches for improving system reliability, auditing, and financial reconciliation accuracy. Open Standards: Support our commitment to observability and open standards. Contribute to initiatives around OpenTelemetry, OpenAPI, and other tools that improve transparency and traceability across services. About you At least 5 years of professional experience in software More ❯
Lincoln, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
Investigations: Lead technical deep-dives and spike solutions to evaluate technologies, libraries, and approaches for improving system reliability, auditing, and financial reconciliation accuracy. Open Standards: Support our commitment to observability and open standards. Contribute to initiatives around OpenTelemetry, OpenAPI, and other tools that improve transparency and traceability across services. About you At least 5 years of professional experience in software More ❯
Birmingham, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
Investigations: Lead technical deep-dives and spike solutions to evaluate technologies, libraries, and approaches for improving system reliability, auditing, and financial reconciliation accuracy. Open Standards: Support our commitment to observability and open standards. Contribute to initiatives around OpenTelemetry, OpenAPI, and other tools that improve transparency and traceability across services. About you At least 5 years of professional experience in software More ❯
Wolverhampton, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
Investigations: Lead technical deep-dives and spike solutions to evaluate technologies, libraries, and approaches for improving system reliability, auditing, and financial reconciliation accuracy. Open Standards: Support our commitment to observability and open standards. Contribute to initiatives around OpenTelemetry, OpenAPI, and other tools that improve transparency and traceability across services. About you At least 5 years of professional experience in software More ❯
Northampton, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
Investigations: Lead technical deep-dives and spike solutions to evaluate technologies, libraries, and approaches for improving system reliability, auditing, and financial reconciliation accuracy. Open Standards: Support our commitment to observability and open standards. Contribute to initiatives around OpenTelemetry, OpenAPI, and other tools that improve transparency and traceability across services. About you At least 5 years of professional experience in software More ❯
Nottingham, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
Investigations: Lead technical deep-dives and spike solutions to evaluate technologies, libraries, and approaches for improving system reliability, auditing, and financial reconciliation accuracy. Open Standards: Support our commitment to observability and open standards. Contribute to initiatives around OpenTelemetry, OpenAPI, and other tools that improve transparency and traceability across services. About you At least 5 years of professional experience in software More ❯
Stoke-on-Trent, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
Investigations: Lead technical deep-dives and spike solutions to evaluate technologies, libraries, and approaches for improving system reliability, auditing, and financial reconciliation accuracy. Open Standards: Support our commitment to observability and open standards. Contribute to initiatives around OpenTelemetry, OpenAPI, and other tools that improve transparency and traceability across services. About you At least 5 years of professional experience in software More ❯
Leicester, England, United Kingdom Hybrid / WFH Options
TieTalent
QA team, fostering a safe, inclusive and supportive culture Embed quality thinking across our squads, ensuring QA is a shared responsibility from ideation to release Champion test automation, performance, observability and secure-by-design thinking across our systems Evolve tools and frameworks that support fast, scalable and confident delivery Collaborate with Engineering, Product and Platform teams to deliver reliable, member … and Design to embed a quality-first culture Familiarity with test planning and scenario development as part of day-to-day delivery A working knowledge of performance, security, and observability testing in modern systems A background in product-led, fast-growing environments where quality and speed go hand in hand Perspective gained from working in more than one organisation, bringing More ❯
the Global FinOps services. Analysing capacity usage data, generating reports, and presenting findings to senior management. Identifying opportunities to enhance reporting quality, grow the scope and coverage of capacity observability and improve capacity reporting. Identifying potential capacity-related risks and developing mitigation strategies to prevent performance issues or service disruptions. Suitable candidates will have: Experience of leading technical IT infrastructure More ❯
Birmingham, West Midlands, United Kingdom Hybrid / WFH Options
Halian Technology Limited
within the last 2 years Strong understanding of CI/CD , Continuous Testing , and Shift Left/Right principles Hands-on technical skills, including TDD , pairing , and experience with observability practices (e.g. logs, metrics, APM) Able to coach and mentor developers in testing and quality ownership Comfortable working in cross-functional teams embedded with engineers Excellent grasp of modern quality More ❯
Stratford-upon-avon, Warwickshire, United Kingdom Hybrid / WFH Options
NFU Mutual
also be or have: Working and demonstrable knowledge and experience of Citrix NetScaler or Checkpoint Firewalls (essential) In depth knowledge of Cisco Switches (essential) Good working experience of Network observability and troubleshooting issues (essential) Exposure to cloud technologies (essential) Knowledge of governance or architectural practices and working within in a regulated industry would be advantageous. Internally this role is known More ❯
East Midlands, United Kingdom Hybrid / WFH Options
Halian Technology Limited
support team - Crafting and executing support strategies for web, app, and API frameworks - Guaranteeing round-the-clock support for critical systems with a robust incident management approach - Championing automation, observability, and performance tracking - Collaborating closely with product development, engineering teams, and external vendors - Shaping the trajectory of our internal digital capabilities for the future More ❯