operational insights. Last updated 5 days ago Collaborate with SRE teams on building and enhancing tooling and automation solutions Work with customers to understand pain points around Supportability and SLO attainment Be the single point of contact for enterprise customer service escalations Implement changes to service telemetry for automation consumption Enhance customer More ❯
willing to present and defend your ideas to technical and non-technical audiences. Additional Desired Skills Experience with incident management platforms like PagerDuty, OpsGenie, or similar tools Understanding of SLO/SLA management and implementations Knowledge of industry standard incident management frameworks and best practices Familiarity with automated remediation and runbook automation Experience with DevOps and SRE practices Cultural Fit More ❯
teamwork. Build rapport with each member of the team and support them as they level up their skills. Define and maintain company-wide practices around SLO definition and management, incident management, postmortem analysis, and disaster testing and recovery. Generate informed insights regarding service quality and interface directly with executive leadership to communicate More ❯
Newcastle Upon Tyne, Tyne and Wear, North East, United Kingdom Hybrid / WFH Options
Develop
platform's core value streams. Key Responsibilities Technical Leadership & Strategy Champion engineering best practices, system reliability, and architectural integrity Define and track progress toward ServiceLevelObjectives (SLOs) Collaborate with product stakeholders to shape robust and scalable solutions Take responsibility for non-functional areas such as performance, maintainability, and security Provide More ❯
Required Provisioning and maintaining cloud-hosted environments in Amazon Web Services with Terraform Programming experience with React (or other JavaScript frameworks) Setting and maintaining servicelevelobjectives and servicelevel indicators Qualities We're Looking For Kind, passionate, and collaborative problem-solver who More ❯
Newcastle Upon Tyne, Tyne And Wear, United Kingdom
Bede Gaming Limited
you'll be doing Technical Leadership & Strategy Champion technical quality, system health, and architectural integrity across your value stream Define and drive progress towards ServiceLevelObjectives (SLOs) in collaboration with Principal Engineers Work closely with Product Owners and Product Managers to design scalable, high-performing technical solutions that align with More ❯
Newcastle Upon Tyne, Tyne And Wear, United Kingdom
Bede Gaming
to meet business needs. Champion the non-functional qualities of our data products-supportability, testability, security, compliance, maintainability, and performance. Drive progress towards our ServiceLevelObjectives (SLOs), ensuring our systems are reliable and resilient. Partner closely with Principal Engineers and technical architects to define and design data solutions aligned with More ❯
to build and enhance tooling and automation solutions, enabling faster resolution of issues impacting SLOs and preventing incidents when possible. Engage with customers to understand their supportability challenges and SLO attainment concerns, developing sustainable strategies to address recurring issues. Serve as the primary technical contact for interfacing with large enterprise customers, managing service escalations, and driving More ❯
feature ideas and betterments through tight cooperation inside and outside of your immediate team. Build trust and reliability in your products, review performance against servicelevelobjectives, address incidents and prioritize improvements. Qualifications: Not all applications will have skills that match a job description exactly. Ciptex values diverse experiences in other More ❯
their full potential through the Microsoft Cloud. We are fast growing team, but we make sure we are committed to remain agile. Customer first, nurturing trust, high responsiveness, automation, SLO/SLI/SLA, blameless post-mortem, observability, monitoring, alerting, and toil reduction form the foundations of our code and we work with teams across Microsoft and external customers to … Baseline Personnel Security Standards; UK Security Clearance Responsibilities Collaborating closely with the existing SRE teams on building and enhancing tooling and automation solutions for faster resolution of issues impacting SLO's and averting incidents altogether when possible. Collaborating with the customers to understand their pain points around Supportability and SLO attainment and formulate strategies for addressing recurring issues in a More ❯
Security Analyst, Security Operations and Incident Response Meta is seeking a Security Analyst to join the Global Security Operations and Incident Response team. The Analyst will serve on the front lines of Meta's Security team and will lead and More ❯