Southampton, Hampshire, South East, United Kingdom Hybrid / WFH Options
Spectrum It Recruitment Limited
on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration management platforms like Ansible, Puppet, or Chef Professional certifications in cloud DevOps, such as AWS Certified DevOps Engineer or Google Cloud Professional DevOps Engineer, or More ❯
Hampshire, England, United Kingdom Hybrid / WFH Options
Spectrum IT Recruitment
on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration management platforms like Ansible, Puppet, or Chef Professional certifications in cloud DevOps, such as AWS Certified DevOps Engineer or Google Cloud Professional DevOps Engineer, or More ❯
making large directional technical decisions (ex. Deciding which technology, or pattern to create or leverage) Experience being "on-call" for a service, and familiarity with incident notification tooling (ex. Pagerduty, Opsgenie) Comprehensive understanding of SRE principles (ex. Working knowledge of the Google SRE book) Demonstrated strength in leading a project in a agile/scrum environment Thrives in a diverse More ❯
proper technological solutions impacting patients and clinicians worldwide. Responsibilities: Production system monitoring and alerting. Automation of system provisioning and deployments. Incident response and troubleshooting (willingness to be on a PagerDuty rota). Managing post-mortems, documenting run books and proposing improvements. Design and development of features and tools to support the reliability of the system. Process refinement, documentation, and communication More ❯
Stoke-on-Trent, England, United Kingdom Hybrid / WFH Options
bet365
Who we are looking for A Site Reliability Engineer, who will enhance system reliability, observability and performance through a strong engineering approach and assist with incident resolution and best practices. You will have software engineering skills, focusing on system reliability More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
bet365
Who we are looking for A Site Reliability Engineer, who will enhance system reliability, observability and performance through a strong engineering approach and assist with incident resolution and best practices. You will have software engineering skills, focusing on system reliability More ❯
DHCP, TCP/IP, firewalls, routing). Experience with Windows and Linux server environments. Experience with monitoring and notification tools (e.g., Cloudwatch, Datadog, Zabbix, Solarwinds, Nagios, PRTG, Opsgenie or Pagerduty). Scripting skills (PowerShell, JavaScript, Bash, Python preferred). Knowledge of backup and recovery tools (e.g., Veeam, Azure Backup, AWS Backup). Understanding of identity and access management (Azure AD More ❯
A Site Reliability Engineer, who will enhance system reliability, observability and performance through a strong engineering approach and assist with incident resolution and best practices. You will have software engineering skills, focusing on system reliability and observability. You will monitor More ❯
/Unix server and Oracle RDS knowledge Working experience with automation/job scheduling tools such as Autosys, GitHub Actions Working experience with monitoring tools like Grafana, Splunk, Obstack, PagerDuty Good communication and organization skills working within a DevOps team supporting a wider IT delivery team Nice-to-Have Skills Description: PL/SQL, Scripting languages (Python) Advanced troubleshooting experience More ❯
/Unix server and Oracle RDS knowledge Working experience with automation/job scheduling tools such as Autosys, GitHub Actions Working experience with monitoring tools like Grafana, Splunk, Obstack, PagerDuty Good communication and organization skills working within a DevOps team supporting a wider IT delivery team Nice-to-Have Skills Description: PL/SQL, Scripting languages (Python) Advanced troubleshooting experience More ❯
/Unix server and Oracle RDS knowledge Working experience with automation/job scheduling tools such as Autosys, GitHub Actions Working experience with monitoring tools like Grafana, Splunk, Obstack, PagerDuty Good communication and organization skills working within a DevOps team supporting a wider IT delivery team Nice-to-Have Skills Description: PL/SQL, Scripting languages (Python) Advanced troubleshooting experience More ❯
the wheels are greased and the ship is humming along Automation, Packaging, Deployments (Custom Hypervisor, Puppet, Git, scripting, CI services) Source Code version control systems: Git & SVN Monitoring: NewRelic, PagerDuty, etc Programming Languages: Bash, PHP, GoLang Virtualisation: We have a custom hypervisor (ZPF) but things like VMWare ESXi/KVM/oVirt/RHEV Classics: Apache, MySQL, Redis, Varnish Logstash More ❯
customers. In addition to day-to-day responsibilities, all of our engineers participate in operating and supporting the product in production and take part in a 12x7 on-call PagerDuty rotation, typically every 5-7 weeks depending on team size. More ❯
verbal and written communication skills and are willing to present and defend your ideas to technical and non-technical audiences. Additional Desired Skills Experience with incident management platforms like PagerDuty, OpsGenie, or similar tools Understanding of SLO/SLA management and implementations Knowledge of industry standard incident management frameworks and best practices Familiarity with automated remediation and runbook automation Experience More ❯
successful in this role, you should have: Experience in architecture and engineering of Event Intelligence Solutions/AIOps platforms. Experience engineering monitoring platforms such as IBM Netcool, Moogsoft, BigPanda, PagerDuty, ServiceNow AIOps. Proficiency in Python, and hands-on knowledge of Ansible Automation Platform. Other highly valued skills include: Knowledge of Observability Platforms: Prometheus, Grafana, ELK, Splunk. Experience with integration into More ❯
healing services that allow you to sleep through the night. Support the production systems on prem (legacy) and GCP which includes build, deployment (CI/CD), and monitoring (Datadog, PagerDuty). Deliver high quality code by following coding standards, performing code reviews and ensuring test coverage at all levels. Collaborate with engineers and other teams to elevate technology, follow agile More ❯
Job Description Grow with the challenge At the time of inception, cloud-based systems and API fluency were new. But that didn't scare us. We waved goodbye to legacy systems and embraced modern technology - where our true innovation began More ❯
AI Ops | Grafana | Observability | Pagerduty | Prometheus | SRE | Site Reliability Engineer | Telecommunications | Consultant | Dashboard | Systems Engineer Looking to make a step into SRE? Excited by the prospect of AI Ops? I've partnered an exciting business who've recently been acquired by a European leader in the AI Ops Consultancy space. Taking on their UK market to replicate their consistent success … in Mainland Europe they're now adding multiple Grafana and PagerDuty consultants to the business to advice and Consult AI Ops to large scale Enterprise right through to SME size businesses. Working with clients across the world, this one will be scaling a team of 4 to join their expert SRE's in the Observability domain. This one, in particular … is hiring multiple people such as Grafana and PagerDuty to take on modern state of the art AI Ops projects and streamline businesses Cloud Operations. If you've got a couple years experience working as a System Engineer or SRE and are looking for somewhere to really get hands on with AI Ops then get in touch. In return the More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
AI Ops | Grafana | Observability | Pagerduty | Prometheus | SRE | Site Reliability Engineer | Telecommunications | Consultant | Dashboard | Systems Engineer Looking to make a step into SRE? Excited by the prospect of AI Ops? I've partnered an exciting business who've recently been acquired by a European leader in the AI Ops Consultancy space. Taking on their UK market to replicate their consistent success … in Mainland Europe they're now adding multiple Grafana and PagerDuty consultants to the business to advice and Consult AI Ops to large scale Enterprise right through to SME size businesses. Working with clients across the world, this one will be scaling a team of 4 to join their expert SRE's in the Observability domain. This one, in particular … is hiring multiple people such as Grafana and PagerDuty to take on modern state of the art AI Ops projects and streamline businesses Cloud Operations. If you've got a couple years experience working as a System Engineer or SRE and are looking for somewhere to really get hands on with AI Ops then get in touch. In return the More ❯
East London, London, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
AI Ops | Grafana | Observability | Pagerduty | Prometheus | SRE | Site Reliability Engineer | Telecommunications | Consultant | Dashboard | Systems Engineer Looking to make a step into SRE? Excited by the prospect of AI Ops? I've partnered an exciting business who've recently been acquired by a European leader in the AI Ops Consultancy space. Taking on their UK market to replicate their consistent success … in Mainland Europe they're now adding multiple Grafana and PagerDuty consultants to the business to advice and Consult AI Ops to large scale Enterprise right through to SME size businesses. Working with clients across the world, this one will be scaling a team of 4 to join their expert SRE's in the Observability domain. This one, in particular … is hiring multiple people such as Grafana and PagerDuty to take on modern state of the art AI Ops projects and streamline businesses Cloud Operations. If you've got a couple years experience working as a System Engineer or SRE and are looking for somewhere to really get hands on with AI Ops then get in touch. In return the More ❯
Leigh, Greater Manchester, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
AI Ops | Grafana | Observability | Pagerduty | Prometheus | SRE | Site Reliability Engineer | Telecommunications | Consultant | Dashboard | Systems Engineer Looking to make a step into SRE? Excited by the prospect of AI Ops? I've partnered an exciting business who've recently been acquired by a European leader in the AI Ops Consultancy space. Taking on their UK market to replicate their consistent success … in Mainland Europe they're now adding multiple Grafana and PagerDuty consultants to the business to advice and Consult AI Ops to large scale Enterprise right through to SME size businesses. Working with clients across the world, this one will be scaling a team of 4 to join their expert SRE's in the Observability domain. This one, in particular … is hiring multiple people such as Grafana and PagerDuty to take on modern state of the art AI Ops projects and streamline businesses Cloud Operations. If you've got a couple years experience working as a System Engineer or SRE and are looking for somewhere to really get hands on with AI Ops then get in touch. In return the More ❯
Altrincham, Greater Manchester, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
AI Ops | Grafana | Observability | Pagerduty | Prometheus | SRE | Site Reliability Engineer | Telecommunications | Consultant | Dashboard | Systems Engineer Looking to make a step into SRE? Excited by the prospect of AI Ops? I've partnered an exciting business who've recently been acquired by a European leader in the AI Ops Consultancy space. Taking on their UK market to replicate their consistent success … in Mainland Europe they're now adding multiple Grafana and PagerDuty consultants to the business to advice and Consult AI Ops to large scale Enterprise right through to SME size businesses. Working with clients across the world, this one will be scaling a team of 4 to join their expert SRE's in the Observability domain. This one, in particular … is hiring multiple people such as Grafana and PagerDuty to take on modern state of the art AI Ops projects and streamline businesses Cloud Operations. If you've got a couple years experience working as a System Engineer or SRE and are looking for somewhere to really get hands on with AI Ops then get in touch. In return the More ❯
Bury, Greater Manchester, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
AI Ops | Grafana | Observability | Pagerduty | Prometheus | SRE | Site Reliability Engineer | Telecommunications | Consultant | Dashboard | Systems Engineer Looking to make a step into SRE? Excited by the prospect of AI Ops? I've partnered an exciting business who've recently been acquired by a European leader in the AI Ops Consultancy space. Taking on their UK market to replicate their consistent success … in Mainland Europe they're now adding multiple Grafana and PagerDuty consultants to the business to advice and Consult AI Ops to large scale Enterprise right through to SME size businesses. Working with clients across the world, this one will be scaling a team of 4 to join their expert SRE's in the Observability domain. This one, in particular … is hiring multiple people such as Grafana and PagerDuty to take on modern state of the art AI Ops projects and streamline businesses Cloud Operations. If you've got a couple years experience working as a System Engineer or SRE and are looking for somewhere to really get hands on with AI Ops then get in touch. In return the More ❯
Leeds, West Yorkshire, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
AI Ops | Grafana | Observability | Pagerduty | Prometheus | SRE | Site Reliability Engineer | Telecommunications | Consultant | Dashboard | Systems Engineer Looking to make a step into SRE? Excited by the prospect of AI Ops? I've partnered an exciting business who've recently been acquired by a European leader in the AI Ops Consultancy space. Taking on their UK market to replicate their consistent success … in Mainland Europe they're now adding multiple Grafana and PagerDuty consultants to the business to advice and Consult AI Ops to large scale Enterprise right through to SME size businesses. Working with clients across the world, this one will be scaling a team of 4 to join their expert SRE's in the Observability domain. This one, in particular … is hiring multiple people such as Grafana and PagerDuty to take on modern state of the art AI Ops projects and streamline businesses Cloud Operations. If you've got a couple years experience working as a System Engineer or SRE and are looking for somewhere to really get hands on with AI Ops then get in touch. In return the More ❯
Bolton, Greater Manchester, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions
AI Ops | Grafana | Observability | Pagerduty | Prometheus | SRE | Site Reliability Engineer | Telecommunications | Consultant | Dashboard | Systems Engineer Looking to make a step into SRE? Excited by the prospect of AI Ops? I've partnered an exciting business who've recently been acquired by a European leader in the AI Ops Consultancy space. Taking on their UK market to replicate their consistent success … in Mainland Europe they're now adding multiple Grafana and PagerDuty consultants to the business to advice and Consult AI Ops to large scale Enterprise right through to SME size businesses. Working with clients across the world, this one will be scaling a team of 4 to join their expert SRE's in the Observability domain. This one, in particular … is hiring multiple people such as Grafana and PagerDuty to take on modern state of the art AI Ops projects and streamline businesses Cloud Operations. If you've got a couple years experience working as a System Engineer or SRE and are looking for somewhere to really get hands on with AI Ops then get in touch. In return the More ❯