Newcastle Upon Tyne, Tyne And Wear, United Kingdom
One Utility Bill
Cloud platform (preferably AWS) Experience of CI/CD pipelines and maintaining live systems Experience conducting code reviews Used monitoring tools (e.g. Datadog, Sentry, Prometheus) Understand security principles and experience assisting with security issues What you'll get from us You can expect a salary of up to More ❯
very welcome) in different cloud technologies and Azure and AWS Cloud Platforms. Experience managing monitoring, alerting, observability, and dashboarding platforms (such as AWS Monitor, Prometheus, Grafana, and Elasticsearch). Good understanding of NOC and DevOps practices. Experience and in-depth knowledge of databases and data handling. Solid understanding of ITIL More ❯
teams Experience with automation tools (Ansible, Puppet etc) Experience supporting message-based architecture (Solace, Tibco, MQ etc) Experience with industry-standard monitoring tools (ITRS, Prometheus or similar) Working knowledge and experience working with SNMP and iLo protocols AWS-certified to SysOps Administrator level Basic knowledge of the FIX protocol and More ❯
Wrexham, Clwyd, United Kingdom Hybrid / WFH Options
TieTalent
cloud offerings from major providers (Azure experience is a plus). Experience with metrics, monitoring, alerting, and logging systems such as Sensu, Graphite, ELK, Prometheus, or CloudWatch. Company: Our client is one of the largest pharmaceutical companies in the world, focusing on addressing urgent medical needs. Location: This role is More ❯
with public cloud offerings from major cloud providers (Azure experience is a plus) Managing metrics, monitoring, alerting and logging systems; e.g. Sensu, Graphite, ELK, Prometheus, CloudWatch Company: Our client is one of the largest pharmaceutical companies in the world. They focus on finding answers for some of the world's More ❯
public cloud offerings from major cloud providers (Azure experience is a plus). Managing metrics, monitoring, alerting and logging systems; e.g. Sensu, Graphite, ELK, Prometheus, CloudWatch. Company: Our client is one of the largest pharmaceutical companies in the world. They focus on finding answers for some of the world's More ❯
public cloud offerings from major cloud providers (Azure experience is a plus). Managing metrics, monitoring, alerting and logging systems; e.g. Sensu, Graphite, ELK, Prometheus, CloudWatch. Company: Our client is one of the largest pharmaceutical companies in the world. They focus on finding answers for some of the world's More ❯
managing databases (ideally Postgres, MongoDB). Experience of event streaming (Apache Kafka) would also be beneficial. Familiarity with observability platforms such as Grafana, Zabbix, Prometheus, OpenTelemetry/SigNoz. Experience of mobile telecoms principles and platforms would be advantageous but is not mandatory (such as EPC, DIAMETER/SS7 signalling, GTP More ❯
to transform the TechOps team Participate in the operational management of OpenShift Work with technologies such as Ansible, PowerShell, C#, SQL Server, Elastic Grafana, Prometheus, OpenTelemetry, Bare-metal builds, Hyper-V automation What we are looking for: Experience in TechOps, especially with Infrastructure as Code Familiarity with development technologies like More ❯
Solace PubSub+ messaging Strong knowledge of production support Good understanding of WAN, networking and latency etc Solid knowledge of tools such as Grafana and Prometheus etc DevOps tooling experience would be ideal Proficiency in troubleshooting message delivery, persistence, and topic routing etc Good Linux/Unix knowledge Excellent communication skills More ❯
Be a key player involved in decision-making and collaborating with other stakeholders in the team 📈 💻 Tech Stack: React, TypeScript, Go, GCP, Docker, Terraform, Prometheus You’ll be a great fit if you have: A degree in Computer Science (or related field) from a top global university Strong fullstack experience More ❯
have worked in offshore and onshore model Should have hands on experience on tools OpsCenter, OpsManager, Talena, iCinga, Splunk, Grafana, ELK/ELF, CTAM Prometheus, Cohesity, Nimsoft, GCP Knowledge of Banking domain is good to have Experience in Linux Operating Systems Experience in Shell scripting and Ansible ROLES AND RESPONSIBILITIES More ❯
have worked in offshore and onshore model Should have hands on experience on tools OpsCenter, OpsManager, Talena, iCinga, Splunk, Grafana, ELK/ELF, CTAM Prometheus, Cohesity, Nimsoft, GCP Knowledge of Banking domain is good to have Experience in Linux Operating Systems Experience in Shell scripting and Ansible ROLES AND RESPONSIBILITIES More ❯
HAProxy, Nginx) and network monitoring tools. Experience in DNS management and troubleshooting. Experience in network security best practices. Proficiency in monitoring and observability tools (Prometheus, Grafana, Splunk). Proficiency in at least one scripting language (Python, Bash) for automation. Experience with CI/CD pipeline management and DevOps practices. Strong … system performance. Experience in tools like df, du, lsblk, and fdisk for managing and troubleshooting file systems and disk partitions. Familiarity with tools like Prometheus and Grafana for monitoring and observability More ❯
london (hammersmith), south east england, United Kingdom
OpenSource
infrastructure. Recruit and lead a growing team of data engineers. Tech Stack Python (3.10+), Pandas, NumPy PostgreSQL (TimescaleDB), SQL optimization RabbitMQ, ZeroMQ, Linux servers Prometheus, Grafana, Zabbix Requirements 5+ years of Data Engineering experience with expertise in Python and SQL. Proven leadership experience guiding teams and projects. Strong background in More ❯
Solid understanding of software design principles and coding best practices. Hands-on experience with GitHub Actions and SDLC automation. Proficient in observability tools like Prometheus and Grafana. Strong communication skills and the ability to work in a collaborative, distributed team environment. Bonus: Familiarity with Jira APIs. More ❯
sheffield, south yorkshire, yorkshire and the humber, United Kingdom Hybrid / WFH Options
PURVIEW
ability to interpret and apply control requirements in technical design contexts. • Hands-on experience with performance monitoring, alerting systems, and diagnostic tooling (e.g., Geneos, Prometheus, Grafana, AppDynamics, or similar tools). • Strong communication skills — able to convey technical concepts to senior stakeholders and control partners. Desirable: • Experience in implementing or More ❯
ability to interpret and apply control requirements in technical design contexts. • Hands-on experience with performance monitoring, alerting systems, and diagnostic tooling (e.g., Geneos, Prometheus, Grafana, AppDynamics, or similar tools). • Strong communication skills — able to convey technical concepts to senior stakeholders and control partners. Desirable: • Experience in implementing or More ❯
ability to interpret and apply control requirements in technical design contexts. • Hands-on experience with performance monitoring, alerting systems, and diagnostic tooling (e.g., Geneos, Prometheus, Grafana, AppDynamics, or similar tools). • Strong communication skills — able to convey technical concepts to senior stakeholders and control partners. Desirable: • Experience in implementing or More ❯
london, south east england, United Kingdom Hybrid / WFH Options
Maxwell Bond
Senior Developer, you’ll work with Java (17+), SpringBoot, Kotlin and NoSQL databases . They’ve got their own internal cloud and toolings using Prometheus and Grafana. As their Senior Developer, you’ll be: A key advocate for best coding practice, standards, and innovation. TDD approach is key for this More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Tbwa Chiat/Day Inc
a SaaS or DBaaS environment. Strong understanding of cloud infrastructure components (e.g., compute, storage, networking) and their cost drivers. Experience with observability tools (e.g., Prometheus, Grafana, OpenTelemetry) and a deep understanding of monitoring and alerting best practices. Exceptional communication skills, capable of articulating complex technical concepts to diverse audiences. Demonstrated More ❯
and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for … messaging-related incidents, including root cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and … in a 24x7 enterprise environment. Experience working with distributed systems over WAN , with an understanding of networking, latency, and failover strategies. Solid experience with Prometheus and Grafana for system monitoring and alerting. Proficiency in troubleshooting message delivery, persistence, and topic routing. Experience with capacity management , performance tuning, and system scaling. More ❯
and applications across their entire IT estate. You’ll help drive the vision, design and implementation of monitoring and observability systems including OpenTelemetry, Grafana, Prometheus and Splunk etc. Working side by side with DevOps teams you’ll also have the chance to work with containers and Kubernetes, OpenShift, Docker and … monitoring, DevOps and automation tools. Requirements: Excellent previous experience in a similar Observability/Monitoring role. Experience of engineering and supporting solutions (OpenTelemetry, Grafana, Prometheus, Splunk etc) Experience with tools such as Jenkins, Ansible or Puppet Good knowledge of Linux and infrastructure support Experience of CI/CD, Cloud (AWS More ❯
and applications across their entire IT estate. You’ll help drive the vision, design and implementation of monitoring and observability systems including OpenTelemetry, Grafana, Prometheus and Splunk etc. Working side by side with DevOps teams you’ll also have the chance to work with containers and Kubernetes, OpenShift, Docker and … monitoring, DevOps and automation tools. Requirements: Excellent previous experience in a similar Observability/Monitoring role. Experience of engineering and supporting solutions (OpenTelemetry, Grafana, Prometheus, Splunk etc) Experience with tools such as Jenkins, Ansible or Puppet Good knowledge of Linux and infrastructure support Experience of CI/CD, Cloud (AWS More ❯