Leeds, Yorkshire, United Kingdom Hybrid / WFH Options
William Hill PLC
our ethos. To apply to this post, you will have: A base in Leeds with working experience of an incident response model and fluency with observability and monitoring (Prometheus, Grafana) Experience defining alerts and implementing dashboards from existing monitoring and logging data Relentless focus on customer experience with good understanding of security best practice Fluency in cloud infrastructure (AWS) - using More ❯
collaborate effectively with cross-functional teams, including DevOps, Engineering, Service Reliability, and Service Delivery teams. Technical Expertise: In-depth knowledge of open-source and commercial observability tools (e.g., Prometheus, Grafana, NewRelic). Expertise in cloud environments (e.g., AWS, Azure) and infrastructure as code (IaC) tools like Terraform. Monitoring and Observability: Experience in creating and maintaining dashboards for proactive monitoring of More ❯
to managing our infrastructure, using Terraform. - We follow a GitOps approach to managing our Kubernetes configuration, using ArgoCD and Helm. - We manage a high-availability metrics collection system using Grafana, Thanos & Prometheus. We're in the process of transitioning to OpenTelemetry and Honeycomb for our application telemetry (traces and metrics). - We manage a data pipeline using Pub/Sub More ❯
systems, such as Puppet, Chef, Ansible, DSC, or related systems - Experience with performance testing and optimisation in a 24x7 production environment - Experience using monitoring platforms, such as CloudWatch, Datadog, Grafana, Elastic or similar Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience More ❯
provided by GCP/AWS, such as S3, FSX, EKS, SQS, SNS, Kinesis, AmazonMQ, DynamoDB, GKE, CloudStorage, PubSub, Filestore, Knowledge of modern observability technologies such as ELK, Splunk, Prometheus, Grafana, Micrometer "What-if" thinking, while designing or reviewing solutions, to foresee or catch potential problems as early in the development process, as only possible Nice to have: Good knowledge of More ❯
primary language for our backend codebase AWS & GCP - we're cloud-native Kubernetes (EKS) Microservice based architecture RESTful APIs PostgreSQL, JDBI, Flyway TeamCity for CI/CD Terraform and Grafana The Team: The Core Banking group is seeking passionate engineers ready to tackle complex challenges and contribute to foundational systems, powering modern banking, that process millions of transactions daily, ensuring More ❯
and distributed storage. Proficiency in Python, Bash, and experience with automation scripting for system monitoring and troubleshooting. Knowledge of POSIX, NFS, S3 protocols, log management, and monitoring tools (Prometheus, Grafana). It's nice if you have: Experience with JIRA, Confluence, Slack, and other collaboration tools. Experience collaborating between customer support and product development teams. Familiarity with Kubernetes, Containers, LXC More ❯
Department: Tech Services Location: SEGA West London Reporting To: Head of Corporate Infrastructure Position Overview: We are seeking an experienced Senior Build + Release Engineer with games industry experience to design, deploy, and maintain our CI/CD and build More ❯
/Polygraph Clearance Required Qualifications Experience building distributed systems. Experience performing application, network, and infrastructure monitoring and analysis. Familiarity with open source tools such as Istio, Keycloak, Nginx, Prometheus, Grafana, Accumulo, and Elasticsearch. Experience with administering Kubernetes clusters including deploying and configuring operators and helm charts. Experience with one or more of the following programming languages: Go, Java, Javascript, Kotlin More ❯
IaC principles and automation tools such as Ansible and SaltStack Experience with Elastic Stack (Elasticsearch/Kibana/Logstash/Beats) Experience with time-series visualization tools such as GrafanaMore ❯
Operations & Maintenance, Networks, VLANs, VPNs, Firewalls, SSPs, STE/STN, Netseer, SEAR logging, patching, IAVMs, Docker, MongoDB, Elastic Search, Ansible, GitLab, RedHat Satellite, Prometheus, Grafana, SELinux, LatteArt, Biscotti Due to federal contract requirements, United States citizenship and an active TS/SCI security clearance and polygraph are required for the position. Required: Must be a US Citizen. Must have TS … MongoDB, and Elastic Search; Ansible and GitLab; RedHat Satellite. Must have working knowledge of software-defined networks. Experience deploying and maintaining nginx; system monitoring platforms like OpenVAS, Prometheus and Grafana; SELinux and FIPS. Must have knowledge of LatteArt/Biscotti, Netseer, SEAR logging. $210,000 - $235,000 a year The pay range for this job, with multi-levels, is a More ❯
Hertfordshire office. In this role, you'll take ownership of the end-to-end monitoring and alerting stack, designing and maintaining infrastructure and alert configurations (e.g., with Prometheus/Grafana or equivalent), and building dashboards that clearly communicate metrics to business stakeholders. You'll drive system automation and integration, crafting scripts and workflows-primarily in Python-to onboard new services More ❯
Desired Skills General HPC technical knowledge regarding compute, network, memory, and storage components Experience with the Elastic Stack (Elasticsearch/Kibana) Experience with time-series visualization tools such as Grafana Experience writing scripts using Bash/Python Experience with IaC principles and automation tools such as Ansible (Puppet and SaltStack acceptable) Requirements Twenty (20) years' experience as a SE in More ❯
Nottingham, Nottinghamshire, East Midlands, United Kingdom Hybrid / WFH Options
Oscar Associates (UK) Limited
Azure AD) and support security/compliance across the estate Troubleshoot and manage network infrastructure - Cisco switches, VLANs, firewalls Support backup (Rubrik), DR (Zerto), and monitoring tools (Dynatrace, Zabbix, Grafana) What We're Looking For: Strong hands-on experience with Linux in enterprise environments Solid background in escalated infrastructure support (3rd/4th line) Scripting and automation skills (Bash, Python More ❯
Manage and configure host-based security systems. Experience with DevOps methodologies and tools, supporting Infrastructure as Code (IaC) within RHEL containers. Use monitoring and logging tools, such as Prometheus, Grafana, ELK, IBM NetCool, and Solarwinds. Patch and manage systems using Red Hat Satellite Server. Provide hardware support for servers and workstations. Experience/knowledge of cloud platforms such as AWS More ❯
years of experience Ensuring Uptime of Critical Systems (Incident Response/Triage) Automating Systems Administration Activities (Bash/Python/Ansible are preferred) Monitoring, and Troubleshooting Enterprise Services (Prometheus, Grafana, Splunk) Configuring Enterprise Services (Ansible, YAML, JSON) Developing recovery procedures for large systems (Backup and Restore, Blue/Green Deployment) Moseley Technical Services, Inc. is an AA/EEO/ More ❯
Town Centre, Telford, Shropshire, England, United Kingdom Hybrid / WFH Options
TXP
progress and milestones. Skills & Experience Strong background in test management and Agile delivery. Hands-on experience with CI/CD (GitLab), automation tools (Playwright, OWASP Zap, Gatling), and monitoring (Grafana, Splunk). Familiarity with cloud (AWS/Azure), Kubernetes, and databases (Oracle RDS, SQL, MongoDB). Technical knowledge of Java 21 and Spring Boot. Deep understanding of all test levels More ❯
predictive analytics. Understanding of AI frameworks and libraries (e.g., TensorFlow, PyTorch, Scikit-learn) and their application in network automation and monitoring. Experience with telemetry and observability frameworks (e.g., Prometheus, Grafana) for real-time network monitoring and troubleshooting. Experience : Minimum of 7 years' of experience in network engineering, operations, and support. Proven ability to work hands-on and take strong technical More ❯
principles and automation tools such as Ansible, Puppet and SaltStack General HPC technical knowledge regarding compute, network, memory, and storage components Experience with monitoring and observability tools such as Grafana Clearance: TS/SCI clearance with polygraph is required. Total Compensation Package We offer a comprehensive compensation package designed to support your well-being and professional growth. Our competitive base More ❯
for you. Ideally you have several years experience using Go in production. You'll be comfortable with Docker, and familiar with modern observability tools such as Prometheus, Alert Manager, Grafana and X-Ray/Tempo/Jaeger. We're looking for 3+ years tackling hard backend problems Seasoned database experience - we use MySQL, DynamoDB, Elasticsearch and Redis Experience with microservices More ❯
for you. Ideally you have several years experience using Go in production. You'll be comfortable with Docker, and familiar with modern observability tools such as Prometheus, Alert Manager, Grafana and X-Ray/Tempo/Jaeger. We're looking for 3+ years tackling hard backend problems Seasoned database experience - we use MySQL, DynamoDB, Elasticsearch and Redis Experience with microservices More ❯
Private Networks, DWDM and Optical Networking, Data Centre builds and design fundamentals. etc. Experience with network modelling Eagerness to learn new technologies and mentor others Experience with Telemetry: Splunk, Grafana, Humio Experience with continuous integration and deployment tools Experience implementing, maintaining and troubleshooting MPLS, BGP, OSPF, IGMP, PIM related internal and external network routing issues in a production environment Knowledge More ❯
Willingness to tackle challenging problems and make meaningful contributions to the success of both the team and the organization. Nice to Have: Experience with Docker and Kubernetes. Familiarity with Grafana and other monitoring tools. Prior experience with Scala and Java is an advantage. What we offer You will have the chance to be involved in something impactful, large-scale, and More ❯
Bracknell, Berkshire, United Kingdom Hybrid / WFH Options
Techex
Experience of public cloud platform architecture/design CCNP or higher/equivalent non-cisco qualification (Routing & Switching or Data-Centre/SDN) Experience with either Influx, Redis, Kafka, Grafana, Kibana Our Values and Benefits We have secured Great Place to work accreditation for the past two years and we seek out individuals who enjoy developing their professional skills, are More ❯
plus Knowledge of Redis and log queries is a plus Experience in automations/AI would be an advantage Experience administering multiple monitoring systems such as Datadog, NewRelic, Kubernetes, Grafana and Elastic Cloud Experience with Cloud Computing, AWS, Microservices Architecture, Unix and Linux Systems Life @ Empowered to think big. Try new opportunities while working with a talented, ambitious and supportive More ❯