IBM V7000/V5000 storage systems Veritas NetBackup Proficiency in Linux, Windows, and AWS environments Strong Bash and Python scripting skills, with a focus on automation Experience with Prometheus, Grafana and Zabbix monitoring tools Experience with Docker Strong problem-solving skills and the ability to work independently and take initiative. Excellent collaboration skills, with the ability to work efficiently within More ❯
plus Knowledge of Redis and log queries is a plus Experience in automations/AI would be an advantage Experience administering multiple monitoring systems such as Datadog, NewRelic, Kubernetes, Grafana and Elastic Cloud Experience with Cloud Computing, AWS, Microservices Architecture, Unix and Linux Systems Empowered to think big. Try new opportunities while working with a talented, ambitious and supportive team. More ❯
Strong proficiency in Bash, Powershell and Ansible scripting, Python experience is desirable Expertise in virtualisation platforms and container orchestration and related tooling. Familiarity with monitoring and observability stacks (Prometheus, Grafana, ELK/EFK, or equivalents). Ability to diagnose and resolve complex technical issues with a clear methodical approach Ability to manage multiple tasks and prioritise effectively Is highly organised More ❯
JMeter/Gatling/wrk2 etc) and JVM profiling to identify and fix performance bottlenecks Hands-on experience with instrumentation and analysis of production metrics using tools like Prometheus, Grafana, InfluxDB, or the ELK stack to identify performance bottlenecks and ensure system health. As an industry pioneer, our work is constantly evolving and challenging us in new ways that require More ❯
such as: Puppet, Chef, Ansible, or related systems - Experience with performance testing and tuning - Experience in a 24x7 production environment - Significant experience of monitoring frameworks (such as CloudWatch, Datadog, Grafana, Elastic or similar) Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience More ❯
our risk appetite and provide and progress appropriate controls Help engineering teams to measure application performance and delivering insights, metrics and relevant alerts to the engineering teams with ELK, Grafana and New Relic Develop and support the processes necessary to ensure smooth operation of internal IT that connect all members of Lyst with the tooling and services they need every More ❯
incidents The Person: 5+ years in SRE, DevOps, or infrastructure engineering Strong experience with AWS, EKS/Kubernetes, and Terraform Familiar with Kafka and observability tools like Datadog or Grafana Able to troubleshoot issues across infrastructure and application layers Reference number: BBBH259300 To apply for this role or for to be considered for further roles, please click "Apply Now" or More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment
incidents The Person: *5+ years in SRE, DevOps, or infrastructure engineering *Strong experience with AWS, EKS/Kubernetes, and Terraform *Familiar with Kafka and observability tools like Datadog or Grafana *Able to troubleshoot issues across infrastructure and application layers Reference number: BBBH(phone number removed) To apply for this role or for to be considered for further roles, please click More ❯
Employment Type: Permanent
Salary: £80000 - £90000/annum 38 Days Holiday, Healthcare, Pension
London, South East, England, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment Limited
incidents The Person: *5+ years in SRE, DevOps, or infrastructure engineering*Strong experience with AWS, EKS/Kubernetes, and Terraform*Familiar with Kafka and observability tools like Datadog or Grafana*Able to troubleshoot issues across infrastructure and application layers Reference number: BBBH259300 To apply for this role or for to be considered for further roles, please click "Apply Now" or More ❯
South East London, London, United Kingdom Hybrid / WFH Options
Client Server
are degree educated, having achieved a 2.1 or above in Computer Science or closely related discipline Ideally you will also have some experience with Django or Flask, React.js, NoSQL, Grafana, Gitlab, AWS You're collaborative and pragmatic with great communication skills What's in it for you: As a Senior Full Stack Engineer you will receive a competitive package: Salary More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Client Server Ltd
are degree educated, having achieved a 2.1 or above in Computer Science or closely related discipline Ideally you will also have some experience with Django or Flask, React.js, NoSQL, Grafana, Gitlab, AWS You're collaborative and pragmatic with great communication skills What's in it for you: As a Senior Full Stack Engineer you will receive a competitive package: Salary More ❯
profession. Quality Engineering Tools and Technologies We use a diverse portfolio of tools, including: Frameworks: Serenity BDD (Selenium + RestAssured), bespoke frameworks Performance testing: JMeter Observability and Monitoring: ELK, Grafana, Dynatrace Responsibilities Your daily responsibilities will include: Maintaining and developing test frameworks, including code refactoring Developing predictable, repeatable tests with automated verification Choosing tools and approaches for virtualization, data mocking More ❯
Private Networks, DWDM and Optical Networking, Data Centre builds and design fundamentals. etc. Experience with network modelling Eagerness to learn new technologies and mentor others Experience with Telemetry: Splunk, Grafana, Humio Experience with continuous integration and deployment tools Experience implementing, maintaining and troubleshooting MPLS, BGP, OSPF, IGMP, PIM related internal and external network routing issues in a production environment Knowledge More ❯
Private Networks, DWDM and Optical Networking, Data Centre builds and design fundamentals. etc. Experience with network modelling Eagerness to learn new technologies and mentor others Experience with Telemetry: Splunk, Grafana, Humio Experience with continuous integration and deployment tools Experience implementing, maintaining and troubleshooting MPLS, BGP, OSPF, IGMP, PIM related internal and external network routing issues in a production environment Knowledge More ❯
AWS & GCP - we're cloud-native Microservice based architecture Kubernetes (EKS) TeamCity for CI/CD (lots of team are releasing code 15-20 times per day!) Terraform and Grafana Our process: Interviewing is a two way process and we want you to have the time and opportunity to get to know us, as much as we are getting to More ❯
AWS & GCP - we're cloud-native Microservice based architecture Kubernetes (EKS) TeamCity for CI/CD (lots of team are releasing code 15-20 times per day!) Terraform and Grafana Our process: Interviewing is a two way process and we want you to have the time and opportunity to get to know us, as much as we are getting to More ❯
AWS & GCP - we're cloud-native Microservice based architecture Kubernetes (EKS) TeamCity for CI/CD (lots of team are releasing code 15-20 times per day!) Terraform and Grafana Our process: Interviewing is a two way process and we want you to have the time and opportunity to get to know us, as much as we are getting to More ❯
AWS & GCP - we're cloud-native Microservice based architecture Kubernetes (EKS) TeamCity for CI/CD (lots of teams are releasing code 15-20 times per day!) Terraform and Grafana The team The Customer Identity & Fincrime division is at the forefront of understanding who our customers are and protecting them and the company from financial crime while ensuring a seamless More ❯
AWS & GCP - we're cloud-native Microservice based architecture Kubernetes (EKS) TeamCity for CI/CD (lots of teams are releasing code 15-20 times per day!) Terraform and Grafana The team The Customer Identity & Fincrime division is at the forefront of understanding who our customers are and protecting them and the company from financial crime while ensuring a seamless More ❯
Cloud native applications working in a continuous delivery environment Modern observability practices Nice to have Not vital, but you'll have the edge if you also have experience with: Grafana Prometheus Kotlin or a least the willingness to learn it or have worked in: an eCommerce organisation a shipping/logistics/exports organisation What you bring Agile : Test-Driven More ❯
Cloud native applications Working in a Continuous Delivery environment Modern observability practices Nice to have Not vital, but you'll have the edge if you also have experience with: Grafana Prometheus Kotlin or a least the willingness to learn it Batch processing data pipelines or have worked in: an eCommerce organisation a shipping/logistics/exports organisation What you More ❯
security best practices across cloud and network environments. Troubleshoot deployment and performance issues across multiple environments. Set up and maintain observability tools for logging, monitoring, and alerting (e.g., Prometheus, Grafana, Loki). Contribute to internal tooling to streamline development, testing, and operations workflows. Stay current with DevOps trends and recommend improvements to tools and processes. Required Qualifications: Bachelor's degree … Exposure to multi-cloud or hybrid cloud architectures. Tech Stack: Cloud: AWS, OCI ZTN: Cloudflare Application: Kong (API Gateway), Java Spring Boot, Python, Go, TypeScript Monitoring: Prometheus Stack (Prometheus, Grafana, Loki) Compute: ECS, EC2, Lambda Frontend: S3, CloudFront Data: Glue, S3, PostgreSQL CI/CD: GitHub Actions IaC: Terraform, AWS SAM Why Join Us? At Intelmatix, you'll work on More ❯
and security. Automation & CI/CD: Implement and manage CI/CD pipelines for efficient deployment, testing, and monitoring of applications. Observability & Monitoring: Develop comprehensive monitoring solutions using Prometheus, Grafana, ELK stack, or similar tools to improve system reliability. Security & Compliance: Apply best practices for cloud security, IAM policies, and compliance frameworks (SOC2, ISO 27001, etc.). Incident Response & Performance … . Proficiency in scripting and automation using Python, Bash, or Go. Experience with Infrastructure as Code (Terraform, CloudFormation, or Ansible). Familiarity with monitoring, logging, and observability tools (Prometheus, Grafana, Datadog, ELK, etc.). Strong understanding of networking concepts (VPC, Load Balancers, DNS, Firewalls). Experience with DevOps methodologies, CI/CD pipelines, and GitOps practices. Experience with high-performance More ❯