Southampton, Hampshire, United Kingdom Hybrid / WFH Options
NICE
GitLab CI/CD, or CircleCI. Strong knowledge of containerization technologies (e.g., Docker, Kubernetes) and microservices architecture. Experience with monitoring and observability tools (e.g., Prometheus, Grafana, ELK stack, Cloudwatch). Excellent problem-solving skills and the ability to troubleshoot complex issues in distributed systems. Experience of Incident management and blameless More ❯
in at least one programming language that compiles to machine code such as Rust, C++, or Go. Expert knowledge of monitoring technologies such as Prometheus, Grafana, and PagerDuty. Expert knowledge of deployment technologies such as Pulumi or Terraform. Expert knowledge of Kubernetes. Responsibilities: Improving our observability by adding/adjusting More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
AI Tech Suite
in at least one programming language that compiles to machine code such as Rust, C++, or Go. Expert knowledge of monitoring technologies such as Prometheus, Grafana, and PagerDuty. Expert knowledge of deployment technologies such as Pulumi or Terraform. Expert knowledge of Kubernetes. Responsibilities: Improving our observability by adding/adjusting More ❯
AWS cloud services to implement highly efficient architecture. Ability to analyze infrastructure and implement security best practices. Experience with infrastructure monitoring tools like Nagios, Prometheus, Grafana. Expertise in containerization platforms like Docker and container orchestration platforms like Kubernetes and Rancher. Familiarity with infrastructure as code tools such as Terraform, CloudFormation More ❯
Bash, or PowerShell for automation. Understanding of AWS networking concepts, including VPCs, subnets, and security groups. Experience with monitoring and logging solutions such as Prometheus, Grafana, ELK Stack, or AWS CloudWatch. Familiarity with Zero Trust security models and best practices for securing cloud workloads. Ability to troubleshoot complex infrastructure issues More ❯
london, south east england, united kingdom Hybrid / WFH Options
LHH
Bash, or PowerShell for automation. Understanding of AWS networking concepts, including VPCs, subnets, and security groups. Experience with monitoring and logging solutions such as Prometheus, Grafana, ELK Stack, or AWS CloudWatch. Familiarity with Zero Trust security models and best practices for securing cloud workloads. Ability to troubleshoot complex infrastructure issues More ❯
own end to end delivery of solutions. Expert knowledge of SRE fundamentals and a commitment to best practice Fluency with common observability tooling like Prometheus, Grafana, OTEL and Cloudwatch Experience analysing and building data telemetry, querying (PromQL), modelling, pipelines and dashboards to provide concise, focused insights and alerts for distributed More ❯
or Google Cloud Platform. • Security: Experience with tools for delivering SCA, SAST, DAST capabilities. • Monitoring and Logging: Proficiency with tools like Splunk, Dynatrace, Datadog, Prometheus, Grafana. • Version Control: Strong understanding of Git and version control practices. • Scripting: Skills in scripting languages like Bash, PowerShell, or Perl. • Containerization: Familiarity with Docker More ❯
or Google Cloud Platform. • Security: Experience with tools for delivering SCA, SAST, DAST capabilities. • Monitoring and Logging: Proficiency with tools like Splunk, Dynatrace, Datadog, Prometheus, Grafana. • Version Control: Strong understanding of Git and version control practices. • Scripting: Skills in scripting languages like Bash, PowerShell, or Perl. • Containerization: Familiarity with Docker More ❯
Bristol, Gloucestershire, United Kingdom Hybrid / WFH Options
Searchability
CI/CD) and automation tools like Terraform and Ansible Programming : Proficiency in Python, Go, or Ruby Monitoring and Observability : Hands-on experience with Prometheus, Grafana, ELK Stack, or similar technologies Core Attributes A passion for solving complex technical challenges in high-availability production environments Strong communication and collaboration skills More ❯
Greater Bristol Area, United Kingdom Hybrid / WFH Options
Searchability NS&D
GitLab CI/CD) and automation tools like Terraform and Ansible Programming : Proficiency in Python, Go, or Ruby Monitoring & Observability : Hands-on experience with Prometheus, Grafana, ELK Stack, or similar technologies Core Attributes A passion for solving complex technical challenges in high-availability production environments Strong communication and collaboration skills More ❯
Kubernetes). Understanding of CI/CD pipelines. Familiarity with scripting languages like Python, Bash, or Go. Experience with monitoring tools such as Datadog, Prometheus, Grafana, or ELK stack. Strong problem-solving, communication skills, and ability to work independently or in teams. Additional notes We value diverse backgrounds and perspectives. More ❯
infrastructure as code (IaC) tools such as Terraform, Ansible, or Chef for automation and configuration management. Strong understanding of monitoring and observability tools like Prometheus, Grafana, Azure App Insights for proactive system monitoring and troubleshooting. Knowledge of networking, security principles, and best practices in a cloud environment. Demonstrated experience of More ❯
/IP, DNS, HTTP Experience of deploying Continuous Integration solutions An awareness of security considerations in web application deployment Monitoring/Logging aka ELK, Prometheus/Grafana etc Strong AWS knowledge - EC2, EKS, RDS, Aurora, networking, cost management If you'd like to discuss this DevOps Engineer in more detail More ❯
Experience with TDD, BDD, and automated testing frameworks (PyTest, Selenium). Familiarity with security best practices in software development. Knowledge of observability tools like Prometheus, Grafana, and ELK stack. More ❯
Deploy, GitHub Actions, TeamCity, Jenkins, Azure DevOps). Monitoring and Observability Familiarity with observability concepts and ensuring system reliability. Experience with monitoring tools like Prometheus, Grafana and Sumo Logic. System Configuration and Documentation Experience designing, documenting, amending and refactoring moderately complex software configurations for deployment and system components. Server Administration More ❯
Deploy, GitHub Actions, TeamCity, Jenkins, Azure DevOps). Monitoring and Observability Familiarity with observability concepts and ensuring system reliability. Experience with monitoring tools like Prometheus, Grafana and Sumo Logic. System Configuration and Documentation Experience designing, documenting, amending and refactoring moderately complex software configurations for deployment and system components. Server Administration More ❯
Strong understanding of modern infrastructure and site reliability engineering practice, including Infrastructure-as-code tools (e.g. Terraform, Ansible ) and metrics and observability tools (e.g. Prometheus, Grafana ). Strong understanding of modern DevOps practice, including DevOps stacks (e.g. Jenkins, GitLab, CircleCI ). Cloud experience (e.g. AWS, Google Cloud, Azure, Kubernetes). More ❯
in Java and C++. We use Airflow for workflow management, Kafka for data pipelines, Bitbucket for source control, Jenkins for continuous integration, Grafana + Prometheus for metrics collection, ELK for log shipping and monitoring, Docker and Kubernetes for containerisation, OpenStack for our private cloud, Ansible and Terraform for architecture automation More ❯
culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS More ❯
culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Future Talent Group
culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS More ❯