swindon, wiltshire, south west england, united kingdom
Global Technology Solutions Ltd
the provisioning and management of systems using Infrastructure as Code (IaC) Support containerisation and orchestration technologies such as Docker and Kubernetes Monitor platform performance, availability, and security using modern observability tools Collaborate with DevOps, security, and application teams to ensure seamless and secure delivery pipelines Implement and maintain CI/CD pipelines and deployment automation Manage secure configurations, patching, and More ❯
tools, including experience with some of the following tools: GitLab CI, GitHub Actions, Concourse CI, Jenkins X, TeamCity, Artifactory, etc.; Infrastructure provisioning (at least one of Terraform, Ansible, CloudFormation); Observability and Application monitoring (ELK stack, TICK stack, Grafana, Prometheus, New Relic, Datadog, etc.); Networking concepts - Bastion hosts, Reverse Proxies, Load Balancing, TLS, etc. Key Soft Skills required: Naturally resilient, tenacious More ❯
and Firewalls. Experience with load balancers (F5, HAProxy, Nginx) and network monitoring tools. Experience in DNS management and troubleshooting. Experience in network security best practices. Proficiency in monitoring and observability tools (Prometheus, Grafana, Splunk). Proficiency in at least one Scripting (Python, Bash) for automation. Experience with CI/CD pipeline management and DevOps practices. Strong understanding of disaster recovery More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Amber Labs
working in Agile teams using tools like Git , Jira , and Confluence Eligible for SC and NPPV3 clearance Desirable: Container orchestration with Kubernetes HashiCorp tools: Vault , Consul , Packer Monitoring and observability with Grafana , Prometheus , or similar Familiarity with cloud networking, VPCs, NAT Gateways, security groups, etc. Personal Attributes: Proactive and self-driven with a passion for technology Strong problem-solving mindset More ❯
working in Agile teams using tools like Git , Jira , and Confluence Eligible for SC and NPPV3 clearance Desirable: Container orchestration with Kubernetes HashiCorp tools: Vault , Consul , Packer Monitoring and observability with Grafana , Prometheus , or similar Familiarity with cloud networking, VPCs, NAT Gateways, security groups, etc. Personal Attributes: Proactive and self-driven with a passion for technology Strong problem-solving mindset More ❯
ARM, or Pulumi. Experience in building secure applications and infrastructure. Strong communication skills, with the ability to convey and understand complex technical concepts clearly and concisely. SRE skills including observability and telemetry monitoring. Familiarity with the HashiCorp Suite (Packer, Terraform, Vault, Vagrant, Consul). Experience in containerization using Docker, Kubernetes, OpenShift, and Helm. Programming skills in languages such as Python More ❯
and firewalls. Experience with load balancers (F5, HAProxy, Nginx) and network monitoring tools. Experience in DNS management and troubleshooting. Experience in network security best practices. Proficiency in monitoring and observability tools (Prometheus, Grafana, Splunk). Proficiency in at least one scripting language (Python, Bash) for automation. Experience with CI/CD pipeline management and DevOps practices. Strong understanding of disaster More ❯
London, England, United Kingdom Hybrid / WFH Options
BlackRock, Inc
access to the best tools available. We combine problem-solving skills with software and systems engineering to take a proactive approach in building fault-tolerant and secure systems, improving observability and zealously automating away toil. In this role you will: Use your site reliability expertise to design, operate and support Preqin's infrastructure, middleware and internal services. Improving their performance More ❯
and high availability CI/CD Pipeline Development: Develop and maintain robust CI/CD pipelines for continuous integration and deployment of ML models and related infrastructure Monitoring and Observability: Build and maintain comprehensive monitoring and alerting systems for our ML infrastructure and models, leveraging tools like DataDog to ensure system health and performance Collaboration and Mentorship: Collaborate effectively with More ❯
operating system for effective troubleshooting activities Awareness of any cloud infrastructure principles (like AWS, GCP or OCI), understanding basic principles of secure software delivery is a plus Familiar with Observability tools like Grafana or Prometheus, understanding the importance of giving the correct visibility to our platforms and environments We highly value ownership and initiative with capabilities to drive projects independently More ❯
etc. Infrastructure as Code and CI/CD paradigms and systems such as: Ansible, Terraform, Jenkins, Bamboo, Concourse etc. Monitoring utilising products such as: Prometheus, Grafana, ELK, filebeat etc. Observability - SRE Big Data solutions (ecosystems) and technologies such as: Apache Spark and the Hadoop Ecosystem Edge technologies e.g. NGINX, HAProxy etc. Excellent knowledge of YAML or similar languages The following More ❯
Manual Tester (DV Security Clearance) Position Description Are you an experienced Test Analyst with a background in secure or classified programmes, ready to contribute to projects of national importance? Step into a role where you'll challenge the complex to More ❯
Consultant IRC250319 Job: IRC250319 Location: United Kingdom - London Designation: Senior Consultant Experience: 5-10 years Function: Engineering Skills: Cloud(Azure/AWS/GCP), Containers, DevOps Practices, Grafana, Kubernetes, Observability stack, SRE Management, Terraform Work Model: Hybrid We are seeking an experienced Platform Engineering leader with a hands-on engineering background, who can articulate the business benefits that Observability and … on the responsibility of handling client engagements from both technical and business perspectives. Requirements: We are ideally looking for someone with a strong background and experience in the following: Observability and SRE Practices: In-depth understanding of observability and Site Reliability Engineering practices. Familiarity with tools in the LGTM stack (Loki, Grafana, Tempo, Mimir) or equivalent observability platforms. Containerisation: Strong More ❯
and firewalls. • Experience with load balancers (F5, HAProxy, Nginx) and network monitoring tools. Experience in DNS management and troubleshooting. Experience in network security best practices. Proficiency in monitoring and observability tools (Prometheus, Grafana, Splunk). Proficiency in at least one scripting language (Python, Bash) for automation. Experience with CI/CD pipeline management and DevOps practices. Strong understanding of disaster … performance. Experience in tools like df, du, lsblk, and fdisk for managing and troubleshooting file systems and disk partitions. Familiarity with tools like Prometheus and Grafana for monitoring and observabilityMore ❯
engineering teams to deploy faster and more confidently-without compromising stability or uptime. As the SRE Lead, you'll mentor a growing team of SREs, drive best practices in observability, automation, and incident management, and collaborate cross-functionally to ensure a seamless experience for both our internal teams and customers. What You'll Be Doing: Leadership & Strategy -Lead and grow … Proven experience in an SRE or DevOps leadership role. -Deep understanding of networking, containers (Docker, Kubernetes), and cloud infrastructure (AWS/GCP/Azure). -Strong skills in monitoring, observability, and alerting systems (Prometheus, Grafana, Datadog, etc.). -Proficiency with infrastructure-as-code tools like Terraform or Pulumi. -Experience with CI/CD pipelines and GitOps practices. -Excellent communication and More ❯
and firewalls. • Experience with load balancers (F5, HAProxy, Nginx) and network monitoring tools. Experience in DNS management and troubleshooting. Experience in network security best practices. Proficiency in monitoring and observability tools (Prometheus, Grafana, Splunk). Proficiency in at least one scripting language (Python, Bash) for automation. Experience with CI/CD pipeline management and DevOps practices. Strong understanding of disaster … performance. Experience in tools like df, du, lsblk, and fdisk for managing and troubleshooting file systems and disk partitions. Familiarity with tools like Prometheus and Grafana for monitoring and observability Seniority level Seniority level Not Applicable Employment type Employment type Full-time Job function Job function Information Technology Industries Computer and Network Security Referrals increase your chances of interviewing at More ❯
London, England, United Kingdom Hybrid / WFH Options
ZipRecruiter
premises infrastructure to the cloud and understanding the challenges involved Familiarity with cloud security best practices, and access management (IAM), and encryption techniques Microsoft Azure certifications are a plus Observability Designing, implementing and day-to-day use of logging and monitoring tools to capture data for alerting and issue identification and resolution using DataDog, App Insights or similar tools. Designing … applications and infrastructure for observability, security, and reliability. Networking & Security Monitor and enhance network performance, ensuring high levels of security and scalability across all cloud environments. Enforce security best practices in AKS, including network policies, RBAC (Role-Based Access Control), and integration with Azure Active Directory Core Services Azure core services such as Azure Storage, including Blob, Azure VMs, Azure More ❯
WAF, CloudFront, API GW, AWS Organizations, S3, ECS, EKS, Route 53, ELBs, OpenShift, Kubernetes, Docker Languages: TypeScript, Python Security & Scanning: AWS Guardrails, Checkov, Prisma Cloud, OSV Scanner, SonarQube, Renovate Observability & Logging: CloudWatch, OpenSearch Operating System Management: RedHat Satellite, AMI lifecycle management, Ubuntu Landscape Testing Tools: Pytest, Jest, Cypress APIs/Microservices: RESTful APIs, API Gateway, containerised services Version Control: GitLab … to as Senior DevOps Engineer. On a typical day you will Design, build, and maintain scalable, high-quality software and platform systems Implement and manage CI/CD pipelines, observability, security automation, automated testing, and engineering standards Lead feature development from concept to production with focus on quality and performance Troubleshoot issues, ensuring resilience, reliability, and minimal user disruption Contribute More ❯
emergency events outside of your local time-zone. Here's what you need: Technical Expertise In-depth understanding of the Linux operating environment: kernel tuning, network stack tuning, system observability & instrumentation, and security & access management. Solid understanding of layer 2-7 networking fundamentals and the relationship between servers & services, and the transit of their packets through network hardware. In-depth … experience engineering and maintaining a private-cloud infrastructure: Bare-metal, vSphere, KVM, Kubernetes. Experience with tools like Ansible, Terraform, Docker, Kafka, Nexus Experience with observability platforms: InfluxDB, Prometheus, ELK, Jaeger, Grafana, Nagios, Zabbix Familiarity with Big Data tools: Hadoop, HDFS, Spark, HBase Ability to write code in Go, Python, Bash, or Perl for automation. Work Experience 5-7+ years More ❯
a fast-paced, dynamic environment. Previous experience working on large App/Data migrations engagements. Cloud Platforms and Technology Experience Core Skills: GCP – Networking, Security tool/Best Practices Observability - Operations suite, Logging, Monitoring, Alerting. Additional Skills: Good understanding of Linux OS. Bash, Scripting, Automation, Ansible, Networking, Security. Hands-on experience with DevOps Principles and Tools. Hands-on with Terraform More ❯
built on Solace PubSub+, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging More ❯
built on Solace PubSub+, ensuring high availability, optimal performance, and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for messaging More ❯
Go • Significant experience with AWS cloud infrastructure • Deep understanding of IaC tools: Terraform, Packer, CloudFormation • Proven leadership in multidisciplinary delivery teams • Skills in Databases: MongoDB/Atlas, Messaging: Kafka, Observability: Prometheus, Grafana, Splunk • Experience of working in a DevOps environment - favouring and implementing Continuous Integration & Deployment over manual processes. • Experience of designing, implementing, securing and supporting Unix/Linux based More ❯
Go • Significant experience with AWS cloud infrastructure • Deep understanding of IaC tools: Terraform, Packer, CloudFormation • Proven leadership in multidisciplinary delivery teams • Skills in Databases: MongoDB/Atlas, Messaging: Kafka, Observability: Prometheus, Grafana, Splunk • Experience of working in a DevOps environment - favouring and implementing Continuous Integration & Deployment over manual processes. • Experience of designing, implementing, securing and supporting Unix/Linux based More ❯