Southampton, Hampshire, United Kingdom Hybrid / WFH Options
NICE
you also have: Hands-on experience of working with large Kubernetes Cluster. Certification will be an added plus. Working experience of Grafana Observability Suite (Loki, Mimir, Tempo). Administration and/or development experience of standard monitoring and automation tools such as Splunk, Datadog, Pagerduty, Rundeck. Familiarity with configuration More ❯
to learn more about this opening. Desired Skills and Experience Linux, Ubuntu, Python, Bash, Jenkins, Bitbucket, Jira, Terraform, Ansible, Helm, Argo CD, Prometheus, Grafana, Loki, VMWare, Docker, PXE, IP networks, Kubernetes, AWX, APT, AWS, Azure Darwin Recruitment is acting as an Employment Agency in relation to this vacancy. More ❯
to learn more about this opening. Desired Skills and Experience Linux, Ubuntu, Python, Bash, Jenkins, Bitbucket, Jira, Terraform, Ansible, Helm, Argo CD, Prometheus, Grafana, Loki, VMWare, Docker, PXE, IP networks, Kubernetes, AWX, APT, AWS, Azure Darwin Recruitment is acting as an Employment Agency in relation to this vacancy. More ❯
efficiency. Technical Expertise Observability and SRE Practices: In-depth understanding of observability and Site Reliability Engineering practices. Familiarity with tools in the LGTM stack (Loki, Grafana, Tempo, Mimir) or equivalent observability platforms. Containerisation: Strong experience building and managing containerised applications, effectively leveraging container orchestration platforms such as Kubernetes. Cloud More ❯
of containerization for applications and their subsequent orchestration within Kubernetes environments. Experience working on at least one monitoring/observability stack (Datadog, ELK, Splunk, Loki, Grafana). Strong knowledge of Unix or Linux Strong communication skills to collaborate with various stakeholders Able to work independently in a fast-paced More ❯
Cloudfront, Route53, Workspace, ) Experience with security standards (PCI-DSS ISO27001, ) Focus on automation Excellent communication skills in English Nice to have Experience with ELK, Loki Docker, Kubernetes Experience in Nosql Experience in Message Bus systems is nice to have (rabbitmq/activemq/) Knowledge in Rundeck, Jfrog Artifactory, Hashicorp More ❯
in the following: Observability and SRE Practices: In-depth understanding of observability and Site Reliability Engineering practices. Familiarity with tools in the LGTM stack (Loki, Grafana, Tempo, Mimir) or equivalent observability platforms. Containerisation: Strong experience building and managing containerised applications, effectively leveraging container orchestration platforms such as Kubernetes. Cloud More ❯
efficiency. Technical Expertise Observability and SRE Practices: In-depth understanding of observability and Site Reliability Engineering practices. Familiarity with tools in the LGTM stack (Loki, Grafana, Tempo, Mimir) or equivalent observability platforms. Containerisation: Strong experience building and managing containerised applications, effectively leveraging container orchestration platforms such as Kubernetes. Cloud More ❯
efficiency. Technical Expertise Observability and SRE Practices: In-depth understanding of observability and Site Reliability Engineering practices. Familiarity with tools in the LGTM stack (Loki, Grafana, Tempo, Mimir) or equivalent observability platforms. Containerisation: Strong experience building and managing containerised applications, effectively leveraging container orchestration platforms such as Kubernetes. Cloud More ❯
/CD automation for containerized applications. Strong knowledge of Kubernetes networking (CNI, ingress controllers, service meshes). Hands-on experience with observability tools (Prometheus, Loki, Open Telemetry, Azure Monitor). Proficiency in Infrastructure as Code (Terraform, Bicep, Pulumi). Security expertise in Kubernetes, including RBAC, pod security policies, network More ❯
and be an autonomous, proactive, confident, credible, and persuasive team player. Experience Required: Expertise with monitoring and alerting platforms, such as ELK, DataDog, Grafana, Loki, etc. Solid understanding of monitoring and alerting best practices. Previous experience as DevOps/Platform Engineer or SRE. Expertise with IaC tooling (Terraform) and More ❯
a focus on GitHub Actions and GitOps tooling such as Flux, Argo, etc. Experience with monitoring and logging tools such as Splunk, Prometheus, Grafana, Loki, Jaeger. Able to develop scripts in Python and/or Bash. Ability to create technical documentation in regard to incidents, etc. Experience in Operations More ❯
C/C++ Linux environment, with Azure Cloud Docker/k8s/helm Kubernetes cluster management Database management (PostgreSQL) Monitoring and alerting (Prometheus, Grafana, Loki) Desirable Distributed logging platforms Cache and data stores (Redis) Distributed message passing (Kafka, AMQP) Micro-services & Restful APIs If this role seems like a More ❯
C/C++ Linux environment, with Azure Cloud Docker/k8s/helm Kubernetes cluster management Database management (PostgreSQL) Monitoring and alerting (Prometheus, Grafana, Loki) Desirable Distributed logging platforms Cache and data stores (Redis) Distributed message passing (Kafka, AMQP) Micro-services & Restful APIs If this role seems like a More ❯
Your Role: Join the cloud transformation team, where you will take ownership of observability capabilities built on OpenTelemetry and the LGTM stack (Grafana, Mimir, Loki & Tempo). As a DevOps Engineer, you'll lead the way in improving observability for cloud-native applications migrated to AWS. Your contributions will More ❯
degree in Computer Science, Engineering, or a related field (or equivalent experience) Other preferred skills Startup experience Usage and integration of OpenTelemetry Grafana LGTM (Loki, Grafana, Tempo, Mimir (or Prometheus stack Worked with Microservices in a production environment Experience with at least 2 programming languages, preferably one of them More ❯
orchestration technologies such as Kubernetes Experience creating Helm Charts to deploy containerized services on Kubernetes Familiar with Log Aggregation and Management systems such as Loki Clearance: Applicants selected will be subject to a security investigation and may need to meet eligibility requirements for access to classified information; TS/ More ❯
ability to implement security controls at the infrastructure level Experience with monitoring and logging tools like DataDog or Grafana's observability stack (Prometheus, Tempo, Loki, Grafana) Familiarity with the open standard OpenTelemetry Excellent written and verbal communication skills, we're a collaborative team! PLEASE NOTE: Our engineering teams work More ❯
Terraform and maintain CI/CD pipelines Build and orchestrate containers using Docker and Kubernetes Implement monitoring and alerting with tools like Grafana and Loki Qualifications Experience in a DevOps or similar role, with strong AWS expertise Proficiency in Terraform, Kubernetes, and CI/CD pipeline development Skilled in More ❯
least 7 years of experience as a Site Reliability Engineer Required Skills Deep expertise with custom implementations of Prometheus, OpenTelemetry, and the LGTM stack (Loki, Grafana, Tempo), customized using Helm Charts Strong understanding of Kubernetes - experience building Custom Operators is essential Solid programming skills in Python and or Golang More ❯