Code (IaC) using Helm and Ansible Write scripts in Bash and Python to support infrastructure operations Configure and maintain containerized services using Docker Monitor system health using Prometheus and Grafana Support seamless Git-based workflows and code management Resolve issues related to networking, ingress, storage, and performance within the Kubernetes stack Enforce DevSecOps best practices across environments Required Skills: 8+ … Proficient with Linux command-line interface Scripting with Bash and/or Python Strong Kubernetes experience (troubleshooting, networking, storage) Hands-on with Docker, Helm, Ansible, Git, and Prometheus/Grafana Experience implementing Infrastructure as Code (IaC) solutions Preferred Qualifications: Experience with Atlassian tools (JIRA, Confluence) Familiarity with CI/CD pipelines and secure deployment practices CKA (Certified Kubernetes Administrator) certification More ❯
using Infrastructure as Code (Ansible, Terraform). Manage Kubernetes environments including virtualization in K8s. Leverage Istio (Service Mesh) for service-level traffic management and security. Monitor systems with Prometheus, Grafana, and Loki; establish centralized logging and alerting. Ensure infrastructure security through PKI, firewalls (hardware-based), and compliance best practices. Manage storage, Linux systems, and basic to advanced Windows environments. Collaborate … a DevOps Engineer or similar role. Strong understanding of: Kubernetes (including virtualization in K8s) Networking (basic and advanced) Istio, PKI, Linux, Windows (basic and advanced) Ansible, GitLab, Terraform Prometheus, Grafana, Loki Firewall hardware and storage solutions Hands-on experience with cloud platforms, preferably Google Cloud Platform (GCP). Possession of GCP certifications: L100, L200, and L300 - if not yet obtained More ❯
in developing CI/CD pipelines, and knowledge of DevOps tools including but not limited to OpenShift, Kubernetes, Docker, Jenkins, Git, and Terraform Experience with technologies such as Dynatrace, Grafana, DataDog, Elastic, Kafka, PostgreSQL, and shell scripting will be a bonus Proficiency in English , Dutch and/or French will be beneficial Benefits: Permanent role within the company Flexible working More ❯
and using REST and/or RPC APIs Desired Skills Experience with Messaging Frameworks such as Kafka, ActiveMQ, and RabbitMQ Experience with tools used for metrics visualization such as Grafana and Kibana Experience with Git Source Control System Experience with the Atlassian Tool Suite (JIRA, Confluence More ❯
across environments 5+ years of experience troubleshooting, triaging, and resolving issues for CI/CD pipeline failures or slowness Experience with some of the following: Kubernetes, ArgoCD, Helm, Prometheus, Grafana, and Python or Golang Experience developing enterprise cloud-native solutions Experience working with application development teams across the software development life cycle and creating solutions to complex problems within a More ❯
Birmingham, West Midlands (County), United Kingdom
Syntax Consultancy Ltd
Perl, JAVA) and automation skills. Thorough knowledge of Jenkins and pipeline using Groovy script. Experience with Docker containers and Amazon Linux 2023 AMI. Experience with system monitoring tools (e.g., Grafana, Alert Manager, Prometheus, and Node exporter). Experience with Git, Jira, Confluence, and ServiceNow for incident and change management. Desired Skills and Experience: Hands-on DevOps delivery experience working on More ❯
role Hands-on expertise with AWS, Docker, Kubernetes, and Terraform Strong understanding of CI/CD tools and modern development workflows Experience implementing monitoring and observability tools (e.g., Prometheus, Grafana) Solid grasp of cloud security, IAM, and best practices for operational excellence Proactive problem solver with excellent troubleshooting skills Strong communication and collaboration skills, ideally within Agile teams Comfortable working More ❯
best practices and automation tools such as Ansible and SaltStack Task Desired Skills Experience developing with programming languages including Go and Java Experience with metrics visualization technologies such as Grafana and Kabana Experience using the Atlassian product suite (Confluence, Jira) Experience with Git Source Control System EQUAL OPPORTUNITY EMPLOYER VETERANS DISABLED More ❯
role Hands-on expertise with AWS, Docker, Kubernetes, and Terraform Strong understanding of CI/CD tools and modern development workflows Experience implementing monitoring and observability tools (e.g., Prometheus, Grafana) Solid grasp of cloud security, IAM, and best practices for operational excellence Proactive problem solver with excellent troubleshooting skills Strong communication and collaboration skills, ideally within Agile teams Comfortable working More ❯
enterprise solutions. Experience with multiprocessing, async I/O, and performance profiling. Unit testing, performance testing, and BDD. Understanding of OAuth 2.0 and secure authorization. Proficiency with observability tools (Grafana, Prometheus, etc.). DevOps and CI/CD (Jenkins, GitOps). Strong communication and collaboration skills. Understanding of deep learning and ML frameworks (TensorFlow, PyTorch). Secure coding practices and More ❯
complex network solutions (routing, VLANs, firewalls, VPNs) Connectivity between on-premises Vmware and cloud environments Network security best practices and segmentation Experience with monitoring/logging tools (e.g., Prometheus, Grafana, Splunk) Scripting experience (e.g., PowerShell, Bash, Python) Experience with version control (Git) Experience with automation and orchestration platforms .Experience of working in an Agile Environment More ❯
team success • Strong Linux systems administration skills • Infrastructure as Code tools like Terraform and Ansible • Basic knowledge of VLAN networking and Bash scripting • Familiarity with monitoring tools like Prometheus, Grafana, and Sensu Go • Experience with Kubernetes, OpenShift, and KubeVirt Workplace Options: This position is onsite or hybrid/flex as desired. While on-site, you will be a part of More ❯
AWS, Azure, or GCP, and their services for scalable, resilient systems. Expertise in containerization technologies (e.g., Docker, Kubernetes) and orchestration tools. Knowledge of monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack) for maintaining system health and performance. Ability to lead and mentor junior engineers in reliability and system optimization best practices. Excellent communication skills for effective collaboration with cross More ❯
Code) principles and automation tools such as Ansible, SaltStack, Puppet In-depth experience with trouble-shooting large Linux Clusters Demonstrated experience using system monitoring tools such as Prometheus/Grafana Experience with the Atlassian Tool Suite (JIRA, Confluence) Experience using Git for version control Position Desired Skills Experience using Helm to deploy Kubernetes applications Experience using GitLab CI/CD More ❯
as log ingestion and communication issues. Design and develop scalable, robust, and high-performance data pipelines and data storage solutions. Develop and maintain observability frameworks using tools like Kibana, Grafana, or similar Work with cross-functional teams to define observability and search requirements. Scale, script and maintain our development and production platform foundation with AWS and GCP Stay updated on More ❯
in agile environments using Scrum and Kanban. Engaging with high-level stakeholders internally and externally. Technologies such as GitLab, Jenkins, Kubernetes, Docker, Terraform, Packer, Vault, Serverless, Elastic Stack, Prometheus, Grafana, Artifactory, Nexus. Due to the sector's nature, applicants should hold high-level security clearance, which requires being a British passport holder and having lived permanently in the UK for More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Embarcaderomediagroup
DevOps, YAML-based) with security scanning and progressive delivery Supporting AKS clusters and Azure services (SQL, Cosmos DB, ADF, Functions, Logic Apps, etc.) Improving monitoring and alerting with Datadog, Grafana, ELK, and proactive failure detection Participating in the on-call rota and leading incident response workflows and blameless postmortems Coaching engineers, upskilling teams, and contributing to a culture of continuous More ❯
Enterprise repository management systems (Artifactory) Workflow management and collaboration tools (Jira, Confluence, Google Suite) Cloud infrastructure (AWS) Monitor, debug and improve system performance and reliability using tools like Datadog, Grafana, or OpenSearch Update and maintain the development experience via automated pipelines that includes timely feedback and a seamless path to release-level quality. Maintain and administer cloud infrastructure. Troubleshoot issues More ❯
infrastructure Other duties as needed About You 5+ years' experience in Site Reliability Engineer roles Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (Prometheus/Grafana, New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud automation More ❯
infrastructure Other duties as needed About You 5+ years' experience in Site Reliability Engineer roles Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (Prometheus/Grafana, New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud automation More ❯
troubleshooting and problem solving skills • A passion for learning new technologies and innovation Desirable: • Certifications on Amazon Web Services, including Solutions Architect, Developer, Google Cloud or Azure • Amazon Managed Grafana • JetBrains TeamCity • Google Apps Script • Agile Development #LI-JS2 Together, as owners, let's turn meaningful insights into action. Life at CGI is rooted in ownership, teamwork, respect and belonging. More ❯
Washington, Washington DC, United States Hybrid / WFH Options
ESimplicity Inc
in relational and non-relational databases like SQL, MySQL, NoSQL, PostgreSQL, MongoDB or similar Experience with Redis Experience with benchmarking, monitoring, and performance engineering applications with tools such as Grafana, Sentry, and Prometheus Knowledge of AuthN and AuthZ systems, including Active Directory, Okta, and AWS IAM Policies/Roles using attribute-based access controls Knowledge of automated end-to-end More ❯
distributed delivery models Additional skills that are a plus: Programming languages such as Scala, Rust, Go, Angular, React, Kotlin Database management with PostgreSQL Experience with ElasticSearch, observability tools like Grafana and Prometheus What this role can offer Opportunity to deepen understanding of AI and Data Science applications Mentorship and support from colleagues to apply your talents Career growth and development More ❯