pipelines for microservices and Kafka-related applications using tools like Drone Automate infrastructure provisioning using Terraform or Infrastructure-as-Code tools Build and maintain monitoring and alerting systems using Prometheus, Grafana, or AWS native monitoring tools like CloudWatch Collaborate with development and DevOps teams to design MSK and Kubernetes-based solutions Troubleshoot complex issues related to Kafka and container orchestration. … Infrastructure as Code (IaC) tools like Terraform Knowledge of container build and deployment automation using CI/CD pipelines Experience in observability tools for both MSK and Kubernetes, including Prometheus, Grafana, and AWS CloudWatch for metrics and logs Deep understanding of Kafka and Kubernetes security practices, including network policies and IAM roles Experience with Vault Strong analytical and troubleshooting skills More ❯
London, England, United Kingdom Hybrid / WFH Options
Government Digital and Data
managing, and scaling applications on this platform is highly desirable. This includes understanding of OpenShift's architecture, its project and application concepts, and its command-line client. Proficiency with Prometheus and Terraform: Experience with Prometheus for monitoring and alerting purposes is desirable. Familiarity with Terraform for infrastructure as code (IaC) to provision and manage any cloud, infrastructure, or service is More ❯
technologies, and network management. Proficiency in scripting and automation (e.g., Bash, Python, PowerShell). Familiarity with CI/CD pipelines and deployment automation. Experience with environment monitoring tools (e.g., Prometheus, Nagios, Datadog). Knowledge of security best practices and compliance standards in IT environments. Excellent problem-solving, troubleshooting, and analytical skills. Strong communication skills, with the ability to collaborate across More ❯
principles in production environments. Desirable (Not Essential): Exposure to Kubernetes or other container orchestration tools. Experience with AWS monitoring and logging tools (e.g., CloudWatch). Familiarity with tools like Prometheus, Grafana, or the ELK Stack. Understanding of networking concepts or configuration management tools (Ansible, Chef, Puppet). Awareness of Agile methodologies. More ❯
tools (Terraform, Bicep, CloudFormation). Experience with CI/CD tools and containerization (Docker, Kubernetes, EKS, AKS). Scripting skills in Bash, Python, PowerShell, etc. Familiarity with monitoring tools (Prometheus, Grafana, ELK, Azure Monitor). Jira administration skills. Proactive learning attitude and excellent communication skills. We offer flexible working, 35 days holiday, pension, health support, development courses, and other benefits. More ❯
of experience in DevOps, SRE, or platform engineering roles. Experience with software development (at least Python, Git - Golang or Rust additionally appreciated) Experience with an observability stack such as Prometheus, VictoriaMetrics, Vector, Elastic stack, Grafana, and AlertManager. Experience with operating highly distributed applications at scale in Kubernetes. Experience with system administration and troubleshooting (Bash, Linux, Containerization) Excellent written and verbal More ❯
the migration process where possible. Experience in versioning and testing migration scripts to ensure smooth transitions with minimal service interruption. Monitoring & Troubleshooting: Strong experience with database monitoring tools (e.g., Prometheus, Grafana, New Relic, Percona Monitoring and Management) for tracking performance metrics and ensuring database health. Expertise in diagnosing and resolving performance issues related to replication, slow queries, schema design, and … query performance. Hands-on experience with upgrading MongoDB versions and migrating data between MongoDB clusters or from on-premises to cloud environments. Additional experience: Experience with monitoring systems (Zabbix, Prometheus) Containerisation and Orchestration : Familiarity with deploying and managing databases in containerized environments (e.g., Docker, Kubernetes). Experience of JIRA and Confluence Openness to learn the required technologies. An interest in More ❯
Expertise in scripting and automation using Python, Bash, or PowerShell. Solid understanding of networking, security, and system administration within cloud environments. Experience with monitoring and logging tools such as Prometheus, Grafana, ELK Stack, or AWS CloudWatch. Knowledge of version control systems like Git and collaboration tools like Jira. Please reach out ASAP to nick.oldridge@areti.io for more information! Areti Group More ❯
and Kubernetes services (EKS, GKE, AKS). Understanding of infrastructure-as-code (IaC) tools such as Terraform or CloudFormation. What makes you stand out? Experience with monitoring tools like Prometheus, Grafana, or Datadog. Knowledge of centralized logging systems like Fluentd, Logstash, or Loki. Experience with CI/CD pipelines using Jenkins, GitLab, or similar tools, and scripting languages such as More ❯
or Azure is a plus). Deep understanding of Container Orchestration technologies such as Kubernetes and Docker . Proficiency in monitoring and logging tools including: Datadog , Splunk , Dynatrace , AppDynamics , Prometheus , Grafana , ELK Stack , CloudWatch , Gremlin , ThousandEyes . Experience with Terraform , Jenkins , GitLab CI , PostgreSQL , Redis , and Kong API Gateway . Solid understanding of networking , security best practices , and infrastructure automation More ❯
Slough, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
as Terraform, Vagrant, and scripting languages like Shell, Python, and PowerShell. Manage containerized applications with Docker to support streamlined deployment and environment consistency. Implement monitoring and alerting solutions with Prometheus, Grafana, and AlertManager to proactively address system performance issues. Promote DevOps best practices in automation, security, and agile delivery to improve operational efficiency and team productivity. Essential Experience Hands-on More ❯
6+ years of Linux system administration and engineering experience in performance-critical environments Proficiency in Python and bash Scripting, with hands-on Ansible experience Familiarity with observability tools like Prometheus, Grafana, and ELK Infrastructure-as-code experience with Terraform and CI/CD pipelines Proven ability to resolve complex system-level issues and performance challenges Knowledge of container orchestration tools More ❯
technologies, and network management. Proficiency in scripting and automation (e.g., Bash, Python, PowerShell). Familiarity with CI/CD pipelines and deployment automation. Experience with environment monitoring tools (e.g., Prometheus, Nagios, Datadog). Knowledge of security best practices and compliance standards in IT environments. Excellent problem-solving, troubleshooting, and analytical skills. Strong communication skills, with the ability to collaborate across More ❯
Automation: Design, implement, and maintain automated infrastructure provisioning using Python, Ansible, and Typescript CDK. Monitoring and Alerting: Set up systems to detect and address issues proactively, using tools like Prometheus/Thanos, Grafana Cloud, and Loki. Database Management: Manage hundreds of PostgreSQL databases, including performance tuning, backups, and disaster recovery, both on-premise and in AWS. Collaboration: Work with cross More ❯
like AWS, Azure, or GCP, and their services for scalable, resilient systems. Expertise in containerization technologies (e.g., Docker, Kubernetes) and orchestration tools. Knowledge of monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack) for maintaining system health and performance. Ability to lead and mentor junior engineers in reliability and system optimization best practices. Excellent communication skills for effective collaboration with More ❯
Terraform creating Infrastructure as Code. Strong scripting skills in Bash, Python, or similar languages. Deep understanding of CI/CD tools like Jenkins. Solid knowledge of system monitoring tools (Prometheus, Grafana, etc.). What We Offer At MrQ, we take pride in providing an array of fantastic benefits to our valued team members. Enjoy a competitive salary package that recognizes More ❯
High Wycombe, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
like GitHub Actions, CircleCI, or similar. Experience with messaging systems (RabbitMQ, Kafka, etc.) and event-driven architectures. Proficiency in infrastructure as code (Terraform preferred). Familiarity with monitoring stacks (Prometheus, Grafana, ELK, etc.) and system tuning. Security-conscious mindset; experience implementing controls in regulated or financial environments is a plus. Excellent problem-solving skills and a proactive attitude. Strong communication More ❯
Bournemouth, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
tools like GitHub Actions, CircleCI, or similar. ● Experience with messaging systems (RabbitMQ, Kafka, etc.) and event-driven ● Proficiency in infrastructure as code (Terraform preferred). ● Familiarity with monitoring stacks (Prometheus, Grafana, ELK, etc.) and system tuning. ● Security-conscious mindset; experience implementing controls in regulated or financial environments is a plus. ● Excellent problem-solving skills and a proactive attitude. ● Strong communication More ❯
Reading, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
tools like GitHub Actions, CircleCI, or similar. ● Experience with messaging systems (RabbitMQ, Kafka, etc.) and event-driven ● Proficiency in infrastructure as code (Terraform preferred). ● Familiarity with monitoring stacks (Prometheus, Grafana, ELK, etc.) and system tuning. ● Security-conscious mindset; experience implementing controls in regulated or financial environments is a plus. ● Excellent problem-solving skills and a proactive attitude. ● Strong communication More ❯
Slough, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
tools like GitHub Actions, CircleCI, or similar. ● Experience with messaging systems (RabbitMQ, Kafka, etc.) and event-driven ● Proficiency in infrastructure as code (Terraform preferred). ● Familiarity with monitoring stacks (Prometheus, Grafana, ELK, etc.) and system tuning. ● Security-conscious mindset; experience implementing controls in regulated or financial environments is a plus. ● Excellent problem-solving skills and a proactive attitude. ● Strong communication More ❯
Glasgow, Scotland, United Kingdom Hybrid / WFH Options
JR United Kingdom
tools like GitHub Actions, CircleCI, or similar. ● Experience with messaging systems (RabbitMQ, Kafka, etc.) and event-driven ● Proficiency in infrastructure as code (Terraform preferred). ● Familiarity with monitoring stacks (Prometheus, Grafana, ELK, etc.) and system tuning. ● Security-conscious mindset; experience implementing controls in regulated or financial environments is a plus. ● Excellent problem-solving skills and a proactive attitude. ● Strong communication More ❯
Brighton, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
tools like GitHub Actions, CircleCI, or similar. ● Experience with messaging systems (RabbitMQ, Kafka, etc.) and event-driven ● Proficiency in infrastructure as code (Terraform preferred). ● Familiarity with monitoring stacks (Prometheus, Grafana, ELK, etc.) and system tuning. ● Security-conscious mindset; experience implementing controls in regulated or financial environments is a plus. ● Excellent problem-solving skills and a proactive attitude. ● Strong communication More ❯
Bath, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
tools like GitHub Actions, CircleCI, or similar. ● Experience with messaging systems (RabbitMQ, Kafka, etc.) and event-driven ● Proficiency in infrastructure as code (Terraform preferred). ● Familiarity with monitoring stacks (Prometheus, Grafana, ELK, etc.) and system tuning. ● Security-conscious mindset; experience implementing controls in regulated or financial environments is a plus. ● Excellent problem-solving skills and a proactive attitude. ● Strong communication More ❯
Aberdeen, Scotland, United Kingdom Hybrid / WFH Options
JR United Kingdom
tools like GitHub Actions, CircleCI, or similar. ● Experience with messaging systems (RabbitMQ, Kafka, etc.) and event-driven ● Proficiency in infrastructure as code (Terraform preferred). ● Familiarity with monitoring stacks (Prometheus, Grafana, ELK, etc.) and system tuning. ● Security-conscious mindset; experience implementing controls in regulated or financial environments is a plus. ● Excellent problem-solving skills and a proactive attitude. ● Strong communication More ❯
Cheltenham, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
tools like GitHub Actions, CircleCI, or similar. ● Experience with messaging systems (RabbitMQ, Kafka, etc.) and event-driven ● Proficiency in infrastructure as code (Terraform preferred). ● Familiarity with monitoring stacks (Prometheus, Grafana, ELK, etc.) and system tuning. ● Security-conscious mindset; experience implementing controls in regulated or financial environments is a plus. ● Excellent problem-solving skills and a proactive attitude. ● Strong communication More ❯