Security & Compliance: Implement secure cloud architectures, automate security controls, and ensure compliance with GDS and NCSC security standards. Monitoring & Logging: Use tools like Prometheus, Grafana, ELK Stack, or AWS CloudWatch to monitor system performance and health, ensuring uptime and identifying issues proactively. Automation Support: Assist software engineers by automating repetitive More ❯
at least one programming language that compiles to machine code such as Rust, C++, or Go. Expert knowledge of monitoring technologies such as Prometheus, Grafana, and PagerDuty. Expert knowledge of deployment technologies such as Pulumi or Terraform. Expert knowledge of Kubernetes. Responsibilities: Improving our observability by adding/adjusting metrics. More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
AI Tech Suite
at least one programming language that compiles to machine code such as Rust, C++, or Go. Expert knowledge of monitoring technologies such as Prometheus, Grafana, and PagerDuty. Expert knowledge of deployment technologies such as Pulumi or Terraform. Expert knowledge of Kubernetes. Responsibilities: Improving our observability by adding/adjusting metrics. More ❯
Jenkins, GitLab CI). • Solid understanding of networking concepts, security principles, and identity management in cloud environments. • Experience with monitoring and observability tools (Prometheus, Grafana, ELK stack). • Scripting and automation skills (Python, Bash, PowerShell). • Knowledge of cost optimization strategies for cloud resources. • Ability to work collaboratively in cross More ❯
or PowerShell for automation. Understanding of AWS networking concepts, including VPCs, subnets, and security groups. Experience with monitoring and logging solutions such as Prometheus, Grafana, ELK Stack, or AWS CloudWatch. Familiarity with Zero Trust security models and best practices for securing cloud workloads. Ability to troubleshoot complex infrastructure issues and More ❯
london, south east england, United Kingdom Hybrid / WFH Options
LHH
or PowerShell for automation. Understanding of AWS networking concepts, including VPCs, subnets, and security groups. Experience with monitoring and logging solutions such as Prometheus, Grafana, ELK Stack, or AWS CloudWatch. Familiarity with Zero Trust security models and best practices for securing cloud workloads. Ability to troubleshoot complex infrastructure issues and More ❯
understanding of CI/CD pipelines and tools (e.g., Github CI, GitLab CI, CircleCI, Jenkins). Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack). Strong scripting skills (e.g., Bash, Python) for automation tasks. Excellent problem-solving skills and attention to detail. Strong communication and collaboration skills. More ❯
understanding of CI/CD pipelines and tools (e.g., Github CI, GitLab CI, CircleCI, Jenkins). Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack). Strong scripting skills (e.g., Bash, Python) for automation tasks. Excellent problem-solving skills and attention to detail. Strong communication and collaboration skills. More ❯
end to end delivery of solutions. Expert knowledge of SRE fundamentals and a commitment to best practice Fluency with common observability tooling like Prometheus, Grafana, OTEL and Cloudwatch Experience analysing and building data telemetry, querying (PromQL), modelling, pipelines and dashboards to provide concise, focused insights and alerts for distributed systems More ❯
NSGs, ASGs), and governance policies to ensure compliance and risk mitigation. Monitoring & Logging : Experience with Azure Monitor, Application Insights, Log Analytics, and Prometheus/Grafana for observability and performance monitoring. Scripting & Automation : Strong scripting skills in PowerShell, Bash, and Python , along with automation frameworks like Ansible . Collaboration & Problem-Solving More ❯
Tracking - e.g. JIRA, Confluence Monitoring, Logging, and Performance Tuning - Skills in monitoring systems' performance and logs to ensure uptime and identify performance bottlenecks - e.g. Grafana, Datadog Networking Concepts - Knowledge in TCP/IP, DNS, VPN, load balancing, and firewalls Security Best Practices - Implementing security in DevOps (e.g., IAM policies, network More ❯
Greater Bristol Area, United Kingdom Hybrid / WFH Options
Searchability NS&D
CI/CD) and automation tools like Terraform and Ansible Programming : Proficiency in Python, Go, or Ruby Monitoring & Observability : Hands-on experience with Prometheus, Grafana, ELK Stack, or similar technologies Core Attributes A passion for solving complex technical challenges in high-availability production environments Strong communication and collaboration skills, able More ❯
and Orchestration: Knowledge of containerization and orchestration technologies (e.g., Docker, Kubernetes) Monitoring and Logging: Experience with monitoring and logging tools like DataDog, Prometheus, or Grafana Data Engineering Skills: Knowledge of event streaming platforms (e.g., Apache Kafka) and SQL database management Strong Communication and Collaboration: Excellent communication skills and the ability More ❯
languages like Python, Bash, or Shell. Experience with Docker and Kubernetes for container orchestration and management. Familiarity with monitoring tools like CloudWatch, Prometheus, or Grafana, and best practices in security for cloud environments. Strong communication skills and experience working with development, operations, and security teams. Preferred Qualifications AWS Certified DevOps More ❯
/CD pipelines using the likes of GitLab, Jenkins, CircleCI, CodeBuild etc. Familiarity with scripting (Bash or Python). Monitoring and alerting tools - Prometheus, Grafana or Splunk, ELK. We're looking for someone who wants to progress their career into the DevOps arena. Submit your CV now to be considered. More ❯
containerization for applications and their subsequent orchestration within Kubernetes environments. Experience working on at least one monitoring/observability stack (Datadog, ELK, Splunk, Loki, Grafana). Strong knowledge of Unix or Linux Strong communication skills to collaborate with various stakeholders Able to work independently in a fast-paced environment Detail More ❯
north yorkshire, yorkshire and the humber, United Kingdom Hybrid / WFH Options
Queen Square Recruitment
. Knowledge of DevSecOps best practices and compliance standards (e.g., ISO 27001, GDPR, NIST). Proficiency in monitoring tools and centralized logging (ELK, Prometheus, Grafana, AWS CloudWatch). Strong scripting skills (e.g., Python, Bash) for automation and tool integration. Demonstrated ability to lead DevOps teams and define scalable best practices. More ❯
Perl, Java, C++ CICD tools such as Azure DevOps, GitHub Actions, Gitlab, Jenkins, TeamCity Scripting languages such as PowerShell, bash Observability/Monitoring: Prometheus, Grafana, Splunk Containerisation tools such as Docker, K8S, OpenShift, EC, containers Hosting technologies such as IIS, nginx, Apache, App Service, LightSail Analytical and creative approach to More ❯
or other build tools; Ansible or other IT Automation/software provisioning tools; JIRA, Confluence; * Experience in monitoring/reporting tools such as Splunk, Grafana/Prometheus etc * Experience in Agile practices * Working knowledge of environment monitoring tools such as GCO, NewRelic, Prometheus, Grafana. * Collaboration Skills: Proactive can-do attitude More ❯
or other build tools; Ansible or other IT Automation/software provisioning tools; JIRA, Confluence; * Experience in monitoring/reporting tools such as Splunk, Grafana/Prometheus etc * Experience in Agile practices * Working knowledge of environment monitoring tools such as GCO, NewRelic, Prometheus, Grafana. * Collaboration Skills: Proactive can-do attitude More ❯
management tools (e.g., Ansible, Puppet, Chef), containerization and orchestration (e.g., Docker, Kubernetes) Experience with Cloud services (e.g., AWS, Azure, GCP), Monitoring tools (e.g., Prometheus, Grafana). Infrastructure as Code (IaC) tools (e.g., Terraform, CloudFormation) About the Team J.P. Morgan is a global leader in financial services, providing strategic advice and More ❯
using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation. Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar. Proven track record of maintaining highly-available and performant production environments. Ability to identify and implement effective mitigation strategies and operational playbooks. Useful More ❯
understanding of modern infrastructure and site reliability engineering practice, including Infrastructure-as-code tools (e.g. Terraform, Ansible ) and metrics and observability tools (e.g. Prometheus, Grafana ). Strong understanding of modern DevOps practice, including DevOps stacks (e.g. Jenkins, GitLab, CircleCI ). Cloud experience (e.g. AWS, Google Cloud, Azure, Kubernetes). Familiar More ❯