Southampton, Hampshire, United Kingdom Hybrid / WFH Options
NICE
experience of Grafana Observability Suite (Loki, Mimir, Tempo). Administration and/or development experience of standard monitoring and automation tools such as Splunk, Datadog, Pagerduty, Rundeck. Familiarity with configuration management tools like Ansible, Puppet, or Chef. Certifications such as AWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer, or More ❯
IAT Certification; higher levels preferred. • Experience with serverless computing (AWS Lambda, Azure Functions, etc.). • Familiarity with logging and monitoring platforms like CloudWatch, Prometheus, Datadog, or Splunk. • Experience with CI/CD tools like Jenkins, GitHub Actions, or GitLab CI. • An adjudicated Counterintelligence Polygraph. Soft Skills: • Self-driven • Strong communication More ❯
modeling. Proficiency in cloud platforms (AWS, Azure, GCP) and associated reliability tools. Hands-on experience with monitoring and logging tools such as Prometheus, Grafana, Datadog, Splunk, or ELK stack. Familiarity with containerization and orchestration tools (Docker, Kubernetes). Strong understanding of distributed systems, fault tolerant design, and high availability architectures. More ❯
Kubernetes Proficiency in CI/CD using Jenkins, GitHub Actions, Azure DevOps CircleCl or similar systems Experience with monitoring and observability tools such as Datadog, Cloudwatch, Prometheus, Grafana, Splunk or similar Experience using scripting languages and configuration-as-code best practices for automation and to create end-to-end solutions More ❯
Azure, or Google Cloud Platform. • Security: Experience with tools for delivering SCA, SAST, DAST capabilities. • Monitoring and Logging: Proficiency with tools like Splunk, Dynatrace, Datadog, Prometheus, Grafana. • Version Control: Strong understanding of Git and version control practices. • Scripting: Skills in scripting languages like Bash, PowerShell, or Perl. • Containerization: Familiarity with More ❯
Azure, or Google Cloud Platform. • Security: Experience with tools for delivering SCA, SAST, DAST capabilities. • Monitoring and Logging: Proficiency with tools like Splunk, Dynatrace, Datadog, Prometheus, Grafana. • Version Control: Strong understanding of Git and version control practices. • Scripting: Skills in scripting languages like Bash, PowerShell, or Perl. • Containerization: Familiarity with More ❯
optimization. Configure and maintain cloud-based services and resources. Monitoring and Logging: Implement and maintain monitoring and logging systems (e.g., Prometheus, Grafana, ELK stack, Datadog). Set up alerts and notifications for critical system events. Analyze logs and metrics to identify and resolve performance issues. Automation and Scripting: Develop and More ❯
optimization. Configure and maintain cloud-based services and resources. Monitoring and Logging: Implement and maintain monitoring and logging systems (e.g., Prometheus, Grafana, ELK stack, Datadog). Set up alerts and notifications for critical system events. Analyze logs and metrics to identify and resolve performance issues. Automation and Scripting: Develop and More ❯
Deep understanding of cloud computing platforms (e.g., AWS, Azure, GCP) and containerization technologies (e.g., Kubernetes). Experience with monitoring, logging, and observability tools like DataDog, AWS Cloudwatch, ELK, Prometheus, Splunk etc. Knowledge of infrastructure as code tools (e.g., Terraform, Ansible, ArgoCD) and CI/CD pipelines. Experience deploying enterprise software More ❯
e.g. JIRA, Confluence Monitoring, Logging, and Performance Tuning - Skills in monitoring systems' performance and logs to ensure uptime and identify performance bottlenecks - e.g. Grafana, Datadog Networking Concepts - Knowledge in TCP/IP, DNS, VPN, load balancing, and firewalls Security Best Practices - Implementing security in DevOps (e.g., IAM policies, network security More ❯
cloud environments. Familiarity with cloud security principles and best practices (e.g., IAM, encryption, threat monitoring). Experience with monitoring and alerting tools (e.g., CloudWatch, Datadog, Prometheus). Strong problem-solving, troubleshooting, and communication skills. Preferred Skills: Cloud certifications (e.g., AWS Certified SysOps Administrator, Microsoft Certified: Azure Administrator, Google Cloud Professional More ❯
GCP. Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation. Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar. Proven track record of maintaining highly-available and performant production environments. Ability to identify and implement effective mitigation strategies and operational More ❯
Strong knowledge of infrastructure as code tools (e.g., Terraform, Ansible, ArgoCD) and CI/CD pipelines. Experience with monitoring, logging, and observability tools like DataDog, AWS Cloudwatch, ELK, Prometheus, Splunk etc. Excellent communication and interpersonal skills, with the ability to collaborate effectively with cross-functional teams. Strong problem-solving and More ❯
GCP Background knowledge and hands-on practice in Observability, specifically experience working with one or more of the following tools - Kibana, Open-Search, Grafana, Datadog, Sumo Logic, New Relic, AppDynamics, Dynatrace, Prometheus, Logz.io, SignalFX, Instana, Splunk, Honeycomb, Jaeger Hands-on experience with Infrastructure as a Code (Terraform/Ansible) Hands More ❯
GCP Background knowledge and hands-on practice in Observability, specifically experience working with one or more of the following tools - Kibana, Open-Search, Grafana, Datadog, Sumo Logic, New Relic, AppDynamics, Dynatrace, Prometheus, Logz.io, SignalFX, Instana, Splunk, Honeycomb, Jaeger Hands-on experience with Infrastructure as a Code (Terraform/Ansible) Hands More ❯
GCP Background knowledge and hands-on practice in Observability, specifically experience working with one or more of the following tools - Kibana, Open-Search, Grafana, Datadog, Sumologic, NewRelic, AppDynamics, Dynatrace, Prometheus, Logz.io, SignalFX, Instana, Splunk, Honeycomb, Jaeger Hands-on experience with Infrastructure as a Code (Terraform/Ansible) Hands-on experience More ❯
. Knowledge of scripting or programming languages, such as Python, PowerShell, or Bash. Familiarity with log management and monitoring tools (e.g., Splunk, Datadog, or ELK stack). Experience with SIEM and/or SOAR tools and capabilities. Travel: Less than 10% travel is expected for this position. Travel may include More ❯
CloudFormation, and manage resources for optimal performance. Monitor, troubleshoot, and resolve incidents, optimizing systems to ensure reliability and minimize downtime. Implement monitoring (Prometheus, Grafana, Datadog) and set up alerting systems to proactively address issues and ensure scalability. Work with DevOps, engineering, and security teams to improve application deployment, infrastructure management More ❯
if capability can be demonstrated). Working with virtualisation technologies (VMware preferred). CI/CD Pipeline Deployments with Jenkins Experience of monitoring systems (Datadog, Grafana etc). Experience of Docker/containerisation. Optional/Desired: Experience of Kubernetes and Amazon EKS. Experience deploying and configuring web applications in multiple More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Smart DCC
Develop automated test suites for data pipelines, ensuring data quality and transformation integrity. Monitoring & Performance Optimization: Monitor data pipelines with tools like Prometheus and Datadog to ensure optimal performance and health. Proactively implement anomaly detection and optimize system performance and resource allocation. Collaborate with cross-functional teams to align DataOps More ❯
and Active Directory. Experience with disaster recovery and redundancy strategies in both cloud and on-premises environments. Proficiency with leading monitoring tools, such as Datadog, Splunk , Prometheus, Grafana, ELK Stack, and New Relic. Programming expertise, especially in systems programming languages (e.g., Java, Kotlin, Scala) and databases (e.g., SQL Server, PostgreSQL More ❯
and Active Directory. Experience with disaster recovery and redundancy strategies in both cloud and on-premises environments. Proficiency with leading monitoring tools, such as Datadog, Splunk , Prometheus, Grafana, ELK Stack, and New Relic. Programming expertise, especially in systems programming languages (e.g., Java, Kotlin, Scala) and databases (e.g., SQL Server, PostgreSQL More ❯
production environments Experience with networking services, such as: VPNs, DNS, load balancers, and firewalls Experience with logging and monitoring tools, such as: AWS CloudWatch, Datadog, New Relic, and Splunk Experience with software development automated testing practices Strong verbal and written communication skills Amazon is an equal opportunity employer and does More ❯
Dublin, City of Dublin, Republic of Ireland Hybrid / WFH Options
The Recruitment Company
plus) Deep knowledge of Kubernetes, containers, and cloud-native architectures Proficient in scripting and automation (Python, Shell, Go) Comfortable with tools like Terraform, Jenkins, DataDog, Prometheus, Splunk Solid background in networking, Linux systems, and infrastructure as code If you’re passionate about cloud reliability, automation, and solving complex problems at More ❯
testing frameworks and continuous delivery tools like Jenkins, GitLab CI, or CircleCI. Understanding of performance monitoring and observability tools such as CloudWatch , Prometheus , or Datadog . Interested? Please Apply! Golang Go AWS Kubernetes Terraform Bank Banking Finance Financial Services Crypto Blockchain Web3 Trading Exchange Digital Assets Hybrid Flexible More ❯