Southampton, Hampshire, United Kingdom Hybrid / WFH Options
NICE
experience of Grafana Observability Suite (Loki, Mimir, Tempo). Administration and/or development experience of standard monitoring and automation tools such as Splunk, Datadog, Pagerduty, Rundeck. Familiarity with configuration management tools like Ansible, Puppet, or Chef. Certifications such as AWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer, or More ❯
e.g., Kubernetes, Docker). Proficiency in scripting and programming languages (e.g., Python, Bash, Go). Experience with monitoring and observability tools (e.g., Prometheus, Grafana, Datadog). Solid understanding of security best practices, compliance standards, and DevSecOps . Proven ability to manage and deliver complex projects on time and within budget. More ❯
Azure, or Google Cloud Platform. • Security: Experience with tools for delivering SCA, SAST, DAST capabilities. • Monitoring and Logging: Proficiency with tools like Splunk, Dynatrace, Datadog, Prometheus, Grafana. • Version Control: Strong understanding of Git and version control practices. • Scripting: Skills in scripting languages like Bash, PowerShell, or Perl. • Containerization: Familiarity with More ❯
optimization. Configure and maintain cloud-based services and resources. Monitoring and Logging: Implement and maintain monitoring and logging systems (e.g., Prometheus, Grafana, ELK stack, Datadog). Set up alerts and notifications for critical system events. Analyze logs and metrics to identify and resolve performance issues. Automation and Scripting: Develop and More ❯
e.g. JIRA, Confluence Monitoring, Logging, and Performance Tuning - Skills in monitoring systems' performance and logs to ensure uptime and identify performance bottlenecks - e.g. Grafana, Datadog Networking Concepts - Knowledge in TCP/IP, DNS, VPN, load balancing, and firewalls Security Best Practices - Implementing security in DevOps (e.g., IAM policies, network security More ❯
Actions, Jenkins, GitLab CI). Expertise in Infrastructure-as-Code using Terraform (or similar tools). Experience with observability tools (e.g., Prometheus, Grafana, ELK, Datadog). Strong communication and collaboration skills. Bonus Points For Experience in containerization and orchestration (e.g., Docker, Kubernetes). Background in performance tuning and incident response. More ❯
GCP Background knowledge and hands-on practice in Observability, specifically experience working with one or more of the following tools - Kibana, Open-Search, Grafana, Datadog, Sumologic, NewRelic, AppDynamics, Dynatrace, Prometheus, Logz.io, SignalFX, Instana, Splunk, Honeycomb, Jaeger Hands-on experience with Infrastructure as a Code (Terraform/Ansible) Hands-on experience More ❯
CloudFormation, and manage resources for optimal performance. Monitor, troubleshoot, and resolve incidents, optimizing systems to ensure reliability and minimize downtime. Implement monitoring (Prometheus, Grafana, Datadog) and set up alerting systems to proactively address issues and ensure scalability. Work with DevOps, engineering, and security teams to improve application deployment, infrastructure management More ❯
if capability can be demonstrated). Working with virtualisation technologies (VMware preferred). CI/CD Pipeline Deployments with Jenkins Experience of monitoring systems (Datadog, Grafana etc). Experience of Docker/containerisation. Optional/Desired: Experience of Kubernetes and Amazon EKS. Experience deploying and configuring web applications in multiple More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Smart DCC
Develop automated test suites for data pipelines, ensuring data quality and transformation integrity. Monitoring & Performance Optimization: Monitor data pipelines with tools like Prometheus and Datadog to ensure optimal performance and health. Proactively implement anomaly detection and optimize system performance and resource allocation. Collaborate with cross-functional teams to align DataOps More ❯
and Active Directory. Experience with disaster recovery and redundancy strategies in both cloud and on-premises environments. Proficiency with leading monitoring tools, such as Datadog, Splunk , Prometheus, Grafana, ELK Stack, and New Relic. Programming expertise, especially in systems programming languages (e.g., Java, Kotlin, Scala) and databases (e.g., SQL Server, PostgreSQL More ❯
production environments Experience with networking services, such as: VPNs, DNS, load balancers, and firewalls Experience with logging and monitoring tools, such as: AWS CloudWatch, Datadog, New Relic, and Splunk Experience with software development automated testing practices Strong verbal and written communication skills Amazon is an equal opportunity employer and does More ❯
testing frameworks and continuous delivery tools like Jenkins, GitLab CI, or CircleCI. Understanding of performance monitoring and observability tools such as CloudWatch , Prometheus , or Datadog . Interested? Please Apply! Golang Go AWS Kubernetes Terraform Bank Banking Finance Financial Services Crypto Blockchain Web3 Trading Exchange Digital Assets Hybrid Flexible More ❯
Azure, or GCP. Skilled in Docker, Kubernetes, and CI/CD tools (Azure DevOps, Jenkins, GitLab, CircleCI). Familiar with monitoring tools like Prometheus, Datadog, New Relic, Grafana, PagerDuty. Solid grasp of networking (DNS, load balancing, firewalls). Understanding of real-time AI and streaming architectures. Preferred Qualifications: Experience with More ❯
in the process of containerization for applications and their subsequent orchestration within Kubernetes environments. Experience working on at least one monitoring/observability stack (Datadog, ELK, Splunk, Loki, Grafana). Strong knowledge of Unix or Linux Strong communication skills to collaborate with various stakeholders Able to work independently in a More ❯
an object-oriented language (preferably Java, .NET or C++) Expert+ level Linux administration, scripting, and troubleshooting Demonstratable knowledge of Observability tools (New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud More ❯
an object-oriented language (preferably Java, .NET or C++) Expert+ level Linux administration, scripting, and troubleshooting Demonstratable knowledge of Observability tools (New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud More ❯
experience with CI/CD, containerization and orchestration tools (Docker, Kubernetes ). Knowledge of monitoring, logging, alerting and observability tools (Prometheus, Grafana, ELK Stack, Datadog ). Familiarity with infrastructure-as-code tools like Terraform or CloudFormation. Proficiency in scripting languages (Python, Go, Bash ) and knowledge of software development best practices. More ❯
Security Best Practices: IAM, MFA, data encryption, firewall configurations. Programming/Scripting: Python, Terraform, or similar languages. Event-Driven Architectures: Kafka. Monitoring and Logging: Datadog, ELK Stack, Prometheus, etc. Experience in agile methodologies and DevOps practices. Location: Hybrid. Office located in London. (Hayes area). Office presence required: Yes. Frequency More ❯
programming languages such as Java, Python, C#, shell script (Linux/Powershell). Experience of monitoring, logging and alerting stacks or APMs such as Datadog, Dynatrace, Solarwinds, Prometheus, Grafana, TICK, ELK. Exposure to incident response processes and scenarios. Solid verbal and written/diagrammatical communication skills. Experience of quality assurance More ❯
using AWS services (SNS, SQS, EventBridge). Knowledge of GraphQL, WebSockets, or real-time data streaming. Exposure to DevOps and observability practices (e.g., Prometheus, Datadog, AWS CloudWatch, OpenTelemetry). Prior experience in leading distributed engineering teams. More ❯
CD pipelines using GitHub Actions. Familiarity with code quality tools like SonarCloud and security tools like Snyk. Extensive experience with monitoring tools such as Datadog or NewRelic. Knowledge of containerization technologies (e.g., Docker, Kubernetes). Excellent problem-solving skills and attention to detail. Strong communication and collaboration skills. Preferred Qualifications More ❯
Proficient in cloud platforms (AWS, Azure, GCP) and modern DevOps tooling (e.g., Terraform, Jenkins, Kubernetes). Hands-on with observability and monitoring tools (e.g., DataDog, Azure Monitor, AppDynamics). Expert in cyber security practices, identity management, encryption, and secure API development. Familiarity with compliance frameworks such as GDPR and PCI More ❯
secret management tools (e.g., HashiCorp Vault, Azure Key Vault) and SSO/authentication systems (e.g., Okta). Observability: Hands-on experience with platforms like DataDog, Grafana, or Azure Monitor. Networking: Strong understanding of networking principles, DNS, and related technologies. CI/CD: Skilled in creating and maintaining CI/CD More ❯
/or internals. Experience working with cloud solutions (GCP or AWS). Deep understanding and demonstrable experience with modern monitoring tools such as Prometheus, Datadog, Grafana, and Telegraf. Experience with infrastructure as code tools. Experience with complex Terraform deployments is a plus. Solid background with configuration management tools. Experience with More ❯