Southampton, Hampshire, United Kingdom Hybrid / WFH Options
NICE
experience of Grafana Observability Suite (Loki, Mimir, Tempo). Administration and/or development experience of standard monitoring and automation tools such as Splunk, Datadog, Pagerduty, Rundeck. Familiarity with configuration management tools like Ansible, Puppet, or Chef. Certifications such as AWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer, or More ❯
AWS, Azure, or GCP Manage infrastructure as code using tools like Terraform Monitor and maintain production systems using tools such as Prometheus, Grafana, or Datadog Collaborate with development and QA teams to improve deployment processes and system reliability Contribute to incident response, troubleshooting, and root cause analysis Requirements Approximately More ❯
optimization. Configure and maintain cloud-based services and resources. Monitoring and Logging: Implement and maintain monitoring and logging systems (e.g., Prometheus, Grafana, ELK stack, Datadog). Set up alerts and notifications for critical system events. Analyze logs and metrics to identify and resolve performance issues. Automation and Scripting: Develop and More ❯
e.g. JIRA, Confluence Monitoring, Logging, and Performance Tuning - Skills in monitoring systems' performance and logs to ensure uptime and identify performance bottlenecks - e.g. Grafana, Datadog Networking Concepts - Knowledge in TCP/IP, DNS, VPN, load balancing, and firewalls Security Best Practices - Implementing security in DevOps (e.g., IAM policies, network security More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Smart DCC
Develop automated test suites for data pipelines, ensuring data quality and transformation integrity. Monitoring & Performance Optimization: Monitor data pipelines with tools like Prometheus and Datadog to ensure optimal performance and health. Proactively implement anomaly detection and optimize system performance and resource allocation. Collaborate with cross-functional teams to align DataOps More ❯
Actions, Gitlab, Jenkins, Teamcity Scripting languages such as PowerShell, bash L1 to L3 networking Logging and monitoring systems, and visualisation tools, such as Splunk, Datadog, Log Analytics, Cloudwatch, ELK, Grafana, PowerBI, Prometheus, Application Insights IaC tools such as Terraform, Cloudformation, Chef, Ansible, Puppet, Pulumi, Bicep Database systems such as MSSQL More ❯
Unix Shell. Deep understanding of software applications and technical processes, with emerging expertise in specific disciplines. Experience with observability tools like Grafana, Dynatrace, Prometheus, Datadog, Splunk, including monitoring, SLO alerting, and telemetry collection. Knowledge of CI/CD tools such as Jenkins, GitLab, Terraform. Experience with containers and orchestration tools More ❯
flows. Develop applications and integrations across platforms such as ITSM, ITOM, PA, CSM, SPM, CSDM, CMDB, Employee Centre, Integration Hub, and observability tools (e.g., Datadog, Splunk, AWS CloudWatch, Prometheus, etc.). Ensure seamless interoperability between service operations tooling and cloud-native environments. Technical Leadership & Collaboration: Serve as a technical lead More ❯
programming languages such as Java, Python, C#, shell script (Linux/Powershell). Experience of monitoring, logging and alerting stacks or APMs such as Datadog, Dynatrace, Solarwinds, Prometheus, Grafana, TICK, ELK. Exposure to incident response processes and scenarios. Solid verbal and written/diagrammatical communication skills. Experience of quality assurance More ❯
the following would be a plus: Experience with Observability across multiple domains (APM, Infrastructure, Synthetics, Logs, etc ) within cloud and on-premise environments using Datadog, Azure Monitor and Application Insights, NewRelic and Grafana. Experience working in B2B SaaS companies. Experience with cloud containers, specifically Kubernetes. Responsibilities & Duties: Develop: Architecture, strategy More ❯
london, south east england, United Kingdom Hybrid / WFH Options
Annapurna
pipelines and container technologies like Docker and Kubernetes. Deep understanding of networking, distributed systems, and databases. Expertise in monitoring and observability tools such as DataDog, Prometheus, Grafana, ELK stack, or Splunk. Excellent communication skills and a meticulous approach to problem-solving. Desirable Experience: Familiarity with Azure. Experience working in the More ❯
APIs in a multi-system e-commerce environment Understanding of architecture patterns: Microservices , SOA , Hexagonal , Modular Monolith Monitoring & Observability: Grafana , Prometheus , CloudWatch , New Relic , Datadog , etc. Solid grasp of AI trends in software development , particularly in using GPT tools and agentic systems Education: Mathematics or Computer Science degree (or equivalent More ❯
london, south east england, United Kingdom Hybrid / WFH Options
Noir
pipelines, and be confident scripting in Python, C# or similar scripting languages. You’ll also be comfortable working with monitoring and performance tools like Datadog or Prometheus, and ideally, you’ll have worked in a fast-moving SaaS or product-led business before. Bonus points if you’ve helped shape More ❯
london, south east england, United Kingdom Hybrid / WFH Options
Merlin Entertainments
Proficient in cloud platforms (AWS, Azure, GCP) and modern DevOps tooling (e.g., Terraform, Jenkins, Kubernetes). Hands-on with observability and monitoring tools (e.g., DataDog, Azure Monitor, AppDynamics). Expert in cyber security practices, identity management, encryption, and secure API development. Familiarity with compliance frameworks such as GDPR and PCI More ❯
in at least one language (.Net, C#, node, java, go, c/C++, ruby). Experience with logging and monitoring tools such as ELK, DataDog or NewRelic, LogEntries, SumoLogic, etc. BENEFITS: Competitive Base Salary Employee Performance Related Bonus Clear Career Progression Flexible Working Hours Matched Employee Pension Generous Holiday Package More ❯
strategy while delivering incremental value. Technical Debt Management – Experience identifying and remediating inefficient architectures. Observability & Performance Optimization – Familiarity with monitoring and logging tools (e.g., Datadog, Splunk, Prometheus, New Relic). Stakeholder Management – Ability to engage with senior leadership, product managers, and engineering teams. Metrics-Driven Decision Making – Familiarity with engineering More ❯
Systems, or related fields is desirable but not essential. Nice to have but not essential: Service monitoring and graphing tools (Prometheus + Grafana, Nagios, Datadog) Elastic Stack Repository solutions (JFrog Artifactory, JFrog Bintray) OpenVPN SQL Databases (MongoDB, PostgreSQL, MySQL) Our Values: We work together We believe in people We won More ❯
Experience in data platform, data engineering, large scale data processing, ETL, Lake house and experience in micro services, API design, Kafka, Redis, MemCached, Observability (Datadog, Splunk, Grafana or similar), Orchestration (Airflow, Temporal) Proficient in SQL and in one or more DBMS: Oracle, PostgreSQL, Sybase, MongoDB, Cassandra, CockroachDB, MySQL, Couchbase, DynamoDB More ❯
Experience with AWS certifications (AWS Certified Solutions Architect, Developer, or DevOps Engineer). Experience with Monitoring and Logging solutions like CloudWatch , New Relic , or Datadog . More ❯
Proficiency in SQL and data analytics tools (e.g., Sigma, Snowflake) Experience with FIX protocol and market data analysis proficient in AWS, Kubernetes, monitoring tools (Datadog, Prometheus, Grafana), and automation frameworks (Terraform, Ansible, Pulumi) For more information, please apply with a relevant CV. More ❯
IAM, DynamoDB), HashiCorp (Vault, Consul, Nomad) Authentication: Auth0 for API authorization DevOps & Tooling: GitLab CI (CI/CD pipelines), Kong, Postman, Apollo GraphQL Observability: Datadog Infrastructure & Automation: Infrastructure as Code (IaC), Distributed Caching with Redis About You: You're an enthusiastic and curious engineer who thrives in collaborative environments and More ❯