Southampton, Hampshire, United Kingdom Hybrid / WFH Options
NICE
experience of Grafana Observability Suite (Loki, Mimir, Tempo). Administration and/or development experience of standard monitoring and automation tools such as Splunk, Datadog, Pagerduty, Rundeck. Familiarity with configuration management tools like Ansible, Puppet, or Chef. Certifications such as AWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer, or More ❯
Azure, or Google Cloud Platform. • Security: Experience with tools for delivering SCA, SAST, DAST capabilities. • Monitoring and Logging: Proficiency with tools like Splunk, Dynatrace, Datadog, Prometheus, Grafana. • Version Control: Strong understanding of Git and version control practices. • Scripting: Skills in scripting languages like Bash, PowerShell, or Perl. • Containerization: Familiarity with More ❯
optimization. Configure and maintain cloud-based services and resources. Monitoring and Logging: Implement and maintain monitoring and logging systems (e.g., Prometheus, Grafana, ELK stack, Datadog). Set up alerts and notifications for critical system events. Analyze logs and metrics to identify and resolve performance issues. Automation and Scripting: Develop and More ❯
e.g. JIRA, Confluence Monitoring, Logging, and Performance Tuning - Skills in monitoring systems' performance and logs to ensure uptime and identify performance bottlenecks - e.g. Grafana, Datadog Networking Concepts - Knowledge in TCP/IP, DNS, VPN, load balancing, and firewalls Security Best Practices - Implementing security in DevOps (e.g., IAM policies, network security More ❯
GCP Background knowledge and hands-on practice in Observability, specifically experience working with one or more of the following tools - Kibana, Open-Search, Grafana, Datadog, Sumo Logic, New Relic, AppDynamics, Dynatrace, Prometheus, Logz.io, SignalFX, Instana, Splunk, Honeycomb, Jaeger Hands-on experience with Infrastructure as a Code (Terraform/Ansible) Hands More ❯
if capability can be demonstrated). Working with virtualisation technologies (VMware preferred). CI/CD Pipeline Deployments with Jenkins Experience of monitoring systems (Datadog, Grafana etc). Experience of Docker/containerisation. Optional/Desired: Experience of Kubernetes and Amazon EKS. Experience deploying and configuring web applications in multiple More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Smart DCC
Develop automated test suites for data pipelines, ensuring data quality and transformation integrity. Monitoring & Performance Optimization: Monitor data pipelines with tools like Prometheus and Datadog to ensure optimal performance and health. Proactively implement anomaly detection and optimize system performance and resource allocation. Collaborate with cross-functional teams to align DataOps More ❯
Unix Shell. Deep understanding of software applications and technical processes, with emerging expertise in specific disciplines. Experience with observability tools like Grafana, Dynatrace, Prometheus, Datadog, Splunk, including monitoring, SLO alerting, and telemetry collection. Knowledge of CI/CD tools such as Jenkins, GitLab, Terraform. Experience with containers and orchestration tools More ❯
london, south east england, united kingdom Hybrid / WFH Options
Noir
pipelines, and be confident scripting in Python, C# or similar scripting languages. You’ll also be comfortable working with monitoring and performance tools like Datadog or Prometheus, and ideally, you’ll have worked in a fast-moving SaaS or product-led business before. Bonus points if you’ve helped shape More ❯
london, south east england, united kingdom Hybrid / WFH Options
Parser
Security Best Practices: IAM, MFA, data encryption, firewall configurations. Programming/Scripting: Python, Terraform, or similar languages. Event-Driven Architectures: Kafka. Monitoring and Logging: Datadog, ELK Stack, Prometheus, etc. Experience in agile methodologies and DevOps practices. Location: Hybrid. Office located in London. (Hayes area). Office presence required: Yes. Frequency More ❯
Cambridge, England, United Kingdom Hybrid / WFH Options
Tokenovate
mesh architectures, and cloud-native security patterns. Exposure to compliance and auditing requirements in regulated industries. Experience with modern observability stacks (OpenTelemetry, Prometheus, Grafana, Datadog, etc.). Familiarity with authentication and authorisation protocols (OAuth2, OIDC, SAML). WHY JOIN US? Cutting-Edge Work – Build innovative solutions at the intersection of More ❯
cambridge, east anglia, united kingdom Hybrid / WFH Options
Tokenovate
mesh architectures, and cloud-native security patterns. Exposure to compliance and auditing requirements in regulated industries. Experience with modern observability stacks (OpenTelemetry, Prometheus, Grafana, Datadog, etc.). Familiarity with authentication and authorisation protocols (OAuth2, OIDC, SAML). WHY JOIN US? Cutting-Edge Work – Build innovative solutions at the intersection of More ❯
Cambridge, south west england, united kingdom Hybrid / WFH Options
Tokenovate
mesh architectures, and cloud-native security patterns. Exposure to compliance and auditing requirements in regulated industries. Experience with modern observability stacks (OpenTelemetry, Prometheus, Grafana, Datadog, etc.). Familiarity with authentication and authorisation protocols (OAuth2, OIDC, SAML). WHY JOIN US? Cutting-Edge Work – Build innovative solutions at the intersection of More ❯
programming languages such as Java, Python, C#, shell script (Linux/Powershell). Experience of monitoring, logging and alerting stacks or APMs such as Datadog, Dynatrace, Solarwinds, Prometheus, Grafana, TICK, ELK. Exposure to incident response processes and scenarios. Solid verbal and written/diagrammatical communication skills. Experience of quality assurance More ❯
the following would be a plus: Experience with Observability across multiple domains (APM, Infrastructure, Synthetics, Logs, etc ) within cloud and on-premise environments using Datadog, Azure Monitor and Application Insights, NewRelic and Grafana. Experience working in B2B SaaS companies. Experience with cloud containers, specifically Kubernetes. Responsibilities & Duties: Develop: Architecture, strategy More ❯
using AWS services (SNS, SQS, EventBridge). Knowledge of GraphQL, WebSockets, or real-time data streaming. Exposure to DevOps and observability practices (e.g., Prometheus, Datadog, AWS CloudWatch, OpenTelemetry). Prior experience in leading distributed engineering teams. More ❯
london, south east england, united kingdom Hybrid / WFH Options
Merlin Entertainments
Proficient in cloud platforms (AWS, Azure, GCP) and modern DevOps tooling (e.g., Terraform, Jenkins, Kubernetes). Hands-on with observability and monitoring tools (e.g., DataDog, Azure Monitor, AppDynamics). Expert in cyber security practices, identity management, encryption, and secure API development. Familiarity with compliance frameworks such as GDPR and PCI More ❯
in at least one language (.Net, C#, node, java, go, c/C++, ruby). Experience with logging and monitoring tools such as ELK, DataDog or NewRelic, LogEntries, SumoLogic, etc. BENEFITS: Competitive Base Salary Employee Performance Related Bonus Clear Career Progression Flexible Working Hours Matched Employee Pension Generous Holiday Package More ❯
strategy while delivering incremental value. Technical Debt Management – Experience identifying and remediating inefficient architectures. Observability & Performance Optimization – Familiarity with monitoring and logging tools (e.g., Datadog, Splunk, Prometheus, New Relic). Stakeholder Management – Ability to engage with senior leadership, product managers, and engineering teams. Metrics-Driven Decision Making – Familiarity with engineering More ❯
operations of web applications. Desirable Skills: Serverless & Microservices: Experience withAWS Lambda,Azure Functions, and event-driven architectures. Observability & Monitoring: Familiarity with monitoring tools likeSplunk,Datadog, orNew Relicfor enhanced visibility and observability. Networking: Knowledge ofVPCs,VPNs, andload balancingin cloud environments. GDS Standards: Awareness ofGDS Service Standardsand accessibility requirements, especially for public More ❯
Reigate, Surrey, United Kingdom Hybrid / WFH Options
Willis Towers Watson
cost effectiveness Implement infrastructure as code with Pulumi Support the team in infrastructure and networking related issues Maintain and configure observability platforms such as Datadog Proactively monitor production and other environments to ensure stability, availability, security and integrity Participate in incident response, troubleshooting, and root cause analysis to mitigate and More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
AI Tech Suite
Find the latest job opportunities in AI and tech. RunPod offers GPU cloud computing for AI/ML, providing secure and community cloud options, on-demand and spot pods, and serverless GPU scaling. The flexibility of remote work with an More ❯
Proficiency in SQL and data analytics tools (e.g., Sigma, Snowflake) Experience with FIX protocol and market data analysis proficient in AWS, Kubernetes, monitoring tools (Datadog, Prometheus, Grafana), and automation frameworks (Terraform, Ansible, Pulumi) For more information, please apply with a relevant CV. More ❯