UrbanCode etc. • Containers – Docker, Kubernetes, Mesosphere etc. • Configuration Management – Ansible, Chef, Puppet etc. • Cloud – AWS preferred; multi clould experience ie with Azure, GCP etc. highly desirable • Monitoring – ELK, Prometheus, Splunk etc. • Experience in one of the following scripting language: Java, Bash, Python, Powershell, Golang, etc. • Experience working with Linux and/or Windows systems About you (ideally): • Demonstrate a working More ❯
UrbanCode etc. • Containers – Docker, Kubernetes, Mesosphere etc. • Configuration Management – Ansible, Chef, Puppet etc. • Cloud – AWS preferred; multi clould experience ie with Azure, GCP etc. highly desirable • Monitoring – ELK, Prometheus, Splunk etc. • Experience in one of the following scripting language: Java, Bash, Python, Powershell, Golang, etc. • Experience working with Linux and/or Windows systems About you (ideally): • Demonstrate a working More ❯
their operational success through proactive insight and incident prevention. What you'll do Design, implement, and manage observability solutions using industry-leading tools such as Dynatrace (primary), Grafana, and Splunk Collect and analyse telemetry data (metrics, logs, traces, events) to diagnose and resolve system and application performance issues Integrate monitoring platforms with ITSM tools (e.g. ServiceNow) and CI/CD … technical designs Proactively identify and highlight risks that could impact solution success What you'll need Strong experience deploying and managing observability platforms including Dynatrace, Grafana, and/or Splunk Deep understanding of telemetry signal analysis and performance monitoring Experience integrating observability tools with ITSM platforms and DevOps toolchains Ability to troubleshoot complex infrastructure and application issues using monitoring insights … and non-technical audiences Experience working in Agile project environments (Scrum, Kanban, etc.) A proactive mindset with a passion for continuous improvement and knowledge sharing Certifications Dynatrace Associate & Pro Splunk Core Certified Power User Desirable Experience DevOps or Site Reliability Engineering (SRE) experience Automation with Terraform or similar tools Building CI/CD pipelines Experience with Docker and Kubernetes for More ❯
their operational success through proactive insight and incident prevention. What you'll do Design, implement, and manage observability solutions using industry-leading tools such as Dynatrace (primary), Grafana, and Splunk Collect and analyse telemetry data (metrics, logs, traces, events) to diagnose and resolve system and application performance issues Integrate monitoring platforms with ITSM tools (e.g. ServiceNow) and CI/CD … technical designs Proactively identify and highlight risks that could impact solution success What you'll need Strong experience deploying and managing observability platforms including Dynatrace, Grafana, and/or Splunk Deep understanding of telemetry signal analysis and performance monitoring Experience integrating observability tools with ITSM platforms and DevOps toolchains Ability to troubleshoot complex infrastructure and application issues using monitoring insights … and non-technical audiences Experience working in Agile project environments (Scrum, Kanban, etc.) A proactive mindset with a passion for continuous improvement and knowledge sharing Certifications Dynatrace Associate & Pro Splunk Core Certified Power User DevOps or Site Reliability Engineering (SRE) experience Automation with Terraform or similar tools Experience with Docker and Kubernetes for packaging and deployment Ability to adapt to More ❯
development in web technologies or OOP (e.g., Python, Java, etc.) Database tech: Oracle SQL, PostgreSQL, MongoDB Proficient with Linux/Windows command line (Bash, PowerShell) Monitoring: Grafana, Prometheus, ELK, Splunk Agile working and tooling (e.g., Jira, Confluence) Diagnosing and resolving complex system issues ITIL knowledge or exposure to IT service operations Containerisation: Docker, Kubernetes, OpenShift Awareness of modern tech trends More ❯
development in web technologies or OOP (e.g., Python, Java, etc.) Database tech: Oracle SQL, PostgreSQL, MongoDB Proficient with Linux/Windows command line (Bash, PowerShell) Monitoring: Grafana, Prometheus, ELK, Splunk Agile working and tooling (e.g., Jira, Confluence) Diagnosing and resolving complex system issues ITIL knowledge or exposure to IT service operations Containerisation: Docker, Kubernetes, OpenShift Awareness of modern tech trends More ❯
development in web technologies or OOP (e.g., Python, Java, etc.) Database tech: Oracle SQL, PostgreSQL, MongoDB Proficient with Linux/Windows command line (Bash, PowerShell) Monitoring: Grafana, Prometheus, ELK, Splunk Agile working and tooling (e.g., Jira, Confluence) Diagnosing and resolving complex system issues ITIL knowledge or exposure to IT service operations Containerisation: Docker, Kubernetes, OpenShift Awareness of modern tech trends More ❯
such as C#, Python, Perl, Java, C++ CICD tools such as Azure DevOps, GitHub Actions, Gitlab, Jenkins, TeamCity Scripting languages such as PowerShell, bash Observability/Monitoring: Prometheus, Grafana, Splunk Containerisation tools such as Docker, K8S, OpenShift, EC, containers Hosting technologies such as IIS, nginx, Apache, App Service, LightSail Analytical and creative approach to problem solving We encourage you to More ❯
Monitoring & Observability Engineer | Splunk, Dynatrace, IaC Location: London/Cheshire - 3 days on site/Hybrid - 6 Months - Inside IR35 Join a cutting-edge team driving the future of observability! We're looking for a Monitoring & Observability Engineer to lead the design and deployment of robust monitoring solutions using Splunk , Dynatrace , and OpenTelemetry (OTel) in a fast-paced, tech-forward … environment. Key Responsibilities: Design and implement end-to-end monitoring pipelines (Splunk, Dynatrace, OTel). Build and maintain dashboards and queries in Splunk. Automate monitoring tool deployments using Terraform , Ansible , and Jenkins . Integrate observability seamlessly into CI/CD pipelines. Support dev teams with logging, alerting, and tracing best practices. Ensure scalable, cost-efficient monitoring configurations. Perform root cause … Code (IaC): Strong Terraform skills. CI/CD Expertise: Jenkins and Bitbucket. Automation & Config Management: Hands-on with Ansible and scripting (Python, Bash). Monitoring Tools: Solid experience with Splunk (must-have) and ideally Dynatrace (nice to have). Artifact Management: Familiarity with Artifactory. Communication: Ability to collaborate with both technical and development teams. Desirables: Experience with Prometheus, Grafana, or More ❯
incident response. Key Responsibilities - Manage and monitor AWS infrastructure for performance and security - Respond to production incidents, perform root cause analysis, and implement fixes - Maintain observability tools (Prometheus, Grafana, Splunk) and write PromQL queries - Improve and operate CI/CD pipelines using GitHub Actions and Kubernetes - Automate infrastructure tasks with Python, Bash, Go or SQL - Work with Git-based workflows … call rotation to ensure system reliability Your Profile Essential: - Solid hands-on AWS experience in a DevOps setting - Background in incident, change, and problem management - Strong with Prometheus, Grafana, Splunk, and PromQL - Proficient in scripting (Python, Go, Bash, SQL) - Skilled in GitHub, CI/CD, and Kubernetes operations Desirable: - Experience with Terraform or CloudFormation - Advanced log analysis with Splunk - Strong More ❯
Science, Information Technology, or related field (or equivalent experience). Industry certifications such as: Certified Information Systems Security Professional (CISSP) Certified Incident Handler (GCIH) GIAC Security Automation Expert (GCSA) Splunk Certified Automation Consultant, or relevant SOAR certifications. Experience with automation tools (e.g., SOAR platforms, Ansible, Phantom or similar). Proficiency in scripting languages (e.g., Python, PowerShell, Bash). Strong understanding … of SOC processes, including incident response and threat detection. Experience with SIEM platforms (e.g., Splunk). Knowledge of security frameworks (e.g., NIST, MITRE ATT&CK). Skills Proficiency in automation tools (e.g., SOAR platforms, Ansible, Phantom). Expertise in scripting languages (e.g., Python, PowerShell, Bash). Strong knowledge of SOC processes (incident response, threat detection). Experience with SIEM platforms … e.g., Splunk). Ability to integrate and automate security tools. Strong problem-solving and analytical skills. Experience in developing automated workflows and playbooks. Knowledge of security frameworks (e.g., MITRE ATT&CK, NIST). Strong collaboration and communication skills. Experience with log management and event correlation automation. Experience 3-5 years of experience in SOC or cybersecurity roles. Hands-on experience More ❯
Implement and maintain DevOps tooling (Terraform, Ansible, GitLab CI/CD, Jenkins) Lead PoCs for new storage technologies and present results to technical leadership Support observability via Grafana, Prometheus, Splunk , and related platforms Contribute to containerization efforts with Docker and Kubernetes (preferred) What We’re Looking For: 8+ years of experience in storage systems administration and infrastructure/platform engineering … performance tuning , particularly in HPC or ML/AI contexts Programming/scripting experience in Python , Golang , or similar languages Familiarity with modern observability and monitoring tools (Grafana, Prometheus, Splunk) Experience supporting AI/ML modelling environments is highly desirable Knowledge of container and orchestration technologies (Docker, Kubernetes) is a plus Proactive, collaborative, and passionate about building world-class infrastructure More ❯
Implement and maintain DevOps tooling (Terraform, Ansible, GitLab CI/CD, Jenkins) Lead PoCs for new storage technologies and present results to technical leadership Support observability via Grafana, Prometheus, Splunk , and related platforms Contribute to containerization efforts with Docker and Kubernetes (preferred) What We’re Looking For: 8+ years of experience in storage systems administration and infrastructure/platform engineering … performance tuning , particularly in HPC or ML/AI contexts Programming/scripting experience in Python , Golang , or similar languages Familiarity with modern observability and monitoring tools (Grafana, Prometheus, Splunk) Experience supporting AI/ML modelling environments is highly desirable Knowledge of container and orchestration technologies (Docker, Kubernetes) is a plus Proactive, collaborative, and passionate about building world-class infrastructure More ❯
preferably within an MSP or fast-paced environment. Strong hands-on experience with SIEM, EDR, VPNs, firewalls, and cloud platforms (AWS, Azure, GCP). Expertise in Microsoft Sentinel, Cisco Splunk or Palo Alto QRadar, and KQL. Proven skills in threat detection, incident response, and forensic analysis. Knowledge of SOAR tools (especially Palo Alto XSOAR or similar). Familiarity with compliance More ❯
preferably within an MSP or fast-paced environment. Strong hands-on experience with SIEM, EDR, VPNs, firewalls, and cloud platforms (AWS, Azure, GCP). Expertise in Microsoft Sentinel, Cisco Splunk or Palo Alto QRadar, and KQL. Proven skills in threat detection, incident response, and forensic analysis. Knowledge of SOAR tools (especially Palo Alto XSOAR or similar). Familiarity with compliance More ❯
hands-on practice in Observability, specifically experience working with one or more of the following tools - Kibana, Open-Search, Grafana, Datadog, Sumologic, NewRelic, AppDynamics, Dynatrace, Prometheus, Logz.io, SignalFX, Instana, Splunk, Honeycomb, Jaeger Hands-on experience with Infrastructure as a Code (Terraform/Ansible) Hands-on experience in technical integrations (OpenTelemetry/fluentd/fluentbit/filebeat/logstash) Hands-on More ❯
of fundamental security concepts and terminologies (CIA, basic cryptography knowledge etc.) Advanced scripting skills in Python, Bash, PowerShell etc. Advanced Network knowledge, including Firewall management. Familiarity with SIEM tools (Splunk, Elastic Stack etc.) Good knowledge of identified operating system platforms; routers, network protocols, and security architecture. Good knowledge of industry standard security tools for implementation, administration and usage. Advanced networking More ❯
Site Reliability Engineer Location: London Hybrid (3 days WFH) Salary Range: Up to £140,000 Annapurna is working on behalf of a pioneering technology company to recruit a Site Reliability Engineer (SRE) . This is a unique opportunity to play More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Annapurna
Site Reliability Engineer Location: London Hybrid (3 days WFH) Salary Range: Up to £140,000 Annapurna is working on behalf of a pioneering technology company to recruit a Site Reliability Engineer (SRE) . This is a unique opportunity to play More ❯
load balancers (F5, HAProxy, Nginx) and network monitoring tools. Experience in DNS management and troubleshooting. Experience in network security best practices. Proficiency in monitoring and observability tools (Prometheus, Grafana, Splunk). Proficiency in at least one scripting language (Python, Bash) for automation. Experience with CI/CD pipeline management and DevOps practices. Strong understanding of disaster recovery and business continuity More ❯
practice in Observability, specifically experience working with one or more of the following tools - Kibana, Open-Search, Grafana, Datadog, Sumo Logic, New Relic, AppDynamics, Dynatrace, Prometheus, Logz.io, SignalFX, Instana, Splunk, Honeycomb, Jaeger Hands-on experience with Infrastructure as a Code (Terraform/Ansible) Hands-on experience in technical integrations (OpenTelemetry/fluentd/fluentbit/filebeat/logstash) Hands-on More ❯
practice in Observability, specifically experience working with one or more of the following tools - Kibana, Open-Search, Grafana, Datadog, Sumo Logic, New Relic, AppDynamics, Dynatrace, Prometheus, Logz.io, SignalFX, Instana, Splunk, Honeycomb, Jaeger Hands-on experience with Infrastructure as a Code (Terraform/Ansible) Hands-on experience in technical integrations (OpenTelemetry/fluentd/fluentbit/filebeat/logstash) Hands-on More ❯