Infrastructure Monitoring Jobs in London

19 of 19 Infrastructure Monitoring Jobs in London

Sr. DevOps Engineer

London, United Kingdom
ANRA Technologies, LLC
setup and maintenance of CICD pipelines. Strong experience working with tools and platforms like Jenkins. Excellent knowledge of AWS cloud services to implement highly efficient architecture. Ability to analyze infrastructure and implement security best practices. Experience with infrastructure monitoring tools like Nagios, Prometheus, Grafana. Expertise in containerization platforms like Docker and container orchestration platforms like Kubernetes and … Rancher. Familiarity with infrastructure as code tools such as Terraform, CloudFormation, Ansible, etc. Experience in writing automation scripts using Bash, Powershell, and Python. Experience with Windows and Linux systems administration. Qualifications: Bachelor's degree in Computer Science or equivalent. Minimum of 5 years of relevant experience. Excellent communication skills, both oral and written. Willingness to explore and learn new More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Operational Acceptance Test Lead

London, United Kingdom
Hybrid / WFH Options
Identity E2E Ltd
Key Responsibilities Lead and manage the Operational Acceptance Testing (OAT) process, ensuring systems meet operational and business requirements before deployment. Define OAT test strategies, plans, and criteria to validate infrastructure, system reliability, failover mechanisms, and non-functional requirements. Collaborate with DevOps, Infrastructure, and Release Management teams to ensure smooth transition from development to production on AWS cloud. Conduct … Expertise 5+ years of experience in Quality Assurance, or Operational Testing. Deep understanding of OAT methodologies, including but not limited to the following core areas: Build verification Alert and Monitoring Backup and Restore Resilience and Recovery Logging, Audit and House Keeping Release Management Config Management Work Instructions OWASP "top ten" security tests Experience with CI/CD pipelines, Jenkins … and test automation frameworks. Knowledge of cloud platforms (AWS/Azure), infrastructure monitoring, and IT service management. Familiarity with databases, Linux environments, and network protocols. Experience in working with Agile & DevOps methodologies. Strong analytical and problem-solving skills with attention to detail. Excellent communication and stakeholder management abilities. Why Apply? We invest in your growth with an ACloudGuru More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Services Lead

Merton, Wisconsin, United States
Hybrid / WFH Options
QuadMed
related technologies and relevant processes. This role is ideally filled by someone who is adaptive, organized, strong in project management, has a customer service focus and background in technical infrastructure, networking, hardware and desktop support. The role involves overseeing one or more infrastructure or support teams, fostering a collaborative and positive environment and making decisions that balance the … continuous improvement and world-class customer service to develop and lead team to build Raving Fans through exemplary service and follow through. Accountable for all health center and corporate infrastructure, ensuring uptime, solution delivery, IT inventory and lifecycle management and adherence to site checklists and security protocols. Responsible for annual client infrastructure budget planning, presentation and adherence. Collaborate … with IT, Facilities & Implementation and Operations on setting health center hardware/technology/infrastructure standards, RFP proposals, implementations and closures. Facilitate Big Room Planning and Agile functions for the team, including capacity planning, tracking, and workload management, in partnership with Quad and QuadMed IT. Focus on IT Scorecard metrics and mentor team on establishing and performing proper infrastructure More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Observability/ Monitoring Engineer - Grafana Dashboarding

City of London, London, United Kingdom
Levy Global
We’re seeking an experienced contractor to support the delivery of observability solutions for a new, large-scale infrastructure environment. This role focuses on developing insightful and automated Grafana dashboards, with a strong emphasis on data integration and actionable telemetry. Required Skills Excellent, concise communication skills - essential for collaborating with technical teams to shape observability outputs. Deep experience with … Bonus/Nice-to-Have Skills: Experience deploying Grafana instances via code (provisioning dashboards, datasources). Familiarity with OpenTelemetry, metric instrumentation, and telemetry pipelines. Background in data center environments, infrastructure monitoring, or SRE practices. Exposure to CI/CD workflows, containers (Podman/Docker), and cloud-native systems. More ❯
Posted:

Observability/ Monitoring Engineer - Grafana Dashboarding

London Area, United Kingdom
Levy Global
We’re seeking an experienced contractor to support the delivery of observability solutions for a new, large-scale infrastructure environment. This role focuses on developing insightful and automated Grafana dashboards, with a strong emphasis on data integration and actionable telemetry. Required Skills Excellent, concise communication skills - essential for collaborating with technical teams to shape observability outputs. Deep experience with … Bonus/Nice-to-Have Skills: Experience deploying Grafana instances via code (provisioning dashboards, datasources). Familiarity with OpenTelemetry, metric instrumentation, and telemetry pipelines. Background in data center environments, infrastructure monitoring, or SRE practices. Exposure to CI/CD workflows, containers (Podman/Docker), and cloud-native systems. More ❯
Posted:

Network Operations Engineer

City of London, London, United Kingdom
Alexander Ash Consulting
Alexander Ash is seeking Network Operations Engineers on behalf of our global, technology-driven financial services client. As part of the global infrastructure team, you will play a key role in managing and maintaining the firm’s network infrastructure and automation systems. The team’s mission is to ensure the availability, reliability, and security of the platform while … driving operational excellence. Key responsibilities include: Supporting and enhancing existing network infrastructure Developing observability tools and self-healing/event-driven automation Performing advanced troubleshooting and incident resolution Contributing to the evolution of a high-performance compute datacentre Skills Required: Proficient in monitoring and resolving incidents across diverse environments Strong diagnostic skills in network infrastructure, collaborating closely … with vendor support teams for in-depth investigations when needed Lead the creation and enhancement of monitoring dashboards, proactively addressing alerts based on priority and driving continuous improvements in alerting mechanisms and system observability Facilitate post-incident reviews to identify and implement improvements that enhance infrastructure reliability and availability Implement BAU changes with a focus on automation, fostering More ❯
Posted:

Network Operations Engineer

London Area, United Kingdom
Alexander Ash Consulting
Alexander Ash is seeking Network Operations Engineers on behalf of our global, technology-driven financial services client. As part of the global infrastructure team, you will play a key role in managing and maintaining the firm’s network infrastructure and automation systems. The team’s mission is to ensure the availability, reliability, and security of the platform while … driving operational excellence. Key responsibilities include: Supporting and enhancing existing network infrastructure Developing observability tools and self-healing/event-driven automation Performing advanced troubleshooting and incident resolution Contributing to the evolution of a high-performance compute datacentre Skills Required: Proficient in monitoring and resolving incidents across diverse environments Strong diagnostic skills in network infrastructure, collaborating closely … with vendor support teams for in-depth investigations when needed Lead the creation and enhancement of monitoring dashboards, proactively addressing alerts based on priority and driving continuous improvements in alerting mechanisms and system observability Facilitate post-incident reviews to identify and implement improvements that enhance infrastructure reliability and availability Implement BAU changes with a focus on automation, fostering More ❯
Posted:

Observability Engineer - Grafana Dashboarding

London, UK
Levy Global
We’re seeking an experienced contractor to support the delivery of observability solutions for a new, large-scale infrastructure environment. This role focuses on developing insightful and automated Grafana dashboards, with a strong emphasis on data integration and actionable telemetry. Required Skills Excellent, concise communication skills - essential for collaborating with technical teams to shape observability outputs. Deep experience with … Bonus/Nice-to-Have Skills: Experience deploying Grafana instances via code (provisioning dashboards, datasources). Familiarity with OpenTelemetry, metric instrumentation, and telemetry pipelines. Background in data center environments, infrastructure monitoring, or SRE practices. Exposure to CI/CD workflows, containers (Podman/Docker), and cloud-native systems. More ❯
Employment Type: Full-time
Posted:

Observability Engineer - Grafana Dashboarding

City of London, England, United Kingdom
Levy Global
We’re seeking an experienced contractor to support the delivery of observability solutions for a new, large-scale infrastructure environment. This role focuses on developing insightful and automated Grafana dashboards, with a strong emphasis on data integration and actionable telemetry. Required Skills Excellent, concise communication skills - essential for collaborating with technical teams to shape observability outputs. Deep experience with … Bonus/Nice-to-Have Skills: Experience deploying Grafana instances via code (provisioning dashboards, datasources). Familiarity with OpenTelemetry, metric instrumentation, and telemetry pipelines. Background in data center environments, infrastructure monitoring, or SRE practices. Exposure to CI/CD workflows, containers (Podman/Docker), and cloud-native systems. More ❯
Posted:

Observability Engineer - Grafana Dashboarding

South East London, England, United Kingdom
Levy Global
We’re seeking an experienced contractor to support the delivery of observability solutions for a new, large-scale infrastructure environment. This role focuses on developing insightful and automated Grafana dashboards, with a strong emphasis on data integration and actionable telemetry. Required Skills Excellent, concise communication skills - essential for collaborating with technical teams to shape observability outputs. Deep experience with … Bonus/Nice-to-Have Skills: Experience deploying Grafana instances via code (provisioning dashboards, datasources). Familiarity with OpenTelemetry, metric instrumentation, and telemetry pipelines. Background in data center environments, infrastructure monitoring, or SRE practices. Exposure to CI/CD workflows, containers (Podman/Docker), and cloud-native systems. More ❯
Posted:

Head of Operations

City of London, London, United Kingdom
Hybrid / WFH Options
Enertek Group
Job Title: Head of Operations Type: Full-time | Leadership | Equity Available Industry: Web3 | AI Infrastructure About Us We are building the future of compute. Our platform is a decentralized, enterprise-grade cloud network , delivering scalable, globally distributed GPU resources for AI training , high-performance gaming , and Web3 infrastructure . By democratizing access to GPU power, we are accelerating … the roadmap, scale the team, and help us deliver a robust, secure, and performant decentralized GPU network. Key Responsibilities – Head of Operations Operational Strategy: Own and evolve the operational infrastructure supporting a globally distributed technology and product ecosystem. Team Leadership: Hire, mentor, and scale high-performing operations, support, and cross-functional execution teams across logistics, compliance, and service delivery. … cloud-native, global teams, remote-first). Strong familiarity with blockchain/Web3 ecosystems, including decentralized networks, compliance nuances, and ecosystem partnerships. Exposure to AI/ML or compute infrastructure operations is a significant plus (e.g., managing GPU resources, vendor logistics, or infrastructure procurement). Understanding of open-source community dynamics and developer ecosystem needs. Experience leading remote More ❯
Posted:

Head of Operations

London Area, United Kingdom
Hybrid / WFH Options
Enertek Group
Job Title: Head of Operations Type: Full-time | Leadership | Equity Available Industry: Web3 | AI Infrastructure About Us We are building the future of compute. Our platform is a decentralized, enterprise-grade cloud network , delivering scalable, globally distributed GPU resources for AI training , high-performance gaming , and Web3 infrastructure . By democratizing access to GPU power, we are accelerating … the roadmap, scale the team, and help us deliver a robust, secure, and performant decentralized GPU network. Key Responsibilities – Head of Operations Operational Strategy: Own and evolve the operational infrastructure supporting a globally distributed technology and product ecosystem. Team Leadership: Hire, mentor, and scale high-performing operations, support, and cross-functional execution teams across logistics, compliance, and service delivery. … cloud-native, global teams, remote-first). Strong familiarity with blockchain/Web3 ecosystems, including decentralized networks, compliance nuances, and ecosystem partnerships. Exposure to AI/ML or compute infrastructure operations is a significant plus (e.g., managing GPU resources, vendor logistics, or infrastructure procurement). Understanding of open-source community dynamics and developer ecosystem needs. Experience leading remote More ❯
Posted:

Senior Data Scientist

City of London, London, United Kingdom
Xcede
data insights to define and evolve pricing approaches Partner with stakeholders to run and analyse effective pricing experiments Collaborate with product, marketing, finance and ops teams Directly shape pricing infrastructuremonitoring customer and financial performance Track competitor movement and customer needs to inform decisions You’ll need: Proven experience leading pricing or monetisation analytics at a top-tier More ❯
Posted:

Senior Data Scientist

London Area, United Kingdom
Xcede
data insights to define and evolve pricing approaches Partner with stakeholders to run and analyse effective pricing experiments Collaborate with product, marketing, finance and ops teams Directly shape pricing infrastructuremonitoring customer and financial performance Track competitor movement and customer needs to inform decisions You’ll need: Proven experience leading pricing or monetisation analytics at a top-tier More ❯
Posted:

Senior Data Scientist

South East London, England, United Kingdom
Xcede
data insights to define and evolve pricing approaches Partner with stakeholders to run and analyse effective pricing experiments Collaborate with product, marketing, finance and ops teams Directly shape pricing infrastructuremonitoring customer and financial performance Track competitor movement and customer needs to inform decisions You’ll need: Proven experience leading pricing or monetisation analytics at a top-tier More ❯
Posted:

ETL Developer

City, London, United Kingdom
Deerfoot Recruitment Solutions Limited
ensuring that critical business services have the capacity and resilience to meet regulatory and operational demands. You will focus on developing robust data pipelines and solutions that support capacity monitoring, reporting, and forecasting across complex IT environments. Key Responsibilities: Design, develop, and maintain ETL processes to collect, transform, and load capacity and infrastructure data from multiple sources. Collaborate … with cross-functional teams and DBAs to ensure seamless data integration and delivery. Analyse and normalise raw infrastructure and monitoring data for reporting, trending, and predictive analytics. Develop and implement automated methods to forecast future capacity requirements using historical data. Support compliance with regulatory standards (including DORA) by providing accurate, transparent, and timely capacity data. Create clear technical … contribute to ongoing process improvement. Technical Requirements: Proven expertise in ETL (Extract, Transform, Load) development. Strong hands-on experience with SQL Server and data engineering tools. Solid understanding of infrastructure monitoring, capacity management, and data analytics. Ability to work with large, complex datasets and automate data workflows. Excellent problem-solving, communication, and stakeholder management skills. Desirable: Experience in More ❯
Employment Type: Contract
Rate: GBP Annual
Posted:

ETL Developer

London, Coleman Street, United Kingdom
Deerfoot Recruitment Solutions Limited
ensuring that critical business services have the capacity and resilience to meet regulatory and operational demands. You will focus on developing robust data pipelines and solutions that support capacity monitoring, reporting, and forecasting across complex IT environments. Key Responsibilities: Design, develop, and maintain ETL processes to collect, transform, and load capacity and infrastructure data from multiple sources. Collaborate … with cross-functional teams and DBAs to ensure seamless data integration and delivery. Analyse and normalise raw infrastructure and monitoring data for reporting, trending, and predictive analytics. Develop and implement automated methods to forecast future capacity requirements using historical data. Support compliance with regulatory standards (including DORA) by providing accurate, transparent, and timely capacity data. Create clear technical … contribute to ongoing process improvement. Technical Requirements: Proven expertise in ETL (Extract, Transform, Load) development. Strong hands-on experience with SQL Server and data engineering tools. Solid understanding of infrastructure monitoring, capacity management, and data analytics. Ability to work with large, complex datasets and automate data workflows. Excellent problem-solving, communication, and stakeholder management skills. Desirable: Experience in More ❯
Employment Type: Contract
Posted:

Quant Systems Developer

London Area, United Kingdom
Quant Capital
SYSTEMS DEVELOPER – High Performance Trading Infrastructure London - Hybrid £100,000-150,000 + bonus Quant Capital is recruiting on behalf of a leading global quant investment firm seeking a Systems Developer to join its high-impact technology group in London. This role is part of a core engineering team focused on building and maintaining the backbone infrastructure supporting … tools for managing system performance, reliability, and software lifecycles (SDLC) Tackle performance bottlenecks across kernel, file system, and network layers to support low-latency, high-throughput applications Contribute to infrastructure monitoring, data collection pipelines, and real-time analytics Collaborate across teams to align system design with evolving trading and research requirements Skills and Experience: A strong interest in More ❯
Posted:

Quant Systems Developer

City of London, London, United Kingdom
Quant Capital
SYSTEMS DEVELOPER – High Performance Trading Infrastructure London - Hybrid £100,000-150,000 + bonus Quant Capital is recruiting on behalf of a leading global quant investment firm seeking a Systems Developer to join its high-impact technology group in London. This role is part of a core engineering team focused on building and maintaining the backbone infrastructure supporting … tools for managing system performance, reliability, and software lifecycles (SDLC) Tackle performance bottlenecks across kernel, file system, and network layers to support low-latency, high-throughput applications Contribute to infrastructure monitoring, data collection pipelines, and real-time analytics Collaborate across teams to align system design with evolving trading and research requirements Skills and Experience: A strong interest in More ❯
Posted:
Infrastructure Monitoring
London
10th Percentile
£61,250
25th Percentile
£75,000
Median
£87,500
75th Percentile
£102,813
90th Percentile
£107,500