Infrastructure Monitoring Jobs in London

1 to 25 of 41 Infrastructure Monitoring Jobs in London

Principal Infrastructure Engineer

London, England, United Kingdom
Markerstudy Group
Role Purpose: The Principal Infrastructure Engineer is responsible for leading the design, implementation, and optimization of complex infrastructure environments. This role requires a seasoned professional with extensive experience in infrastructure engineering, capable of driving strategic initiatives and ensuring the highest levels of performance, security, and reliability. Key Accountabilities and Responsibilities: Leadership and Strategy Lead the planning, installation … maintenance, and acceptance of infrastructure components and services, including physical and virtual servers (Windows Server, Linux). Develop and implement infrastructure strategies that align with organizational goals and service expectations. Drive the adoption of tools and processes for effective operational management and delivery. Infrastructure Design and Implementation: Design and implement robust architectures, ensuring scalability, security, and performance. … Oversee the configuration and deployment of infrastructure components, including servers, storage solutions (SAN, NAS, cloud storage), and virtualization platforms (Nutanix). Ensure compliance with industry standards and best practices. Operational Excellence: Monitor infrastructure performance, load, and security metrics using advanced management tools (e.g., System Center, vRealize, Nagios). Investigate and resolve complex infrastructure issues, ensuring minimal downtime More ❯
Posted:

Senior DevOps Engineer (Mainframe) 678

London, England, United Kingdom
Protegrity USA, Inc
and ability to maintain continuous integration, delivery, and deployment (CI/CD) process for a complex set of software requirements and products spread across multiple platforms. Monitor and manage infrastructure, ensuring optimal performance, security, and scalability. Define and develop, test, release, update, and support processes for DevOps operations. Troubleshoot and resolve issues related to application development, deployment, and operations. … Strong knowledge of Shell Scripting and any other programming languages such as Python, C, Groovy, Java , YAML. Experience working on Linux infrastructure. Familiarity with IBM z/OS-based infrastructure is advantageous, particularly in hybrid enterprise environments. Experiential learning on Infrastructure as Code tools (like Terraform, Ansible). Hands-On experience on Container & Container Orchestration tools like Docker … AWS ECS, Kubernetes and infrastructure monitoring tools like Prometheus and Grafana. Experience with designing, building, and maintaining cloud-native applications across major cloud platforms such as AWS, Azure or GCP is a strong plus. Knowledge of Data Protection, Privacy and Security domain. · Understanding of agile methodologies and principles. Knowledge of databases and SQL. Excellent communication and collaboration skills More ❯
Posted:

Lead DevOps

London, England, United Kingdom
Smartedgesolutions
GitLab CI), configuration management tools (Ansible, Puppet), and containerization technologies (Docker, ECS, Kubernetes) Monitor system performance, identify bottlenecks, and implement optimizations to improve reliability and efficiency Develop and maintain Infrastructure as Code (IaC) using Terraform, Ansible, AWS CloudFormation, ensuring consistency, repeatability, and compliance Identify and automate application deployment, scaling, and security processes, reducing manual effort and improving reliability Work … Expertise in Docker, ECS, EKS, Kubernetes, implementing security best practices like image vulnerability scanning, Kubernetes RBAC, IAM Roles for Service Accounts (IRSA), Pod Security Policies, and automated compliance enforcement Infrastructure Automation & Security: Strong experience in IaC tools (Terraform, CloudFormation, Ansible), applying least privilege IAM policies, role-based access controls (RBAC), automated compliance checks, and zero-trust security principles Monitoring, Logging & Alerting: Expertise in building centralized logging solutions, integrating ELK Stack, Prometheus, Grafana, Splunk, and AWS-native security monitoring tools such as CloudWatch, Security Hub, SIEM integrations CI/CD Security & Automation: Proficient in Jenkins, Git, GitHub Actions, ensuring secure CI/CD pipelines with artifact encryption, automated security scanning, and DevSecOps best practices Cloud-Based Database Security More ❯
Posted:

Senior DevOps Engineer 678

London, England, United Kingdom
Hybrid / WFH Options
Protegrity
platforms. You will receive comprehensive training on existing zOS product applications and assembly processes, with a focus on developing and implementing CI/CD capabilities tailored for mainframe systems. Monitoring and managing infrastructure, ensuring optimal performance, security, and scalability. Defining and setting development, test, release, update, and support processes for DevOps operations. Troubleshooting and resolving issues related to … Strong knowledge of Shell Scripting and any other programming languages such as Python, C, Groovy, Java, YAML. Experience working on Linux infrastructure. Familiarity with IBM z/OS-based infrastructure is advantageous, particularly in hybrid enterprise environments. Experiential learning on Infrastructure as Code tools (like Terraform, Ansible). Hands-On experience on Container & Container Orchestration tools like Docker … AWS ECS, Kubernetes and infrastructure monitoring tools like Prometheus and Grafana. Experience with designing, building, and maintaining cloud-native applications across major cloud platforms such as AWS, Azure or GCP is a strong plus. Excellent communication and collaboration skills, as well as the ability to work effectively in cross-functional teams including nearshore and offshore. Why Choose Protegrity More ❯
Posted:

Senior Azure Cloud Engineer

London, England, United Kingdom
Netcompany
Engineer, you will be part of a dynamic team responsible for designing, implementing, and managing cloud solutions for our prestigious clients. You will leverage your expertise in Azure and Infrastructure as Code to deliver robust, scalable, and secure cloud environments. This role offers significant opportunities for professional growth and the chance to work on some of the largest, most …/Experience Essential Minimum of 3 years' extensive experience with Azure cloud services, including advanced resource management, networking, and security implementations 2 years' hands-on experience with Terraform and Infrastructure as Code methodologies in enterprise environments Proven track record of architecting and implementing complex cloud solutions at scale Advanced troubleshooting skills with the ability to diagnose and resolve intricate … technical issues in production environments Strong knowledge in at least one scripting/programming language (PowerShell, Python, Bash) with demonstrable complex automation implementations Experience designing and implementing robust monitoring solutions and advanced alerting strategies Strong background in cloud security principles and implementation of security controls in Azure environments Demonstrated ability to lead technical discussions with stakeholders and translate business More ❯
Posted:

IT Engineering Infrastructure Engineer

London, England, United Kingdom
Hybrid / WFH Options
Surrey Satellite Technology Ltd
This role is an exciting opportunity to join the IT Infrastructure team with responsibility for system architecture, providing technical expertise in implementation, maintenance and day to day administration of SSTL corporate and satellite ground station infrastructure. Alongside new and exciting on-prem and cloud projects on our corporate infrastructure, SSTL IT Infrastructure engineers have real involvement in … influencing how satellite mission infrastructure is architected, implemented and deployed. This role involves interaction with other technical leads across multiple disciplines to generate technical solutions for innovative projects. Key Tasks Systems Architecture & Security : Develop and review architecture, assess new technologies, and recommend enhancements to ensure robust security and efficiency. Infrastructure Implementation : Execute new infrastructure services, manage project … management processes. System Modernisation & Automation : Modernise existing systems, focusing on implementing automation to enhance operational efficiency. Scripting & Templates : Create scripts and templates for repeatable machine builds to streamline deployments. Infrastructure Maintenance : Conduct routine maintenance, including firmware updates, system upgrades, and virtual infrastructure management, aiming to automate processes where feasible. 3rd Line Support : Provide advanced support within the Service More ❯
Posted:

Azure Cloud Engineer (Security Clearance Required)

London, England, United Kingdom
Netcompany
Engineer, you will be part of a dynamic team responsible for, implementing, managing and maintaining cloud solutions for our prestigious clients. You will leverage your expertise in Azure and Infrastructure as Code to deliver robust, scalable, and secure cloud environments. This role offers significant opportunities for professional growth and the chance to work on some of the largest, most … industry. Key Skills/Experience Essential Minimum of 18 months experience with Azure cloud services, including advanced resource management, networking, and security implementations Hands-on experience with Terraform and Infrastructure as Code methodologies in enterprise environments, ability to create and modify infrastructure. Strong troubleshooting skills with the ability to diagnose and resolve intricate technical issues in production environments Experience … in at least one scripting/programming language (PowerShell, Python, Bash) with demonstrable complex automation implementations Experience implementing robust monitoring solutions and advanced alerting strategies Strong background in cloud security principles and implementation of security controls in Azure environments Experience mentoring team mates and contributing to team knowledge growth Strong analytical mindset with exceptional problem-solving abilities Must be More ❯
Posted:

Software Architect

London, England, United Kingdom
Verint
solutions for third-party products Experience with Agile and DevOps methodologies Experience with Linux operating system Experience with relational and NoSQL databases (Postgres, Dynamo and others) Familiarity with application monitoring, infrastructure monitoring and log aggregation tools like Datadog. Experience managing source code control and CI/CD tools like GitHub, Jenkins or similar Experience with developing multi More ❯
Posted:

Infrastructure Engineer

London, England, United Kingdom
Hybrid / WFH Options
Story Terrace Inc
we are already serving some of the world’s largest companies, providing them with superior credit protection and innovative risk management technology About the role: We’re seeking an Infrastructure Engineerto join our technical team. This role is ideal for someone with 2+ years of experience who is passionate about maintaining and improving robust infrastructure systems while contributing … to data management initiatives. You’ll work closely with our CTO and engineering team to ensure our infrastructure remains reliable, cost-efficient, and adaptable to meet the needs of high-value users. This is a fantastic opportunity to join a fast-growing, well-funded start-up that is disrupting the global credit insurance industry, growing and developing with the … company. Responsibilities: Maintain and improve existing infrastructure built on AWS ECS with observability tools in place Improve automation for deployments and infrastructure management Collaborate with development teams to streamline the CI/CD pipeline Maintain and enhance monitoring and alerting systems Monitor system performance to ensure reliability for high-value users Proactively manage alerts, licensing, costs and More ❯
Posted:

Infrastructure Engineer

London, England, United Kingdom
Hybrid / WFH Options
Bondaval
Join to apply for the Infrastructure Engineer role at Bondaval Continue with Google Continue with Google Join to apply for the Infrastructure Engineer role at Bondaval We Look For Character Over Credentials We’re a specialist credit and surety underwriter transforming B2B credit protection with technology-enabled solutions (website) We Look For Character Over Credentials We’re a … we are already serving some of the world’s largest companies, providing them with superior credit protection and innovative risk management technology About The Role We’re seeking an Infrastructure Engineer to join our technical team. This role is ideal for someone with 2+ years of experience who is passionate about maintaining and improving robust infrastructure systems while … contributing to data management initiatives. You’ll work closely with our CTO and engineering team to ensure our infrastructure remains reliable, cost-efficient, and adaptable to meet the needs of high-value users. This is a fantastic opportunity to join a fast-growing, well-funded start-up that is disrupting the global credit insurance industry, growing and developing with More ❯
Posted:

Cloud Engineer (Full time - Remote Europe)

London, England, United Kingdom
Hybrid / WFH Options
Ikerian
are collectively shaping the future of healthcare. We are looking for a Cloud Engineer to join our team, where you will play a critical role in optimising our cloud infrastructure to meet our customers' needs. This position is pivotal to our mission of providing a seamless, secure, and scalable experience on our platform About Us Ikerian AG (formerly RetinAI … are collectively shaping the future of healthcare. We are looking for a Cloud Engineer to join our team, where you will play a critical role in optimising our cloud infrastructure to meet our customers' needs. This position is pivotal to our mission of providing a seamless, secure, and scalable experience on our platform Key Responsibilities Enhance and maintain highly … available, secure, and scalable AWS cloud environments and services. Manage and prioritise tasks in the cloud infrastructure backlog to address immediate needs and plan long-term improvements. Set up infrastructure monitoring and observability solutions, proactively addressing availability, performance or security issues. Assess new technologies, systems, and services for production readiness, ensuring seamless and stable integration. Prepare and More ❯
Posted:

IT Support Engineer / Service Desk Analyst / IT Technician

London, England, United Kingdom
AWD online
volume calls, to specific SLAs Experience of working in a structured ISO27001 environment with specific awareness of Security Incident processes Experience working within a Managed Service Provider (MSP) PTRG monitoring or infrastructure monitoring toolsets Windows Server Administration BENEFITS Annual Salary up to £32,000 per annum Working in an inclusive environment Industry renowned training/certifications (sponsored More ❯
Posted:

IT Support Engineer / Service Desk Analyst / IT Technician

City of London, London, England, United Kingdom
Hybrid / WFH Options
AWD online
volume calls, to specific SLAs Experience of working in a structured ISO27001 environment with specific awareness of Security Incident processes Experience working within a Managed Service Provider (MSP) PTRG monitoring or infrastructure monitoring toolsets Windows Server Administration BENEFITS Annual Salary up to £32,000 per annum Working in an inclusive environment Industry renowned training/certifications (sponsored More ❯
Employment Type: Full-Time
Salary: £32,000 per annum
Posted:

Software Engineer

London, England, United Kingdom
Park Place Technologies
Knowledge of network technologies and device types (routers, switches, load balancers, etc.). Preferred Qualifications: Network device vendor certifications (CCNA, JNCIA). Experience of operating and/or supporting Infrastructure Monitoring tools. Travel: 0% #J-18808-Ljbffr More ❯
Posted:

Infrastructure Monitoring Engineer

London, England, United Kingdom
Hybrid / WFH Options
Cloud Decisions
Infrastructure Monitoring & Support Engineer - 3 Positions Available! To £45,000 + Excellent Benefits (Negotiable for the right person) Hybrid Role with an Exceptional Number of Days off Work Location: London (SE1) or Newport in Wales (NP10) Enterprise Microsoft Consulting Partner | Rapidly growing organization - 50% revenue growth last year | Enviable Company Culture with an Inclusive and Fun Atmosphere “Without … for enterprise level customers, this diverse and multi-skilled partner have fully submerged themselves within the Cloud, giving you exposure to a wide range of cutting-edge technologies. “The Infrastructure Monitoring Engineer – The heartbeat of the support team” The Infrastructure Monitoring Team is responsible for maintaining the health of the organization's estate (both customer and … internal). This is achieved by maintaining a 24/7/365 engineer presence and responding to proactive and real-time alerting from a Remote Monitoring and Management (RMM) platform and carrying our investigation and remediation for areas covering availability, network health, storage capacity, antivirus, patching, backup jobs, performance, hardware issues and other infrastructure related issues. In More ❯
Posted:

Senior Infrastructure Engineer - Managed Systems Operability London, GBR

London, England, United Kingdom
Bloomberg L.P
combine Appliance and Datacenter server environments regardless of the underlying infrastructure. The team uses the latest automation technologies to reduce operational toil, improve the stability of our mission critical infrastructure services, and to facilitate maintenance of Bloomberg's server fleet at scale. You'll work closely with Software Developers, Frontline teams, and other key stakeholders to ensure that we … flow charts & procedures, perform capacity planning, and implement changes Diagnose and resolve critical system issues Continuously refine processes and procedures with a focus on standardization and automation. Enhance our monitoring and alerting solutions You’ll Need To Have: Experience programming in Python (or related language) and a good understanding of software development methodologies, open source systems, and familiarity with … and share knowledge We’d Love To See: RHEL System Administrator level of competency An understanding of some or all of the following: configuration management, orchestration, CI/CD, infrastructure monitoring and telemetry Experience using Agile (e.g. Kanban or Scrum) Familiarity with telemetry tools such as Splunk, Grafana Experience with Web frameworks (BENTO, REACT, Angular, DJANGO) Bloomberg is More ❯
Posted:

Observability/ Monitoring Engineer - Grafana Dashboarding

London Area, United Kingdom
Levy Global
We’re seeking an experienced contractor to support the delivery of observability solutions for a new, large-scale infrastructure environment. This role focuses on developing insightful and automated Grafana dashboards, with a strong emphasis on data integration and actionable telemetry. Required Skills Excellent, concise communication skills - essential for collaborating with technical teams to shape observability outputs. Deep experience with … Bonus/Nice-to-Have Skills: Experience deploying Grafana instances via code (provisioning dashboards, datasources). Familiarity with OpenTelemetry, metric instrumentation, and telemetry pipelines. Background in data center environments, infrastructure monitoring, or SRE practices. Exposure to CI/CD workflows, containers (Podman/Docker), and cloud-native systems. More ❯
Posted:

Observability/ Monitoring Engineer - Grafana Dashboarding

City of London, London, United Kingdom
Levy Global
We’re seeking an experienced contractor to support the delivery of observability solutions for a new, large-scale infrastructure environment. This role focuses on developing insightful and automated Grafana dashboards, with a strong emphasis on data integration and actionable telemetry. Required Skills Excellent, concise communication skills - essential for collaborating with technical teams to shape observability outputs. Deep experience with … Bonus/Nice-to-Have Skills: Experience deploying Grafana instances via code (provisioning dashboards, datasources). Familiarity with OpenTelemetry, metric instrumentation, and telemetry pipelines. Background in data center environments, infrastructure monitoring, or SRE practices. Exposure to CI/CD workflows, containers (Podman/Docker), and cloud-native systems. More ❯
Posted:

Network Operations Engineer

City of London, London, United Kingdom
Alexander Ash Consulting
Alexander Ash is seeking Network Operations Engineers on behalf of our global, technology-driven financial services client. As part of the global infrastructure team, you will play a key role in managing and maintaining the firm’s network infrastructure and automation systems. The team’s mission is to ensure the availability, reliability, and security of the platform while … driving operational excellence. Key responsibilities include: Supporting and enhancing existing network infrastructure Developing observability tools and self-healing/event-driven automation Performing advanced troubleshooting and incident resolution Contributing to the evolution of a high-performance compute datacentre Skills Required: Proficient in monitoring and resolving incidents across diverse environments Strong diagnostic skills in network infrastructure, collaborating closely … with vendor support teams for in-depth investigations when needed Lead the creation and enhancement of monitoring dashboards, proactively addressing alerts based on priority and driving continuous improvements in alerting mechanisms and system observability Facilitate post-incident reviews to identify and implement improvements that enhance infrastructure reliability and availability Implement BAU changes with a focus on automation, fostering More ❯
Posted:

Network Operations Engineer

London Area, United Kingdom
Alexander Ash Consulting
Alexander Ash is seeking Network Operations Engineers on behalf of our global, technology-driven financial services client. As part of the global infrastructure team, you will play a key role in managing and maintaining the firm’s network infrastructure and automation systems. The team’s mission is to ensure the availability, reliability, and security of the platform while … driving operational excellence. Key responsibilities include: Supporting and enhancing existing network infrastructure Developing observability tools and self-healing/event-driven automation Performing advanced troubleshooting and incident resolution Contributing to the evolution of a high-performance compute datacentre Skills Required: Proficient in monitoring and resolving incidents across diverse environments Strong diagnostic skills in network infrastructure, collaborating closely … with vendor support teams for in-depth investigations when needed Lead the creation and enhancement of monitoring dashboards, proactively addressing alerts based on priority and driving continuous improvements in alerting mechanisms and system observability Facilitate post-incident reviews to identify and implement improvements that enhance infrastructure reliability and availability Implement BAU changes with a focus on automation, fostering More ❯
Posted:

Solutions Engineer - 32882

London, England, United Kingdom
Splunk Inc
data access and management at scale with AI. These solutions include Asset and Risk intelligence, Attack Analysis, Orchestration Automation and Response, User Behavior Analytics, SIEM Enterprise Security, Application Performance Monitoring, Infrastructure Monitoring, Log Analysis, Incident Response, Network Monitoring, Business Risk Observability, AIOps, Digital Experience Monitoring and the ecosystem continues to expand and integrate at a … demonstration skills with an ability to use stories effectively. Ability to actively listen. Goodorganisational, prioritisation and time management skills. Technical Knowledge: Linux or Windows knowledge Basic network connectivity troubleshooting Infrastructure as aService platforms: AWS, GCP and/or Azure IT architecture concepts such as High Availability, Disaster Recovery Desirable Knowledge and Experience: Experience selling SaaS services or data solutions More ❯
Posted:

Network Engineer

London, England, United Kingdom
Piran Technologies Ltd
Advanced experience and knowledge of Active Directory Experience and knowledge of virtualisation platforms, VMware and/or Hyper-V Experience and knowledge of wireless technology Experience and knowledge of infrastructure monitoring technology Experience with Windows server software (all versions), and Linux Experience in configuring and troubleshooting high availability servers and storage (DA/SAN) Experience in cloud services More ❯
Posted:

Technical Product Manager

London, England, United Kingdom
ZipRecruiter
Job Description Product Manager – Infrastructure Monitoring We’re looking for a skilled and strategic Product Manager to take ownership of the Infrastructure Monitoring domain within a cutting-edge, full-stack observability platform. This role is central to building a robust foundation that empowers both Observability and Security product teams, while delivering a seamless and powerful experience … for end users. If you have experience in observability, security products, or cloud infrastructure, and you're passionate about shaping high-impact platform capabilities, this is your opportunity to make a difference. What You’ll Do Own and drive the full product lifecycle—from discovery and roadmap planning to execution and launch. Work closely with design and engineering to … create intuitive, scalable infrastructure monitoring capabilities. Understand and represent customer and user needs, balancing long-term vision with near-term priorities. Stay on top of market trends, evolving technologies, and competitive landscapes to inform product strategy. Collaborate cross-functionally with marketing, sales, and support to ensure go-to-market success and optimize product adoption. What You Bring 5+ More ❯
Posted:

DevOps Engineer

London, England, United Kingdom
Hybrid / WFH Options
Octopus Money
DevOps Engineer (Engineer 2) , you’ll play a key role in scaling and securing our cloud infrastructure. You’ll work alongside experienced platform engineers to improve our deployment pipelines, infrastructure-as-code, monitoring, and cost efficiency—while ensuring the systems we build are resilient, secure, and easy to manage. This is a great opportunity for someone ready to … deepen their expertise in cloud platforms, CI/CD automation, observability, and infrastructure design. You’ll be part of a collaborative engineering culture that encourages continuous learning and pragmatic problem-solving. About Octopus Money At Octopus Money, we’re on a mission to make money advice accessible to all – because the right advice can turn your life dreams into … you'll be responsible for Platform Modernisation: Contribute to the migration of build and deployment pipelines to GitHub Actions, helping us consolidate and improve our CI/CD strategy. Infrastructure as Code: Help standardise our infrastructure using Terraform, ensuring environments are consistent, auditable, and scalable. Containerised Workloads on ECS: Support and improve our usage of Amazon ECS for More ❯
Posted:

Security Operations Lead

London, United Kingdom
Hybrid / WFH Options
Square Enix Co Ltd
Job Summary: The Security Operations Lead is responsible for our security monitoring and incident response capabilities within the Square Enix Cyber Security team (covering Europe and North America). The primary goals of the role are the timely detection of security incidents, effective response and the continuous improvement of our preventative and detective controls. This role will work alongside … Maintaining and optimising our Cyber Security tools and platforms to continuously improve our detection and response capability. Supporting the management, administration and support of our SIEM platform, including general infrastructure and system administration, troubleshootingand user access management Maintaining and tuning security detections and alerts within our SIEM platform. Onboarding and managing security log sources for our SIEM platform, including … software development. Experience responding to or handling major cyber security incidents and following common response frameworks. Experience within the gaming industry providing security operations support to game releases, game infrastructure monitoring and live game operations. Strong appreciation of the cyber threat landscape and attacker tactics, techniques and procedures. Experience developing operational processes and playbooks. Desirable Interpersonal Skills: Ability More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:
Infrastructure Monitoring
London
10th Percentile
£61,250
25th Percentile
£75,000
Median
£87,500
75th Percentile
£102,813
90th Percentile
£107,500