Proactive Monitoring Jobs in the UK

351 to 375 of 597 Proactive Monitoring Jobs in the UK

DevOps Engineer

Glasgow, Scotland, United Kingdom
ELLIOTT MOSS CONSULTING PTE. LTD
CD pipelines using GitHub Actions, ensuring fast and reliable software delivery. · Manage containerized applications using Docker, Kubernetes, Amazon EKS, and Helm. · Administer and enhance observability using log aggregation and monitoring tools such as CloudWatch, Splunk, and Datadog. · Maintain and manage artifact repositories (e.g., JFrog Artifactory) and ensure effective dependency management. · Automate and streamline system operations, deployments, and workflows to … that align with business and technical goals. · Advocate for and implement best practices in DevOps, Site Reliability Engineering (SRE), and Software Engineering. · Ensure infrastructure security, scalability, and resilience through proactive monitoring, patching, and maintenance. · Contribute to knowledge sharing and mentoring of junior team members on DevOps practices and tools. Requirements · software engineering experience with proficiency in at least … 3+ years of practical experience with AWS cloud services and infrastructure management. AWS certifications are advantageous. · Strong experience with Infrastructure as Code tools (Terraform, CloudFormation) · Familiarity with observability and monitoring tools (CloudWatch, Splunk, Datadog). · Experience managing CI/CD workflows, especially with GitHub Actions. · Strong knowledge of artifact repository management systems like JFrog. · Proficient in Linux administration and More ❯
Posted:

Security Analyst

Manchester, England, United Kingdom
ANS Group
Alerts in Sentinel Tuning security alerting and prevention rulesets Managing security technologies Communicating with clients during an incident (email and phone) On-boarding of customers into Managed XDR services Monitoring and responding to SOC alerts Assisting with compromise investigations and e-discovery requests Liaising with clients via ticket and phone call and also during incidents Reviewing vulnerability scans and … providing remediation advice to clients Proactive monitoring of clients solutions Assisting Sales/Account managers with client security What will I bring to the role? A good understanding of the following technologies is required: Previous Security engineering or Security Operations experience, ideally within another MSSP Microsoft sentinel Microsoft Defender A good understanding of Microsoft Cloud Technologies Experience with More ❯
Posted:

Machine Learning Operations Engineer - UK

London, England, United Kingdom
Hybrid / WFH Options
DraftWise
Auto-Evaluation Framework: Develop and deploy an automated system that continuously monitors and validates our LLM outputs against stringent quality thresholds. Incorporate real-time performance metrics, anomaly detection, and proactive remediation triggers—ensuring that any dip in response quality is promptly identified and corrected. Integrate feedback loops that sustain our commitment to excellence, thereby ensuring our platform maintains consistently … distributes API calls across multiple LLM endpoints. Implement robust fallback mechanisms to automatically reroute traffic when an LLM instance is down or reaching usage quotas. Integrate real-time health monitoring and performance analytics to assess endpoint availability and adjust routing in real time. Deploying and Maintaining Proprietary LLM Models with Training and Fine-Tuning Support: Deploy and scale in … continuous model improvement and adaptation. Collaborate closely with ML and NLP teams to integrate the latest techniques, rigorous evaluation metrics, and industry best practices into the training workflow. Implement proactive monitoring and maintenance protocols to quickly identify and address performance bottlenecks or operational issues. About You We're looking for teammates who possess: Proven expertise in Python with More ❯
Posted:

Senior Network Engineer

United Kingdom
Square One Resources
refreshes and migrations from Cisco to Arista, ensuring minimal downtime in production environments. Collaborate with global teams to align network delivery with broader CI/CD pipelines, and implement proactive monitoring and remediation frameworks. Provide senior-level L3 support, mentor junior engineers, and drive best practices in operational excellence. Essential Skills & Experience Strong hands-on experience with Arista … 2k–9k) and migrations to Arista. Exposure to network security platforms (Palo Alto, Fortinet, F5). Familiarity with CI/CD pipelines and infrastructure-as-code principles. Experience with monitoring tools (SolarWinds, PRTG, Logic Monitor) and SNOW for incident/change management. Relevant certifications (CCNP R&S, CCNP DC, PCNSA, ACI Specialist, or CCIE in progress) are a strong More ❯
Posted:

Senior Network Engineer

London, England, United Kingdom
Square One Resources
refreshes and migrations from Cisco to Arista, ensuring minimal downtime in production environments. Collaborate with global teams to align network delivery with broader CI/CD pipelines, and implement proactive monitoring and remediation frameworks. Provide senior-level L3 support, mentor junior engineers, and drive best practices in operational excellence. Essential Skills & Experience Strong hands-on experience with Arista … 2k–9k) and migrations to Arista. Exposure to network security platforms (Palo Alto, Fortinet, F5). Familiarity with CI/CD pipelines and infrastructure-as-code principles. Experience with monitoring tools (SolarWinds, PRTG, Logic Monitor) and SNOW for incident/change management. Relevant certifications (CCNP R&S, CCNP DC, PCNSA, ACI Specialist, or CCIE in progress) are a strong More ❯
Posted:

Junior NOC L1 Engineer

London, England, United Kingdom
Hybrid / WFH Options
Smartnumbers
into the First Line queue, management of new customer orders, facilitating customer on-boarding, scheduling of ports and ensuring the smooth running of ports for customers and assisting with proactive monitoring. Customer satisfaction is critical to the success of this role; anything you do must have the customer experience in mind. Responsibilities include, but are not limited to: Working … the severity of issues and initiating appropriate troubleshooting procedures Basic Troubleshooting: Diagnose and resolve straightforward network issues, escalating more complex problems to higher-tier engineers when necessary Call Health Monitoring: Utilise advanced monitoring tools to continuously track Call health, identifying potential issues and bottlenecks Documentation: Maintain detailed records of incidents, actions taken, and solutions applied, ensuring transparency and … Customer Communication: Provide clear and concise updates to customers and internal stakeholders regarding incident status and resolution progress Champion the customer as part of the delivery process, taking a proactive approach to all aspects of customer delivery Train internal and external customers on the use of our products and services Work on projects set by the Manager as well More ❯
Posted:

Equities Algo Trader

London, United Kingdom
Hybrid / WFH Options
Citibank (Switzerland) AG
You'll collaborate closely with the Electronic Sales Trading team to oversee the deployment and configuration of client-specific algorithmic trading strategies, ensuring their performance and stability. This includes proactive monitoring, troubleshooting, and direct support to sales traders. You will also identify and integrate corporate actions and new securities into Citi's algorithmic trading framework, as well as … You'll collaborate closely with the Electronic Sales Trading team to oversee the deployment and configuration of client-specific algorithmic trading strategies, ensuring their performance and stability. This includes proactive monitoring, troubleshooting, and direct support to sales traders. You will also identify and integrate corporate actions and new securities into Citi's algorithmic trading framework, as well as More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior DevOps Engineer

Woking, England, United Kingdom
Fletcher Chase
implement Infrastructure as Code solutions that set industry standards Build resilient CI/CD pipelines using Bitbucket and Spacelift orchestration Develop sophisticated observability strategies with Grafana , CloudWatch , and advanced monitoring tools Leadership & Growth Opportunities Mentor emerging DevOps talent and shape team culture Influence architectural decisions across cross-functional teams Drive strategic initiatives that align technical excellence with business outcomes … Recent AWS experience with enterprise-scale deployments CI/CD Mastery : Advanced experience with Jenkins, Bitbucket Pipelines, and orchestration tools Observability : Hands-on expertise with Grafana, Splunk, CloudWatch for proactive monitoring Leadership & Delivery: Proven track record architecting scalable, secure infrastructure solutions Experience implementing advanced security measures across DevOps workflows Large-scale project management and delivery experience Strong mentoring More ❯
Posted:

Manager, SRE

London, England, United Kingdom
GroupM
and conduct regular performance reviews. Define and implement reliability standards. Develop and improve incident management processes in alignment with engineering support, ensuring effective resolution and root cause analysis. Drive proactive monitoring, alerting, and automation to minimize downtime and improve system reliability. Lead efforts to eliminate single points of failure. Collaborate with DevOps practice to ensure best practices to … related roles, with some experience in a leadership or managerial position. Strong knowledge of cloud platforms (AWS, GCP, Azure) and modern infrastructure technologies (Kubernetes, Docker, Terraform). Expertise in monitoring, logging, and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk). Proficiency in at least one programming or scripting language (e.g., Python, Go, Bash). Deep understanding of networking, databases More ❯
Posted:

Senior Network Engineer

Bath, Somerset, South West, United Kingdom
Hybrid / WFH Options
TrueSpeed
Truespeed's network and interconnections Proactively monitor performance and implement continuous improvements Diagnose complex issues, lead root cause analysis, and deliver long-term fixes Drive the implementation of network monitoring and automation tools Collaborate on the network roadmap and infrastructure changes Lead post-incident reviews and maintain technical documentation Manage capacity planning, resilience, and customer onboarding Mentor junior engineers … days annual leave + bank holidays Employee referral bonus scheme Free on-site parking Wellbeing benefits including 24/7 online GP The Ideal Candidate You're a proactive, technically proficient professional with extensive experience in ISP or data centre environments. You excel at tackling complex challenges, designing scalable systems, and mentoring team members along the way. About you … optical Ethernet, BGP, ISIS, and IP routing Strong understanding of FTTH technologies and fibre troubleshooting Experience with network hardware from leading vendors Hands-on use of network automation and monitoring tools Strong communication and leadership capabilities Track record in technical team supervision Excellent documentation and analytical skills Certifications like CCNA or CCIE (desirable but not essential) Be part of More ❯
Employment Type: Permanent, Work From Home
Salary: £55,000
Posted:

IT Specialist - SAP Infrastructure & Platforms

Belfast, United Kingdom
Airbus
that will enable a smoother migration to a future SAP solution owned by Airbus . This requires a strategic mindset, the ability to see the big picture, and a proactive approach to long-term planning. Service Performance Management (Transition Period): During the transition, you will assess the IT services received from the existing supplier and contribute to the tracking … collaboration with them. Project & Activity Leadership: Take the lead on various infrastructure and platform-related projects and activities, ensuring successful delivery and alignment with strategic goals. SAP System Administration & Monitoring: Administer and maintain multiple SAP instances (development, test, and production) within the SAP environment. Proactively monitor system health, performance, and stability. HANA Landscape Administration: Administer and monitor the SAP … understanding and hands-on experience with change management and transport processes within SAP. Experience in coordinating and managing end-to-end incident resolution for complex SAP environments. Proficiency in monitoring and configuring SAP Basis settings, interface creation, and DDIC changes. Demonstrable experience in the administration and monitoring of SAP HANA landscapes . Solid understanding of the technical architecture More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Network Engineer - Banking & Finance

London, United Kingdom
Hamilton Barnes Associates Limited
learn more about this opportunity, feel free to reach out and apply today! Key Responsibilities: Oversee and maintain a low-latency, global network infrastructure Provide high-level support and proactive monitoring of mission-critical systems Take ownership of network improvement projects from design through to deployment Collaborate closely with cross-functional teams to ensure security, resilience, and efficiency More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior PostgreSQL SRE

Knutsford, England, United Kingdom
Barclays UK
practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability of systems and services through proactive monitoring, maintenance, and capacity planning. Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring. Development of tools and scripts … to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience. Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning. Collaboration with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle, and work closely with other teams to ensure More ❯
Posted:

PostgreSQL SRE

Glasgow, Scotland, United Kingdom
Barclays Business Banking
based in Knutsford or Glasgow. Purpose: Apply software engineering techniques, automation, and best practices to ensure system reliability, availability, and scalability. Responsibilities: Ensure system performance, scalability, and availability through proactive monitoring and capacity planning. Respond to system outages, analyze issues, and implement preventive measures. Develop automation tools and scripts to improve operational efficiency and system resilience. Monitor and More ❯
Posted:

Release Triage Engineer

London, England, United Kingdom
Hybrid / WFH Options
Capgemini Engineering
the same. About The Job You’re Considering We are looking for a Release Triage Engineer to ensure timely and high-quality software releases across various devices by proactively monitoring metrics and alerts. Hybrid working: The places that you work from day to day will vary according to your role, your needs, and those of the business; it will … going back 3 continuous years, and unspent criminal record check (known as Disclosure and Barring Service) Your role Ensuring timely and high-quality software releases across various devices using proactive monitoring of metrics and alerting Troubleshoot problems using Linux systems and networking protocols, including packet capture analysis Diagnose and resolve issues in broadband devices, covering various components like … lead and mentoring a small team of engineers 3 + Experience with log management and analysis tools such as Elastic Stack (ELK), Splunk, and Grafana for data visualization and monitoring One of Scripting language experience - BASH, Go, Python Nice To Have Embedded Systems experience Experience AWS, Asure or other cloud infrastructure Experience with monitoring systems (i.e. Datadog, CloudWatch More ❯
Posted:

Database Engineer

Knutsford, England, United Kingdom
Barclays
practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability of systems and services through proactive monitoring, maintenance, and capacity planning. Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring. Development of tools and scripts … to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience. Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning. Collaboration with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle, and work closely with other teams to ensure More ❯
Posted:

Service Delivery Manager

Birmingham, England, United Kingdom
Mitel
all future applications, and you will be notified whenever a new position that matches your criteria becomes available. Overview Responsibilities: Ensure contractual service levels are met or exceeded, with proactive monitoring and issue management. Lead regular service reviews, providing performance reporting and driving actions for service improvement. Act as the primary operational contact for key customer accounts, ensuring … Understanding the financial aspects of contract management, including cost management, budget forecasting, and profitability analysis. Communicate effectively with customers, internal teams, and stakeholders through clear reporting, service reviews, and proactive dialogue. Growth Mindset - Demonstrate a growth mindset by embracing feedback, seeking continuous improvement, and proactively developing new skills to adapt to evolving customer and business needs. Full UK driving More ❯
Posted:

Service Delivery Manager

Glasgow, Scotland, United Kingdom
Mitel
retention and value realisation. The role ensures customer satisfaction, SLA adherence, and alignment between service operations and business needs. Responsibilities Ensure contractual service levels are met or exceeded, with proactive monitoring and issue management. Lead regular service reviews, providing performance reporting and driving actions for service improvement. Act as the primary operational contact for key customer accounts, ensuring … Understanding the financial aspects of contract management, including cost management, budget forecasting, and profitability analysis. Communicate effectively with customers, internal teams, and stakeholders through clear reporting, service reviews, and proactive dialogue. Growth Mindset - Demonstrate a growth mindset by embracing feedback, seeking continuous improvement, and proactively developing new skills to adapt to evolving customer and business needs. Full UK driving More ❯
Posted:

Manager, SRE

London, England, United Kingdom
WPP Media
and conduct regular performance reviews. Define and implement reliability standards. Develop and improve incident management processes in alignment with engineering support, ensuring effective resolution and root cause analysis. Drive proactive monitoring, alerting, and automation to minimize downtime and improve system reliability. Lead efforts to eliminate single points of failure. Collaborate with DevOps practice to ensure best practices to … related roles, with some experience in a mentorship or managerial position. Strong knowledge of cloud platforms (AWS, GCP, Azure) and modern infrastructure technologies (Kubernetes, Docker, Terraform). Expertise in monitoring, logging, and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk). Proficiency in at least one programming or scripting language (e.g., Python, Go, Bash). Deep understanding of networking, databases More ❯
Posted:

Technical Support Manager

London, England, United Kingdom
Hybrid / WFH Options
Littlepay
and platform, working closely with Product, Engineering and Project teams to diagnose and resolve customer and partner issues Lead cross-functional initiatives to optimize support processes and tools and proactive monitoring to improve efficiency and reliability Collaborate and advocate with Product and Engineering to proactively improve the platform based on user feedback Create and refine documentation and our … creation of knowledge base articles and FAQs Engage with users directly to gather information and provide clear, concise explanations Work with Engineering to design, implement, and manage a comprehensive monitoring framework to proactively identify and address potential issues Drive initiatives and implement strategies to ensure consistently high levels of customer satisfaction, exceeding service level agreements (SLAs) and fostering positive More ❯
Posted:

Lead Software Engineer - ServiceNow

London, England, United Kingdom
ZipRecruiter
CSS, Angular). Experience with system integrations between ServiceNow and other applications using REST API. Strong analysis, problem-solving, and decision-making skills. Ability to debug issues and implement proactive monitoring for platform stability. Experience mentoring junior developers and fostering a collaborative team environment. Excellent communication skills to clarify requirements with stakeholders and collaborate effectively. Additional qualifications Experience More ❯
Posted:

Account Executive - Programmatic

London, England, United Kingdom
Hybrid / WFH Options
Genius Sports Group
and external note taking and presenting information to clients as directed. Prepare for all internal client meetings with relevant activation updates across assigned accounts. Pacing, Performance Reporting & Delivery management - Proactive monitoring of performance. Work with Customer Success & Ad Ops to monitor and improve performance over time. Responsible for performance and pacing updates in internal and external client account … Experience of Demand Side Platforms and Ad Serving technologies Excellent communication and organisational skills Strong analytical skills and uncompromising attention to detail Ability to clearly prioritize work in a proactive manner and remain flexible in a changing environment A keen interest in digital media and ad tech. This role is eligible for hybrid working - 3 days in the office More ❯
Posted:

Senior Full Stack Developer

Bromsgrove, England, United Kingdom
AFH Financial Group
define project requirements and deliver innovative software solutions. Working with developers to create automations to support source code checking with continuous integration, automated test tool configuration, automated deployment and proactive environment monitoring & management. Building and implementing new development tools and instilling best practice around ways of working. What we are looking for in our ideal Senior Software Developer More ❯
Posted:

Senior Software Engineer, Fixed term for 12 months

London, United Kingdom
T. Rowe Price
of the Data and Analytics platform Unit testing, integration testing. Contributing to maintaining and constantly improving the CI/CD pipeline Code reviews, design reviews Code instrumentation, setting up proactive monitoring dashboards Liaising with users and subject matter experts in order to gather requirements, analyse solutions and triage feedback and incidents Understanding business requirements and translating them into More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Cloud Systems Engineer

London, England, United Kingdom
Hybrid / WFH Options
BASE Media Cloud Limited
tools Support & Service Operations: Provide 2nd and 3rd line engineering support in collaboration with internal teams and vendor partners Analyze infrastructure and system logs, troubleshoot technical issues, and implement proactive monitoring strategies Contribute to service documentation, including architecture diagrams and technical runbooks Collaboration & Innovation: Assist software developers with CI/CD environments and application deployment Act as a More ❯
Posted:
Proactive Monitoring
10th Percentile
£37,170
25th Percentile
£42,500
Median
£53,594
75th Percentile
£67,500
90th Percentile
£81,125