responsible for maintaining high system availability, enabling rapid delivery through CI/CD, and supporting development teams with robust infrastructure and tooling. A key part of the role includes proactivemonitoring using Prometheus, Grafana, and Splunk, as well as participating in on-call rotations to respond to live incidents. Collaboration across engineering, security, and product teams is essential … recurring issues, and work on long-term fixes to improve system stability. 4. Implement and maintain observability solutions using Prometheus, Grafana, and Splunk. 5. Write PromQL queries for custom monitoring dashboards, alerting, and diagnostics. 6. Manage and optimize CI/CD pipelines for automated testing, deployment, and rollback strategies. 7. Develop and maintain automation scripts in Python, Bash, Go More ❯
and problem-solving abilities, with a hands-on approach to implementation. Communication Prowess: Excellent verbal and written communication skills, with the ability to articulate complex ideas clearly and concisely. Proactive & Adaptable: A proactive and resourceful individual with a strong ability to work independently, lead a team, and adapt quickly to changing priorities. Language Skills: Fluent in English (C1 … continuous optimization. Strategic Direction: Define clear, measurable product goals that align with overarching business objectives and drive revenue growth. Market & User Expert: Conduct comprehensive user and market research, proactively monitoring industry trends and competitive landscapes, with a specific focus on the dynamic gambling and entertainment sectors. Data-Driven Insights: Translate user feedback and data analytics into actionable product improvements More ❯
London, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
to current demand and your professional development needs. Typical projects include: Designing performance test plans to address performance risk Demand forecasting/planning Development of capacity and performance models Proactive system monitoring to identify performance risks You will report to your line manager and your career development will be supported by the Talent department. A day in the … or Python SQL Experience with RDMS systems e.g. MySQL, Microsoft SQL Server, Oracle Technical tools you may use as a Junior Consultant Cost-explorer Cloudwatch APIs (further performance metrics) Monitoring tools for CPU PRTG MatLab GitHub J-Meter Gatling Scripting and coding software The things our Junior Consultants most like about their job Lots of responsibility Working with people More ❯
London, England, United Kingdom Hybrid / WFH Options
Magentus Group
infrastructure as code, observability, and DevOps methodologies. Key Responsibilities of the role: Platform Development & Automation Design, develop, and maintain cloud-native infrastructure and platform services. Automate provisioning, scaling, and monitoring of infrastructure and application workloads. Develop CI/CD pipelines to enhance deployment efficiency and reliability. Implement Infrastructure as Code (IaC) using tools such as CDK, Terraform or CloudFormation. … Reliability & Security Ensure platform reliability, scalability, and security through best practices and proactive monitoring. Implement observability solutions including logging, metrics, and distributed tracing. Support incident response and post-mortem analysis, driving continuous improvements. Collaborate with security teams to ensure compliance with security and regulatory requirements. Collaboration & Technical Leadership Work closely with software engineers to enable self-service capabilities and … Bash, etc.). Understanding of networking, security principles, and best practices. Knowledge of observability tools such as Datadog, Prometheus, Grafana, etc. Desired Attributes Strong problem-solving skills with a proactive approach to improving systems and processes. Excellent communication and collaboration skills, able to work effectively with cross-functional teams. Ability to work autonomously and take ownership of platform initiatives. More ❯
security updates. Identify opportunities to drive business value through enhanced productivity and collaboration by understanding colleague needs and wider collaboration technology landscape. Ensure product reliability, performance, and security through proactivemonitoring and incident management, leveraging observability tooling to increase issue prevention and proactive support. Use data-led insights and continuous improvement to increase Engineering capacity by driving More ❯
London, England, United Kingdom Hybrid / WFH Options
Menlo Ventures
/Trainium workloads. Runtime Platform: We build and maintain the infrastructure that monitors the health, performance, and efficiency of our AI systems. You'll work across teams to implement monitoring solutions using tools like Prometheus, Splunk, and Grafana, while developing automated approaches for dashboards and alerts. Your work will create reliable, low-maintenance systems that enable proactivemonitoringMore ❯
analyze and resolve mechanical and system issues across the MSP product suite. Design and implement electro-mechanical tools and fixtures to support deployed and new products. Develop and implement proactivemonitoring tools to measure the safety, reliability, and performance of advanced robotics systems. Develop and implement diagnostic tools to minimize troubleshooting and machine downtime. Support hardware and systems More ❯
Excellence & Risk Management Identify and mitigate technical risks and obstacles, ensuring that the product development pipeline runs smoothly and efficiently. Ensure the security, performance, and scalability of products through proactivemonitoring and improvements. Ensure adherence to high coding standards, software quality, and efficient development processes, including continuous integration and deployment practices Implement key performance indicators (KPIs) to track … and collaboration, we drive innovation and power global markets. S&P Global has a Securities Disclosure and Trading Policy ("the Policy") that seeks to mitigate conflicts of interest by monitoring and placing restrictions on personal securities holding and trading. The Policy is designed to promote compliance with global regulations. In some Divisions, pursuant to the Policy's requirements, candidates More ❯
technical issues reported by customers/internal teams. Triage, analyze, and resolve issues across the full technology stack. Collaborate with engineering and product teams to ensure timely resolution. Observability & Monitoring Set up and manage observability tools, with a focus on Dynatrace. Create and maintain dashboards, alerts, and logs for proactive monitoring. Continuously refine observability practices to improve system More ❯
and resolve mechanical and system issues across the MSP product suite. - Design and implement electro-mechanical tools and fixtures to support deployed and new products . - Develop and implement proactivemonitoring tools to measure the safety, reliability, and performance of advanced robotics systems. - Develop and implement diagnostic tools to minimize troubleshooting and machine downtime. - Support hardware and systems More ❯
team processes Write clean, efficient, and well-documented Java code Contribute to all phases of the development lifecycle—from planning to deployment Ensure system reliability, scalability, and performance through proactivemonitoring and optimisation You'll be a great fit if: 10+ years of experience in Java development, with at least 3 years in a senior or principal role. More ❯
Excellence & Risk Management Identify and mitigate technical risks and obstacles, ensuring that the product development pipeline runs smoothly and efficiently. Ensure the security, performance, and scalability of products through proactivemonitoring and improvements. Ensure adherence to high coding standards, software quality, and efficient development processes, including continuous integration and deployment practices Implement key performance indicators (KPIs) to track More ❯
Responsibilities Functional: Planning and delivering large/complex projects and the ability to manage multiple projects and work-streams at any one time Drive project execution; tracking delivery expecting, monitoring and controlling change with a pro-active approach to seek and resolve blockers. Engaging with stakeholders and business areas to identify business requirements and opportunities and the ability to … transition this into a delivery framework. Engaging with sponsors to develop specifications, requirements, deliverables and key dependencies. Planning and designing the projects and portfolio proactively monitoring its progress, resolving issues and initiating appropriate corrective action. Ensuring effective quality assurance and the overall integrity of the project and portfolio – conduct progress reviews as required Contribute to the evolution of the … whether as document author, reviewer, or user Operate within agreed authority levels, as determined by the Board Skills, Knowledge & Experience Experienced with a strong knowledge of techniques for planning, monitoring and controlling of projects and programmes Effective leadership, interpersonal and communication skill. Ability to understand problem statements and evaluate prioritised options and activities. Ability to resolve conflicting demands and More ❯
London, England, United Kingdom Hybrid / WFH Options
Drata
app development. Familiarity with UNIX command-line tools. Ability to analyze server logs and database operations. Strong problem-solving and analytical skills. Experience handling escalations and leading resolution efforts. Proactive issue identification skills. Knowledge of APIs, integrations, and automation tools is a plus. Excellent written and verbal communication skills. Customer-focused attitude and ability to work independently and in More ❯
and roles Continuously improves work processes and seeks efficiencies Adapts well to change and stays positive during setbacks Goes above and beyond to meet or exceed targets Demonstrates a proactive, committed approach to work and objectives Note: This role requires on-site presence and may involve travel to project sites as necessary. Flexibility in working hours may be required More ❯
and roles Continuously improves work processes and seeks efficiencies Adapts well to change and stays positive during setbacks Goes above and beyond to meet or exceed targets Demonstrates a proactive, committed approach to work and objectives Note: This role requires on-site presence and may involve travel to project sites as necessary. Flexibility in working hours may be required More ❯
cascading information from the CS Head and other departments. Your responsibilities extend to recruitment and training of new joiners, ensuring proper onboarding, setup, and access to necessary tools. A proactive approach to managing workloads to ensure team KPIs are met is crucial to success in this role. Essential: Leadership and team management - Ability to motivate, guide, and manage a More ❯
London, England, United Kingdom Hybrid / WFH Options
Careers at Drata
technical issues, leading the resolution process as a SWAT leader, as appropriate from lower tiered Technical Support teammates Proactively monitor error logs and performance metrics for large customers, providing proactive support to prevent potential issues. Collaborate with internal teams to explore opportunities for evolving our internal tech stack, including APIs, integrations, and automation. Document and track customer issues and More ❯
and roles Continuously improves work processes and seeks efficiencies Adapts well to change and stays positive during setbacks Goes above and beyond to meet or exceed targets Demonstrates a proactive, committed approach to work and objectives Note: This role requires on-site presence and may involve travel to project sites as necessary. Flexibility in working hours may be required More ❯
London, England, United Kingdom Hybrid / WFH Options
Financial Ombudsman Services
challenges that will be faced Leading and motivating your team to deliver the objectives you set for them, you’ll be accountable for service, quality and performance standards – proactively monitoring and managing performance and addressing issues effectively Having demonstrable experience working in a senior or lead position in an infrastructure team Being able to plan and organise what’s More ❯
interpersonal skills Outstanding organizational, problem solving, and multi-tasking skills Team-player and positive attitude, eager to do more and help out, views challenging situations as opportunities Self-motivated, proactive, and able to work with minimal supervision Passion and commitment to succeed by working hard and never giving up Can-do approach, Problem solving attitude Ability to Influence without More ❯
London, England, United Kingdom Hybrid / WFH Options
Funding Circle UK
Support the implementation and adoption of secure software development lifecycle (SSDLC) practices and secure coding standards across engineering teams through collaboration, training, and tooling. Assist with configuring, managing and monitoring automated security controls, tooling, and "security rails" within CI/CD pipelines to ensure secure and efficient deployments. Collaborate closely with Cloud Platform Engineers, DevX and Product Engineering to More ❯
London, England, United Kingdom Hybrid / WFH Options
Assent Inc
Assent is the leading solution for supply chain sustainability tailored for the world's top-tier, sustainability-driven manufacturers. Hidden risks riddle supply chains, many of which weren't built with sustainability in mind. That's where we step in. More ❯
Department Description: Oracle's Cloud Data Center Operations team is building and supporting new Infrastructure-as-a-Service technologies that operate at high scale in a broadly distributed multi-tenant cloud environment. Our customers run their businesses on our cloud More ❯
real-time threat detection strategies, and ensure smooth implementation of Hexagate's solutions. This role demands technical problem-solving skills, strong blockchain security knowledge, client-facing abilities, and a proactive approach to ensuring customer success. In this role, you’ll : Partner with Account Executives and Inside Sales reps to run discovery calls, security risk reviews, and product demos Help … prospects evaluate Hexagate’s capabilities, align solutions with their risk profile, and build proof-of-concepts Design and deploy customized real-time monitoring solutions for smart contracts, transactions, and blockchain entities Lead technical onboarding and solution deployment, including custom monitor rules, API integrations, and alerting workflows Assist with integrations into customer pipelines and tools Provide ongoing advisory on threat … in a technical role such as Solutions Engineer, Security Engineer, Sales Engineer, or Blockchain Analyst Strong knowledge of Web3 security, including smart contract vulnerabilities, DeFi risks, and blockchain transaction monitoring Hands-on programming experience and APIs for data integrations and monitoring configurations Ability to read Solidity and analyze smart contract interactions (some coding experience is a requirement) Excellent More ❯