love to hear from you. Good knowledge of Azure Cloud hosting technologies Experience with PostgreSQL databases (including PostGIS spatial extension) Good understanding of SiteReliabilityEngineering (SRE) and software engineering best ractices Experience investigating the root cause of failures to understand why they have occurred and propose/enact solutions, and work with external suppliers if … anyway. You might be the ideal candidate for this or other roles within Ordnance Survey. Skills required for this role: Information Technology (IT) Systems, Infrastructure Services, Process Improvements, Software Engineering, Technical Consulting, Waterfall Model Support Engineer (Fulltime) - Salary - £41,892.00 to £48,874.00 (dependent on experience) - Hybrid working Join Our Dynamic API and Data Distribution Team! Are you passionate More ❯
faceted and complex environment. This ICG-Tech business aligned tech resource will collaborate with ICG-Markets the business, CTI Enterprise Infrastructure, and CISO Security teams driving technical architecture and engineering of CTB and End of Vendor Support (EoVS) initiatives where we choose to innovate. Role Overview/What will you do: Reporting to the head of HPAi, the Systems … Architect will be responsible for ensuring best in breed technology architecture and engineering are in alignment with business requirements. This will be done in collaboration with CISO and CTI Compute resources aligned to HPAi who represent Enterprise and CISO priorities and standards. Active collaboration with all parties is required to be successful. The resource should be able to effectively … as an example. Kernel-bypass technology knowledge (such as Solarflare OpenOnload/TCPDirect/ef_vi, RoCE, DPDK) Prior experience in a similar SiteReliabilityEngineering (SRE), DevOps, distributed computing, systems engineering/administration Ability to context switch to work on several projects at once Knowledge of JIRA, SCRUM, Sprints This job description provides a high More ❯
you have demonstrated your capability as a Support Engineer you will be able to choose the path you want to take into Software Engineering, either product development or SRE (sitereliabilityengineering). This makes the role ideal for someone looking to take their first steps in a software engineering career. The role Part of … cares about customer experience, dealing with other Cronofy team members across the business, and ensures that customers have positive experiences with Cronofy. This role will also work alongside the Engineering team on a range of product development projects that will, in the long term, aid the ability to support customers on technical product-related issues. This role reports to … First class documentation In-depth understanding of Cronofy's API and Scheduler tool to provide a high level of customer support, resolve issues and find practical solutions Engaging with Engineering/Development team to provide appropriate technical solutions, often through the full development lifecycle Working on development projects as part of the Product Engineering team Requirements: Experience with More ❯
software-defined networking principles. Embed zero-trust principles and user-centric design into all remote connectivity services. Align remote connectivity architecture with broader enterprise network, security, and cloud strategies. Engineering & Operations: Lead the engineering, deployment, and lifecycle management of remote access solutions such as Cisco AnyConnect, Zscaler, and other mainstream VPN … platforms. Drive automation of remote access provisioning, policy enforcement, and configuration management through Infrastructure as Code (IaC) and zero-touch deployment practices. Apply SiteReliabilityEngineering (SRE) principles to improve performance, availability, and troubleshooting. Establish observability practices across all access points with real-time metrics, logs, and telemetry. Security, Compliance & Governance: Ensure compliance with corporate security and … segmentation, and endpoint-based access control. Proven ability to scale remote connectivity solutions to tens of thousands of users and devices. Experience with IaC, network automation, observability tooling, and SRE methodologies. Preferred Qualifications: Certifications such as CCNP, CCIE, PCNSE, Zscaler Certified, or equivalent. Familiarity with secure hybrid work and cloud networking models. Background in network performance optimization, user-centric design More ❯
occasional travel to Scotland Employment Type: 6 month Contract Rate: £550 per day, Outside of IR35 Role Overview Morgan Hunt are seeking an experienced SiteReliability Engineer (SRE)/Unix Infrastructure Engineer to support the deployment, migration, and optimisation of critical infrastructure services. The role involves ensuring high availability, disaster recovery readiness, and automation-driven improvements across RHEL More ❯
occasional travel to Scotland Employment Type: 6 month Contract Rate: £550 per day, Outside of IR35 Role Overview Morgan Hunt are seeking an experienced SiteReliability Engineer (SRE)/Unix Infrastructure Engineer to support the deployment, migration, and optimisation of critical infrastructure services. The role involves ensuring high availability, disaster recovery readiness, and automation-driven improvements across RHEL More ❯
faceted and complex environment. This ICG-Tech "business aligned tech" resource will collaborate with ICG-Markets "the business", CTI "Enterprise Infrastructure", and CISO "Security" teams driving technical architecture and engineering of CTB and End of Vendor Support (EoVS) initiatives where we choose to innovate. Role Overview/What will you do: Reporting to the head of HPAi, the Systems … Architect will be responsible for ensuring best in breed technology architecture and engineering are in alignment with business requirements. This will be done in collaboration with CISO and CTI Compute resources aligned to HPAi who represent Enterprise and CISO priorities and standards. Active collaboration with all parties is required to be successful. The resource should be able to effectively … as an example. Kernel-bypass technology knowledge (such as Solarflare OpenOnload/TCPDirect/ef_vi, RoCE, DPDK) Prior experience in a similar SiteReliabilityEngineering (SRE), DevOps, distributed computing, systems engineering/administration Ability to context switch to work on several projects at once Knowledge of JIRA, SCRUM, Sprints This job description provides a high More ❯
hub right in the heart of the city - with real backing, big ambition and the support of an established, award-winning company. You'll be part of a new engineering chapter in Manchester, helping shape how we grow and build the team from day one, and create software and platforms with a purpose: to make finance fairer and more … to date Designing scalable, self-service tools to reduce operational toil Supporting infrastructure observability through metrics, tracing, and alerting Working closely with product teams to foster a culture of reliabilityengineering About you: 4+ years in a Platform/SiteReliabilityEngineering or similar role Solid Linux admin skills (we use FlatcarOS & Amazon Linux) Experience More ❯
team of passionate thinkers, innovators, and dreamers - and help us connect people and build communities to create economic opportunity for all. About the team and the role: As a SiteReliability Engineer at eBay, you'll play a key role in managing major incidents and the overall health of our services, making sure they are both resilient and … high-performing. You'll create strategies for availability and reliability, enhance domain ecosystem observability, and support a shift toward a more engineering-focused culture. Your contributions will ensure that eBay's technology remains cutting-edge and reliable for our global community. What you will accomplish: Proactive Monitoring : Continuously monitor the health of eBay's critical services to identify … and address potential issues before they escalate. Solution Development : Collaborate with Architecture, Engineering, and Operations teams to develop solutions that ensure high site availability, reliability and performance. Collaborative Problem Solving : Work closely with partner teams to resolve recurring technical issues, onboard new alerts, and develop high-quality Standard Operating Procedures (SOPs). Automation and Process Enhancement : Identify More ❯
Mission We unleash the potential of organisations through the science of board effectiveness, building better businesses and benefiting society. The Opportunity As a Senior SiteReliability Engineer (SRE), you'll be joining a team whose mission is to ensure the availability, performance, security and reliability of our platform and core services, ensuring that they meet the needs … be responsible for visibility and monitoring of those systems, for building tooling and automation to reduce TOIL and for responding to incidents as part of our 24/7 SRE on-call team. The SRE team: Strives to provide the highest standards of Availability, Scalability, Performance and Security for our Software as a Service environments across multiple cloud vendors and … work Proactively monitors our platform and responds to incidents as part of a 24/7 rota Key responsibilities of the role We're looking for a great Senior SRE to be a hands on individual contributor to key technical projects and to help us build a first-class SRE function. This role will involve: Hands on work with technical More ❯
in high-impact delivery teams that support some of the worlds most well-known organisations. Youll play a key role in helping our customers achieve greater visibility, performance, and reliability across their IT estatescontributing to their operational success through proactive insight and incident prevention. What you'll do Design, implement, and manage observability solutions using industry-leading tools such … mindset with a passion for continuous improvement and knowledge sharing Certifications Dynatrace Associate & Pro Splunk Core Certified Power User Desirable Experience DevOps or SiteReliabilityEngineering (SRE) experience Automation with Terraform or similar tools Building CI/CD pipelines Experience with Docker and Kubernetes for packaging and deployment Ability to adapt to new technologies in fast-paced More ❯
impact delivery teams that support some of the world's most well-known organisations. You'll play a key role in helping our customers achieve greater visibility, performance, and reliability across their IT estates-contributing to their operational success through proactive insight and incident prevention. What you'll do Design, implement, and manage observability solutions using industry-leading tools … A proactive mindset with a passion for continuous improvement and knowledge sharing Certifications Dynatrace Associate & Pro Splunk Core Certified Power User DevOps or SiteReliabilityEngineering (SRE) experience Automation with Terraform or similar tools Experience with Docker and Kubernetes for packaging and deployment Ability to adapt to new technologies in fast-paced environments Please note that all More ❯
impact delivery teams that support some of the world's most well-known organisations. You'll play a key role in helping our customers achieve greater visibility, performance, and reliability across their IT estates-contributing to their operational success through proactive insight and incident prevention. What you'll do Design, implement, and manage observability solutions using industry-leading tools … A proactive mindset with a passion for continuous improvement and knowledge sharing Certifications Dynatrace Associate & Pro Splunk Core Certified Power User DevOps or SiteReliabilityEngineering (SRE) experience Automation with Terraform or similar tools Experience with Docker and Kubernetes for packaging and deployment Ability to adapt to new technologies in fast-paced environments Please note that all More ❯
become the UK's most loved retirement expert. Purpose As a Senior Application Support Engineer, you will play a crucial role in powering our Retail applications by partnering with engineering and business teams to build deep technical and business expertise. You'll be the go-to expert across a diverse, modern, and complex technology landscape, ensuring seamless support and … with a broad range of technologies, including: Practical experience with performance monitoring tools such as Dynatrace or equivalent. Skills & Knowledge Solid understanding of SiteReliabilityEngineering (SRE) principles, including incident management, monitoring, alerting, and performance tuning. Strong knowledge of Software Development Lifecycle (SDLC) processes. Familiarity with incident management platforms like ServiceNow, PagerDuty, or similar tools. Excellent analytical … e.g., annuities, equity release) is advantageous. Experience with automation and scripting to improve manual processes (e.g., PowerShell, Bash). Familiarity with agile methodologies and experience working in DevOps/SRE-driven environments. Company Benefits A Competitive Salary, Pension Scheme and Life Assurance Along with 25 Days Annual Leave plus an Additional Day on us for your Birthday Private Medical Cover More ❯
the opportunity for employees with a range of experience to develop their skills while shaping our platform and making meaningful contributions to the firm. Role: Development Operations Engineer DevOps Engineering requires engagement across development, product and operational teams to ensure Droit's enterprise software products are successfully built, tested and supported in production. Successful candidates will demonstrate skills in … build engineering, sitereliabilityengineering, and client support. At Droit, DevOps is involved in the full lifecycle of product development, from inception to deployment, optimization, and daily operation. As a DevOps Engineer, you will implement solutions that increase the velocity of product delivery, while identifying and resolving problems before they become production issues. As a member More ❯
the future? Let's talk. About the role The Production Support Analyst is responsible for triaging and resolving incidents and defects in production environments. This role ensures the stability, reliability, and performance of business-critical systems by providing timely support, collaborating with engineering and product teams, and following established defect management processes. This position has a strong focus … the defect backlog, ensuring all required information is documented (title, summary, classification, impacted journey, replication steps, screenshots, error messages, etc.). Collaborate with SiteReliabilityEngineering (SRE), scrum teams, and product teams to resolve production defects. Prioritize and manage defects according to severity (Critical, High, Medium, Low). Allocate and track sprint capacity for production defect resolution … technical and non-technical audiences. Excellent communication and collaboration skills. Ability to work under pressure and manage multiple priorities. Desirable Experience working in Agile/Scrum environments. Familiarity with SRE practices and production maintenance planning. Essential Experience with financial services, investment platforms, or enterprise software environments. Prior experience in a TA support, middle-office, or back-office technology role. Private More ❯
Support teams to input into business reviews Be a visionary Ops champion for our internal teams Skills Bachelor's or Master's degree in a STEM field (Computer Science, Engineering, Mathematics, etc.) or equivalent experience Demonstrable experience in product management or product operations Strong product and technical background with proven ability to communicate effectively with engineers and technical team … management best practices-user research, market insights, goal setting, prioritisation, execution, and leadership Familiarity with monitoring tools, incident management protocols, and collaboration with SiteReliabilityEngineering (SRE) teams Proven ability to develop relationships and align teams across product, engineering, and leadership to ensure the effective execution of strategic priorities Hands-on experience analysing workflows and implementing … of improvement, develop solutions, and inspire change with autonomy The Interview Process Online interview with the Senior Talent Partner In-person interview with the Director of Product Operations and Engineering team member Online interview with Director of Product Operations and CPO At Reward Gateway Edenred, we are committed to ensuring an inclusive and accessible recruitment process for all candidates. More ❯
MHP - Mieschke Hofmann und Partner Gesellschaft für Management- und IT-Beratung mbH
innovation in onboarding, deployment, and operations. DevSecOps Enablement & Team Leadership: You will build and steer high-performing DevSecOps teams across international locations, implementing SiteReliabilityEngineering (SRE) practices and promoting skill development and tool standardization. Technical Project Leadership: You will act as a technical lead in customer-facing Azure projects-translating business requirements into architecture and execution More ❯
innovation in onboarding, deployment, and operations. DevSecOps Enablement & Team Leadership: You will build and steer high-performing DevSecOps teams across international locations, implementing SiteReliabilityEngineering (SRE) practices and promoting skill development and tool standardization. Technical Project Leadership: You will act as a technical lead in customer-facing Azure projects-translating business requirements into architecture and execution More ❯
delivery of automation and improvements across the function. Essential Skillset/Experience: A strong background in Enterprise Technology including infrastructure, applications Understanding of sitereliabilityengineering (SRE), cloud and SDLC/devops concepts Understanding of risk and control frameworks Strong experience structuring and running workshops to elicit use cases, requirements etc Experience of reporting solutions Have a … Portfolio/Project Management frameworks & methodologies eg Waterfall/Agile and associated best practice techniques Experience of continuous improvement Strong Excel, PowerPoint, confluence and Jira skills Competent on SharePoint site design and administration Excellent stakeholder management capabilities and communication skills Experience reporting to and working with senior management - able to summarise findings and recommendations based on detail. Pro-active More ❯
of the development teams Configure Kubernetes environments to enable scalable, resilient, and high-performance application deployments Main Skills/Requirements: Over 5 years of hands-on experience in Platform Engineering, SiteReliabilityEngineering, Platform Integration, or related technical domains Proven track record in architecting and implementing CI/CD pipelines using tools like Jenkins, GitLab CI More ❯
looking for experienced SREs help grow our small team into a global footprint that can provide expert engagement across our core serving systems. As an early member of the SRE team you will report directly to the Director of Managed Infrastructure and play a foundational role in expanding our SRE practice, integrating reliability principles more deeply into Vercel's … Devise repeatable, low-toil operational practices through the development of automated systems for software delivery, system failover, and capacity management. About You: At least 3 years experience in an SRE role, or at least 5 years experience in an adjacent role (e.g. platform engineering), operating in a scaled environment. Firm grasp of the SRE philosophy and mindset, with practical … experience working on or directly with SRE teams that have proactively engaged in system design and improvement. Strong sense of accountability and commitment to problem solving, backed by a curiosity to dig deep and identify root causes. Willingness to proactively engage with development teams to influence the course of software design and operational practices. Capability to manage risk, make decisions More ❯
Barclays Payments is hiring a Director, Head of Tech Operations Lead . The role holder will: Lead the SiteReliabilityEngineering (SRE) function to ensure platform stability, scalability, and performance. Own and evolve the major incident management process, including escalation, resolution, and post-mortem analysis and follow-up to closure. Drive resilience and disaster recovery strategy, ensuring More ❯
design, implementation, testing, to exposing features to end users. Collaborate with multiple teams inside Google (e.g., Product Management, User Experience, Identity, Product Security, Customer Support, feature development teams, Database Engineering, SiteReliabilityEngineering). Contribute to integrating the software stack with Google Production systems and partner subscription lifecycle management solutions. Contribute to documentation and educational content More ❯
tools/systems in a really fast-paced dynamic team and environment. The key values that we strive towards in Technology Ownership, Continuous Improvement Mindset, Trust and Honesty. Delivery Engineering Team Delivery Engineering is an internal platform team focused on building and improving systems to accelerate the delivery of software and infrastructure. Our team's first target will … be enabling DevOps practices due to its direct impact on the overall processes and culture. The primary goal of Delivery Engineering is to build the "Golden Path" to production by automating and concentrating improvements around bottlenecks such as Integration points and handoffs. We will work closely with Development teams to break down silos, helping to build more resilient pipelines … equipping ourselves with the right tools like operating procedures (SOPs) and architectural diagrams Improve the team skill set by increasing and sharing knowledge, thus setting the quality and precision engineering benchmark Understand the importance of and be a strong advocate for non-functionals eg. monitoring, alerting, logging Required Skills and Experience 3+ years of experience in DevOps oriented roles More ❯