are working with a leading global travel company dedicated to providing exceptional experiences for travelers around the world. The SiteReliability Engineering (SRE) team plays a critical role in ensuring the reliability, performance, and scalability of their systems, enabling them to deliver best-in-class services to … skills and the ability to mentor team members. Passion for learning and staying updated on industry best practices. Nice to Haves: Experience as a SiteReliabilityEngineer or with high-availability systems. Background in production infrastructure and troubleshooting distributed systems. Familiarity with mobile development and distributed computing. more »
SiteReliabilityEngineer – High Frequency Trading – London - £120-150k base + bonus + share options It’s a very cool time to join this high frequency trading firm. They are past the shaky uncertainty phase of a start-up. The core blocks have been built. They more »
in upskilling, learning new tech Deeply curious, creative, and innovative Flexible in working hours/ability to collaborate in different time zones The Lead SiteReliabilityEngineer has a pivotal role at the forefront of our engineering operations, responsible for guiding the Platform Team toward achieving exceptional … standards of reliability, performance, and stability across all our applications. The successful candidate will possess deep expertise in these core areas and will be instrumental in defining and implementing industry-leading practices. As a key leader, this role will not only shape … the strategic direction of our platform operations but also establish the benchmarks and processes by which our engineering excellence is measured. Responsibilities Lead the SRE Team, setting clear goals and priorities in line with business objectives. In collaboration with the department Director develop and execute strategies that enhance technological capabilities more »
roles where you can make a significant impact on the availability, performance, and efficiency of critical services? If you've previously excelled in an SRE or similar operations environment and are looking for your next challenge, we want to hear from you! These opportunities require you to work one day … Role Overview: As part of our clients dedicated Mortgages team, you'll be instrumental in working within a new SiteReliability Engineering (SRE) Function, focusing on enhancing system reliability across key areas such as availability, performance, latency, efficiency, capability, and incident response. This role is crucial as … manage risks effectively. What They're Looking For: Proven experience in software engineering with a strong background in Java or C#. Experience in an SRE function or similar operations environment, excluding purely DevOps, infrastructure, or deployment analyst roles. Familiarity with AWS, Kubernetes, and moving systems from data centers to cloud more »
Saffron Walden, Essex, South East, United Kingdom Hybrid / WFH Options
EMBL-EBI
The IT & Technical Services department's Operations team is seeking a Senior SiteReliabilityEngineer to support the growing portfolio of services it provides to EMBl-EBIs service and research teams. The Operations team is responsible for maintaining and developing the Institutes Transfer Services , the application and … to the varied nature of this role, it may suit an individual with experience in a hands-on systems management role, a Senior Infrastructure Engineer, or someone from a sitereliability engineering background. The role will initially focus on the email systems - understanding and upgrading the infrastructure … cultural, multi-disciplinary staff, at different levels of their IT career. We are eager to welcome new talent who will join us in ensuring reliability and supporting EMBL-EBI's mission to advance scientific discovery. Your role During the first months, the role will focus on the upgrade of more »
SiteReliabilityEngineer- Lead, Mentoring, Kubernetes, PaaS, IaaS, SQL, Azure DevOps, CI/CD A leading provider of financial services is seeking two SiteReliability Engineers- Leads with a solid and proven background in Azure or GCP. This position will also be based onsite in … Will consider candidates from any of the key vendors across the Cloud- Azure, GCP, and AWS. Kubernetes & troubleshooting, managed services like AKS Using your SRE Attitude (understanding SLI, SLO & SLA) Container Image Management & Security like Aquasec Code Quality & repository Management like SonarQube & NexusQ Service Mesh (Istio) traffic shaping, canary, blue … Unit/Integration/Load Testing Azure Application Gateway & API Management Azure IAM - Identity & Access Management Azure Policy Management & Cloud Security Azure Express Route SiteReliabilityEngineer- Lead, Mentoring, Kubernetes, PaaS, IaaS, SQL, Azure DevOps, CI/CD McGregor Boyall is an equal opportunity employer and do more »
operating the best in class, most reliable access network for our customers. About the Team: As a Senior SiteReliabilityEngineer (SRE), you will be part of the SRE team within the CONNECT OpTek team. Our team is responsible for development and support of multiple tools and … applications used by Comcast field technicians to diagnose and troubleshoot issues within the Comcast nation-wide network. The SRE team is responsible for maintaining the existing systems, supporting our development teams, and implementing innovative solutions. You will work alongside software developers, testers, and project managers. What You'll Do: Your … supporting developers to help maintain/define best practices Configuring, watching, tuning and responding to monitoring events Supporting an on-call rotation with the SRE team Maintaining and improving CI/CD pipelines using Concourse and GoCD Supporting corporate initiatives (e.g., security hardening) Having a good time learning and working more »
operating the best in class, most reliable access network for our customers. About the Team: As a Senior SiteReliabilityEngineer (SRE), you will be part of the SRE team within the CONNECT OpTek team. Our team is responsible for development and support of multiple tools and … applications used by Comcast field technicians to diagnose and troubleshoot issues within the Comcast nation-wide network. The SRE team is responsible for maintaining the existing systems, supporting our development teams, and implementing innovative solutions. You will work alongside software developers, testers, and project managers. What You'll Do: Your … supporting developers to help maintain/define best practices Configuring, watching, tuning and responding to monitoring events Supporting an on-call rotation with the SRE team Maintaining and improving CI/CD pipelines using Concourse and GoCD Supporting corporate initiatives (e.g., security hardening) Having a good time learning and working more »
operating the best in class, most reliable access network for our customers. About the Team: As a Senior SiteReliabilityEngineer (SRE), you will be part of the SRE team within the CONNECT OpTek team. Our team is responsible for development and support of multiple tools and … applications used by Comcast field technicians to diagnose and troubleshoot issues within the Comcast nation-wide network. The SRE team is responsible for maintaining the existing systems, supporting our development teams, and implementing innovative solutions. You will work alongside software developers, testers, and project managers. What You'll Do: Your … supporting developers to help maintain/define best practices Configuring, watching, tuning and responding to monitoring events Supporting an on-call rotation with the SRE team Maintaining and improving CI/CD pipelines using Concourse and GoCD Supporting corporate initiatives (e.g., security hardening) Having a good time learning and working more »
operating the best in class, most reliable access network for our customers. About the Team: As a Senior SiteReliabilityEngineer (SRE), you will be part of the SRE team within the CONNECT OpTek team. Our team is responsible for development and support of multiple tools and … applications used by Comcast field technicians to diagnose and troubleshoot issues within the Comcast nation-wide network. The SRE team is responsible for maintaining the existing systems, supporting our development teams, and implementing innovative solutions. You will work alongside software developers, testers, and project managers. What You'll Do: Your … supporting developers to help maintain/define best practices Configuring, watching, tuning and responding to monitoring events Supporting an on-call rotation with the SRE team Maintaining and improving CI/CD pipelines using Concourse and GoCD Supporting corporate initiatives (e.g., security hardening) Having a good time learning and working more »
operating the best in class, most reliable access network for our customers. About the Team: As a Senior SiteReliabilityEngineer (SRE), you will be part of the SRE team within the CONNECT OpTek team. Our team is responsible for development and support of multiple tools and … applications used by Comcast field technicians to diagnose and troubleshoot issues within the Comcast nation-wide network. The SRE team is responsible for maintaining the existing systems, supporting our development teams, and implementing innovative solutions. You will work alongside software developers, testers, and project managers. What You'll Do: Your … supporting developers to help maintain/define best practices Configuring, watching, tuning and responding to monitoring events Supporting an on-call rotation with the SRE team Maintaining and improving CI/CD pipelines using Concourse and GoCD Supporting corporate initiatives (e.g., security hardening) Having a good time learning and working more »
Nottingham, Nottinghamshire, East Midlands, United Kingdom
Experian Ltd
age. If you have a disability or special need that requires accommodation, please let us know at the earliest opportunity. Job Description As a SiteReliability Engineering Manager, you will lead a global team of talented SREs in the development, deployment, and continuous improvement of our Cyber Threat … data storage and compute budget, ensuring effective allocation of resources through management of the data lifecycle. Qualifications This role requires a great deal of SRE technical and managerial skills in a large enterprise environment, such as: A great background in theSRE field supporting a Cyber Threat Detection function, with demonstrable more »
London, England, United Kingdom Hybrid / WFH Options
Bayside Solutions
SiteReliabilityEngineer Contract Salary Range: £91,400 - £108,000 per year Location: London, England - Hybrid Role Job Summary: We seek a SiteReliabilityEngineer to join our team and play a crucial role in ensuring our applications and services' reliability, availability, and … Willingness to adapt and learn new tools and technologies as needed Availability to participate in on-call rotations as required Desired Skills and Experience SiteReliability, Java, AWS, Azure, Kubernetes, GIT, CD Bayside Solutions, Inc. may collect your personal information during the position application process. Please reference Bayside more »
processing data at a scale comparable to Meta and Google! They are on the lookout for multiple count Senior SiteReliability Engineers (SRE) to join one of their incredibly talented teams. As a SiteReliabilityEngineer (SRE), you will play a crucial role in ensuring … the reliability, scalability, and performance of our systems and infrastructure. You will work closely with cross-functional teams to design, implement, and maintain robust and resilient systems, with a focus on automation, monitoring, and incident response. The role: • Working arrangements: Flexible – can be fully remote (UK residents only – unfortunately … automation tools and scripts for deployment, monitoring, and management of infrastructure components. Collaborate with software engineering teams to ensure that applications are designed with reliability, scalability, and performance in mind. Implement and maintain monitoring, alerting, and logging systems to proactively identify and resolve issues before they impact customers. Participate more »
SiteReliabilityEngineer – High Frequency Trading – London - £120-150k base + bonus + share options It’s a very cool time to join this high frequency trading firm. They are past the shaky uncertainty phase of a start-up. The core blocks have been built. They more »
SiteReliabilityEngineer – Google Cloud London Excellent Salary & Package including Bonus Key Skills – SRE, GCP (Enterprise Deployments), HELM, Python/Golang/Java, IAC/Automation, Blockchain Technologies, Node Infrastructure, Security Hardening Overview An influential member of a team of highly skilled engineers building out cloud native … as an enabler for the developers and business. Predominantly supporting Java, Typescript and Python workloads which are built upon open-source software. As an SRE subject matter expert you will: Enable cross functional teams to rapidly code, build and deliver. Own critical parts of the software development life cycle such … accountable for the cloud native deployment environments across dev, staging and production. Expertise Required: At least 5 years professional experience in a DevOps/SRE role Google Cloud Expertise - GCP Enterprise Level Deployments, Helm etc. Experience building tooling, scripts or applications to enhance the developer experience. 2+ years current experience more »
Job Title: SiteReliabilityEngineer Location: Hybrid with onsite requirements in London as and when required Contract Length: Six Months Role Summary Our client has chosen to do something incredible. They are totally transforming their business and building our future on smoke-free products that are a … up and deliver better, brighter solutions and the space to move your career forward in endlessly different directions. We are looking for world-class SiteReliabilityEngineer with experience in developing processes, tools and automation for managing distributed systems in production environments. Our team combines software and … ELK stack (including monitoring tools as mentioned), PagerDuty/OpsGenie or similar, and Jenkins. NON-TECHNICAL REQUIREMENTS: Awareness of SiteReliability Engineering (SRE) principles, including Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets. Understanding of development and operations models, with a holistic view of monitoring more »
Role: SiteReliabilityEngineer Job Type: Permanent Location: London, UK Ready to utilise your skills and experience in configuration and deployment management? Are you passionate about front line production support and quality assurance? Join us as a SiteReliabilityEngineer! Careers at TCS: It … bring products to life. Work with customers and identify opportunities to support their strategy and improve their processes across functions. The Role As a SiteReliabilityEngineer , you’ll be responsible for managing incidents at an Application Support Level and you’ll also be troubleshooting and resolving more »
Role: SiteReliabilityEngineer Job Type: Permanent Location: London, UK Ready to utilise your skills and experience in configuration and deployment management? Are you passionate about front line production support and quality assurance? Join us as a SiteReliabilityEngineer! Careers at TCS: It … bring products to life. Work with customers and identify opportunities to support their strategy and improve their processes across functions. The Role As a SiteReliabilityEngineer , you’ll be responsible for managing incidents at an Application Support Level and you’ll also be troubleshooting and resolving more »
A global high-frequency trading firm is seeking an experienced SiteReliabilityEngineer to join a Systems Infrastructure team working with Linux, Python and AWS. In this role, you will help deliver scalable, secure, and reliable solutions on their AWS Linux digital assets trading platform. Key Responsibilities … in both on-premise and cloud environments. Implement network latency optimization techniques. Create automation scripts and tools using Python. Qualifications: Extensive experience as an SRE with an AWS focus in high-frequency trading or similar performance-focused roles. Expertise in Linux system administration and system engineering. Proficient in Python for … This is a hybrid role in Central London inside an impressive building where you can shape your career. If you are a forward-thinking SRE with a passion for finance and a desire to work on cutting-edge technology, apply now. more »
A global high-frequency trading firm is seeking an experienced SiteReliabilityEngineer to join a Systems Infrastructure team working with Linux, Python and AWS. In this role, you will help deliver scalable, secure, and reliable solutions on their AWS Linux digital assets trading platform. Key Responsibilities … in both on-premise and cloud environments. Implement network latency optimization techniques. Create automation scripts and tools using Python. Qualifications: Extensive experience as an SRE with an AWS focus in high-frequency trading or similar performance-focused roles. Expertise in Linux system administration and system engineering. Proficient in Python for … This is a hybrid role in Central London inside an impressive building where you can shape your career. If you are a forward-thinking SRE with a passion for finance and a desire to work on cutting-edge technology, apply now. more »
SREEngineer should have knowledge of alerting and monitoring tools The tools can be Splunk, Log DNA, Grafana, AWS Cloud Watch Should have knowledge of CI/CD tools. The tools can be Team City, Jenkins, IBM Tool Chain etc Should have knowledge of APM and observability tools. The more »
Simply Commerce - Digital Commerce Recruitment Experts
a hands on Engineering Managers to join their newly created platform squad, this will involve building and hiring a sitereliability engineering (SRE) team. The purpose of this team to make life for the development teams in other parts of the business easier by providing a set of … still wants to be hands on. Responsibilities: This role is 60% hands on, 40% management. Lead, grow and hire a team of engineers and SRE's ensuring the platform and applications running on it are stable and secure. Create and Lead strategy for planned outages and DR exercises. Implement monitoring … of commercial expertise in core Java and Spring Boot. 3+ years experience building software products in Javascript/Typescript. 4+ years experience in an SRE/DevOps role Strong experience with AWS. Experience setting and managing Service Level Objectives (SLOs) and Service Level Agreements (SLAs) Salary up to more »
help millions of consumers build a brighter financial future and achieve yours along the way with a rewarding career. Sitereliability engineering (SRE) is a set of principles and practices that incorporates aspects of software engineering and applies them to IT infrastructure and operations. The main objectives are … availability, latency, performance, efficiency, change management, monitoring, emergency response and capacity planning of their services. As an Application SiteReliabilityEngineer (SRE) you will be part of team of people who are responsible for the availability of a number of Discover's most critical applications: our Card … only: technical proficiency rating of competent on the Dreyfus engineering scale , Preferred Qualifications Bonus Points If You Have: 3+ years of experience as an SRE Good understanding of hybrid infrastructure, ideally with AWS Expertise in one or more general purpose programming languages: Python, Go, shell scripting (Unix/Linux), Java more »
help millions of consumers build a brighter financial future and achieve yours along the way with a rewarding career. Sitereliability engineering (SRE) is a set of principles and practices that incorporates aspects of software engineering and applies them to IT infrastructure and operations. The main objectives are … availability, latency, performance, efficiency, change management, monitoring, emergency response and capacity planning of their services. As an Application SiteReliabilityEngineer (SRE) you will be part of team of people who are responsible for the availability of a number of Discover's most critical applications: our Card … only: technical proficiency rating of competent on the Dreyfus engineering scale , Preferred Qualifications Bonus Points If You Have: 3+ years of experience as an SRE Good understanding of hybrid infrastructure, ideally with AWS Expertise in one or more general purpose programming languages: Python, Go, shell scripting (Unix/Linux), Java more »