enhances and strengthens internal tooling while evangelizing new use cases among existing internal customers and stakeholders. Fulfil the responsibilities of a DevOps and automation engineer working on cloud-native technologies. Research the collection, parsing, and analysis of infrastructure data from various devices or services while developing/enhancing tool more »
enhances and strengthens internal tooling while evangelizing new use cases among existing internal customers and stakeholders. Fulfil the responsibilities of a DevOps and automation engineer working on cloud-native technologies. Research the collection, parsing, and analysis of infrastructure data from various devices or services while developing/enhancing tool more »
SiteReliabilityEngineer/SRE Reigate/WFH to £85k Global FinTech is seeking a skilled SiteReliabilityEngineer/SRE to collaborate across product focussed Agile engineering teams to ensure the reliability, availability and performance of client facing services. Responsibilities will include … week for team meet-ups and stakeholder meetings with the other three days work from home. About you: You have experience in a similar SRE/SiteReliabilityEngineer position You have experience of running 24x7 services in the public cloud - Azure preferred You have experience with … happy to collaborate with senior stakeholders and mentor others What's in it for you: As a SiteReliabilityEngineer/SRE you will receive a competitive salary plus a range of perks and benefits: Up to £85k salary plus bonus Hybrid working (3 days a week more »
help millions of consumers build a brighter financial future and achieve yours along the way with a rewarding career. SiteReliability Engineering (SRE) is a set of principles and practices that incorporates aspects of software engineering and applies them to IT infrastructure and operations. The main objectives are … availability, latency, performance, efficiency, change management, monitoring, emergency response and capacity planning of their services. As an Application SiteReliabilityEngineer (SRE) you will be part of team of people who are responsible for the availability of several of Discover's most critical applications: our PULSE network … operational goals (MTTR reduction, incident reduction, platform availability, SLO\SLA targets) Ensure the proper level of documentation exists, is maintained, and reviewed regularly Drive SRE community discussions Participate in an on call rotation Minimum Qualifications At a minimum, here's what we need from you: Bachelors - Computer Science or related more »
SiteReliabilityEngineer- Lead, Mentoring, Kubernetes, PaaS, IaaS, SQL, Azure DevOps, CI/CD A leading provider of financial services is seeking two SiteReliability Engineers- Leads with a solid and proven background in Azure or GCP. This position will also be based onsite in … Will consider candidates from any of the key vendors across the Cloud- Azure, GCP, and AWS. Kubernetes & troubleshooting, managed services like AKS Using your SRE Attitude (understanding SLI, SLO & SLA) Container Image Management & Security like Aquasec Code Quality & repository Management like SonarQube & NexusQ Service Mesh (Istio) traffic shaping, canary, blue … Unit/Integration/Load Testing Azure Application Gateway & API Management Azure IAM - Identity & Access Management Azure Policy Management & Cloud Security Azure Express Route SiteReliabilityEngineer- Lead, Mentoring, Kubernetes, PaaS, IaaS, SQL, Azure DevOps, CI/CD McGregor Boyall is an equal opportunity employer and do more »
La Fosse are currently partnered with a client who are looking to hire a SiteReliabilityEngineer into their team, on a contract that will initially run to the end of this year. This role is paying £550 a day, fully remote and inside IR35. Main Responsibilities more »
Leeds, West Yorkshire, Yorkshire, United Kingdom Hybrid / WFH Options
Evri
help you grow. We're never one-size-fits-all. Our careers are as unique as you are. We are looking for a Senior SiteReliabilityEngineer to be responsible for providing the tooling, processes and support that their team requires to Reliably deploy applications to production … standards to the TDA (Technical Design Authority) Ensuring that the Service Level Objectives in your area are met Helping to develop and promote the SRE service catalogue Ensuring the best security practices are followed Supporting and developing junior members of the team Capturing the SLIs and mapping them to the more »
Employment Type: Permanent, Part Time, Work From Home
Saffron Walden, Essex, South East, United Kingdom Hybrid / WFH Options
EMBL-EBI
The IT & Technical Services department's Operations team is seeking a Senior SiteReliabilityEngineer to support the growing portfolio of services it provides to EMBl-EBIs service and research teams. The Operations team is responsible for maintaining and developing the Institutes Transfer Services , the application and … to the varied nature of this role, it may suit an individual with experience in a hands-on systems management role, a Senior Infrastructure Engineer, or someone from a sitereliability engineering background. The role will initially focus on the email systems - understanding and upgrading the infrastructure … cultural, multi-disciplinary staff, at different levels of their IT career. We are eager to welcome new talent who will join us in ensuring reliability and supporting EMBL-EBI's mission to advance scientific discovery. Your role During the first months, the role will focus on the upgrade of more »
Nottingham, Nottinghamshire, East Midlands, United Kingdom
Microlise
Lead EngineerSRE When registering to this job board you will be redirected to the online application form. Please ensure that this is completed in full in order that your application can be reviewed. Our Engineering Team is 200 strong, from Apprentice Engineers through to Enterprise Architects, and were … currently in an exciting period of growth! As our new Lead Engineer, you would be key to maximising this growth through coaching in terms of technical performance, achieving technical evangelism and acting in a leadership role in terms of design review. We provide clear career ladders for each employee … you are looking for a new challenge and have a strong technical background, then we want to hear from you! As our new Lead SiteReliabilityEngineer , you will be key to maximising the observability of our infrastructure and applications, and to resolving error-prone manual processes more »
evolution of their applications to deliver a modern, first class, cloud based platform to their users. As such we are looking for an experienced SRE to join the team to drive best Agile practices, DevOps and software development ways of working. You must have worked within the FS industry previously … strong CI/CD experience and have the ability to automate to eliminate/reduce toil. This role will require you to be on site 3 days a week and is inside IR35. more »
JOB TITLE: Senior SiteReliabilityEngineer (SRE) LOCATION: Halifax, Leeds or Manchester HOURS: Full-time WORKING PATTERN: Our work style is hybrid, which involves spending at least two days per week, or 40% of our time, at one of our office sites. Who are Lloyds Banking Group … there are some specific skills that we'd need to see: · Experience of CI/CD across various tooling and methodologies. · Experience as a SRE, Service, DevOps Engineer or in a similar software or cloud role with a focus on service management. · Strong critical thinking skills, leading live support more »
build a brighter financial future and achieve yours along the way with a rewarding career. As a Principal SiteReliabilityEngineer (SRE), you'll tap into your passion for finding and fixing inefficiencies to solve our reliability and performance issues. You'll work on projects including … CI/CD, improving data monitoring, and work with our internal product group to help build and define our SRE practice within our Fraud value stream. Responsibilities Develop and run SRE own tooling and observability using automation like CI/CD, and Kubernetes. Build monitoring that alerts on symptoms rather … or related Internal applicants only: technical proficiency rating of proficient on the Dreyfus engineering scale Preferred Qualifications Bonus Points If You Have: 5+ years SRE experience Think about systems: edge cases, failure modes, behaviors, specific implementations. Strong knowledge of SDLC (System Development Life Cycle) Strong knowledge of git, Docker, Kubernetes more »
build a brighter financial future and achieve yours along the way with a rewarding career. As a Principal SiteReliabilityEngineer (SRE), you'll tap into your passion for finding and fixing inefficiencies to solve our reliability and performance issues. You'll work on projects including … CI/CD, improving data monitoring, and work with our internal product group to help build and define our SRE practice within our Fraud value stream. Responsibilities Develop and run SRE own tooling and observability using automation like CI/CD, and Kubernetes. Build monitoring that alerts on symptoms rather … or related Internal applicants only: technical proficiency rating of proficient on the Dreyfus engineering scale Preferred Qualifications Bonus Points If You Have: 5+ years SRE experience Think about systems: edge cases, failure modes, behaviors, specific implementations. Strong knowledge of SDLC (System Development Life Cycle) Strong knowledge of git, Docker, Kubernetes more »
Nottingham, Nottinghamshire, East Midlands, United Kingdom
Experian Ltd
age. If you have a disability or special need that requires accommodation, please let us know at the earliest opportunity. Job Description As a SiteReliability Engineering Manager, you will lead a global team of talented SREs in the development, deployment, and continuous improvement of our Cyber Threat … data storage and compute budget, ensuring effective allocation of resources through management of the data lifecycle. Qualifications This role requires a great deal of SRE technical and managerial skills in a large enterprise environment, such as: A great background in theSRE field supporting a Cyber Threat Detection function, with demonstrable more »
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »