the systems that make many of these daily experiences possible. If you’ve used Apple products, you’ve likely interacted with us. iCloud Services SRE teams are responsible for the systems and services that directly support those customers and their experiences. We focus on availability and automation of key services … the world.Key QualificationsExperience with large scale distributed systems, especially ML infrastructure and services including LLMs, Generative AI, and transformersDemonstrable success leading engineering teams - ideally SRE or Production EngineeringKnowledge of core operating system principles, networking fundamentals, and systems managementUnderstanding of SRE principals, including monitoring, alerting, error budgets, fault analysis, and other … person to join this amazing team. You will be an accomplished builder and leader of teams looking to tackle your next challenge. You know SRE and you know what it will take to run services at Apple scale with a high degree of operational perfection. This role will position you more »
to do so. Preferred Technical and Professional Expertise RCHA/RCHE Linux Certification AWS Certified Cloud Practitioner AWS Certified SysOps Administrator AWS Certified DevOps Engineer AWS Certified Solutions Architect Docker Certified Associate Certified Kubernetes Administrator About Business UnitThe Client Innovation Centre (CIC) is an innovative and exciting part of more »
Nottingham, Nottinghamshire, East Midlands, United Kingdom
Experian Ltd
age. If you have a disability or special need that requires accommodation, please let us know at the earliest opportunity. Job Description As a SiteReliability Engineering Manager, you will lead a global team of talented SREs in the development, deployment, and continuous improvement of our Cyber Threat … data storage and compute budget, ensuring effective allocation of resources through management of the data lifecycle. Qualifications This role requires a great deal of SRE technical and managerial skills in a large enterprise environment, such as: A great background in theSRE field supporting a Cyber Threat Detection function, with demonstrable more »
The Opportunity: If you are passionate about digital projects and have the ability to ensure they run smoothly and reliably we would love to discuss our clients ongoing success story working on a number of high profile government projects that more »
The Opportunity: If you are passionate about digital projects and have the ability to ensure they run smoothly and reliably we would love to discuss our clients ongoing success story working on a number of high profile government projects that more »
to deliver new features rapidly, securely, and at scale. Play a critical role in evolving our infrastructure to address complex technical challenges related to reliability, latency, bandwidth, and security. Improve observability, monitoring, and alerting throughout the platform. Coordinate work across different areas of the company to ensure efficient execution. more »
Simply Commerce - Digital Commerce Recruitment Experts
a hands on Engineering Managers to join their newly created platform squad, this will involve building and hiring a sitereliability engineering (SRE) team. The purpose of this team to make life for the development teams in other parts of the business easier by providing a set of … still wants to be hands on. Responsibilities: This role is 60% hands on, 40% management. Lead, grow and hire a team of engineers and SRE's ensuring the platform and applications running on it are stable and secure. Create and Lead strategy for planned outages and DR exercises. Implement monitoring … of commercial expertise in core Java and Spring Boot. 3+ years experience building software products in Javascript/Typescript. 4+ years experience in an SRE/DevOps role Strong experience with AWS. Experience setting and managing Service Level Objectives (SLOs) and Service Level Agreements (SLAs) Salary up to more »
Are you passionate about ensuring a great customer experience behind the scenes? This role will be predominately operational, focused on improving & supporting front-line SRE operations. The focus will centre on operational readiness, resiliency & quality standards. In addition, there will be the opportunity to contribute & define exciting but scalable reliability … get the best possible experience. Competencies : Well organised, learning on the fly, self-development, problem-solving, functional/technical skills. Prior experience as an SRE, System Administrator, Application Support, or similar role. more »