SiteReliabilityEngineer (SRE) - LLM and Machine Learning London/Remote Roles we're searching for now: Software Engineering/We are a pioneering technology company specialising in cutting-edge Language Models (LLM) and Machine Learning solutions. We are seeking a highly skilled SiteReliabilityEngineer (SRE) to join our team and ensure the reliability, scalability, and performance of our LLM and Machine Learning infrastructure. As an SRE, you will play a critical role in maintaining the stability and efficiency of our LLM and Machine Learning platforms. You will work closely with … teams to implement security best practices, vulnerability assessments, and compliance requirements for LLM and Machine Learning systems. Continuous Improvement: Continuously evaluate and improve system reliability, performance, and efficiency through automation and optimisation. Documentation: Maintain comprehensive documentation for infrastructure configurations, procedures, and incident reports. Requirements Bachelor's or Master's more »
Manchester, North West, United Kingdom Hybrid / WFH Options
Searchability (UK) Ltd
SiteReliabilityEngineer Salary up £40,000 plus benefits Hybrid working options available Based in Manchester About the client: Due to the rapid growth of our company, we are looking for an enthusiastic SiteReliabilityEngineer , who is keen to learn and bring good … Exciting employee benefits and bonus scheme Income protection and life assurance policies Tax-free childcare scheme and eye care for staff Company pension The SiteReliabilityEngineer role: Maintain critical platform supporting functionality Optimize production systems in collaboration with Software Developers Communicate effectively with wider teams, both … verbally and in writing Ensure successful operation of business critical systems/platforms SiteReliabilityEngineer - Essential skills: Previous experience with Linux and Unix Preferred experience with Python Excellent communication skills Strong understanding of networking Knowledge of GNU utilities Capacity to learn and build on current knowledge more »
Bristol, Avon, South West, United Kingdom Hybrid / WFH Options
Twinstream Limited
SITERELIABILITYENGINEER/BRISTOL/UP TO £85K & GREAT BENEFTIS Are you an experienced SiteReliabilityEngineer looking for an exciting new challenge? If so, we have the perfect opportunity for you. Excellent pay and extensive benefits package. In 2019, our founders were … experience into one business, providing technical excellence and exceptional service to their clients. The business is headquartered in Cheltenham with teams working both on-site with clients and remotely from home. We are looking for skilled engineers to join a new team that will deploy and maintain our established … lease an electric vehicle via salary sacrifice Health and Well-being - Access to workplace Mental Health First Aider Life assurance Key Responsibilities of the SiteReliabilityEngineer: Collaborate with Feature Development teams to promote new component versions into production as efficiently as possible. Maintain the system to more »
BS1, Bristol, City of Bristol, United Kingdom Hybrid / WFH Options
Twinstream Limited
SITERELIABILITYENGINEER/BRISTOL/UP TO £85K & GREAT BENEFTIS Are you an experienced SiteReliabilityEngineer looking for an exciting new challenge? If so, we have the perfect opportunity for you. Excellent pay and extensive benefits package. In 2019, our founders were … experience into one business, providing technical excellence and exceptional service to their clients. The business is headquartered in Cheltenham with teams working both on-site with clients and remotely from home. We are looking for skilled engineers to join a new team that will deploy and maintain our established … lease an electric vehicle via salary sacrifice Health and Well-being - Access to workplace Mental Health First Aider Life assurance Key Responsibilities of the SiteReliabilityEngineer: Collaborate with Feature Development teams to promote new component versions into production as efficiently as possible. Maintain the system to more »
City of London, London, United Kingdom Hybrid / WFH Options
Akkodis
Azure SiteReliabilityEngineer Akkodis are currently working in partnership with a leading service provider to recruit an experienced Azure SiteReliabilityEngineer to join a growing team of talented Cloud Engineers providing high level support and project delivery for a large customer base. … fully remote role and you must be eligible to gain security clearance (do not need to hold currently). The Role As an Azure SiteReliabilityEngineer you will support the cloud infrastructure used to deliver cloud hosted managed services to customers. You will have a high more »
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Confidential
SiteReliabilityEngineer Salary up £40,000 plus benefits Hybrid working options available Based in Manchester About the client: Due to the rapid growth of our company, we are looking for an enthusiastic SiteReliabilityEngineer , who is keen to learn and bring good more »
Glasgow, Lanarkshire, Scotland, United Kingdom Hybrid / WFH Options
Sanderson Recruitment
SiteReliabilityEngineer -Remote/Glasgow -Salary to £75,000 + Bonus -Immediate Start Fantastic new opportunity to the market to join our Glasgow-based tech-for-good client, specialising in digital solutions and who have a huge global reach. Due to increased success in their space … and demand for their services, they are now recruiting for a SiteReliabilityEngineer to join the team as they embark on a hugely exciting roadmap. The core function of the role will be to maintain a secure and reliable infrastructure, define and manage the infrastructure and more »
Reference : BH-298c Job Role: Senior SiteReliabilityEngineer Job Type: Contract IR35 : Inside IR35 Day Rate: £600/Day Contract Duration: 6 months Working Hours: 5 days per week Remote Working : 4 days remote working. 1 day on-site in London Location: Hybrid Remote/… London (UK only) Role Overview: We re looking for a Senior SiteReliabilityEngineer with deep Google Cloud (GCP) experience, to join our customer s organisation. Responsibilities Influencing Service Level Objectives, Non-Functional Requirements, and infrastructure requirements Ensuring that the Service Level Objectives in the dev teams … Root Cause Analysis) Maintain existing compliance and governance standards established in the business Key Experience: Deep understanding of Google Cloud (GCP) Deep understanding of SRE ethos and principles Vast amounts of Terraform experience Solid experience with Python Solid experience of Observability tooling. Good experience in dashboard creation/data visualisation more »
passionate about Real Time data and automation? Our client, a leading Real Time data platform company, is on the lookout for a talented ClickHouse SiteReliabilityEngineer to join their innovative team. If you thrive in a fully remote, dynamic environment and have a knack for managing … and automating ClickHouse databases, this could be your next exciting opportunity! About the Role: As a ClickHouse SiteReliabilityEngineer, you will play a pivotal role in ensuring the seamless operation and reliability of ClickHouse databases. you'll be developing and implementing automation strategies for upgrades more »
Leeds, West Yorkshire, United Kingdom Hybrid / WFH Options
Confidential
of engineers around the world working on truly groundbreaking projects. So what will you be required to do? Provide leadership and guidance across the SRE team motivating and driving the team with technical leadership acting as a subject matter expert and leading best practice techniques. Lead the SRE team in … ensuring technical assurance in significant projects, for the delivery of quality technical deliverables, which may involve several teams or technologies. Oversee the SRE team to ensure they are involved in every step of the application software development lifecycle, including product design, development, testing, and transition into operation. Provide coaching and … mentoring to the SRE team to improve their skillset, increase knowledge and set the benchmark of quality and precision engineering Oversee the implementation of service transition and change and release process changes, ensuring that processes are reviewed and improved with onus on optimisation Evaluate risks and defects, analysing specifications, and more »
Stanmore, England, United Kingdom Hybrid / WFH Options
Sky
with development teams to streamline deployment processes using Jenkins pipelines and ArgoCD . Monitor system performance and troubleshoot issues to ensure high availability and reliability of our services. Optimise and automate infrastructure management tasks to improve efficiency and reduce manual intervention. Implement robust security practices to safeguard our infrastructure … respect. We embrace authenticity and inspire our people to thrive.Global Streaming Technology Global Operations team is seeking a SiteReliabilityEngineer (SRE) to enhance and expand our global operations. We develop and maintain tools that empower our colleagues to deliver a seamless streaming platform accessible to a more »
Lincoln, England, United Kingdom Hybrid / WFH Options
Jobleads-UK
that demonstrates your qualifications and skill set pertaining to this position. Applications without a resume will not be considered. We are seeking a talented SRE (SiteReliabilityEngineer) to join our growing team. The ideal candidate should be at a senior level, but we are open to … candidates who are close to this level and have relevant experience. As an SRE, you will be responsible for ensuring that our products are reliable, available, and scalable. You will work closely with our development teams to identify and resolve issues before they impact our customers while advocating for the … improvements needed to provide a world class cloud experience for our customers. The SRE will be responsible for building systems and tooling to enable and empower the dev teams to work more efficiently while fortifying a cloud-first mentality. As a member of this team, you’ll get exposure to more »
of Canonical s IT production services. The team is in charge of running services used by over 60 million Ubuntu users. As an Senior SRE & Gitops engineer you ll be in a unique position to drive operations automation to the next level, both in our own private clouds as … together and support each other to provide the best possible services to our company, Canonical s customers and the Ubuntu Community. As a Senior SiteReliability/Gitops Engineerengineer you will Drive the development of automation, Gitops in your team as an embedded tech lead more »
in upskilling, learning new tech Deeply curious, creative, and innovative Flexible in working hours/ability to collaborate in different time zones The Lead SiteReliabilityEngineer has a pivotal role at the forefront of our engineering operations, responsible for guiding the Platform Team toward achieving exceptional … standards of reliability, performance, and stability across all our applications. The successful candidate will possess deep expertise in these core areas and will be instrumental in defining and implementing industry-leading practices. As a key leader, this role will not only shape … the strategic direction of our platform operations but also establish the benchmarks and processes by which our engineering excellence is measured. Responsibilities Lead the SRE Team, setting clear goals and priorities in line with business objectives. In collaboration with the department Director develop and execute strategies that enhance technological capabilities more »
Bristol, Somerset, United Kingdom Hybrid / WFH Options
Talent Leaders
CLOUD PLATFORM OPERATIONS ENGINEER - RHEL, KUBERNETES, AUTOMATION, IAC, AWS, OPENSHIFT - SRE, £70-80k + BRISTOL - 3 DAYS A WEEK The Client: Well recognised £1bn "Tech for Good" UK organisation with a reputation for excellence & innovation; Currently undergoing a significant Agile digital transformation programme innovating their core end to … role will be varied challenging and involve be responsible for: Continuous Improvement: Support, maintain and improve highly scalable and reliable cloud platform infrastructure services. SRE - SiteReliabilityEngineer: You will be responsible for the day-to-day operations of the Cloud Platform and be accountable for its … availability. To be considered Strong command line experience with RHEL (minimum of 4 years), Cloud Operations & SRE Proficient at Scripting with one or more of the following: Java, Perl, Python, or Scripting in Shell and Perl Automating delivery of Infrastructure as Code eg Terraform Working with software development methodologies; agile more »
Cheltenham, Gloucestershire, United Kingdom Hybrid / WFH Options
Confidential
Must have active enhanced DV (West) Clearance Up to £85k DoE plus 15% clearance bonus Must be willing to be full-time on-site in Cheltenham (some hybrid work dependent on … project) Skills required in Java Spring Boot, Kubernetes & Docker, Elastic, Helm, Linux, Git, CI/CD Who are we? We are recruiting a Senior SRE with enhanced DV Clearance for a prestigious client to work on a portfolio of public and pr... more »