MAIN DUTIES/RESPONSIBILITIES OF THE ROLE: Deep understanding of core AWS cloud services including networking, storage solutions, load balancing, containers, messaging, serverless architectures, and monitoring/logging. Deploying and supporting production Kubernetes(EKS) clusters through their growth Strong knowledge More ❯
London, England, United Kingdom Hybrid / WFH Options
ZipRecruiter
Job Description You will have deep knowledge of Linux build, integration, and engineering, with secondary expertise in high-performance computing, storage, or networking, as well as a strong understanding of contemporary containerized, infrastructure-as-code technologies, to play a key role in the build of a sophisticated, highly performant, and highly optimized infrastructure. You should apply for this role … if you are/have: 15+ years of systems, infrastructure and/or platformengineering in financial markets or other critical environments Deep understanding of Unix/Linux systems engineering to Kernel level, for high performance/low latency requirements Strong systems design and optimization, low latency tuning and/or GPU processing Strong automation and/… or containerization experience in Ansible, Terraform, Puppet, Chef, Kubernetes, and similar tools Scripting experience in Shell, Bash, and Python; C++ desirable but not required Platformengineering/site reliability engineering experience Degree educated or higher in Computer Science, Engineering, or related This is a £600-£650/day contract role (dependent on experience), based in London More ❯
Join to apply for the SR Site Reliability Engineer role at Wakapi . We are seeking a highly skilled Senior Site Reliability Engineer to join our PlatformEngineering team. The ideal candidate will have a strong understanding of DevOps and Service Level Management (SLM) metrics, with experience in event-driven infrastructure projects using tools like Terraform, New Relic … Kubernetes, AWS, and Kafka. As a PlatformEngineering representative, you will collaborate with engineering teams to ensure our platform infrastructure tooling meets their needs and positively impacts Developer Experience. You will also assist in setting appropriate thresholds for alerts and automations related to their applications. Responsibilities Design, implement, and maintain scalable and highly available systems using … response times, and resolution times to ensure compliance with SLAs, SLOs, and SLIs. Implement and promote system resiliency practices, including Chaos Engineering. Collaborate with cross-functional teams to enhance platformengineering practices and gather metrics data. Requirements Proven experience with Infrastructure-as-Code tools like Terraform. Strong understanding of scalability, high availability patterns, and DevOps metrics such as More ❯
workplace - expectation is 1 to 2 days in the office. Closing Date: 29th April 2025. The role The Enterprise Platforms Principal Engineer will drive the evolution of the Enterprise Platform, ensuring the platform continues to evolve in line with changing business requirements for corporate application hosting. This senior role has responsibility for the development of the Enterprise Platforms … Enterprise Azure Infrastructure as a Service (IaaS) for Enterprise GCP Infrastructure as a Service (IaC) Windows Citrix Jira LogicMonitor Puppet PowerShell Main responsibilities Provide line management to the Consultative Engineering Squad within Enterprise platforms; ensure a collaborative culture, with no blame, where every individual can be themselves and their skills are recognized and valued. Provide coaching and mentoring to … team members to upskill the team. Act as an escalation point for the senior engineer within the Core Engineering Squad. Maintain and develop low-level architectural standards for Enterprise Platforms. Collaborate with Product and Engineering managers to develop new ways of working that promote collaboration, autonomy, and high standards. Develop the Enterprise Platforms product set, including automation with More ❯
London, England, United Kingdom Hybrid / WFH Options
Velocity Tech
people create visual content—powering popular apps used by millions, and helping creators and brands grow through cutting-edge technology. We're looking for an experienced Infrastructure/ML Platform Engineer to join our Machine Learning Platform team . This team builds and supports the platform that powers advanced AI models, helping bring research into production at … scale. Hybrid role – 3 days onsite in Central London What you’ll do: Design, build, and maintain a scalable and reliable ML serving platform Develop cloud infrastructure and internal tools to support research and engineering teams Set up and manage CI/CD pipelines and monitoring systems Build self-serve tools to simplify deployment and development Share best … practices across teams and help level up the platform Take part in an on-call rotation (weekends included, with extra pay) What we’re looking for: 5+ years of experience running scalable SaaS systems in GCP or AWS, or Azure 3+ years with Kubernetes, Helm/Kustomize, and tools like Terraform or Pulumi Experience with microservices, containerized environments, and More ❯
Amazon Web Services (AWS). Your role is crucial in ensuring the reliability, security, and scalability of the company's cloud-based applications and services. You Will: Translate the platformengineering team's goals into detailed tasks and collaborate with your team to achieve them. Work with the PlatformEngineering team manager to optimize cloud costs … and infrastructure. Plan and deliver projects and team objectives with your manager. Coach and guide platform engineers, reviewing their infrastructure designs and Git pull requests. Design and implement AWS cloud solutions based on company requirements and best practices. Focus on enhancing developer experience by building internal developer portals using MUFGIS templates with CI/CD pipelines. Collaborate with developers … on the cloud. Implement and maintain network and security configurations to protect sensitive financial data. #LI-Hybrid Qualifications You Have: A Bachelor's or Associate's degree in computer engineering or computer science. AWS Solution Architect - Associate and Professional certifications. Expertise in AWS Security or Networking. Technical skills include: Experience with Infrastructure as Code tools like Terraform. Hands-on More ❯
that drive business growth and enhance customer experiences. On behalf of our client, we are currently seeking a Senior Programme Manager. Someone with deep understanding of modern infrastructure and platformengineering to lead the delivery of mission-critical, low-latency systems. This role requires a blend of technical fluency and structured project management skills, particularly in environments built … on containerised platforms and high availability systems. You will work cross-functionally with engineering, networking, platform, and operations teams to deliver infrastructure that supports ultra-low-latency applications with strict performance and availability SLAs. Key Responsibilities: End-to-end project delivery of infrastructure and platformengineering initiatives Build and maintain detailed project plans, identifying hardware, software More ❯
why we're a leader in next-generation cybersecurity. Protection Starts with People. Proofpoint. The Role We are looking for a highly experienced Staff Software Engineer to join our PlatformEngineering team. This role is focused on building and scaling the foundational infrastructure and services that power our internal engineering ecosystem. As a senior individual contributor, you … support petabyte-scale workloads in production • Lead development of RESTful APIs using Java or similar high-performance languages (e.g., Go, Kotlin, Scala) • Contribute to the evolution of our internal platform by improving core infrastructure components and abstractions • Design and optimize data pipelines and backend systems leveraging distributed technologies like Apache Kafka and Cassandra • Drive platform modernization through Kubernetes … based deployments and configuration management using tools like Puppe • Champion best practices in CI/CD, observability, security, and cloud-native architecture • Collaborate with product and engineering teams to understand their needs and deliver reusable, scalable solutions • Mentor other engineers, review design documents and code, and provide leadership in solving complex technical challenges • Own the operational health of platformMore ❯
a Site Reliability Engineer to join their highly skilled, innovative team. Essential skills: Strong proficiency in Python for infrastructure and automation Hands-on experience in SRE, DevOps or production engineering roles Deep understanding of monitoring, incident response workflows, and system architecture Productive approach to improving systems and reducing technical debt Strong collaboration and communication skills – working closely with developers … quants, and platform engineers Experience designing and delivering scalable, reliable production systems Proficiency with Linux/Unix systems Bachelor’s degree in CS, Engineering or a related field Familiarity with Kubernetes, Docker, or container orchestration technologies Experience with automation tools such as Terraform or Ansible Background in Go, Bash or other system-level languages Exposure to low-latency … principles (SLIs, SLOs, error budgets) to enhance system reliability; build, document, and improve high-performance system designs; lead incident response and implement improvements; collaborate closely with quant developers/platform teams on evolving infrastructure; and evaluate and implement new tools, balancing performance, maintainability, and operational complexity. This is a rare and exciting opportunity to join a collaborative, fast-paced More ❯
a Site Reliability Engineer to join their highly skilled, innovative team. Essential skills: Strong proficiency in Python for infrastructure and automation Hands-on experience in SRE, DevOps or production engineering roles Deep understanding of monitoring, incident response workflows, and system architecture Productive approach to improving systems and reducing technical debt Strong collaboration and communication skills – working closely with developers … quants, and platform engineers Experience designing and delivering scalable, reliable production systems Proficiency with Linux/Unix systems Bachelor’s degree in CS, Engineering or a related field Familiarity with Kubernetes, Docker, or container orchestration technologies Experience with automation tools such as Terraform or Ansible Background in Go, Bash or other system-level languages Exposure to low-latency … principles (SLIs, SLOs, error budgets) to enhance system reliability; build, document, and improve high-performance system designs; lead incident response and implement improvements; collaborate closely with quant developers/platform teams on evolving infrastructure; and evaluate and implement new tools, balancing performance, maintainability, and operational complexity. This is a rare and exciting opportunity to join a collaborative, fast-paced More ❯
be based on your skills and experience — talk with your recruiter to learn more. Base pay range Direct message the job poster from Caspian One DevOps, SRE, and Data Engineering | Supporting FinTech, Healthcare, and Broadcast across Cloud, Data & GenAI We’re seeking a Network Engineer to join a PlatformEngineering team. In this role, you’ll be … maintaining the stability and security of networks, while working closely with development teams to ensure their solutions meet high standards and best practices. As a key member of the PlatformEngineering team, you’ll play a crucial role in supporting critical systems that power the business every day. From large-scale office moves and data centre expansions to More ❯
Gloucester, England, United Kingdom Hybrid / WFH Options
Searchability NS&D
+ hybrid working + AWS training support Deliver modern, cloud-native solutions for enterprise-scale clients Influence technical direction across major AWS projects Join a world-class engineering community built on Agile & innovation ABOUT THE CLIENT Our client is a leading digital transformation consultancy with a world-renowned software engineering division. Their teams deliver enterprise-grade solutions for … using the latest in open-source tooling, DevOps practices and cloud-native design. They are now seeking a Cloud Native Engineer with deep AWS experience to join their expanding platformengineering group. THE BENEFITS Flexible hybrid working – blend home, client site and office Support to pursue AWS certifications (training provided) Inclusive culture that values authenticity and growth Access … to active internal engineering communities and blogs Strong L&D culture, technical mentorship and career support Full benefits package including private healthcare and pension THE CLOUD NATIVE ENGINEER ROLE As a senior engineer in the AWS team, you’ll work on high-impact projects across public and private sectors, delivering modern infrastructure solutions with a DevOps-first approach. You More ❯
Cheltenham, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
+ hybrid working + AWS training support Deliver modern, cloud-native solutions for enterprise-scale clients Influence technical direction across major AWS projects Join a world-class engineering community built on Agile & innovation ABOUT THE CLIENT Our client is a leading digital transformation consultancy with a world-renowned software engineering division. Their teams deliver enterprise-grade solutions for … using the latest in open-source tooling, DevOps practices and cloud-native design. They are now seeking a Cloud Native Engineer with deep AWS experience to join their expanding platformengineering group. THE BENEFITS Flexible hybrid working – blend home, client site and office Support to pursue AWS certifications (training provided) Inclusive culture that values authenticity and growth Access … to active internal engineering communities and blogs Strong L&D culture, technical mentorship and career support Full benefits package including private healthcare and pension THE CLOUD NATIVE ENGINEER ROLE As a senior engineer in the AWS team, you’ll work on high-impact projects across public and private sectors, delivering modern infrastructure solutions with a DevOps-first approach. You More ❯
and borderless – naturally inclusive in how you work with others. Required Technical and Professional Experience Experience from an architect’s perspective of 6 years or more of the following: PlatformEngineering – Experience of designing and working in platformengineering squads to auto-provision business and development platforms using DevSecOps (e.g. DevSecOps tooling, Kubernetes, serverless, messaging, databases … Cloud Service Adoption – Experience of adopting, enhancing and optimising a customer’s cloud solution. New Ways of Working – Experience of at least two of the following: DevSecOps, Site Reliability Engineering (SRE), Agile Delivery, Product-Based delivery to automate, optimise and provision cloud solutions and platforms. Architectural experience of the following. Large customer change and transformation projects, with particular emphasis … employee learningprograms give you access to the best learning in the industry to receive certifications, includingMicrosoft, Google, Amazon, Skillsoft, and many more. Through our company-wide volunteering and giving platform, you can donate, start fundraisers, volunteer, and search over 2 million non-profit organizations. At Kyndryl, we invest heavily in you, we want you to succeed so that together More ❯
London, England, United Kingdom Hybrid / WFH Options
Pandora
aiming to elevate our innovation and customer satisfaction to unprecedented heights. As an MLOps Engineer, you will play a crucial role in designing, developing, and maintaining the infrastructure and platform essential for scaling our machine learning systems efficiently from experimentation to production. Your contribution will be pivotal in enabling and automating data-driven decision-making processes, utilizing cutting-edge … techniques and ensuring the seamless integration of machine learning models into our operational workflows. Responsibilities: Design, implement and maintain the ML platform and infrastructure required for ML systems to scale Facilitate an efficient ML route to live from experiment to production. Develop and deploy scalable tools and services to handle machine learning training and inference Identify and evaluate new … technologies to improve performance, maintainability, and reliability of ML systems Apply software engineering rigor and best practices to machine learning, including CI/CD, automation, etc. Support model development, with an emphasis on auditability, versioning, and data security Communicate with product team and data scientists and machine learning engineers to build requirements and track progress Qualifications: Experience building end More ❯
and borderless – naturally inclusive in how you work with others. Required Technical and Professional Experience Experience from an architect’s perspective of 4 years or more of the following: PlatformEngineering – Experience of designing and working in platformengineering squads to auto-provision business and development platforms using DevSecOps (e.g. DevSecOps tooling, Kubernetes, serverless, messaging, databases … Cloud Service Adoption – Experience of adopting, enhancing and optimising a customer’s cloud solution. New Ways of Working – Experience of at least two of the following: DevSecOps, Site Reliability Engineering (SRE), Agile Delivery, Product-Based delivery to automate, optimise and provision cloud solutions and platforms. Architectural experience of the following. Large customer change and transformation projects, with particular emphasis … employee learningprograms give you access to the best learning in the industry to receive certifications, includingMicrosoft, Google, Amazon, Skillsoft, and many more. Through our company-wide volunteering and giving platform, you can donate, start fundraisers, volunteer, and search over 2 million non-profit organizations. At Kyndryl, we invest heavily in you, we want you to succeed so that together More ❯
the team, line-managing two ML Ops Engineers and reporting to the Director of Data Science & AI. Day-to-day you will partner with pricing and central data scientists, platform engineers and external vendors to ensure our models ship quickly, safely and at scale. What you'll be doing Crafting and owning a multi-year ML Ops roadmap that … forefront of innovation. Setting and iterating the company-wide MLOps & LLMOps strategy, covering classical ML and emerging generative-AI workloads. Hiring, coaching and inspiring a high-performing ML Ops Engineering team with a culture of ownership and excellence. Mentoring and developing the ML Ops Engineers, expanding the team as we grow and setting technical standards while promoting best practice … multiple deployment pipelines on AWS SageMaker (real-time and batch inference) and integrating them with TeamCity for automated CI/CD. Managing the relationship with Tecton as our feature platform vendor, advising Data Science teams on its use for training and inference, and overseeing related infrastructure cost and operation Who you are You are a strategic engineering leader More ❯
London, England, United Kingdom Hybrid / WFH Options
Learning People
DevOps Engineers to build in quality, using industry best practices, to the products that make up our citizen services. You will also be responsible for leading and driving our Engineering community ensuring DWP and DevOps practices evolve in line with technology advances. If you are an experienced DevOps Engineer with a passion for automation excellence this could be the … see 'Selection process details'. Job description In the Citizen Information team, you will be leading our DevOps engineers to build shared tooling and capabilities to improve their developer platform to improve developer experience and software development life cycle to release value faster with improved quality. They work with Kubernetes and AWS cloud services to provide an environment where … be part of the team providing strategy and direction to the engineers alongside you. Working across the Children's area you will be pushing a mindset change to foster engineering ownership, and the importance of the integrity and maintenance of the services we deliver. The team underpins all of Children Digitals ambitions via the creation and maintenance of a More ❯
Social network you want to login/join with: Roke is a leading technology & engineering company with clients spanning National Security, Defence and Intelligence. National Security Business Be part of a growing and highly trusted supplier into the NS domain working to deliver mission critical solutions helping to keep the nation safe, secure and prosperous. Working on leading edge … technology solutions including AI/DS, Cyber, Cloud, DevOps/SRE, PlatformEngineering We have secured long term work, across the full spectrum, on the latest framework with the client, which provides the springboard for our ongoing growth and development in this domain, so join us on what will be an incredible growth journey. Role As a SRE More ❯
Social network you want to login/join with: Roke is a leading technology & engineering company with clients spanning National Security, Defence and Intelligence. National Security Business Be part of a growing and highly trusted supplier into the NS domain working to deliver mission critical solutions helping to keep the nation safe, secure and prosperous. Working on leading edge … technology solutions including AI/DS, Cyber, Cloud, DevOps/SRE, PlatformEngineering We have secured long term work, across the full spectrum, on the latest framework with the client, which provides the springboard for our ongoing growth and development in this domain, so join us on what will be an incredible growth journey. Role As a SRE More ❯
about building unified IT solutions that simplify the way IT organizations work. We are currently looking for a Senior Site Reliability Engineer to join our SRE team in the PlatformEngineering organization and help us scale our products to millions of end-users. We are looking for individuals with a passion for automation and observability, ensuring the quality … About Us NinjaOne automates the hardest parts of IT to deliver visibility, security, and control over all endpoints for more than 20,000 customers. The NinjaOne automated endpoint management platform is proven to increase productivity, reduce security risk, and lower costs for IT teams and managed service providers. NinjaOne is obsessed with customer success and provides free and unlimited … management, remote monitoring and management, and mobile device management. What You'll Love Grow personally and professionally with one of the fastest growing companies. Access to our Corporate Benefits Platform (with discounts for brands such as Expedia, FitX, Zalando and many more). Develop your skills through our renowned training platform. Receive competitive compensation. Collaborate with a curious, kind More ❯
about building unified IT solutions that simplify the way IT organizations work. We are currently looking for a Senior Site Reliability Engineer to join our SRE team in the PlatformEngineering organization and help us scale our products to millions of end-users. We are looking for individuals with a passion for automation and observability, ensuring the quality … About Us NinjaOne automates the hardest parts of IT to deliver visibility, security, and control over all endpoints for more than 20,000 customers. The NinjaOne automated endpoint management platform is proven to increase productivity, reduce security risk, and lower costs for IT teams and managed service providers. NinjaOne is obsessed with customer success and provides free and unlimited … management, remote monitoring and management, and mobile device management. What You'll Love Grow personally and professionally with one of the fastest growing companies. Access to our Corporate Benefits Platform (with discounts for brands such as Expedia, FitX, Zalando and many more). Develop your skills through our renowned training platform. Receive competitive compensation. Collaborate with a curious, kind More ❯
more productive? Do you thrive on streamlining workflows, squashing bugs, and leading innovation? If so, we want you on our team! We're on a mission to supercharge our engineering culture by building a world-class Developer community. Join us as a Developer Experience Engineer and help us revolutionize our development tools, workflows, and processes to enhance the productivity … investigate software development tools, and collaborate with vendors to resolve any issues. Automation: Create tools that automate repetitive tasks and streamline development workflows. Collaboration: Partner with product teams and platform engineers to deliver the best tools and practices for development workflows. Documentation and Training: Develop clear guidelines and conduct training sessions to help developers adopt new tools and processes. … development environments. Scalability: Ensure our tools and systems scale efficiently as our team grows and our codebase becomes more complex. WHAT YOU'LL BRING: Experience: Professional experience in software engineering or development, with a focus on improving workflows and productivity. Technical Skills: Proficiency in languages like JavaScript, Python, Java, and experience with multiple CI/CD tools and cloud More ❯
more productive? Do you thrive on streamlining workflows, squashing bugs, and leading innovation? If so, we want you on our team! We're on a mission to supercharge our engineering culture by building a world-class Developer community. Join us as a Developer Experience Engineer and help us revolutionize our development tools, workflows, and processes to enhance the productivity … investigate software development tools, and collaborate with vendors to resolve any issues. Automation: Create tools that automate repetitive tasks and streamline development workflows. Collaboration: Partner with product teams and platform engineers to deliver the best tools and practices for development workflows. Documentation and Training: Develop clear guidelines and conduct training sessions to help developers adopt new tools and processes. … development environments. Scalability: Ensure our tools and systems scale efficiently as our team grows and our codebase becomes more complex. WHAT YOU'LL BRING: Experience: Professional experience in software engineering or development, with a focus on improving workflows and productivity. Technical Skills: Proficiency in languages like JavaScript, Python, Java, and experience with multiple CI/CD tools and cloud More ❯
Derby, England, United Kingdom Hybrid / WFH Options
Jacobs
to our communities through our Collectively program which incorporates matched-funding, paid volunteering time and charitable donations. About the Opportunity Amentum is currently seeking an Apps Support/DevOps Platform Engineer as part of a team that delivers the infrastructure for critical applications and development tools running within a secure environment. We are looking for platform engineers to … work with the support teams and developers to support our customer. The Apps Support/DevOps Platform Engineer will focus on administrating, maintaining the infrastructure that is used to deliver the services. Key Responsibilities: Build and maintain virtualization infrastructure to support the applications running in the client environment Management of the Azure DevOps infrastructure Plan, coordinate and execute deployment … software applications. Troubleshoot and resolve infrastructure, application & performance issues reported by users. Escalate complex issues to third-line support or development teams. Implement automation and optimisation strategies to enhance platform performance. Working closely with other departments to resolve complex technical issues Documenting any processes which are required in the support of the infrastructure and applications Continual service improvement to More ❯