t just write code-we build mission-driven systems that push the boundaries of what's possible. Join a collaborative team where your expertise in software engineering, DevOps, and high-performance computing contributes to real-world impact. Your ideas are valued, your skills are sharpened, and your contributions drive innovation that matters. What You'll Do: As a Software … Design, implement, and maintain CI/CD pipelines for C++ or Python applications using Kubernetes (K8s), Helm charts, and pod definition. • Kubernetes Administration: Manage and configure Kubernetes clusters for highavailability, scalability, and security. • Debugging and Defect Correction: Troubleshoot and resolve software defects with effective root cause analysis and debugging techniques. • GPU Configuration and Support: Configure and optimize … development processes, and interfaces to support long-term project continuity What You Bring: • Active TS/SCI clearance with Full Scope Polygraph • Experience with GPU/CUDA development for high-performance applications • Familiarity with message queue implementations and communication protocols • Proficiency with Linux system programming and development environments • Strong analytical and problem-solving mindset • Excellent verbal and written communication More ❯
privacy in all LLM implementations, adhering to AI governance, risk management and compliance requirements Perform rigorous testing, including model validation, stress testing, and debugging Conduct code reviews to uphold high-quality standards and enforce best practices Monitor and maintain LLM systems, troubleshooting issues, and ensuring highavailability and reliability Proactively identify and address technical debt, optimizing models … the ability to solve complex problems efficiently Able to deliver work in line with internal policies and procedures and external requirements Demonstrated ability to manage multiple priorities and deliver high-quality work under tight deadlines Skilled in designing evaluation metrics and benchmarks for LLM performance, including accuracy, coherence, and safety. Proficiency with version control systems (e.g., Git) and collaborative More ❯
by a team of experienced technology professionals who recognised an opportunity to provide highly secure enterprise data platforms to large organisations. We build and operate ground-breaking, ultra-secure, high performance, cloud-based data infrastructure for the enterprise. Our proprietary technology solutions drive performance and reduce costs while helping our clients to improve the management and sharing of data … Times Tech Track. The Company was also recognised at the Thames Valley Tech Awards 2020; winning the Thames Valley Tech Company of the year, the Emerging Tech Company and High Growth Tech Business categories. We encourage people of all different backgrounds and identities to apply. We are committed to maintaining an inclusive, and supportive place for you to do … development of backend services for a government client leveraging the latest commercial and open-source technologies. This involves designing, developing, documenting and deploying backend services with a focus on highavailability, low latency and scalability. Design, develop, document and deploy new or enhanced features to our backend services that meet functional/non-functional business requirements Actively participate More ❯
Reading, Berkshire, South East, United Kingdom Hybrid / WFH Options
Pertemps Thames Water
to improve performance, reduce downtime, and drive innovation in a business delivering lifes essential service every day. What youll do as a Regional SCADA Engineer: SCADA System Support: Maintain highavailability of the Regional SCADA platform (targeting 99.2% uptime or higher). Perform proactive system maintenance, patching, upgrades, and configuration management. Diagnose and resolve faults across SCADA applications … RTUs, and client workstations. Emergency & Out-of-Hours Coverage: Participate in the SCADA emergency support roster, providing in-hours and out-of-hours fault response. Investigate high-impact incidents and escalate recurring faults for permanent resolution. Incident & Request Management: Deliver responsive support for SCADA-related service requests and incidents. Manage access and data requests in line with OT policies … will be given for what is undoubtedly an incredibly rewarding experience. Its also a great opportunity to learn more about our business and meet colleagues. Disclaimer: due to the high volume of applications we receive, we may close the advert earlier than the advertised date, so we encourage you to apply as soon as possible to avoid disappointment. More ❯
Milton Keynes, Buckinghamshire, England, United Kingdom
Sonepar
with infrastructure upgrade projects and contribute to new solution designs. Manage and monitor servers, storage systems (SAN/NAS), and network devices (switches, firewalls, WAPs). Ensure system performance, availability, and data integrity through proactive monitoring and backups. Support and test Disaster Recovery plans and contribute to business continuity strategies. Maintain accurate technical documentation and contribute to knowledge sharing. … protocols, TCP/IP, DNS, DHCP, and VLANs, along with Proficient in managing and maintaining virtualized environments using VMware vSphere and Microsoft Hyper-V, including VM provisioning, resource optimization, highavailability configurations, and troubleshooting performance issues. hands-on experience configuring and managing routers, switches, and firewalls. Experience with SAN (Storage Area Network) and NAS (Network Attached Storage) technologies … and managing LogicMonitor for infrastructure monitoring, including custom dashboard creation, alert tuning. Strong time management, multitasking, and attention to detail. Ability to perform under pressure, especially during incidents or high-demand periods. First interviews will be held at the beginning of August WHAT SONEPAR UK CAN OFFER YOU Competitive Salary+ 5% performance bonus. Generous Time Off: Start with More ❯
Newcastle Upon Tyne, Tyne And Wear, United Kingdom
Ronald James Ltd
great opportunity for a .Net Developer to join a leading technology company focused on innovation and growth in the igaming industry. They're looking for someone passionate about building high-quality code and eager to explore cutting-edge technology. You'll be part of a collaborative, people-first culture that values creativity and technical excellence. What they offer: Work … handles 200k transactions per minute and supports 1M+ active users per month. Ensure compliance with industry-leading security standards (ISO 27001, PCI-DSS, SOC 2). Work in a high-availability ecosystem with Azure SQL and MongoDB for data management. What you'll be doing: Designing and implementing innovative solutions to meet project goals. Proactively supporting team-owned … available systems. A team player who thrives in a dynamic, growth-focused environment. Strong communication and time management skills, with the ability to prioritize tasks. Eagerness to contribute to high-quality software solutions and continuous learning. Note: The additional content about puppy visits, candidate contact methods, and in-person meet-ups appears irrelevant to the job role and should More ❯
Product, Machine Learning, Engineering, Research, Data Science teams , and executive stakeholders to iterate on existing Gen AI products and launch new ones Facilitate technical planning, code reviews, and ensure high-quality code and operational standards across projects Evaluate the technical tradeoffs of major decisions and be a strong technical mentor Manage and mentor a team of developers, in a … fast-paced, quick-to-market environment Hire, grow and retain high-performing team members Knowledge, Skills & Abilities: Experience setting the direction for customer-facing product teams leveraging Machine Learning/Generative AI Track record of delivery highavailability ML-based backend products at scale in rapidly changing, highly collaborative, multi-stakeholder environments Strong management and mentorship skills More ❯
safer. Our commitment is to extract, analyze, and visualize critical intelligence that provides law enforcement, intelligence agencies, and emergency services with real-time insights to help save lives. Our high-performance, flexible, and future-proof solutions also enable mobile network operators to achieve regulatory compliance with minimal disruption, time, and cost. SS8 is trusted by the largest government agencies … SS8 Products in secure environments. Ensure all client PCs and servers are appropriately hardened and secured according to CESG guidance and industry best practices. Manage Domain Controller roles, ensure highavailability, and monitor replication. Implement periodic patching schedules for all client PCs and Windows Servers. Deploy and configure antivirus solutions for servers and workstations. Deploy, configure, and maintain … Experience with Atlassian Confluence Knowledge of regulatory compliance Experience with Kubernetes and containerized deployments Experience and Education: BA/BS in a technology-related field (preferred) 5+ years in high-tech, preferably telecom Willing to travel and hold a driver's license Willing to acquire or hold Security Clearance More ❯
standards (e.g., GDPR, ISO 27001, Cyber Essentials+) and proactively manage cyber risks. Lead digital programmes including automation, data analytics, self-service, and agile delivery models. Build and develop a high-performing, collaborative IT and Digital team with a focus on continuous improvement. Manage IT budgets and vendor relationships, ensuring cost-effectiveness and value for money. Act as a key … Analytical and problem-solving mindset Resilience and ability to drive change Technology and Infrastructure Management Oversee the management and modernisation of IT infrastructure, enterprise applications, and cloud services. Ensure highavailability, scalability, and security of IT systems and digital platforms. Establish and manage vendor relationships, contracts, and service levels. Cybersecurity and Risk Management Lead the development and implementation … finance, and IT professionals, to understand solution needs and implement them. Lead user adoption initiatives and change management across digital tools. Team Leadership and Development Lead and develop a high-performing IT and digital team, fostering a culture of continuous learning, innovation, and collaboration. Define clear KPIs, performance goals, and professional development plans. Oversee recruitment, retention, and succession planning More ❯
Milton Keynes, Buckinghamshire, England, United Kingdom Hybrid / WFH Options
TC Group
responsibilities of a Systems Administrator will include: Information gathering and assessment to provide timely and accurate visibility of all systems. Ensuring suitable monitoring is in place for the ensuring high quality experience to the Partner firms with the goal for overall excellent Partner experience in services provided. Planning changes to align with prioritisation for the delivery of agreed roadmap … monitoring of new systems to ensure all are aligned with the relevant policies, standards and Technology operating model. Identify risks and propose mitigation strategies to ensure confidentiality, integrity and highavailability of all systems. About you We are looking for candidates with the following core competencies and skills: Technical Significant experience in a similar role Bachelor Degree (Computer … and aspirations About us TC Group is growing and changing at quite a pace! We’re an ambitious group of people with the same aim – to create a consistent, high-quality experience for our clients. Like us, our client list is growing – we’re supporting more and more of the UK’s ambitious business owners, SME’s and large More ❯
as part of Data Center workload and incident response processes o Responsible for developing, documenting, submitting, and executing changes to the storage and backup infrastructure o Provides secure, reliable, high-availability storage and backup services for hosted customers and datacenter infrastructure o Manages capacity reporting and monitoring to ensure successful datacenter hosting service delivery and support to evolving … 24x7x365 operations including on-call duties and occasional night and weekend patching or implementation requirements. Qualifications Basic Qualifications: • Bachelors degree 10-12, Masters degree 8-10, Phd 5-7, High school years • IAT II Certification required • Cohesity hybrid-cloud backup solution experience required • Senior-level experience with the following technologies supported in the hybrid-cloud datacenter: o Cohesity hybrid … for Storage and Backup solutions including requirements for STIG, FIPS implementation, POA&M development, and support for vulnerability management and accreditation requirements • Experience automating tasks in support of a high-availbility storage and backup environment • Experience implementing proactive monitoring and event/incident management to ensure prompt resolution of anomalies or outages (vROPS monitoring suite preferred) • Experience collaborating with More ❯
who brings a strong drive for continuous self-improvement. You know your way around working effectively within a team, but have the ability to work independently on large scale (highavailability) SaaS products with zero-downtime. You bring strong communicative skills, with the ability to explain your technical choices to the team and stakeholders. You enjoy working in … or PostgreSQL and a deep understanding of transactions and isolation levels. Extensive experience with PHP Laravel and related back-end technologies. Knowledge of AWS, Kubernetes, and Docker for scalable, high-performance back-end development. Familiarity with handling high-load systems, including performance tuning, zero-downtime upgrades, and scaling. Understanding of event-driven architectures and experience with technologies like More ❯
Senior Research Scientist, AWS Incident Tooling & Response Job ID: Amazon Development Centre Ireland Limited AWS Resilience owns service to prevent and response to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to … and availability. You'll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion. AWS Incident Response is at the heart of the highavailability of Amazon Web Services. We make customer impacting events shorter and less frequent by driving large scale event and incident response. Our automated tooling quickly identifies the More ❯
Who You'll Work With Work Environment & Additional Information: Hybrid or on-site work model. Occasional travel for global team engagement or vendor meetings. Ability to operate in a high-availability, global IT operations environment. The Director - Software Asset and Configuration Management (SACM) is a critical leadership role, launching a new, fully integrated capability that will transform how … specific base salaries within the range depend on various factors such as experience and skill set. It is not common for new BCG employees to be hired at the high-end of the salary range. BCG regularly reviews its ranges to ensure market competitiveness. In addition to your base salary, your total compensation will include a bonus of up More ❯
Who You'll Work With Work Environment & Additional Information: Hybrid or on-site work model. Occasional travel for global team engagement or vendor meetings. Ability to operate in a high-availability, global IT operations environment. The Director - Software Asset and Configuration Management (SACM) is a critical leadership role, launching a new, fully integrated capability that will transform how … specific base salaries within the range depend on various factors such as experience and skill set. It is not common for new BCG employees to be hired at the high-end of the salary range. BCG regularly reviews its ranges to ensure market competitiveness. In addition to your base salary, your total compensation will include a bonus of up More ❯
environment. Deliver customer solutions in accordance with InterSystems service standards. Work with a broad range of proven and cutting-edge technologies in areas including object, web, relational, networking, storage, highavailability, performance, integration, cloud, and database. Continuous self-learning and gaining an in-depth knowledge of these technologies. Develop leadership skills working with developers, managers, and other stakeholders. … Passion for solving complex interesting problems Experience with developing debugging tools for Unix/Linux or Windows environments Ability to lead technical discussions with senior developers and managers A high level of understanding of interface development is an advantage. More ❯
you’ll lead multiple engineering teams, develop critical digital services, and support 24/7 live systems. This is a leadership role within a SAS team, focused on building high-quality applications and infrastructure using modern engineering practices. Responsibilities: Lead and mentor teams of engineers across complex digital service projects Engage with senior business and technical stakeholders Provide technical … Knowledge of AWS DevOps principles & Docker, Terraform, Kubernetes, Helm, Git C#, Java (Springboot, JPA/Hibernate), REST APIs, observability & monitoring, queue technologies & security History working on building new, evolving, highavailability microservices with data integrity Strong understanding of test methodologies: JUnit, TDD, Integration Tests & E2E Experience working with relational databases: PostgreSQL More ❯
Managers with a strong background in program management and working with Software Engineers. As part of the AWS Managed Operations team, you will play a pivotal role in delivering high-availability AWS services, including EC2, S3, Dynamo, Lambda, and Bedrock, exclusively for EU customers. For more information on ESC please check out our blog: European Sovereign Cloud (ESC … relocation support for successful applicants relocating within the European Union. Key job responsibilities In this role you will work closely with the software development and product teams to deliver high quality, scalable products deployed around the world. You will collaborate with the product teams to deliver the best product with most efficient use of resources and technologies. You will More ❯
This person will define technical direction, unblock and mentorteammates, and tackle the most complex system and architectural challenges. Principal engineers aredistinguished by their organization-level impact and focus on high-leverage activity, in addition to directcode contribution. Beyond pure technical focus, Principals bridge the gap between management and engineering - they obsess over not only the how, but also what … cloud computing. We primarily use AWS native technologies, Docker. Significant experience with configuration management and infrastructure as code. We often useTerraform and Ansible. Experience building and owning low-latency, high-availability critical systems or cloud-basedservices. Experience defining, managing, and executing a portfolio of complex engineering projects. Preferred Qualifications: Expertise in event-driven architecture. Expertise in instrumentation, observability More ❯
Job ID: Amazon Development Centre Ireland Limited Amazon Central Technical Operations Services (CTOS) maintains highavailability for the Amazon Retail Website and is the team that provides the first line of incident response to protect it. We make customer impacting events shorter, less frequent, severe, and impactful by providing large scale incident and response management. The Amazon Retail … automate the detection and resolution of issues within Amazon's Retail Website infrastructure. You will also spend a portion of your time of your time directing the resolution of high visibility incidents by leading conference calls, taking notes to collect data and help improve our processes. Using data and insights learned from those incidents you will drive further improvements More ❯
Join Barclays as an Engineering Lead and take ownership of guiding and managing engineering teams to deliver high-quality software solutions. In this role, you'll provide technical leadership, mentorship, and support while fostering a culture of innovation and continuous improvement. You'll collaborate closely with cross-functional teams to ensure technical decisions align with business goals and drive … A results-driven approach to team goals, with the ability to organise work effectively and ensure clear milestones are in place. Other highly valued skills include: Proven experience building high-availability systems. Previous experience deploying and monitoring production loads in AWS. Proficiency in Java, particularly with Spring Boot. Experience with React. Familiarity with source control tools such as … Luke Street, London or Radbroke Hall, Knutsford . Purpose of the role To lead and manage engineering teams, providing technical guidance, mentorship, and support to ensure the delivery of high-quality software solutions, driving technical excellence, fostering a culture of innovation, and collaborating with cross-functional teams to align technical decisions with business objectives. Accountabilities Lead engineering teams effectively More ❯
IR35 - Outside Duration - 6 months Required Technical Skills - Deep expertise in Amazon Connect, AWS architecture, and enterprise telephony. - Strong understanding of voice network infrastructure and SIP routing. - Experience with high-availability and disaster recovery planning. - Ability to lead technical teams and coordinate with external vendors. - Proficiency in infrastructure-as-code tools and CI/CD pipelines. Qualifications - Bachelor More ❯
City of London, London, United Kingdom Hybrid / WFH Options
DGH Recruitment
proficiency in designing and implementing CI/CD pipelines, and a deep understanding of monitoring and observability practices. Core responsibilities: - Architect, deploy, and manage Azure-based infrastructure to ensure highavailability, scalability, and security. - Develop and maintain Infrastructure as Code (IaC) using Terraform for automated and consistent environment provisioning. - Design and implement CI/CD pipelines to streamline More ❯
Data and Security practices. The Lead Cloud Engineer will fulfil a technical leadership role (very much player-coach to begin with) overseeing cloud infrastructure-primarily Microsoft Azure-to ensure highavailability, security, performance, and cost-efficiency. You'll get the opportunity to build out and lead a team of cloud operations engineers and coordinate across departments like DevOps More ❯
Manchester, Lancashire, England, United Kingdom Hybrid / WFH Options
Lorien
with containerisation & orchestration (Kubernetes preferred) Good understanding of software architecture and clean code principles Comfortable working independently and as part of a collaborative team Experience with performance tuning and high-availability systems Familiarity with Agile/Scrum methodologies Benefits £60,000-£70,000 DOE Remote working with flexible hours 25+ days holiday + bank holidays Private healthcare Training More ❯