Cambridge, England, United Kingdom Hybrid / WFH Options
ZipRecruiter
including High-Performance Computing (HPC), Engineering Platform Access, Engineering Collaboration and Linux Platforms. You will lead a global team to ensure seamless product development by maintaining and improving the infrastructure that supports engineering teams. Key Responsibilities: High-Performance Computing (HPC): Manage and lead a large-scale HPC environment (handling half a million cores), using LSF (or similar schedulers) to … ensure high availability, scalability, and operational efficiency. DevOps & Automation: Drive the implementation of DevOps best practices (CI/CD, Terraform, Ansible, GitLab) to automate infrastructure and improve the efficiency of development workflows. Engineering Collaboration Tools: Manage and optimize the Atlassian suite (Jira, Confluence) for enhanced engineering collaboration and compliance. Linux Platform Leadership: Oversee the Linux Platform team responsible … for managing Linux-based infrastructure, especially for HPC servers. Virtualization & Kubernetes: Lead virtualization efforts involving VMware and Kubernetes clusters, ensuring efficient orchestration and resource utilization. Platform Access & Security: Lead teams handling login servers and user access solutions, ensuring seamless authentication experiences for engineers using OpenText ETX. Leadership & Strategy: Strategic Roadmap: Define and implement a clear roadmap for the Engineering More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Arm Limited
group provides the high-performance compute environment that fuels product and solutions development for Arm's engineering community. Whether its high-performance compute (HPC) on Arm's on-prem infrastructure and/or in the cloud, Electronic Design Automation (EDA) tools, or customised software tools and workflows, the products, and services we deliver are used by every engineer … in Arm. Responsibilities: Collaborate closely with technical leads, managers, and project teams to support and evolve HPC infrastructure tailored for semiconductor design workflows. Leverage strong Linux system administration skills and scripting (Bash, Python, Perl, etc.) to automate routine tasks and optimise engineering environments. Administer and tune job schedulers like IBM Spectrum LSF to ensure efficient workload management across hybrid … and on-prem infrastructure. Troubleshoot and resolve day-to-day technical issues through ticketing systems, proactively finding opportunities to automate and improve recurring tasks. Design and implement scalable, secure infrastructure solutions across public cloud platforms (AWS, Azure, GCP) and data centers, using tools like Terraform, Pulumi, and Ansible. Act as a key technical contributor on high-profile infrastructureMore ❯
any other duties as required. Deploying, maintaining, monitoring and upgrading production deployments of ARIC Risk Hub SaaS and third-party integrated services Building software and systems to manage platform infrastructure and applications Continually evaluating and improving our technology and processes to increase quality, decrease costs and improve time-to-market Periodically testing the service with predictable and unpredictable failures … improved ARIC Risk Hub capabilities This is a hybrid position. There is an expectation of 3 days in the office per week. Required experience: Hands-on experience administering cloud infrastructure or building cloud-native applications (preferably on AWS) Bachelor's, Master's, or higher qualification in Computer Science or a related field Comfortable using AWS EC2 Proficiency with command … line tools and shell scripting Experience with infrastructure as code and configuration management Proficiency in one or more programming languages (e.g., Python) Solid understanding of networking fundamentals (DNS, routing, firewalls) Experience with version control tools such as Git Familiarity with CI/CD pipelines and tools Proficient in setting up and managing monitoring, metrics, and alerting systems Experience operating More ❯
role in designing, maintaining, and optimising serverless systems, as well as supporting internal teams and external clients with high-level technical queries. Key Responsibilities Design, deploy, and maintain scalable infrastructure using AWS and serverless technologies (e.g. Lambda, API Gateway, DynamoDB, S3). Act as a senior escalation point for infrastructure-related support issues raised by clients or internal … teams. Collaborate with developers and product teams to ensure reliable and secure cloud deployments. Monitor cloud performance and proactively troubleshoot issues to ensure uptime and efficiency. Automate infrastructure and deployment processes using tools like Terraform, CloudFormation, or CDK. Contribute to system design decisions and drive cloud best practices across the business. Document infrastructure, workflows, and support processes to … AWS in a production environment. Hands-on experience with serverless technologies (Lambda, API Gateway, Step Functions, etc.). Strong understanding of cloud security, monitoring, and performance tuning. Experience with Infrastructure-as-Code (e.g. Terraform, CloudFormation, CDK). Familiarity with CI/CD tools and cloud-based deployment pipelines. Confident communicator, comfortable handling escalations and client queries. Desirable AWS Certifications More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Ecm Selection
An immediate, permanent requirement for a Cambridge-based developer of automated cloud infrastructure, possessing full existing UK work permission without time limit or sponsorship. This is a market-leading company operating a portfolio of secure and reliable services. Their friendly team includes cloud engineers, developers, product owners and testers. Infrastructure is automated for fast and effective deployment using More ❯
FinTech on a 6-month contract with possible extension. The role requires hands-on technical expertise and strong collaboration with developers, architects, and DevOps teams to ensure our data infrastructure aligns with business needs and industry standards. Your Responsibilities Manage, configure and optimise PostgreSQL databases hosted on Amazon Aurora and Oracle databases hosted on on-premise Exadatas and AWS … experience with AWS services such as EC2, S3, VPC, and IAM. Solid understanding of database security principles and compliance frameworks Experience implementing robust backup and recovery strategies. Proficiency with infrastructureautomation tools like Terraform or CloudFormation. Familiarity with monitoring tools such as CloudWatch, Datadog, or similar. Experience of database replication tooling such as AWS DMS and Goldengate. Experience … of collaborating with DevOps teams to integrate databases into CI/CD pipelines and infrastructure-as-code (IaC) frameworks like Terraform. Please APPLY NOW for immediate consideration!!! #J-18808-Ljbffr More ❯
Experts in AI, Machine Learning & Cloud Infrastructure This specialism covers the full spectrum of Artificial Intelligence and Machine Learning expertise - from cutting-edge research to scalable infrastructure deployment. It includes academic-leaning AI researchers, production-grade ML engineers, MLOps experts who bridge the gap between models and systems, and cloud infrastructure professionals who build the backbone these … models into real-world impact - at scale, and with speed. Key Areas of Usage: Research & development of new AI/ML methods Deployment and serving of models at scale Infrastructureautomation and cloud-native design Responsible AI, LLM safety, and interpretability tooling Data pipelines, versioning, and observability in production A glimpse of roles we recruit for: AI Research … Scientist Machine Learning Engineer Data Engineer with ML experience Applied Scientist/Research Engineer DevOps for AI/AI Infrastructure Lead Industries and Sectors: AI-first start-ups & scale-ups Healthcare & Biotech (AI in diagnostics, drug discovery) FinTech & Quant Research Autonomous Systems (Robotics, Automotive, UAV) SaaS Platforms integrating ML features Government/Defence innovation labs Who we work with More ❯
to low-level hardware. This is enabled, in addition to the work on Qualcomm's products, by the fact that the team also actively develops its own in-house automationinfrastructure in collaboration with other teams. The group consists of multi-disciplined (RF, Analogue, Software and Digital) engineers who work closely with specialist design and systems engineers. This … Troubleshoots advanced issues with radio designs and finds solutions that are documented and shared with internal cross functional teams working on similar products. The Engineer will be responsible for automation for development and validation of ASIC products generating Python test scripts, writing embedded C code for FW radio control, supporting the business group with radio training materials and investigating More ❯
to low-level hardware. This is enabled, in addition to the work on Qualcomm's products, by the fact that the team also actively develops its own in-house automationinfrastructure in collaboration with other teams. The group consists of multi-disciplined (RF, Analogue, Software and Digital) engineers who work closely with specialist design and systems engineers. This … Troubleshoots advanced issues with radio designs and finds solutions that are documented and shared with internal cross functional teams working on similar products. The Engineer will be responsible for automation for development and validation of ASIC products generating Python test scripts, writing embedded C code for FW radio control, supporting the business group with radio training materials and investigating More ❯