Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Arm Limited
collaborate across teams to: Modernise our infrastructure by leading the migration from Docker Swarm to Kubernetes Design and operate CI/CD pipelines using CloudBees and GitLab Build out observability with Prometheus, Grafana, OpenTelemetry, and Dynatrace Automate cloud deployments (AWS-first) using Terraform and platform tooling Improve security posture across IAM, secrets, and networking Help the team ship faster and … TypeScript, Python). Validated experience operating distributed systems at scale in production. Cloud AWS (primary), Kubernetes (future), Docker (current), Terraform. Excellent debugging skills across network, systems, and data stack. Observability tooling, e.g. custom metrics pipelines, OpenTelemetry tracing, or integrations across telemetry stacks. Security engineering and practical understanding of IAM hardening, zero-trust network principles, and secrets management in data-heavy More ❯
Cambourne, Cambridgeshire, United Kingdom Hybrid / WFH Options
Remotestar
strong track record of building and maintaining highly reliable infrastructure and services. Expertise in incident management, including incident response, resolution, and post-mortem analysis. Proficiency in monitoring, alerting, and observability tools such as Prometheus, Grafana, ELK stack or Datadog. Experience with cloud platforms such as AWS, Azure, or GCP, including infrastructure as code tools like Terraform or CloudFormation. Strong scripting More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Arm Limited
Java or Python. Deep understanding of AWS or other cloud providers (e.g. GCP, Azure). Strong understanding of key security technologies and protocols such as TLS, OAuth and SPIFFE. Observability, alerting, metrics collection and visualisation (e.g. Prometheus, Grafana, Elasticsearch, Dynatrace). "Nice To Have" Skills and Experience: We would be even more impressed if you are passionate about the following More ❯
Cambourne, Cambridgeshire, United Kingdom Hybrid / WFH Options
Remotestar
end tests. Ability to write and understand design documentation using C4, sequence diagrams and workflows. Excellent problem-solving skills and attention to detail. Solid understanding of logging, monitoring and observability to understand if software is functioning as required. Strong communication and teamwork skills. Preferred Skills: Experience with cloud platforms e.g., AWS, Azure, Google Cloud. Knowledge of DevOps practices and CI More ❯
management for Windows workloads Create tooling and automation around the deployment of a customer-specific Windows-based SaaS product Ensure high availability, reliability, and scalability of Windows services. Integrate observability tooling (metrics, logs, traces) into IIS-hosted services Harden Windows infrastructure for security, compliance, and operational best practices Lead incident response for Windows-related systems Contribute to internal documentation and … Windows internals Proven ability to build infrastructure-as-code and CI/CD for Windows environments Comfort wrapping a Windows software product with the surrounding infrastructure, services, automation, and observability required to run it as a SaaS offering. Hands-on experience administering cloud infrastructure or building cloud-native applications (preferably on AWS) Comfortable using AWS EC2 Proficiency with command-line More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Arm Limited
and attitude on automating common repetitive tasks A suitable sense of ownership and responsibility in driving tasks to timely full completion "Nice To Have" Skills and Experience: AIOps and Observability Meaningful experience in a distributed team Working in a sophisticated, multi-geography, engineering services environment! Providing technical support and mentoring to othe Accommodations at Arm At Arm, we want to More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Gearset Limited
experience automating the management of infrastructure using Terraform. Strong knowledge of CI/CD pipeline configuration and operation. Familiarity with .NET/C# applications and development environments. Experience implementing observability solutions and using them to gain insights into application and infrastructure performance. Benefits and perks (the stuff you'd expect!) This is a full time opportunity, working Monday to Friday More ❯
portals, dashboards, internal tools, and web applications. Collaborate closely with DevOps on CI/CD pipelines, deployment workflows, infrastructure, and SecOps compliance. Uphold high standards for code quality, system observability, and technical documentation. Act as the technical lead, setting direction and best practices for the full-stack engineering team. Mentor engineers, providing guidance on architecture, design patterns, and career growth. … cross-functional teams Deep experience with React, TypeScript, .NET Core, SOAP/REST APIs, and MySQL/PostgreSQL, Red Hat OpenShift, Kubernetes Understanding of DevOps, cloud deployments, and service observability Bonus: Interest/experience in AI, digital twins, Nvidia Omniverse SDK & APIs, Universal Scene Description What We Offer : Reimbursement for tuition and professional dues Three weeks of vacation and five More ❯
to build cost-effective solutions on Microsoft Azure while maintaining agility and fostering innovation. This position is perfect for engineers who are passionate about optimising cloud usage, enhancing cost observability, and championing a Fin Ops culture. Experience in some of the following would be ideal Partner with engineering, finance and product teams to drive cost-efficiency across Azure Clear understanding More ❯
to build cost-effective solutions on Microsoft Azure while maintaining agility and fostering innovation. This position is perfect for engineers who are passionate about optimising cloud usage, enhancing cost observability, and championing a Fin Ops culture. Experience in some of the following would be ideal Partner with engineering, finance and product teams to drive cost-efficiency across Azure Clear understanding More ❯
new AI/ML methods Deployment and serving of models at scale Infrastructure automation and cloud-native design Responsible AI, LLM safety, and interpretability tooling Data pipelines, versioning, and observability in production A glimpse of roles we recruit for: AI Research Scientist Machine Learning Engineer Data Engineer with ML experience Applied Scientist/Research Engineer DevOps for AI/AI More ❯