you have We're a multi-cloud team - experience with AWS and CDK is a big plus. Experience implementing/maintaining cloud compliance standards (SOC2, ISO-27001) Familiarity with observability tools like Datadog Background in implementing security best practices in cloud infrastructure Why Prolific is a great place to work We've built a unique platform that connects researchers and More ❯
you have We're a multi-cloud team - experience with AWS and CDK is a big plus. Experience implementing/maintaining cloud compliance standards (SOC2, ISO-27001) Familiarity with observability tools like Datadog Background in implementing security best practices in cloud infrastructure Why Prolific is a great place to work We've built a unique platform that connects researchers and More ❯
a collaborative and supportive team environment through experienced, empathetic leadership Commit to continuous learning and stay current with emerging technologies and best practices Implement and maintain application monitoring and observability, proactively identifying and resolving system issues Person Specification Experience Essential Relevant degree or qualification is desirable but not essential Previous experience using Cloud Platforms, Version Control Systems and Front and More ❯
build-and-load programs up to browser extensions and web applications. Develop software to analyse and interpret cryptocurrency usage behaviours and trends on the clear and dark web Implement observability mechanisms (we use DataDog) to detect problems in your environment(s), and run the associated business processes to resolve Work with the existing engineers on your team to foster their More ❯
teams to execute effectively. DataOps Enablement and Optimization: Drive the adoption of modern DataOps principles to streamline engineering workflows. Partner with platform teams to establish CI/CD pipelines, observability standards that improve operational efficiency, reliability, and speed across data pipelines. Data Governance and Quality Assurance: Embed governance, security, and data quality practices into engineering workflows. Define guardrails and reference More ❯
teams to execute effectively. DataOps Enablement and Optimization: Drive the adoption of modern DataOps principles to streamline engineering workflows. Partner with platform teams to establish CI/CD pipelines, observability standards that improve operational efficiency, reliability, and speed across data pipelines. Data Governance and Quality Assurance: Embed governance, security, and data quality practices into engineering workflows. Define guardrails and reference More ❯
enable fast analytics and experimentation. Partner with analysts and product managers to define data tracking specifications and ensure implementation alignment. Contribute to infrastructure automation, CI/CD pipelines, and observability for data systems. Stay up to date on industry best practices for scalable and secure data systems in mobile gaming. Requirements: Strong experience with Python and SQL for data engineering More ❯
others in the team. You have a bias to simplicity, where you care most about achieving impact Bonus Experience with evaluation harnesses and frameworks for Generative AI Experience with observability, monitoring, and safety techniques for deployed GenAI systems Experience in strongly typed languages such as Go The Company Our mission is to be the definitive food company. We are transforming More ❯
integration and continuous delivery tools with different tech stacks, web or mobile. You've previously worked with monitoring systems for availability, performance or security, stress and performance testing with observability patterns: Distributed Tracing/OpenTracing, Log Aggregation, Audit Logging, Exception Tracking, Health Check API, Application MetricS, Self-Healing/Multi-Cloud. You have an understanding of security concerns, threats and More ❯
generation of automotive software development. The right candidate will have excellent communication skills, solid coding skills, broad knowledge of software development across areas such as Cloud, Compute Frameworks, MLOps, Observability and Build Infra. RESPONSIBILITIES: Work on high-impact projects and innovate new solutions to problems in the self-driving space Work with Computer Vision and Machine Learning engineers on high More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Gearset Limited
over process and deliberation Great to haves Experience with .NET/C# Experience working in an agile development team with a focus on delivering value early Experience with building observability and alerting into systems Salary and benefits (the stuff you'd expect!) Salary is £78K - £100K (depending on experience) This is a full time opportunity, working Monday to Friday with More ❯
Write production-quality software with strong engineering rigor-designing clean APIs, building reliable systems, and collaborating closely with product engineers. Build high-reliability ML infrastructure: training pipelines, model registries, observability, and CI/CD for ML. Ensure ML solutions meet enterprise standards for security, compliance, data privacy (e.g., SOC2, GDPR), explainability, and auditability. Develop evaluation and monitoring frameworks that measure More ❯
our engineers Lead and contribute to cross-team initiatives from design through deployment and operations Write maintainable, well-tested, high-quality code and uphold engineering best practices Focus on observability and maintain Service Level Objectives, take operational responsibility for the Identity Platform, including joining the on-call rota Foster a strong engineering culture through mentorship, code reviews, and collaboration Lead More ❯
process and deliberation Great to haves Experience with .NET/C# You've worked in an agile development team with a focus on delivering value early Experience with building observability and alerting into systems Salary and benefits (the stuff you'd expect!) Salary is £78K - £100K (depending on experience) This is a full time opportunity, working Monday to Friday with More ❯
on: • Environment Platform: a Kubernetes-based PaaS spanning hundreds of production clusters • Apollo: secure, fleet-wide deployment and change-management for complex microservice suites • Signals: our full suite of observability and alerting tools Core Responsibilities As a Software Engineer at Palantir, you'll own every phase of the product lifecycle-from generating ideas and designing prototypes to executing features and More ❯
also play a part in mentoring other developers including more junior colleagues to impart knowledge and build their skills. Write high quality documentation and implement user & system metrics and observability as you go to continually learn, assess and improve 9fin's platforms and products. Our Tech Stack React via Typescript & Vite React query Jest, React Testing Library, Playwright Production workloads More ❯
value. Writing reliable, well-tested code, using tools such as pytest, Jest with React Testing Library, Storybook, and Cypress for end-to-end coverage. Monitoring performance and stability through observability tools, keeping our platform running smoothly. Champion knowledge-sharing by crafting documentation that's as delightful to read as it is useful. We are fully AWS hosted. You'll get More ❯
Remote ?? Up to £70,000 + annual share scheme + excellent benefits What You'll Do: You'll take a lead role in driving operational excellence, ensuring the resilience, observability, and performance of web-based systems across a growing digital platform. Working within a collaborative, cross-functional environment, you'll design scalable infrastructure, automate operations, and embed SRE principles to … web applications and distributed systems, including Micro Frontends and BFFs Hands-on expertise in React and TypeScript development with an eye for performance and resilience Proven ability to implement observability practices using tools like Prometheus, Grafana, or Azure Monitor Proficiency in containerisation and orchestration (Docker, Kubernetes - ideally AKS or GKE) Experience building and maintaining CI/CD pipelines for frontend More ❯
CI/CD pipelines (Jenkins, GitHub Actions) Define and enforce platform standards across environments (dev, staging, prod) Collaborate with developers and DevOps on deployment tooling and security Enable platform observability using tools like Datadog, Prometheus, and CloudWatch Maintain Helm charts and Terraform modules for shared infrastructure Contribute to onboarding documentation and platform adoption practices Participate in incident response and postmortem … containerisation using Docker and secure image management Scripting or programming experience in Bash, Python, or TypeScript Strong understanding of GitOps practices and infrastructure lifecycle management Desirable Skills Experience with observability tooling (Datadog, Prometheus, Fluent Bit) Knowledge of admission controllers, OPA/Gatekeeper (optional for governance) Familiarity with cloud cost optimisation and Kubernetes scaling strategies Exposure to security scanning tools (tfsec More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Arm Limited
collaborate across teams to: Modernise our infrastructure by leading the migration from Docker Swarm to Kubernetes Design and operate CI/CD pipelines using CloudBees and GitLab Build out observability with Prometheus, Grafana, OpenTelemetry, and Dynatrace Automate cloud deployments (AWS-first) using Terraform and platform tooling Improve security posture across IAM, secrets, and networking Help the team ship faster and … TypeScript, Python). Validated experience operating distributed systems at scale in production. Cloud AWS (primary), Kubernetes (future), Docker (current), Terraform. Excellent debugging skills across network, systems, and data stack. Observability tooling, e.g. custom metrics pipelines, OpenTelemetry tracing, or integrations across telemetry stacks. Security engineering and practical understanding of IAM hardening, zero-trust network principles, and secrets management in data-heavy More ❯
Support incident response and root-cause analysis through effective test coverage and tooling Mentor junior QA engineers and help scale best practices across teams Contribute to documentation and test observability (logs, metrics, dashboards) Tech Stack & Environment Languages/Frameworks: Kotlin, Java, Spring Boot Frontend: React, tested using Playwright Automation: Custom test frameworks in Kotlin/Java Cloud & Infrastructure: AWS (Azure … testing strategy, and building testable systems by design Nice-to-Haves: Exposure to regulated environments (e.g., BFSI, healthcare, public sector) Experience with performance, security, or chaos testing Familiarity with observability tooling (e.g., Prometheus, Grafana, OpenTelemetry) Knowledge of contract testing, mocking, or service virtualization Mindset & Cultural Fit A builder's mindset, focused on enabling early, frequent, and safe delivery through automated More ❯
across engineering teams to build, refine, and enrich data-driven solutions that span diverse systems, data models, and cloud-native architectures. By championing best practices in engineering, including testing, observability, security, and robust documentation, you'll play a key role in ensuring Axon's platforms are reliable, maintainable, and prepared to scale. Work Location: This role is based out of … an in-house L7 (reverse) proxy that allows users to securely access parts of the data platform directly Drive best practices around production data systems, including performance, testing, security, observability, and documentation. Troubleshoot and resolve issues in production environments to ensure data integrity and platform reliability What you bring: Bachelor's Degree in Computer Science, Engineering, or related field 3+ More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Lorien
growing to meet our business needs. What you'll lead: Shape and evolve the backend technical architecture to support product scale and complexity Identify and drive improvements in performance, observability, and infrastructure Lead the design of domain models aligned with evolving business needs Be a go-to person for backend excellence, and improve code quality Engineering centric requirement definition (user More ❯