Portsmouth, Hampshire, United Kingdom Hybrid / WFH Options
Checkatrade
Hybrid working. Where do you fit in? We're seeking a Senior Platform Engineer with a strong background in cloud-native technologies and a passion for automation, DevOps, and observability practices. You'll be at the forefront of building and maintaining our infrastructure using tools like Kubernetes, Terraform, Helm, and Datadog. You will drive the adoption of infrastructure-as-code … AWS is also valuable, with a willingness to work within a GCP environment. Experience with programming languages such as Golang, Python, and JavaScript. Passion for automation, DevOps, SRE, and observability practices. Proven leadership, management skills, and excellent communication abilities. We are an equal opportunities employer committed to diversity and inclusion in the workplace. About us We're Checkatrade, the UK More ❯
Monitor, troubleshoot, and optimize data pipelines to ensure performance and cost efficiency. Implement data governance, access controls, and security measures in line with best practices and regulatory standards. Develop observability and anomaly detection tools to support Tier 1 systems. Work with engineers and business teams to gather requirements and translate them into technical solutions. Maintain documentation, follow coding standards, and … to work across technical and non-technical teams. Additional Strengths Experience with orchestration tools like Apache Airflow. Knowledge of real-time data processing and event-driven architectures. Familiarity with observability tools and anomaly detection for production systems. Exposure to data visualization platforms such as Tableau or Looker. Relevant cloud or data engineering certifications. What we offer: A collaborative and transparent … ELT workflows with Apache Airflow (or similar) and integrating them into containerised CI/CD pipelines (Docker, GitHub Actions, Jenkins, etc.)? Select Which option best describes your experience building observability and automated anomaly detection tooling for data pipelines? Select What best describes your current location and working rights status? Select By submitting your application, you confirm that you have read More ❯
Role Title: Azure Cloud Architect Location: London Hybrid - 2-3 days onsite, Canary Wharf Start Date: 08/09/2025 End Date: 27/03/2026 Rate: £650 per day - PAYE via Umbrella Only Role Overview: We are More ❯
Staff and Team Lead, Onyx Application Engineering The Onyx Research Data Tech organization represents a major investment by GSK R&D and Digital & Tech, designed to deliver a step-change in our ability to leverage data, knowledge, and prediction to More ❯
and advocating for the best solutions that improve developer productivity and system efficiency. Infrastructure Automation & Management: Use Terraform/OpenTofu and automation frameworks to provision and manage infrastructure. Monitoring & Observability: Configure and utilise observability tools like Datadog for performance monitoring, alerting, and visualisation, ensuring system reliability and quick identification of issues. Performance Optimisation: Continuously monitor the performance of the tools More ❯
Google Cloud, and Alibaba Cloud. This role ensures consistent, reliable, and secure connectivity for all workloads, services, and users across the multi-cloud landscape. Leveraging SRE principles, automation, and observability, the Cloud Connectivity Director will drive a scalable, self-service-first experience while working closely with the broader network and cloud platform teams. Key Responsibilities: Cloud Network Strategy & Architecture: Define … application and platform teams. Implement policy-as-code frameworks for routing, firewalling, and access control. Partner with platform engineering teams to integrate network services into CI/CD workflows. Observability & Operational Reliability: Establish deep observability into cloud network paths, health indicators, and latency measurements. Apply SRE practices to ensure uptime, fast incident response, and continuous improvement. Drive performance optimization and … DNS, VPC/VNet peering, private link, VPN, and transit gateway solutions. Expertise in Infrastructure as Code (e.g., Terraform, CloudFormation) and network automation. Demonstrated success implementing SRE principles and observability in large-scale environments. Preferred Qualifications: Certifications such as AWS Advanced Networking, Google Cloud Network Engineer, Azure Network Engineer Associate, or equivalent. Familiarity with service mesh, API gateways, and microservices More ❯
area of the product component or the system in aggregate and at scale. Specific domains include Workload Management (Kubernetes, Ray, and so on); Cloud Development (Cloud Infrastructure Automation); Management & Observability (open source and commercial monitoring, observability and DCIM solutions) Skills and Experience Essential Strong relevant programming experience Python/Go/C infrastructure-as-code scripting or related to the … of the products under test: Containerisation (e.g. Docker), Virtualisation and Provisioning, Workload and job scheduling (e.g. Kubernetes, Ray) on high core-count machines and rack-scale installations, Management and Observability (e.g. Prometheus, OpenTelemetry, DataDog, Splunk, etc.). 10+ years of relevant experience related to quality assurance/testing teams. Experience with the Atlassian suite and CI/CD platforms such More ❯
in implementing good practice with regards to accessibility (Keyboard support, screen readers, form usability) Knowledge of various front-end architectural patterns E2E Testing experience (Cypress/Playwright) Experience with Observability as a practice (logging, GA tagging, TrackJS, App Insights) If you would be interested please apply below! INDMANS More ❯
in implementing good practice with regards to accessibility (Keyboard support, screen readers, form usability) Knowledge of various front-end architectural patterns E2E Testing experience (Cypress/Playwright) Experience with Observability as a practice (logging, GA tagging, TrackJS, App Insights) If you would be interested please apply below! INDMANS More ❯
with our sales team to help achieve our company pipeline and revenue goals. We're excited about you because you have: A deep desire to help build the data observability category. You display a company-first attitude in your decision-making and attitude Comfortability with ambiguity, change, and an incredibly fast-paced working environment A bias towards speed 1+ years More ❯
Chronosphere Chronosphere is the observability platform built for control in the modern, containerized world. Chronosphere empowers customers to focus on the data and insights that matter by reducing data complexity, optimizing costs, and remediating issues faster. The observability platform reduces data volumes and associated costs by 60% on average while saving developers thousands of hours. Chronosphere's Fluent Bit-based … Telemetry Pipeline optimizes and simplifies observability and security log data. The product transforms logs at the source and routes them to any destination without lock-in. Recognized as a leader by major analyst firms, Chronosphere is trusted by the world's most innovative brands, including Snap, Robinhood, DoorDash, and Zillow. Learn more at Chronosphere.io . Follow at LinkedIn and X … years of experience in an engineering or support related role. Experience working directly with customers in a technical setting (preferably SaaS). Customer support experience ideally in the monitoring, observability, or data pipeline space. Experience with Kubernetes, Terraform, and significant consideration if you also have Prometheus experience. Technical understanding and experience with: Coding/SDLC, Linux, Cloud providers (AWS, GCP More ❯
our north star. Current initiatives include working towards a unified build and release path, a developer portal, AI powered automation experiments, value stream diagnostics and a company wide Honeycomb observability migration. You'll work across AWS, Kubernetes, ArgoCD, Terraform, Jenkins, GitHub Actions, Go and more. A day in the life: As a DevX Engineer at Zopa, your week blends hands … on engineering with real customer impact. You'll spend most of your time building and evolving internal delivery and observability tools - writing Go code, shaping CI/CD pipelines, and refining Terraform infrastructure that supports every product team at Zopa. Each day kicks off with team stand-ups, followed by focused coding time or pairing with colleagues to deliver features … across the SDLC, observability, or developer productivity initiatives. You might jump into a platform tribe sync, a whiteboarding session where you or another member would share a new idea or collaborate with a squad to identify friction points using real data and developer feedback. There's space for outreach too - whether it's supporting engineers through our internal forum, contributing More ❯
demanding and regulated environment Foster a culture of engineering excellence, cross-functional collaboration, and high accountability Partner closely with Product and Regulatory experts to align strategy and execution Uplevel observability, DevOps, and data pipeline maturity Languages: Python, Scala, Clojure Tooling: Strong focus on observability, CI/CD, infrastructure-as-code What We're Looking For: 10+ years of engineering experience More ❯
ll dig into logs, traces and code to explain behaviour, patch bugs or raise backlog stories when deeper product work is needed. Often these investigations will result in improving observability or stability of the platform. High-impact feature work. Between investigations we deliver focused enhancements and platform improvements that don't slot neatly into long-term road-maps. Because our … team's workload is unpredictable, delivery dates are flexible and scoped by the team. Platform observability & performance. Your team members continually raise the bar on monitoring, metrics and efficiency. Joining as our newest engineer, you'll pair with seasoned Go/TypeScript/Python devs, owning real tasks from week one. Expect a dynamic mix of bug hunting, green field … and basic cloud/Linux fundamentals. Curiosity and the confidence to ask questions in a fast-moving team. Nice-to-haves Exposure to Kubernetes, Docker or Terraform. Experience with observability stacks (Grafana, Prometheus, OpenTelemetry). Familiarity with Postgres. Interest in data-privacy, AdTech/MarTech or large-scale data processing. Familiarity with Kafka, gRPC or Apache Spark. As well as More ❯
staying at the forefront of modern full-stack technologies and frameworks Product Ownership: Taking a leading role in designing, developing, and supporting impactful projects, ensuring seamless user experiences with observability baked in. You will champion a strong product mindset with a keen eye for user experience and system design Passionate about code quality, maintainability, observability and best practices across the More ❯
and reliability across FX and Fixed Income platforms. Location: Hybrid, Central London Your mission: Deliver and maintain a world-class, low-latency trading environment - managing messaging middleware, network connectivity, observability tooling, and capacity planning to meet front-office demands. What you'll do: Connectivity & Monitoring Deploy and support global market connectivity (Solace/Tibco, multicast, TCP/IP) across 30+ … 29West, Tibco, LBM) in performance-sensitive environmentsHands-on packet analysis using Wireshark , tcpdump , Corvil (custom decoder skills a bonus)Scripting/automation with Python , Bash , or PowerShell Familiarity with observability platforms (ITRS Geneos, Prometheus, Grafana)Experience supporting real-time trading applications , feed handlers, matching enginesExcellent communication-able to translate technical metrics into business insights Nice to have: Exposure to FIX More ❯
with event sourcing. All of our systems are on Kubernetes and using the Google Cloud Platform. This role comes with the opportunity to take ownership of our GCP infrastructure, observability, and platform reliability, with a focus on ensuring our systems remain secure, scalable, and well maintained. We encourage collaboration and our engineers are involved in the full development lifecycle, from … engineers in building new APIs and data contracts to support new functionality Maintaining and evolving our cloud infrastructure (GCP, Kubernetes) to ensure high availability, security, and performance Managing service observability and reliability, including logging, metrics and alerting (we use Prometheus and Grafana) Handling database and service upgrades (e.g. MySQL, Kubernetes), secrets management and security best practices Taking ownership of platform … Solid understanding of security best practices across infrastructure and applications, including secrets management and credential rotation. Familiarity with infrastructure-as-code or automation tools is a plus Experience with observability tools (such as Prometheus and Grafana), service monitoring, and debugging in production environments A demonstrated interest in staying up-to-date with new technology, new frameworks, new languages and other More ❯
with React & Material UI, Postgres, Hasura and AWS Serverless Technologies such as Lambda, DynamoDB and EventBridge - all managed via AWS CDK & SST. We use Sentry, Lumigo and LogRocket for observability and Github Actions for automated testing and deployment. End-to-end Ownership. You will be entrusted with end-to-end ownership of your projects. From product, design and architectural decisions … ideally AWS). You focus on having a high impact . You've spearheaded the engineering of critical systems before, working with best-in-class tooling in AWS, IaaC, observability and quality assessments. You want to discover the best ways to bring this to an early-stage startup. You know what good can look like . You understand what it … takes to build highly reliable & well architected products. You build with quality, observability & redundancy at the forefront. You're ready to get a lot done. You enjoy all aspects of building a product and are comfortable moving across the stack when necessary. You enjoy problem solving and thinking from first principals You're ready to pick up new skills and More ❯
while ensuring seamless coordination with the DevOps group. Spearhead the development, integration, and maintenance of CI/CD data pipelines for automated deployments. Integrate best practices for monitoring and observability to proactively detect, analyse, and resolve issues. Enforce robust data governance and security protocols through tools like Azure Key Vault, ensuring compliance with standards such as GDPR, and other regulatory … Python, SQL, Bash, and PySpark for automation. Strong aptitude for data pipeline monitoring and an understanding of data security practices such as RBAC and encryption. Implemented data and pipeline observability dashboards, ensuring high data quality, and improving the efficiency of data workflows. Experience ensuring compliance with regulatory frameworks and implementing robust data governance measures. Demonstrated ability to implement Infrastructure as More ❯
that will shape the future of global banking. The Public Cloud Engineer will implement cloud architecture (on AWS, GCP and Azure) that enables the infrastructure (compute, database network, storage, observability) required for application hosting in public cloud using Citi's engineering processes and best practices, with particular emphasis on automation and security by design. Experience essentials ( Non-negotiable) : Developer experience … analytics Messaging Pub/Sub services such as Kafka, Rabbit MQ Cache/search technologies such as Elasticache/Elasti search API services such as apigee/kong Elementary, Observability and monitoring 9.Identity and access management, dynamic keys and certificates Test driven development More than 5-7 years of cloud real world experience, any where, not just a bank Characteristics More ❯
Product, Design, and GTM teams to ensure fast, high-quality product delivery. Champion modern engineering practices, agile methodologies, and code quality standards. Infrastructure & Security: Ensure platform scalability, reliability, and observability through robust tooling and cloud-native practices. Implement and maintain infrastructure best practices with a focus on performance, monitoring, and security. Stakeholder Collaboration: Work closely with cross-functional stakeholders to …/ML integration, scalable platform systems, and API development. Experience with no-code/low-code platforms or developer ecosystems. Skills: Strong understanding of distributed systems, data pipelines, and observability tools. Hands-on experience with languages and tools like Ruby, Java, React, GraphQL, AWS/GCP, and Kubernetes. Bonus: Experience with vector databases, LLM frameworks (e.g., LangChain, RAG), and open More ❯
HCLTech is a global technology company, home to more than 220,000 people across 60 countries, delivering industry-leading capabilities centered around digital, engineering, cloud and AI, powered by a broad portfolio of technology services and products. We work with More ❯
HCLTech is a global technology company, home to more than 220,000 people across 60 countries, delivering industry-leading capabilities centered around digital, engineering, cloud and AI, powered by a broad portfolio of technology services and products. We work with More ❯