automation, scalability, and high reliability. A strong working knowledge of Microsoft Azure is essential. The role involves daily coding, technical leadership across orchestration, CI/CD pipelines, cloud services, observability, and security-working alongside site reliability, onboarding, architecture, and delivery functions. You're expected to scale impact through others by upskilling team members, hiring where needed, and championing platform engineering More ❯
Milton Keynes, Buckinghamshire, South East, United Kingdom
Hays
automation, scalability, and high reliability. A strong working knowledge of Microsoft Azure is essential. The role involves daily coding, technical leadership across orchestration, CI/CD pipelines, cloud services, observability, and security-working alongside site reliability, onboarding, architecture, and delivery functions. You're expected to scale impact through others by upskilling team members, hiring where needed, and championing platform engineering More ❯
models. Desirable: Master's degree or DAMA certification (CDMP).Proven expertise with Workday product and data integrations. Experience across different database architectures (Lakehouse, Relational, Document Store). Familiarity with observability tools, BI platforms (Power BI), or Agile methods. Knowledge of advanced analytics, data science, or NLP techniques. Soft Skills Strong stakeholder management and communication skills, able to bridge technical and More ❯
Caldecotte, Milton Keynes, Buckinghamshire, England, United Kingdom
Connells Group HQ
day-to-day and strategic decision making.You will be a hands-on and customer focused engineering servant-leader. You will be comfortable moving across orchestration, automation, pipelines, cloud services, observability and security domains (even if you are not an expert in them all). A non-negotiable is experience and familiarity with Microsoft Azure.You will play your part in operating More ❯
Platform (ETP) team provides tailored infrastructure solutions for the firm's global markets businesses, we engineer and operate low latency trading environments globally. We leverage modern infrastructure, automation, and observability to optimize performance and manage risk on behalf of our clients. The team's areas of responsibility encompass server hardware, Linux, networking, configuration management and packet capture analytics. The team More ❯
to have: 4+ years working with an object-oriented programming language (C/C++, Python, Java, etc.) Strong communication skills and ability to collaborate with diverse stakeholders Familiarity with observability tools like Grafana, Splunk, or Humio Experience with CI/CD tooling and automation A degree in Computer Science, Computer Engineering, or equivalent practical experience We would love to see More ❯
SQL) and query optimization Stored procedures, functions, triggers, views, and complex queries ETL processes, backend reporting, and database automation scripts Database performance monitoring, troubleshooting, and optimization Log analytics and observability platforms, particularly Splunk Containerization and orchestration technologies (Docker, Kubernetes) Cloud platforms (AWS GovCloud, SC2S, C2S) Windows and Linux operating systems Strong understanding of forensic and investigative data requirements Demonstrated experience More ❯
technologies (listed below) is advantageous: Kotlin Cloud Technologies (Kubernetes, Open Shift) Messaging Technologies (Kafka, Solace, TIBCO) Database/Data Store/Data Query Technologies (SQL Server, Trino, Mongo, S3) Observability Technologies (OpenTelemetry, Elastic Stack/ELK, Grafana) This job description provides a high-level review of the types of work performed. Other job-related duties may be assigned as required. More ❯
mission requires Communicate work using SMART Understanding of forensic and investigative data requirements Data processing, automation and ETL workflow orchestration GraphQL Server side technologies (Apache Tomcat) Log analytics and observability platforms, particularly Splunk SAFe Agile methodologies Working in classified environments SECURITY CLEARANCE: US Citizenship required Active, current, and fully adjudicated (no interim) US Department of Defense Top Secret Clearance with More ❯
and CI/CD workflows (GitLab CI). Write clean, production-grade code in Python (Scala is a bonus). Build infrastructure using Terraform, AWS CloudFormation, or SAM. Drive observability across the platform using Datadog or CloudWatch. Actively mentor Data Engineers and Associates, and lead technical discussions and design sessions. Key requirements: Must-Have: Strong experience with AWS services: Glue More ❯
In this dynamic and fast-paced environment, your role will be essential to ensuring our infrastructure remains resilient, secure, and scalable. You'll work on automating operations, enhancing system observability, and driving continuous improvements that reduce downtime and improve efficiency. If you're motivated by solving, multi-layered problems and building systems that perform reliably amid shifting priorities, we encourage More ❯
Edinburgh, Midlothian, United Kingdom Hybrid / WFH Options
Aberdeen
Financial Services Cloud. Strong understanding of Salesforce security, data modelling, and governor limits. CI/CD pipeline implementation experience (Gearset, Copado, Azure DevOps). Familiarity with test automation and observability tools. Agile delivery experience and strong communication skills. Salesforce certifications (eg, Platform Developer II, Application Architect) are preferred. We are proud to be a Disability Confident Committed employer. If you More ❯
us to silently drop spans. - An enthusiasm for both software development and systems engineering. - A high bar for code and configuration quality and readability. - A good understanding of current observability and reliability practices. - Experienced and comfortable in running incident response. - Big picture thinking - you can make trade offs on technical work streams against business impact. - Fantastic communication skills. You're More ❯
Financial Services Cloud. Strong understanding of Salesforce security, data modelling, and governor limits. CI/CD pipeline implementation experience (Gearset, Copado, Azure DevOps). Familiarity with test automation and observability tools. Agile delivery experience and strong communication skills. Salesforce certifications (e.g., Platform Developer II, Application Architect) are preferred. We are proud to be a Disability Confident Committed employer. If you More ❯
managers, and support teams to evolve our cloud-first DevOps practice. You will lead critical projects involving migration of legacy systems to AWS, design and optimise deployment pipelines, improve observability and incident response, and drive process alignment across teams and platforms. This is a hands-on leadership role ideal for someone who thrives on solving complex problems, influencing architecture, and … contribute to migration projects, helping move legacy, customerhosted systems into cloud-native architectures. Develop and maintain infrastructure-as-code using Terraform. Guide improvements to system architecture, focusing on reliability, observability, and cost-efficiency. Contribute to modernising our CI/CD practices, enabling safe, rapid delivery of services across environments. Improve operational visibility through monitoring, logging, and tracing, and support incident … DNS, security, and CDN layers. Familiarity with PostgreSQL or MySQL in production environments. Experience leading or participating in cloud migration efforts. Exposure to multi-tenant SaaS architectures. Familiarity with observability platforms (e.g., Datadog, Splunk, ELK). Knowledge of modern networking practices in AWS. Interest in the golf industry or experience working on platforms related to booking, memberships, or golf course More ❯
California, with additional locations across the globe. What you'll do: As a Site Reliability Engineer at Zefr, you'll apply your expertise in cloud infrastructure, CI/CD, Observability, and core SRE concepts, to deliver high-quality, reliable, and scalable solutions. A significant aspect of this role involves working closely with Zefr's Engineering and Data Science teams ensuring … EKS expected), Helm, Kustomize Service Mesh: Istio CI/CD & Automation: CI/CD Pipelines: GitHub Actions GitOps/Continuous Delivery: Argo CD Primary Scripting/Automation Language: Python Observability & Monitoring: Monitoring & Alerting: Prometheus, Datadog, Pagerduty Telemetry Standards: OpenTelemetry Application & Data Ecosystem (Supporting): Application Languages/Frameworks: Python, FastAPI, Flask, Node.js, React Data Streaming: Apache Kafka Data Processing/Transformation … CircleCI, Argo CD, Flux) Knowledge of IaC and configuration management tools (Terraform, OpenTofu, Crossplane, Pulumi, Ansible, CloudFormation) Strong problem-solving experience, focusing on automation Production experience with Monitoring and Observability tools (Prometheus, Grafana, Datadog, Thanos, New Relic, Open Telemetry) Understanding of Cloud Networking concepts (Mesh Networking, NAT, Load Balancers, SSL Certificates and TLS termination, API Gateways, proxies, etc) Strong written More ❯
for leading and executing the migration of data, dashboards, alerts, and configurations from Splunk systems to Elasticsearch. This role involves deep technical expertise in Splunk architecture, data ingestion, and observability tools, along with strong project management and stakeholder communication skills. Must have skills: -Splunk -ELK Stack -Kibana Nice to have skills: -stakeholder communication skills -strong project management Responsibilities: Minimum number More ❯
Chester, Cheshire West and Chester, Cheshire, United Kingdom
Ascendion
product teams, infrastructure, and DevOps to address platform issues and implement improvements. Architect and develop resilient backend systems primarily using Java, Spring, Kafka, and Oracle. Implement best practices for observability, incident response, and operational excellence in line with SRE principles. Drive automation and self-healing mechanisms across platform components. Provide technical leadership and hands-on coding as needed. Monitor, troubleshoot More ❯
able to build new DevOps pipelines AWS S3 RDS Route 53 IAM EKS Secrets Manager ECR Kubernetes Helm Kops Ingress/Egress Terraform Deployment of AWS Resources Pipelines OCI Observability ELK Dynatrace Prometheus Others Vault RedHat Skills working in a secure environment and ability to adhere to security principles Experience in support organisation DevOps Engineer - HLC DV UK wide (Manchester More ❯
recovery procedures to ensure system availability and data integrity. o Monitor and troubleshoot system resources in the AWS environment, ensuring modern Site Reliability Engineering best practices and client approved observability tools, such as OpenTelemetry, Dynatrace, Elastic, etc. • Collaboration and Security: o Work closely with development, operations, and security teams to ensure cloud solutions align with organizational goals and security requirements. More ❯
small team Data Engineers. Define and deliver the data engineering roadmap aligned with business priorities. Own and evolve the data platform architecture from ingestion and transformation through to governance, observability, and real time processing Drive adoption of best practices (CI/CD, testing, infra as code) and introduce new technologies where appropriate. Collaborate with stakeholders across Engineering, Product, and Analytics More ❯
neuroscience, and clinical datasets Build a unified feature store to serve ML training and downstream biological analysis Develop scalable storage, ingestion, and validation systems with a focus on robustness, observability, and versioning Collaborate with ML researchers and biologists to translate raw data into actionable insights and high-quality training data Scale distributed systems using Kubernetes, Terraform, and orchestration tools such More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Hlx Technology
neuroscience, and clinical datasets Build a unified feature store to serve ML training and downstream biological analysis Develop scalable storage, ingestion, and validation systems with a focus on robustness, observability, and versioning Collaborate with ML researchers and biologists to translate raw data into actionable insights and high-quality training data Scale distributed systems using Kubernetes, Terraform, and orchestration tools such More ❯
london, south east england, united kingdom Hybrid / WFH Options
Hlx Technology
neuroscience, and clinical datasets Build a unified feature store to serve ML training and downstream biological analysis Develop scalable storage, ingestion, and validation systems with a focus on robustness, observability, and versioning Collaborate with ML researchers and biologists to translate raw data into actionable insights and high-quality training data Scale distributed systems using Kubernetes, Terraform, and orchestration tools such More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Hlx Technology
neuroscience, and clinical datasets Build a unified feature store to serve ML training and downstream biological analysis Develop scalable storage, ingestion, and validation systems with a focus on robustness, observability, and versioning Collaborate with ML researchers and biologists to translate raw data into actionable insights and high-quality training data Scale distributed systems using Kubernetes, Terraform, and orchestration tools such More ❯