Key Responsibilities As an ITS Application Services (ENET) Engineer , you will: Provide technical ownership and support for the infrastructure powering electronic trading systems. Architect, deploy, and manage high-performance observability solutions using the Elastic Stack (ELK) . Optimize Elasticsearch index lifecycle, shard configurations, and cluster performance. Develop and maintain data pipelines using Logstash , Beats , and custom ingestion strategies. Integrate ELK … promote knowledge sharing, and drive continuous improvements. Skills & Experience Required Elastic Stack (Elasticsearch, Logstash, Kibana, Beats): Advanced-level knowledge. Strong experience with data ingestion , index management , and real-time observability . Experience with Solace , 29West , Tibco , or similar messaging platforms is a plus. Working knowledge of trading protocols (e.g., FIX, market data, order entry) and low-latency systems . Hands More ❯
quality KPIs that drive accountability and continuous improvement Act as a mentor and coach for engineers and QA professionals, upskilling the org in modern testing practices Improve visibility and observability of test execution and failures Support initiatives to enhance our staging and test environments for reliable internal testing A third level degree in an Information Technology or Computer Science related … a testing or quality engineering capacity Experience with performance and load testing frameworks (e.g., k6, JMeter) Familiarity with cloud-based test environments and infrastructure (AWS preferred) Working knowledge of observability and test reporting tools (e.g., Datadog, Grafana) Experience improving test data strategies and test isolation techniques Contributions to internal tooling or open-source testing frameworks Background in building out quality More ❯
DevOps, or Infrastructure Engineering roles. Deep Linux systems knowledge and strong scripting skills (Python, Bash, etc.). Experience with containerisation (Docker) and orchestration tools (Kubernetes preferred). Familiarity with observability stacks (e.g. Prometheus, Grafana, ELK). Prior experience in trading, low-latency, or crypto systems is a strong plus. Comfortable working independently in a fully remote environment. More ❯
Bristol, Avon, England, United Kingdom Hybrid / WFH Options
interAct Consulting Limited
as-Code (IaC). Experience of Configuration-as-Code, Containerisation and Orchestration, CI/CD. Proficiency with Kubernetes, Docker and AKS. Familiarity with Azure cloud-native services. Knowledge of observability and site-reliability engineering principles. Proficiency in SQL and experience working with relational databases. This is a fully remote (UN only) position within a fabulous team. Lots of flexibility, opportunity More ❯
and shared infrastructure Identify and resolve architectural bottlenecks in the current data platform and propose improvements that reduce complexity and boost performance Drive initiatives that improve data quality, lineage, observability, and system reliability Influence and Collaborate Across Teams Act as a technical liaison between engineering, product, and analytics teams, ensuring alignment on architecture and data strategy Provide technical leadership and … workloads Familiarity with data governance, privacy, and compliance frameworks Background in customer-centric or product-driven environments (e.g., digital, eCommerce, SaaS) Experience with infrastructure-as-code and data platform observability (e.g., Terraform) What You Can Expect Interesting work - working in a fast-paced and ever-changing industry, new problems and exciting solutions are never too far away. There are always More ❯
we provide world-class resilience and performance across the platform. The remit and focus of the role is to advise on all aspects of site reliability including availability, scalability, observability and capacity planning. It's a broad and exciting role, so we're looking for someone up for a challenge - if you're an energetic and a collaborative Site Reliability … Core responsibilities Proactively monitor and analyse platform performance. Collaborate with engineering teams to address performance bottlenecks and ensure scalability. Assist engineering teams with implementing and reviewing SLOs Continually improve observability through monitoring and alerting, and dashboards, using tools such as DataDog or Prometheus for example. Work with other teams to ensure it is effective and provides full coverage. Ensure the More ❯
other teams' working lives through automation of manual tasks Lead in making the deployment of Dayshape more scalable Increase our knowledge sharing of SRE across the organisation Improve the observability of Dayshape through reporting and tool creation Drive improvements to our Cost of Giving Service (COGS) Diagnose and triage escalated issues Support, mentor, and coach other members of the team … CI/CD etc. Writing and debugging Python code One or more IaC toolset proficiency e.g. Pulumi or Terraform. Designed and built infrastructure using Azure which takes into consideration: observability, alerting, uptime SLA's and SLO's and Azure DevOps pipelines. Be able to collaborate well with both engineering teams and colleagues in customer-facing teams. Be an excellent communicator More ❯
DevOps Manager responsibilities: Oversee the team that provides operational support for Linux servers, networks, and AWS cloud infrastructure. Manage security vulnerabilities and implement mitigations. Implement and maintain monitoring and observability solutions. Provision infrastructure for new projects and products. Support project delivery and provide infrastructure design expertise. Maintain and improve configuration management (Puppet) and DevOps processes. Lead the transition to containerization More ❯
Newcastle Upon Tyne, Tyne And Wear, United Kingdom
Strive Gaming
in between - ensuring our platform is resilient, efficient, secure and developer-friendly. Key Responsibilities: Design, build, and maintain platform services and infrastructure used by product engineering teams. Improve reliability, observability, and scalability of existing systems. Develop and maintain CI/CD pipelines to support software delivery. Build tooling and automation that supports self-service infrastructure and deployment. Ensure security best More ❯
Milton Keynes, Buckinghamshire, England, United Kingdom
Noir
financial institution with soaring profits - my client is modernising platforms, embracing AI, and driving automation at scale. We're hiring a Lead Site Reliability Engineer (SRE) to drive reliability, observability, and performance across our Azure cloud infrastructure. You'll work in a modern engineering environment where we live by "you build it, you run it", focused on automation, scale, and More ❯
years in platform/SRE/DevOps roles * Strong Kubernetes experience (config and deployment) * Deep CI/CD experience - Jenkins, GitLab CI/CD or similar * Skilled with infra observability tooling (Prometheus, Grafana, etc.) * Confident with Git and repo management workflows * Strong automation mindset - reducing manual intervention wherever possible * Cloud experience (AWS, Azure or GCP) * Must be a sole UK More ❯
Leeds, West Yorkshire, England, United Kingdom Hybrid / WFH Options
Anson McCade Ltd - IT and Finance Recruitment
CI/CD pipelines (e.g., GitHub Actions, CircleCI). Bonus Skills That Impress Delivery into cloud platforms (AWS, Azure, GCP). Familiarity with relational and NoSQL databases. Experience with observability tools (e.g., DataDog, Prometheus). Test automation know-how. Exposure to open-source tools and community practices. Ready to build what matters? Apply now to shape the future of digital More ❯
Reigate, Surrey, South East, United Kingdom Hybrid / WFH Options
Client Server
of IaC principles and tools such as Terraform and Pulumi You have experience of building and improving CI/CD pipelines for product teams You have experience with cloud observability (logging, tracing, metrics, monitoring and alerting) You have experience with Containerisation - Azure Container Apps preferred You have strong scripting skills with PowerShell and/or C# .Net coding You enjoy More ❯
to development and operational teams Assist in leading design efforts for new ICAM technologies and services Reduce and limit technical debt through improved automated processes Increase GEOAxIS availability and observability Basic Qualifications STEM Bachelors degree plus 2 years, or AS degree plus 4 years of experience in Software Development, DevOps, or Systems Engineering Active TS/SCI with ability to More ❯
Caldecotte, Milton Keynes, Buckinghamshire, England, United Kingdom
Connells Group HQ
day-to-day and strategic decision making.You will be a hands-on and customer focused engineering servant-leader. You will be comfortable moving across orchestration, automation, pipelines, cloud services, observability and security domains (even if you are not an expert in them all). A non-negotiable is experience and familiarity with Microsoft Azure.You will play your part in operating More ❯
/IP, VLANs, routing). You will bring some of these skills, but more importantly you're interested in learning these things: • Hardware & physical infrastructure. • Data-driven monitoring and observability (Grafana, InfluxDB, Prometheus, Elastic). • Exposure to configuration management (Puppet, Ansible, Terraform). • Some exposure to scripting (Bash, Python). • Supporting CI/CD delivery pipelines (GitLab, GitHub). More ❯
experience in Azure and GCP) Kubernetes (AWS EKS) and container infrastructure IAM and managing cloud identities at-scale Secure development and application of IAC solutions (Terraform, Helm) Cloud-native observability and management tools Development experience in Go, Python and Rust PREFERRED QUALIFICATIONS Bachelor's degree in computer science or a related field and/or candidates with equivalent job experience More ❯
variety of open-source databases (MySQL, Postgres, Redis, etc.) -?Experience with DevOps engineering and working with container orchestration, such as with Docker or Kubernetes -?Experience with log monitoring and observability via platforms like Sumologic or Cloudwatch -?Experience automating infrastructure, testing, and deployments using tools like CircleCI Configuration management tooling and infrastructure as code knowledge is preferred but not required -?Experience More ❯
Pipelines is a plus. Experience with multi-cloud and hybrid cloud environments. Experience with Elastic (or OpenSearch) and Grafana Knowledge of ServiceNOWfor change management and incident management. Familiarity with observability tools and practices for 24x7x365 monitoring and alerting. Identity and Access Management experience is a plus for this role LI RB1 LI Remote LI Hybrid About Bentley Systems Bentley Systems More ❯
enable continuous integration and delivery (CI/CD). Make data-guided decisions that impact core business metrics and processes. Solid understanding of platform and reliability engineering approaches, including observability, performance optimisation, capturing analytics, and security best practices. Drive the adoption of new technologies like Go and Python. Facilitate collaboration between teams and build a culture of continuous improvement. Mentor More ❯
across web access, Zero Trust frameworks, WAFs, and secure mainframe connectivity. Embedding modern engineering practices such as Infrastructure as Code, Policy as Code, CI/CD, automated testing, and observability into all solutions. Simplifying and modernising our technology estate through smart design and automation, reducing manual processes and operational complexity. Guiding the adoption of Zero Trust by applying identity-first More ❯
with cross-functional stakeholders including the Data Platform team and Engineering teams. Design and maintain reliable, scalable cloud infrastructure (primarily AWS). Drive key initiatives involving container orchestration (Kubernetes), observability, security, and CI/CD. Establish best practices in platform engineering and foster a servant-leadership culture focused on empathy, empowerment, and collaboration. Work with your peers and colleagues at More ❯
with cross-functional stakeholders including the Data Platform team and Engineering teams. Design and maintain reliable, scalable cloud infrastructure (primarily AWS). Drive key initiatives involving container orchestration (Kubernetes), observability, security, and CI/CD. Establish best practices in platform engineering and foster a servant-leadership culture focused on empathy, empowerment, and collaboration. Work with your peers and colleagues at More ❯
with cross-functional stakeholders including the Data Platform team and Engineering teams. Design and maintain reliable, scalable cloud infrastructure (primarily AWS). Drive key initiatives involving container orchestration (Kubernetes), observability, security, and CI/CD. Establish best practices in platform engineering and foster a servant-leadership culture focused on empathy, empowerment, and collaboration. Work with your peers and colleagues at More ❯
methodologies, and tools such as GitLab CI and Jenkins Experience with Git Source Control System Desired Skills Familiar with Datacenter Infrastructure Management (DCIM) tools such as Netbox Familiar with Observability and Analytics platform solutions such as Splunk Familiar with Identity and Access Management (IAM) solutions such as Keycloak Familiar with Secret Management tools such as HashiCorp Vault Experience with the More ❯