Portsmouth, England, United Kingdom Hybrid / WFH Options
Trust In SODA
through the entire development life cycle. Infrastructure-as-code Bash Delivery methods and techniques, including agile scrum experience. Desirable Skills: RedHat OpenShift Hashicorp (such as Terraform, Packer, Vault) Ansible Observability (such as Prometheus, Grafana, Splunk) Containerised services (such as Postgres, Redis, Kafka, Keycloak, Elk) Experience of doing all the above at OS or S level YAML based pipelines. Immutable infrastructure More ❯
managing CI/CD pipelines, Docker containers, and security-first deployment pipelines. Implement high-availability systems and disaster recovery for business continuity across time zones and territories. Maintain system observability and monitoring to proactively identify issues and optimize system health. Ensure compliance with security standards and data privacy regulations across regions. Manage third-party vendors, licenses, and infrastructure budgets. Required More ❯
experience leading enterprise backup and disaster recovery initiatives. Working knowledge of cloud-native storage solutions such as Longhorn. Strong Linux administration skills, particularly with RHEL environments. Experience implementing comprehensive observability solutions using Prometheus, Grafana, Loki, and related tools. Ability to establish and enforce security policies through tools like Open Policy Agent. Knowledge of identity management solutions such as Keycloak. Experience More ❯
London, England, United Kingdom Hybrid / WFH Options
Future Talent Group
resilient, observable, and maintainable features. Promote DevOps culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Strong Linux and networking fundamentals (TCP, DNS, TLS, HTTP More ❯
GCP cloud platforms Working knowledge of CI/CD tooling and practices (GitHub Actions, Jenkins, etc.) Experience with Infrastructure as Code (Terraform, CloudFormation) preferred Understanding of monitoring, logging, and observability tools Solid grasp of software development best practices (testing, code quality, documentation) Experience with modern frontend frameworks (React, Vue, Angular) a plus Background with AI/ML systems integration preferred More ❯
and postmortems to learn from system failures and prevent recurrence. Participate in on-call rotations and respond to incidents, minimising downtime and customer impact. Continuously improve deployment, configuration, and observability processes. Qualifications: Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent experience. Strong experience with Linux/Unix systems administration. Proficient in scripting and programming languages More ❯
e.g., Slackbots and integrations) to streamline IT operations and business processes. Monitoring and Maintenance: Manage and maintain network security systems through system patches and periodic maintenance tasks. Establish comprehensive observability and proactive issue-resolution strategies using tools like SNMP, Syslog, Netflow, Elasticsearch (ELK Stack), and Grafana. Collaboration and Communication: Work with CyberEnergiateams to identify functional needs, develop secure architectures, and More ❯
Experience working in Agile teams using Git , Jira , and Confluence Eligible for SC and NPPV3 clearance Container orchestration with Kubernetes Experience with HashiCorp tools: Vault , Consul , Packer Monitoring and observability with Grafana , Prometheus , or similar Knowledge of cloud networking, VPCs, NAT Gateways, security groups, etc. Personal Attributes: Proactive, self-driven, and passionate about technology Strong problem-solving skills Collaborative team More ❯
recognize road blocks and demonstrates interest in learning technology that facilitates innovation Experience with continuous integration and continuous delivery tools like Jenkins, GitLab, Terraform Experience in at least one observability tool such as Dynatrace, Datadog, New Relic, CloudWatch, AppDynamics, Splunk., Preferred Qualification Experience a plus in common SRE toolchains: Grafana, Prometheus, Elasticsearch, Kibana, Jaeger. #J-18808-Ljbffr More ❯
the business succeed. Provide timely and effective technical support for end users of a designated set of DevOps tools, encompassing traditional tools (e.g., CI/CD platforms, monitoring and observability tools, source code management systems) and GenAI-powered tools. Troubleshoot and resolve complex technical issues involving in-depth analysis of logs, configurations, system behaviour. Proactively monitor the health, performance, and More ❯
Bristol, Gloucestershire, United Kingdom Hybrid / WFH Options
Curo Resourcing Ltd
domain adjacent technologies/services, such as: Docker, OpenShift, Kubernetes etc. Infrastructure as Code and CI/CD paradigms and systems such as: Ansible, Terraform, Jenkins, Bamboo, Concourse etc. Observability - SRE Big Data solutions (ecosystems) and technologies such as: Apache Spark and the Hadoop Ecosystem Excellent knowledge of YAML or similar languages The following Technical Skills & Experience would be desirable More ❯
architectures , as described by thought leaders like Martin Fowler. Hands-on experience building and maintaining complex CI/CD pipelines , preferably with GitHub Actions . Familiarity with monitoring and observability tools (e.g., Prometheus, Grafana, Google Cloud's operations suite). A solid understanding of networking principles and cloud security best practices. Experience with other cloud platforms like Amazon Web Services More ❯
architectures , as described by thought leaders like Martin Fowler. Hands-on experience building and maintaining complex CI/CD pipelines , preferably with GitHub Actions . Familiarity with monitoring and observability tools (e.g., Prometheus, Grafana, Google Cloud's operations suite). A solid understanding of networking principles and cloud security best practices. Experience with other cloud platforms like Amazon Web Services More ❯
London, England, United Kingdom Hybrid / WFH Options
9fin
as possible. Designing and implementing a developer portal (eg. Backstage), to provide a service catalog to the engineering team, and also author many other useful DevOps plugins. Contributing to observability best practices and providing key SLI/SLO metric reporting, so that the engineering team can balance velocity and reliability. Develop inner/open source projects to help provide a More ❯
London, England, United Kingdom Hybrid / WFH Options
Deutsche Bank
services environment Strong technical skills in Linux/Unix systems, SQL, and scripting Strong experience with a programming language such as Python, Java, etc Strong experience with monitoring and observability tools (Prometheus, Grafana, Splunk, Geneos, OpenTelemetry, Corvil) Familiarity with cloud platforms, containerization (e.g., Kubernetes, Docker), and CI (Continuous Integration)/CD (continuous Delivery) pipelines Strong understanding of the trade lifecycle More ❯
Belfast, Northern Ireland, United Kingdom Hybrid / WFH Options
CME Group Inc
both independently and collaboratively. Key Responsibilities: Collaborate with senior SREs and Product engineering teams to monitor, maintain, and troubleshoot our Markets systems. Collaborate with Product teams to continuously improve observability and alerting of our applications to enable data-driven business decision, faster issue detection and incident resolution. Take accountability for delivery of moderately-complex features. Lead technical discussions for own More ❯
architectures , as described by thought leaders like Martin Fowler. Hands-on experience building and maintaining complex CI/CD pipelines , preferably with GitHub Actions . Familiarity with monitoring and observability tools (e.g., Prometheus, Grafana, Google Cloud's operations suite). A solid understanding of networking principles and cloud security best practices. Experience with other cloud platforms like Amazon Web Services More ❯
and their associated data services. Hands-on experience with continuous integration and deployment systems (e.g. Jenkins, Tekton). Practical experience with containerization and orchestration technologies, particularly Kubernetes. Familiarity with observability tools such as Prometheus and Grafana, the ELK stack, or similar managed service. Strong problem-solving skills and attention to detail. A demonstrated eagerness to learn and grow in a More ❯
Belfast, Northern Ireland, United Kingdom Hybrid / WFH Options
CME Group
both independently and collaboratively. Key Responsibilities Collaborate with senior SREs and Product engineering teams to monitor, maintain, and troubleshoot our Markets systems. Collaborate with Product teams to continuously improve observability and alerting of our applications to enable data-driven business decision, faster issue detection and incident resolution. Take accountability for delivery of moderately-complex features. Lead technical discussions for own More ❯
automated deployments Familiarity with Helm charts Experience with Infrastructure as Code (IaC) tools like Terraform Knowledge of container build and deployment automation using CI/CD pipelines Experience in observability tools for both MSK and Kubernetes, including Prometheus, Grafana, and AWS CloudWatch for metrics and logs Deep understanding of Kafka and Kubernetes security practices, including network policies and IAM roles More ❯
to ask for help when needed. Strong proficiency in AWS, Azure, or GCP, with hands-on experience with Terraform. Experience with configuration management tools like Ansible or Puppet, and observability tools like Prometheus, Grafana, and Datadog, etc. Design, develop, automate testing, and deploy custom tools using languages like Python or C#. Version control administration (examples GitHub, Perforce) Great to Have More ❯
Bristol, Avon, South West, United Kingdom Hybrid / WFH Options
Hargreaves Lansdown
Experience with unit, integration, and end to end testing tools and practices (e.g. Jest, Cypress, Backstop, Playwright). Experience with CI/CD and Trunk Based Development. Experience with observability tools and practices, including monitoring, logging, and tracing to ensure system reliability and performance. Understanding of Microservices & principles of RESTful API development, including structuring, documenting, versioning, testing and stubbing/ More ❯
mentoring engineers and collaborating with stakeholders. Proven ability to resolve technical incidents in unfamiliar production systems. Technical and process documentation champion. Experience of operationally managing production software components, including observability, logging, metrics, error reporting, debugging, and live incident management. Your time will be spent roughly as follows: 60% - Proactive technical work (e.g. migrating DB hosting provider, new message bus system More ❯
Python/Java/C Go), with practical experience in automating tasks, building scripts, and solving problems. - Proven track record of successful implementation of DevOps practices and culture (think observability, code reviews, source control management, building feedback loops, removing organization silos...). - Basic understanding of Machine Learning concepts related to model training, deployment, inference, MLOps, etc. PREFERRED QUALIFICATIONS - Experience designing More ❯
FX or crypto trading; front-end experience with React or similar frameworks is a plus. Collaborate with the team to implement, configure, and manage comprehensive monitoring, logging, alerting, and observability solutions - advocating for security best practices. Deploy, manage, operate, and scale applications and services on AWS - whilst troubleshooting performance issues across the stack. Collaborative, agile approach, passionate about clean architecture More ❯