AWS infrastructure (EC2, S3, IAM) for scalable cloud solutions Test and debug APIs using Postman Work with graph databases (Neo4j) to enhance platform functionality Contribute to platform monitoring and observability solutions What's in it for You Gain hands-on experience with enterprise-level Java and Spring Boot development Contribute to an EdTech and Tech for Good start-up that More ❯
Knutsford, Cheshire, North West, United Kingdom Hybrid / WFH Options
Experis
resource is required to assist in upgrading the Elastic DP estate to Kubernetes thereby moving away from Obsolete technology (Cloudera), uplifting to RHEL 8, contributing towards improving stability and observability of the platform and providing advanced analytics tooling and services for modelling analytics Role purpose/summary Working across continuous integration, development, build and deployment using automation & cloud technologies to More ❯
Solve performance, concurrency, and scalability challenges in a multithreaded environment Help drive the move to Linux and containerised workloads using Docker, AKS Support modernisation efforts including GPU computation and observability Tech Stack C#/.NET Azure : Batch, Blob/Table/Queue Storage, AKS, CosmosDB Distributed event-driven architecture (CQRS/event sourcing) Docker, Linux Frontend: minimal exposure to JavaScript More ❯
to-day and strategic decision making. You will be a hands-on and customer focused engineering servant-leader. You will be comfortable moving across orchestration, automation, pipelines, cloud services, observability and security domains (even if you are not an expert in them all). A non-negotiable is experience and familiarity with Microsoft Azure. You will play your part in More ❯
and architecture through to production deployment and support. You'll work closely with experienced engineers and domain experts to deliver mission-critical services with a strong focus on scalability, observability (DataDog), and quality. You'll also contribute to architectural design, sequence diagrams, and flow mapping, ensuring robust documentation and testing standards are met. This is a full Agile environment, and More ❯
and architecture through to production deployment and support. You'll work closely with experienced engineers and domain experts to deliver mission-critical services with a strong focus on scalability, observability (DataDog), and quality. You'll also contribute to architectural design, sequence diagrams, and flow mapping, ensuring robust documentation and testing standards are met. This is a full Agile environment, and More ❯
and architecture through to production deployment and support. You'll work closely with experienced engineers and domain experts to deliver mission-critical services with a strong focus on scalability, observability (DataDog), and quality. You'll also contribute to architectural design, sequence diagrams, and flow mapping, ensuring robust documentation and testing standards are met. This is a full Agile environment, and More ❯
Employment Type: Contract
Rate: £500 - £700/day Day Rate Contract | 6 months
be: Comfortable working autonomously and taking independent decisions as well as having the ability to work cooperatively within a team, Experience working with microservice architectures and building monitoring/observability metrics, Understanding of cloud native landscapes (AWS or Azure or GCP), Knowledgeable of containerized environments would be beneficial(Docker or Kubernetes). Benefits we offer: 23 days' holiday + all More ❯
variety of open-source databases (MySQL, Postgres, Redis, etc.) -?Experience with DevOps engineering and working with container orchestration, such as with Docker or Kubernetes -?Experience with log monitoring and observability via platforms like Sumologic or Cloudwatch -?Experience automating infrastructure, testing, and deployments using tools like CircleCI Configuration management tooling and infrastructure as code knowledge is preferred but not required -?Experience More ❯
aligned to Public Cloud Lab/s and will work with the relevant Product Owners and Engineering Leads, using data, to balance product improvements covering aspects such as reliability, observability and performance, with new feature development Key Responsibilities You will help improve the SRE framework and principles to strengthen focus, behaviours, and culture You will support the POs and ELs More ❯
aligned to Public Cloud Lab/s and will work with the relevant Product Owners and Engineering Leads, using data, to balance product improvements covering aspects such as reliability, observability and performance, with new feature development Key Responsibilities You will help improve the SRE framework and principles to strengthen focus, behaviours, and culture You will support the POs and ELs More ❯
Pipelines is a plus. Experience with multi-cloud and hybrid cloud environments. Experience with Elastic (or OpenSearch) and Grafana Knowledge of ServiceNOWfor change management and incident management. Familiarity with observability tools and practices for 24x7x365 monitoring and alerting. Identity and Access Management experience is a plus for this role LI RB1 LI Remote LI Hybrid About Bentley Systems Bentley Systems More ❯
or similar) Significant experience with CI/CD tools (Jenkins, GitHub Actions, CircleCI or similar) Preferred Experience: Experience with real time data pipelines (AWS Kinesis, Kafka, Spark) Experience with observability tools (Metaplane, MonteCarlo, Datadog or similar) Experience within FinTech/Finance/FX preferred More ❯
Hart, Yorkshire, United Kingdom Hybrid / WFH Options
RVU Co UK
enable continuous integration and delivery (CI/CD). Make data guided decisions that impact core business metrics and processes. Solid understanding of platform and reliability engineering approaches, including observability, performance optimisation, capturing analytics and security best practices. Drive the adoption of new technologies like Go and Python. Facilitate collaboration between teams and build a culture of continuous improvement. Mentor More ❯
Product Manager and stakeholders to refine requirements, scope technical feasibility, and ensure user and business needs are met. Drive DevOps & Modern Practices: Promote CI/CD, infrastructure as code, observability, and automated testing to improve delivery speed and stability. Develop the Team: Support engineers of varying experience levels, identifying opportunities for development and knowledge-sharing. Top Skills/Experience Required More ❯
enable continuous integration and delivery (CI/CD). Make data-guided decisions that impact core business metrics and processes. Solid understanding of platform and reliability engineering approaches, including observability, performance optimisation, capturing analytics, and security best practices. Drive the adoption of new technologies like Go and Python. Facilitate collaboration between teams and build a culture of continuous improvement. Mentor More ❯
with business requirements and scalability. Code Development & Review : Write, review, and maintain production-quality Python code for NLP applications, ensuring high-quality, reliable, and efficient code. Enhance Scalability and Observability : Optimize NLP solutions to be more scalable, observable, and resilient, with a focus on improving performance and monitoring in production environments. Stakeholder Communication : Serve as the main point of contact More ❯
with cross-functional stakeholders including the Data Platform team and Engineering teams. Design and maintain reliable, scalable cloud infrastructure (primarily AWS). Drive key initiatives involving container orchestration (Kubernetes), observability, security, and CI/CD. Establish best practices in platform engineering and foster a servant-leadership culture focused on empathy, empowerment, and collaboration. Work with your peers and colleagues at More ❯
members Participate in code reviews to develop clean, secure, testable and maintainable code Gain exposure to DevOps tools and processes, including containerisation (e.g., Docker), CI/CD pipelines, and observability platforms (e.g., Datadog, Grafana) Troubleshoot and resolve technical issues in collaboration with your team Stay curious and proactive, learning new technologies and contributing ideas to innovate our platform What We More ❯
standards, architectural principles, and development best practices to ensure a high-quality, maintainable, and efficient codebase. Taking ownership of key performance indicators (KPIs), you will drive improvements in operability, observability, and scalability. Beyond technical leadership, you will foster a culture of innovation, continuous learning, and knowledge sharing within the team. Encouraging an inclusive environment, you will promote new ideas, experimentation More ❯
and prior hands-on experience using AWS services at the DevOps Engineer level. Previous experience with incidents, change and problem management. Strong background in setup and operation of enterprise observability tooling, specifically Prometheus, Grafana and Splunk, including usage of PromQL. Proficient in one or more languages of Python, Go, Bash, SQL. Familiar with GitHub, GitOps, container orchestration, and Kubernetes operations. More ❯
and prior hands-on experience using AWS services at the DevOps Engineer level. Previous experience with incidents, change and problem management. Strong background in setup and operation of enterprise observability tooling, specifically Prometheus, Grafana and Splunk, including usage of PromQL. Proficient in one or more languages of Python, Go, Bash, SQL. Familiar with GitHub, GitOps, container orchestration, and Kubernetes operations. More ❯
infra-as-code (Terraform), rollout, and user training. Collaborate directly with executives & operators - run white-boarding sessions, turn ambiguous requirements into concrete specs, demo weekly, and iterate fast. Champion observability & reliability - instrument services with OpenTelemetry, define SLIs/SLOs, and automate incident response. Contribute across the stack - build lightweight front-ends when needed and pair with ML engineers on inference More ❯
infra-as-code (Terraform), rollout, and user training. Collaborate directly with executives & operators - run white-boarding sessions, turn ambiguous requirements into concrete specs, demo weekly, and iterate fast. Champion observability & reliability - instrument services with OpenTelemetry, define SLIs/SLOs, and automate incident response. Contribute across the stack - build lightweight front-ends when needed and pair with ML engineers on inference More ❯
infra-as-code (Terraform), rollout, and user training. Collaborate directly with executives & operators - run white-boarding sessions, turn ambiguous requirements into concrete specs, demo weekly, and iterate fast. Champion observability & reliability - instrument services with OpenTelemetry, define SLIs/SLOs, and automate incident response. Contribute across the stack - build lightweight front-ends when needed and pair with ML engineers on inference More ❯