solutions from the ground up? Our client is seeking a talented and motivated Senior Software Engineer to lead the development of our next-generation observability platform. THIS IS NOT A DEVOPS ROLE. Responsibilities Collaborate within a dynamic software engineering team to architect and build a new cloud-native IaC platform. More ❯
orchestration (ECS/Fargate). Good understanding of CI/CD pipelines and DevOps workflows. Solid grasp of SRE principles – SLIs, SLOs, error budgets, observability, etc. Familiarity with Datadog, Prometheus, Grafana, or similar tools. Experience with Nix is a plus (or curiosity to learn it). Bonus if you’ve More ❯
understanding of distributed systems , microservices architecture , and event-driven design Experience working with cloud platforms (e.g., AWS, GCP, Azure) Familiarity with monitoring, logging, and observability tools Bonus Points For Contributions to open-source projects Contributions to an AI product ⚙️ Tech Stack: Golang, GCP, microservices, Kubernetes, Kafka, MongoDB, Prometheus If scalability More ❯
performance, and reliability. Write clean, maintainable code and contribute actively to the codebase. Define and uphold engineering best practices (code quality, CI/CD, observability, etc.). Collaborate closely with the CTO and product team to align technical delivery with strategic goals. Continuously improve team operations, development workflows, and developer More ❯
Oracle, SQL, InfluxDB). Knowledge of containerisation tools (Docker, Kubernetes). Any experience of the following would be advantageous: BrowserStack tools e.g. Live, Automate, Observability Concourse/Jenkins Docker JMeter Karate/Rest API Postman/SOAP UI Selenium Grid SQL Unix based systems XML Benefits Alongside your salary of More ❯
including Azure Functions, LogicApps, and container-based services. Drive cost-effective use of cloud services and performance optimization across environments. Ensure cloud architectures support observability, fault tolerance, and high availability. Data Engineering & Intelligent Systems Guide the engineering team in data modelling, data access patterns, and efficient use of databases (MSSQL More ❯
technical decisions, whether individually or with a team Extensive knowledge of different data stores, and formats Solid understanding of software engineering fundamentals, version control, observability, unit and integration testing Our ideal candidate will champion engineering excellence, open source, enabling internal users and creating delightful user experiences. We are looking for More ❯
practices, Experience with Public and Private Cloud solution design, Experience with Kubernetes and CICD, Experience in designing and implementing Infrastructure components encompassing Databases, Monitoring, Observability, Orchestration, Networking, IAM, Message Broker technologies, Exposure to API Management, Experience with Architecture frameworks and best practices. Familiarity with CRM tools (e.g., Salesforce, Hubspot). More ❯
AWS/Azure)/on-premise ecosystem and seek a Network Engineer for design, implementation, and stewardship of our hosting environments. This role emphasizes observability, high-availability, security, and infrastructure-as-code, involving collaboration with TechOps, DevOps, and InfoSec, and support of existing/acquired assets. Responsibilities Implement technical solutions. More ❯
/GCP Infrastructure as code (we use Pulumi) Relational databases such as MySQL/PostgreSQL Proficiency in writing and maintaining test suites Monitoring and observability tools, for example Grafana/Crashlytics What we offer A competitive salary and benefits package (depending on experience). Holidays: 32 days paid leave including More ❯
Cardiff, South Glamorgan, United Kingdom Hybrid / WFH Options
RVU Co UK
cost optimised Cloud platforms (preferably Azure) from the ground up, following well architected principles Solid understanding of platform and reliability engineering approaches (SRE), including observability, performance optimisation, capturing analytics and security best practices Experience implementing Service Level Objectives and using them to drive error budgets, risk management and alerting Knowledge More ❯
product-centric operating models. Solid understanding of hybrid/multi-cloud environments, DevOps, CI/CD, SRE, DevSecOps models, DevX, build and deployment pipelines, observability, and ITIL. Proven experience leading/managing/mentoring a team of DevOps/SRE/Platform professionals. Presales/Sales experience would also be More ❯
best practices in cloud-native application deployment on Azure . Work closely with developers to enhance build automation, release management, and deployment strategies. Optimise observability, logging, and monitoring within development environments. Troubleshoot and resolve build, deployment, and performance issues. Skills & Experience Required Strong hands-on experience in DevOps and CI More ❯
product-centric operating models. Solid understanding of hybrid/multi-cloud environments, DevOps, CI/CD, SRE, DevSecOps models, DevX, build and deployment pipelines, observability, and ITIL. Proven experience leading/managing/mentoring a team of DevOps/SRE/Platform professionals. Presales/Sales experience would also be More ❯
monitoring, and incident response Ability to debug and resolve issues directly in production code Solid experience with Kubernetes , AWS , CI/CD pipelines , and observability tools A proactive, collaborative approach with a strong engineering mindset Desirable Experience Background in regulated or high-availability environments such as Payments or FinTech Understanding More ❯
What You’ll Do Set technical direction and provide architectural guidance across multiple teams. Champion best practices: cloud-native design, DevOps, CI/CD, observability, TDD, and Pair Programming. Collaborate with product, data, UX, and platform teams to build scalable, event-driven systems. Mentor and grow senior engineers, promoting a More ❯
creation and management of Service Level Indicators (SLIs) and Service Level Objectives (SLOs), ensuring reliability and performance. An understanding of Microservices & container orchestration Strong Observability & Monitoring experience (preferably tools such as Dynatrace, Prometheus or OpenTelemetry) Experience delivering DevOps/SRE Best Practices and cost optimisation proposals Experience in Multi-Cloud More ❯
and performance management Solid commercial C++ experience on complex systems Proven experience with large, multi-component systems and distributed team practices Strong background in observability and logging Familiarity with infrastructure-as-code and automated deployments (Terraform, Helm or Flux) We make an active choice to be inclusive towards everyone every More ❯
and performance management Solid commercial C++ experience on complex systems Proven experience with large, multi-component systems and distributed team practices Strong background in observability and logging Familiarity with infrastructure-as-code and automated deployments (Terraform, Helm or Flux) We make an active choice to be inclusive towards everyone every More ❯
to build scalable, low-latency systems. Collaborate on every part of the stack—from data ingestion and indexing to query planning and optimisation. Build observability, reliability, and scalability into a product being designed for some of the most demanding data applications in the world. Operate in a high-agency environment More ❯
and AI goals. 🛠️ Tech Stack : React, .NET 🧠 You’ll bring : Proven expertise in React and .NET in production environments Strong experience with system design, observability, and performance optimisation Great communication and stakeholder engagement skills Background in financial, payroll, or reporting systems is a bonus 💼 Contract Details Day rate : TBC Length More ❯
and performance management Solid commercial C++ experience on complex systems Proven experience with large, multi-component systems and distributed team practices Strong background in observability and logging Familiarity with infrastructure-as-code and automated deployments (Terraform, Helm or Flux) We make an active choice to be inclusive towards everyone every More ❯
and spike solutions to evaluate technologies, libraries, and approaches for improving system reliability, auditing, and financial reconciliation accuracy. Open Standards: Support our commitment to observability and open standards. Contribute to initiatives around OpenTelemetry, OpenAPI, and other tools that improve transparency and traceability across services. About you At least 5 years More ❯
deployment, monitoring, and incident response Tune performance across OS, network, and cloud layers — this role is hands-on and detail-oriented Improve system resilience, observability, and security in a high-stakes production environment Requirements: Fluent in Linux — not just using it, but understanding how it works under the hood Advanced … Docker (Kubernetes is a plus), infrastructure-as-code, and CI/CD tooling Strong scripting and automation experience in Python and Bash Familiarity with observability stacks (Prometheus, OpenTelemetry, eBPF) Cloud infrastructure experience (AWS/GCP/Azure), with attention to IAM and software supply chain security Curious, persistent, and comfortable More ❯
Platform for high-throughput, low-latency workloads. Implement infrastructure-as-code (Terraform/Bicep) and automated release workflows that enable true continuous delivery. Drive observability: log aggregation, metrics, distributed tracing and on-call runbooks. Champion security, cost-efficiency and performance tuning across our services. Collaborate with product and platform teams … Excellent communication skills and a track record of cross-team collaboration. Nice to have: Kubernetes expertise (GKE/AKS/EKS) and container-native observability stacks (Prometheus/Grafana). NoSQL experience (Firestore, Cosmos DB, DynamoDB, MongoDB). Experience with game-backend scales, real-time services or hybrid cloud/… PostgreSQL, MS SQL Server, Redis. Messaging: Pub/Sub, RabbitMQ, Azure Service Bus. Infra & Ops: Docker, Kubernetes, Terraform/Bicep, GitHub Actions, Cloud Build. Observability: OpenTelemetry, Grafana, Elastic. More ❯