london (city of london), south east england, united kingdom
Scrumconnect Consulting
Cloud Infrastructure: AWS (EKS, RDS, Aurora, ElastiCache, Kafka, IAM) Secure Hosting: Experience working with air-gapped or government-secure environments Container & Cluster Management: Docker, Kubernetes, Rancher, Jenkins, Helm Monitoring & Observability: Prometheus, Grafana, ELK Stack, Dynatrace Secrets & Identity Management: HashiCorp Vault, Keycloak CI/CD & DevOps Tooling: Jenkins, Git, ServiceNow, Trivy, Terraform Streaming & Messaging: Apache Kafka (including Kafka Replication) Data Layers … tooling and self-service developer pipelines for tenant teams. Proactively manage and resolve tech debt by working with central governance bodies and ensure visibility to the board. Increase automation, observability, and testing coverage across the platform components while enabling data-driven decision-making. Align delivery with the product roadmap, collaborating with internal/external platform and infrastructure teams to support More ❯
point for production issues in client reporting systems. Conduct real-time incident troubleshooting, root cause analysis, and postmortems. Collaborate with product and operations teams to address reliability risks. Implement observability tools (metrics, logging, tracing) for actionable insights. Automate deployment, monitoring, and incident response using tools like Ansible, Terraform, Python etc. Translate financial domain requirements into technical reliability strategies. Apply knowledge … with AWS and Kubernetes for infrastructure management. Proficiency with monitoring tools such as ELK stack and New Relic. Experience in CI/CD pipeline support and management. Understanding of observability principles and OpenTelemetry. Knowledge of ITIL practices and support processes. Strong collaboration, communication, and problem-solving skills. Experience with Power Platform tools (Power BI, Power Automate, Power Apps) to enhance More ❯
mesh integrations. Automation & Scripting: Develop and maintain Ansible playbooks and Python automation for configuration management, provisioning, and compliance, including working against API interfaces of network equipment. Monitoring & Telemetry: Implement observability tools using SNMP, sFlow, and gRPC to detect and address network bottlenecks at scale. Incident Management: Lead L3/L4 network incident response, escalation management and root cause analysis in … JNCIE, CCIE, or equivalent. Familiarity with network security frameworks and best practices. Experience with hybrid cloud and cloud connectivity solutions (e.g. AWS/Azure Direct Connect). Exposure to observability platforms and time-series databases (e.g. Grafana, Prometheus, InfluxDB). Qualities we look for: Set the standard : Every single day, you spot opportunities to constructively shake things up. Inspire the More ❯
contributing to system design and architectural evolution. Leading by example in trunk-based development, automated testing, CI/CD, and infrastructure-as-code principles. Taking ownership of performance, resilience, observability, maintainability, security, and accessibility. Building and operating a suite of Node.js backend services, React-based web apps, and React Native mobile experiences that form the backbone of our mental health … appropriately Strong communication skills, builds great colleague relationships across disciplines Desired Deep experience with React or React Native in production environments. Experience operating production systems and improving them through observability, testing, and thoughtful design. A track record of delivering product improvements that tie into measurable business and user outcomes. Familiarity with cross-platform design, mobile/web accessibility, and monorepos. More ❯
Salary banding: £90,000 - £110,000 dependent on experience Working pattern: 1-2 days per week in office Location: London About our Engineering Team As a business which has AI at its core, we need to have a reliable, scalable More ❯
Job Title Observability Engineer Location Asda House Employment Type Full time Contract Type Permanent Hours Per Week 37.5 Salary Competitive salary plus benefits Category Software Engineering Closing Date 29 August 2025 We are looking for an Observability Engineer who will report into the Engineering Manager and contribute to the delivery of our mission through a combination of design, build & implementation … configuration and support, and over time evolve to include the wider goals of the team. What You'll Love Design, build, and evolve core features of New Relic's observability platform (APM, logs, traces, infrastructure monitoring) for high throughput and scalability Configure New Relic dashboards, alerts, synthetic monitoring, distributed tracing, and log management Collaborate with cross-functional teams (product, SRE … UX) to translate requirements into resilient, cost-effective observability solutions Implement observability-as-code: define dashboards, alerts, synthetic monitors, notification channels & tags using New Relic Integrate instrumentation standards like OpenTelemetry across distributed systems Actively mentor Associate engineers, and lead incident response and analysis Work with stakeholders to understand problems, analyse requirements, develop ideas and design & deliver solutions that enhance engineering More ❯
technical proficiency in: Languages: Java 17+ (Java 21 preferred) Frameworks: Micronaut (preferred), Spring Boot Testing: JUnit, Mockito Build Tools: Gradle Data & Messaging: Kafka, MongoDB APIs: GraphQL Federation, REST Infrastructure & Observability: Terraform, OpenTelemetry, Dynatrace Please get in touch asap for a chance to work on this amazing project. More ❯
Cambridge, Cambridgeshire, East Anglia, United Kingdom Hybrid / WFH Options
La Fosse
infrastructure platform with AI-operable capabilities Oversee key infrastructure components such as data centre expansion, programmable compute, and software-defined network/storage Enable automation-first delivery models with observability, self-healing, and policy-driven control Implement and mature GitOps workflows, IaC pipelines, and CI/CD processes across engineering teams Lead programme governance, risk management, and stakeholder engagement Partner More ❯
function integrated throughout the software development lifecycle. Partnering closely with product and engineering teams, you will help scope and estimate strategic work, align on tooling, and drive improvements in observability, automation, and testing. Ideal Experience & Skills Demonstrated technical leadership across diverse skillsets, including Site Reliability Engineering (SRE), DevOps, and Quality Assurance (QA) Proven track record of aligning and integrating cross More ❯
and industry security standards (e.g. OWASP CI/CD, SAMM) are adhered to across systems Managing and improving cloud security posture (Azure Defender, Prisma Cloud etc) Implementing and optimising observability platforms for holistic system monitoring Supporting and securing software delivery lifecycle, from development to deployment and ongoing operations The successful Security Engineer's essential skills will include: Demonstrated experience in More ❯
Tunbridge Wells, Kent, Royal Tunbridge Wells, United Kingdom Hybrid / WFH Options
FPSG
and industry security standards (e.g. OWASP CI/CD, SAMM) are adhered to across systems Managing and improving cloud security posture (Azure Defender, Prisma Cloud etc) Implementing and optimising observability platforms for holistic system monitoring Supporting and securing software delivery lifecycle, from development to deployment and ongoing operations The successful Security Engineer's essential skills will include: Demonstrated experience in More ❯
Glasgow, Lanarkshire, Scotland, United Kingdom Hybrid / WFH Options
Circle Group
solid understanding of quality engineering principles. Ability to work autonomously and collaboratively in a fast-paced, cross-functional environment. A holistic view of quality , considering everything from testability and observability to scalability and resilience. Ideal Background Degree in Computer Science, Engineering, or a related field. Proven experience in quality engineering roles with a focus on continuous improvement and cross-team More ❯
Manchester, Lancashire, England, United Kingdom Hybrid / WFH Options
Circle Recruitment
solid understanding of quality engineering principles. Ability to work autonomously and collaboratively in a fast-paced, cross-functional environment. A holistic view of quality , considering everything from testability and observability to scalability and resilience. Ideal Background Degree in Computer Science, Engineering, or a related field. Proven experience in quality engineering roles with a focus on continuous improvement and cross-team More ❯
Cardiff, South Glamorgan, Wales, United Kingdom Hybrid / WFH Options
Circle Recruitment
solid understanding of quality engineering principles. Ability to work autonomously and collaboratively in a fast-paced, cross-functional environment. A holistic view of quality , considering everything from testability and observability to scalability and resilience. Ideal Background Degree in Computer Science, Engineering, or a related field. Proven experience in quality engineering roles with a focus on continuous improvement and cross-team More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Circle Recruitment
solid understanding of quality engineering principles. Ability to work autonomously and collaboratively in a fast-paced, cross-functional environment. A holistic view of quality , considering everything from testability and observability to scalability and resilience. Ideal Background Degree in Computer Science, Engineering, or a related field. Proven experience in quality engineering roles with a focus on continuous improvement and cross-team More ❯
best practices It will also help you to have Experience establishing and enforcing data governance standards through technical architecture (not just documentation) Familiarity with data cataloging, metadata management, and observability tools A systems-thinking mindset-you understand the full data lifecycle and how to maintain integrity from source to dashboard At Booksy, we believe in the power of well-structured More ❯
and know your way around Node.js backend frameworks You have solid experience designing and maintaining APIs , background workers, or async processing systems You have experience with performance optimization and observability You're comfortable working with infra basics (Docker, GCP, CI/CD) You care about code quality and testing What we offer Monthly subsidy programme: Different people have different needs More ❯
environments Real world experience delivering data quality management and data profiling Broad understanding of database designs, schema designs and data mapping Experience with tools supporting data management (governance, quality, observability, analytics) Excellent written and verbal communication, with the ability to develop clear requirements and specifications and communicate complex technical information to both technical and non-technical colleagues Excellent people skills More ❯
prototype new applications of AI for the construction domain, pushing the boundaries of what's possible Build core infrastructure that allows us to build LLM apps quickly - this includes observability, how we work with several LLM providers + our own fine tuned models Work with ML engineers and data scientists in our research team to bring new models and applications More ❯
innovative retail environments. With the ability to leverage the latest Cloud technology, techniques and thinking to build our business. Supporting the d esign and evolution of Asda's enterprise observability solutions including Application Performance Monitoring, Best Practice Logging, Monitoring and Alerting, Enterprise Dashboarding, Defining, incident management processes from a technical perspective - tooling, integrations and automation, including evolving the current solution More ❯
on, contributing production code while guiding architectural decisions and mentoring the team Mentor and elevate: Grow engineering maturity through technical coaching, thoughtful code reviews, and driving best practices in observability, reliability, and scale Shape product direction: Work cross-functionally with product managers, researchers, and designers to translate customer problems into impactful technical solutions Scale voice infrastructure: Build systems that meet More ❯
skills and experience (ideally Python, and/or Rust, Go, Kotlin, Java, etc) Sound technical knowledge, ideally across multiple technical competencies and levels (e.g APIs, networking, databases, security, compliance, observability, architecture) Excellent communication skills (written, graphical, remote, in-person, presentation, one:one, one:many) with the ability to engage, influence, and inspire stakeholders and colleagues to drive collaboration and alignment More ❯
They will be a strong communicator, and may have previously worked in an SRE role, a software engineering role or a systems engineering role. Key Responsibilities: Participate in building observability, monitoring and alerting for key services - continuously improving our SLI & SLOs and observability data enabling faster issue detection and incident resolution Collaborate with senior engineers and product teams to ensure More ❯
a hybrid multi-cloud (AWS, Azure, GCP) and on-premises ecosystem. This is your opportunity to drive modernisation, steer architecture, and be a hands-on force across infrastructure, automation, observability, and security. What You'll Do Architect and build modern hosting environments from the ground up, with a focus on observability, security, and infrastructure-as-code. Lead infrastructure design & technical More ❯
to deployment and maintenance. • Ensure on-time delivery by identifying and mitigating risks early. • Champion CI/CD practices and ensure smooth, automated deployment pipelines. 5. Reliability, Security, and Observability • Own the uptime, latency, and performance SLAs of financial APIs and services. • Proactively monitor risk vectors and enforce observability via metrics, logging, and alerting. • Work with DevSecOps to embed security More ❯