London, South East, England, United Kingdom Hybrid / WFH Options
Robert Half
Java, Python, or similar). Hands-on experience building and maintaining CI/CD pipelines (Azure DevOps, GitHub Actions, Jenkins, or similar). Strong understanding of monitoring, logging, and observability tools (e.g., AppInsights, ELK, Prometheus, Grafana). Solid knowledge of test-driven development and experience embedding TDD in automated delivery workflows. Experience working directly within software development teams to support More ❯
design and evolution of our API schemas, ensuring they meet the complex demands of a rapidly growing platform. Champion best practice in code quality, automated testing (Vitest, Playwright) and observability to deliver resilient, maintainable, and production-ready business logic. Drive DevOps excellence by collaborating on CI/CD pipelines (Jenkins, Concourse), containerisation (Docker) and Kubernetes deployments. Mentor and empower fellow More ❯
through coaching, recruitment, and career development aligned with DDaT frameworks. Excellent development skills, with a depth of experience including C#, Java (Spring Boot, JPA/Hibernate), REST API's, observability and monitoring, queue technologies and security. Detailed knowledge of best practices such as SOLID principles Experience of building new and evolving microservices with emphasis on high availability and data integrity. More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Oscar Technology
. Lead migration projects and modernisation of legacy systems. Ensure compliance with security and regulatory standards in sensitive industries. Provide senior-level support, troubleshooting, and performance monitoring with modern observability tools. Mentor and support junior engineers, collaborating with global teams. Required: Proven full-stack development expertise Node.js, TypeScript, JavaScript, and C#, .NET. Hands-on experience with cloud platforms (Azure preferred More ❯
models. Desirable: Master's degree or DAMA certification (CDMP).Proven expertise with Workday product and data integrations. Experience across different database architectures (Lakehouse, Relational, Document Store). Familiarity with observability tools, BI platforms (Power BI), or Agile methods. Knowledge of advanced analytics, data science, or NLP techniques. Soft Skills Strong stakeholder management and communication skills, able to bridge technical and More ❯
mindset, from commit to production Collaborate directly with end-users and internal teams to understand needs and deliver value Operate across multi-cloud environments (AWS, GCP, Azure) Drive system observability and reliability with tools like Datadog Help shape our engineering culture by mentoring, sharing knowledge, and encouraging best practices Push boundaries, challenge assumptions, and ensure delivery of meaningful solutions Tech More ❯
using GCP-native tools and technologies. * Develop capabilities which allow Platform Engineering teams to operate with a DevOps ethos. * Collaborate with development teams to optimize application performance, reliability, and observability on GCP. * Implement and enforce Service Level Objectives (SLOs) and Error Budgets to ensure a balance between reliability and feature development. * Develop and maintain a comprehensive monitoring and alerting platform More ❯
using GCP-native tools and technologies.* Develop capabilities which allow Platform Engineering teams to operate with a DevOps ethos.* Collaborate with development teams to optimize application performance, reliability, and observability on GCP.* Implement and enforce Service Level Objectives (SLOs) and Error Budgets to ensure a balance between reliability and feature development.* Develop and maintain a comprehensive monitoring and alerting platform More ❯
communication skills, able to engage both technical and non-technical stakeholders Leadership experience within data teams Desirable DAMA certified (CDMP) Knowledge of Lakehouse and other database architectures Familiarity with observability principles and BI tools (e.g. Power BI) Experience working in Agile environments More ❯
an initial 6 month contract. You'll be primarily responsible for working in a team that designs, builds, and maintains the organisations cloud infrastructure, with a focus on automation, observability and scalability. Essential skills/experience required: AWS Infrastructure as code using Terraform Cloudflare Developing CI/CD pipelines Incredibly beneficial: Snowflake MLOps Security best practices The role is confirmed More ❯
of student lifecycle processes in Higher Education and relevant data domains. Knowldge of event-driven and message-based architectures (Event Hub, Kafka, or Service Bus) Experience with monitoring and observability tools like Azure Monitor, Application Insights, and Log Analytics. Awareness of data security, GDPR, and compliance in educational or public sector environments. Exposure to OpenAPI/Swagger, API lifecycle management More ❯
and analysis. Dashboarding with Looker Studio or AWS QuickSight . Proven delivery in agile , multi-disciplinary teams; translating insight into roadmap decisions. Desirable Python for analysis; dbt or similar. Observability for LLM/agentic systems (prompt/version telemetry, HITL oversight). Solid statistics for experimentation and practical significance. Please apply for further details. More ❯
Selenium, Puppeteer). Orchestrate pipelines using Airflow, and manage data quality workflows. Model and transform data in SQL and Snowflake to create clean, analytics-ready datasets. Ensure data quality, observability, and governance across workflows. Collaborate closely with product managers, analysts, and engineers to deliver high-quality data products for dashboards and reporting. Technical Skills & Experience We're looking for candidates More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Lorien
growing to meet our business needs. What you'll lead: Shape and evolve the backend technical architecture to support product scale and complexity Identify and drive improvements in performance, observability, and infrastructure Lead the design of domain models aligned with evolving business needs Be a go-to person for backend excellence, and improve code quality Engineering centric requirement definition (user More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Harnham - Data & Analytics Recruitment
solutions Essential Skills & Experience: Proven experience in Python/Django Solid frontend skills with React & TypeScript Exposure to Ruby on Rails (nice to have) Experience with distributed systems and observability tools Ability to design solutions across multiple services Strong analytical problem-solving skills and clear communication Nice-to-Have Skills: Next.js Internal tooling for operational teams Experience in highly collaborative More ❯
technical proficiency in: Languages: Java 17+ (Java 21 preferred) Frameworks: Micronaut (preferred), Spring Boot Testing: JUnit, Mockito Build Tools: Gradle Data & Messaging: Kafka, MongoDB APIs: GraphQL Federation, REST Infrastructure & Observability: Terraform, OpenTelemetry, Dynatrace Please get in touch asap for a chance to work on this amazing project. More ❯
Lead Developer 6 Months Hybrid -1/3 days a month in office, either London or Bristol £750 Overview: Working within an agile digital delivery team developing and supporting a mission critical application for the UK client , with instances hosted More ❯
We are seeking a highly experienced Splunk ITSI Expert with 10+ years in observability to enhance our monitoring and analytics capabilities. Key Responsibilities: Design and implement advanced monitoring strategies using Splunk IT Service Intelligence (ITSI). Create service models, define KPIs, and build glass tables to visualize key business services. Utilize Splunk ES for security event monitoring and correlation searches. … Automate tasks and integrate systems using Python, Shell, or Perl scripting. Perform root cause analysis and anomaly detection by analyzing complex log data. Requirements: 10+ years experience in observability, with deep expertise in Splunk, especially ITSI. Proficiency in Scripting (Shell/PowerShell/Python). Strong understanding of Load Balancers such as F5, Netscaler, and AWS ELB. Hands-on experience More ❯
to support low latency applications. * Practical and working knowledge of IP networking and data flow within distributed systems. * Experience integrating ELK with packet capture/analysis tools to enhance observability of real-time systems. * Hands-on expertise in data pipeline creation, ingestion strategies, and performance tuning of Logstash and Beats for scalable telemetry. * Knowledge of Corvil and/or Pico … a deep understanding of the demands of high-frequency and algorithmic trading environments. * Working knowledge of security best practices, including RBAC, TLS, and audit logging in Elasticsearch. * Experience with observability platforms such as ITRS Geneos and their integration with ELK is a strong plus. * Comfortable with multi-site replication, cross-cluster search, and disaster recovery configurations for global deployment. * Strong … understanding of Linux systems, containers, and cloud-native observability stacks. * Organized, self-driven, and able to manage priorities in a dynamic, high-performance trading environment. More ❯
with the Enterprise Monitoring & Alerting (EMAS) team to deliver a transformative initiative aimed at maximising Dynatrace capabilities. We are looking for a skilled Dynatrace Admin/Consultant to enable observability across complex, hybrid cloud environments. Essential Skills: To thrive in this role, you must demonstrate extensive experience in designing and configuring within Dynatrace, including: Application Performance Monitoring Proficiency in Dynatrace … assets for monitoring. Work with EMAS to analyse Dynatrace coverage of these critical assets. Identify opportunities for enhancement in monitoring configurations across crucial applications. Review roles and responsibilities concerning observability and propose improvements focused on Operational Resilience. Contribute to establishing an automated end-to-end business flow for key business processes within the Dynatrace toolset. Ensure optimal alerting configurations in … collaboration with Application Stewards and SREs. Participate in workshops with third-party software suppliers to review observability standards. Bonus Points: Skills in correlating events across the full stack for root cause analysis. Key Attributes: Ability to manage competing priorities in a fast-paced environment. Flexibility and a pragmatic approach to problem-solving. A delivery-oriented mindset coupled with a can More ❯
Per Day Inside IR35 To apply, email: THE OPPORTUNITY We are looking for a Dynatrace Subject Matter Expert (Data Resilience). You'll play a critical role in improving observability, resilience, and performance monitoring across hybrid cloud environments using the Dynatrace platform. THE ROLE Collaborate with Application Stewards & Site Reliability Engineers (SREs) to confirm monitoring requirements for critical assets. Analyse … optimise, and automate monitoring within the Dynatrace toolset. Provide AI-driven insights (via Davis AI) for anomaly detection, root cause analysis, and proactive recommendations. Support operational resilience by embedding observability standards and best practices. Engage in workshops with third-party suppliers to review and improve observability standards. TECH STACK/REQUIREMENTS Strong expertise in Dynatrace (SaaS & On-Premises) including: Application … Anomaly Detection Profiles Alerting Rules & Profiles Synthetic & Log Monitoring Real User Monitoring (RUM) DQL & Grail for advanced data analytics API integrations with complex systems Experience working in resilience/observability engineering. Strong communication skills and ability to work in high-pressure environments. Flexible, pragmatic, and delivery-focused with a can-do attitude. Experience in Financial Services highly desirable Hands on More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Harnham - Data & Analytics Recruitment
do Deliver full-stack features from concept to deployment using Python/Django and React/TypeScript Build and enhance dashboards, admin panels, APIs, and backend processes Improve system observability, monitoring, and debugging capabilities Work in a collaborative, agile team with 2-week sprints and daily stand-ups Engage in discovery work and adapt quickly to shifting priorities Tech stack … Must have: Python, Django, React, TypeScript Nice to have: Ruby on Rails, Next.js, distributed systems experience, observability tooling What we're looking for Strong experience delivering both frontend and backend features Ability to translate requirements into clean, maintainable, and scalable code Excellent communication skills in a remote-first setting Analytical approach to solving technical problems Contract details Length: 3 months More ❯
goal is to ensure comprehensive End-to-End Monitoring coverage across the Group's critical applications. As a Senior Dynatrace Consultant, you will play a vital role in enabling observability within complex hybrid cloud environments. Key Responsibilities: Collaborate with Application Stewards and Site Reliability Engineers (SREs) to identify critical assets for monitoring verification and enhancement. Analyze Dynatrace coverage of critical … assets in partnership with the EMAS team. Identify and recommend enhancements to monitoring configuration and capabilities across key applications. Review roles and responsibilities among teams for observability and suggest improvements with a focus on Operational Resilience. Facilitate the establishment of an automatically maintained end-to-end business flow for significant processes within the Dynatrace toolset. Ensure optimal and purpose-fit … alerting configuration in collaboration with Application Stewards and SREs. Engage in workshops with third-party software suppliers to assess observability standards. Proven experience in designing and configuring: Anomaly Detection Profiles Application Performance Monitoring Alerting Rules and Profiles Synthetic Monitoring Log Monitoring Real User Monitoring (RUM) for web and mobile applications Utilization of Dynatrace Query Language (DQL) and Grail for advanced More ❯