and a strong belief in quality-first development. CI/CD Experience: Youve shipped code using GitHub Actions (or similar), and you know how to keep the pipeline green. Observability & Reliability: Experience with monitoring and alerting tools like DataDog. Proactive Problem-Solver: You take initiative, work independently, and thrive on solving complex technical challenges. Interview Process We want our process More ❯
Group's data strategy , enabling the intelligent use of mobility, behavioural, and payment data to unlock new product and commercial opportunities. Ensure platform reliability, performance, and scalability through robust observability, incident response processes, performance testing, and fault-tolerant architecture. Partner with Security, Compliance, and Infrastructure teams to meet regulatory and certification standards (e.g., PCI DSS, TISAX, ISO 27001), and embed More ❯
also play a part in mentoring other developers including more junior colleagues to impart knowledge and build their skills. Write high quality documentation and implement user & system metrics and observability as you go to continually learn, assess and improve 9fin's platforms and products. Our Tech Stack React via Typescript & Vite React query Jest, React Testing Library, Playwright Production workloads More ❯
the Groups data strategy, enabling the intelligent use of mobility, behavioural, and payment data to unlock new product and commercial opportunities. Ensure platform reliability, performance, and scalability through robust observability, incident response processes, performance testing, and fault-tolerant architecture. Partner with Security, Compliance, and Infrastructure teams to meet regulatory and certification standards (e.g., PCI DSS, TISAX, ISO 27001), and embed More ❯
standards and regulatory requirements. This includes secure coding practices, dependency management, access controls, and auditing where required. DevOps and Tooling Strategy - integrating development with operational considerations, including automation, monitoring, observability, and incident response. Guides selection and configuration of tools supporting the build-test-release cycle. Technology Strategy and Stack Selection - evaluating and selecting appropriate technologies, platforms, and frameworks. Aligns choices More ❯
CI/CD pipelines (Jenkins, GitHub Actions) Define and enforce platform standards across environments (dev, staging, prod) Collaborate with developers and DevOps on deployment tooling and security Enable platform observability using tools like Datadog, Prometheus, and CloudWatch Maintain Helm charts and Terraform modules for shared infrastructure Contribute to onboarding documentation and platform adoption practices Participate in incident response and postmortem … containerisation using Docker and secure image management Scripting or programming experience in Bash, Python, or TypeScript Strong understanding of GitOps practices and infrastructure lifecycle management Desirable Skills Experience with observability tooling (Datadog, Prometheus, Fluent Bit) Knowledge of admission controllers, OPA/Gatekeeper (optional for governance) Familiarity with cloud cost optimisation and Kubernetes scaling strategies Exposure to security scanning tools (tfsec More ❯
shift-left and introduce automation. You will represent the support function on Change Advisory Boards and incident management calls. You will be introducing single-pane-of-glass and transparent observability, monitoring and alerting in a continual improvement regime If Customer orientated, cloud first, cutting edge true IAC, automated deployment, detection and reporting is your wheel-house - this is the role … terraform, and other tools. Essential Knowledge & Experience: - 5 years of cloud engineering experience - Fluency with Kubernetes, Docker, Terraform, Python - Strong CI/CD and Linux administration skills - Experience in observability and performance tuning - Pragmatic, team-first approach - Confidence inworking with stakeholders (Engineering, Test and Project Management teams) - Able to build and represent ITIL changes at CABs - Ability to collaborate and More ❯
Milton Keynes, Buckinghamshire, England, United Kingdom
IT Talent Solutions Ltd
Architect Expert, DevOps Engineer Expert). Experience in enterprise-scale environments or regulated industries. Exposure to hybrid cloud models, legacy system integration, and cloud migrations. Familiarity with monitoring and observability tools such as Azure Monitor, Application Insights, or Log Analytics. More ❯
Employment Type: Full-Time
Salary: £50,000 - £70,000 per annum, Negotiable, Inc benefits
Telford, Shropshire, United Kingdom Hybrid / WFH Options
Experis - ManpowerGroup
proactive incident management. Key Skills/requirements Translate high-level monitoring non-functional requirements (NFRs) into actionable configurations across tools such as Splunk, Dynatrace, and AppDynamics. Deliver full-stack observability solutions, including application-aware network performance monitoring (NPM), synthetics, log analytics, and infrastructure metrics. Provide live support for monitoring technologies and assist with live service support, including key business events More ❯
Telford, Shropshire, United Kingdom Hybrid / WFH Options
Experis
proactive incident management. Key Skills/requirements Translate high-level monitoring non-functional requirements (NFRs) into actionable configurations across tools such as Splunk, Dynatrace, and AppDynamics. Deliver full-stack observability solutions, including application-aware network performance monitoring (NPM), synthetics, log analytics, and infrastructure metrics. Provide live support for monitoring technologies and assist with live service support, including key business events More ❯
Wellington, Shropshire, United Kingdom Hybrid / WFH Options
Experis
proactive incident management. Key Skills/requirements Translate high-level monitoring non-functional requirements (NFRs) into actionable configurations across tools such as Splunk, Dynatrace, and AppDynamics. Deliver full-stack observability solutions, including application-aware network performance monitoring (NPM), synthetics, log analytics, and infrastructure metrics. Provide live support for monitoring technologies and assist with live service support, including key business events More ❯
processes to facilitate the work of the development team Build continuous integration, continuous delivery, and continuous deployment (CI/CD) pipelines for software delivery Set up operational tooling and observability infrastructure Identify security risks and deploy measures to mitigate them Identify problems and take steps to resolve and remove them Automate and improve development and release processes Your skills and More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Lorien
growing to meet our business needs. What you'll lead: Shape and evolve the backend technical architecture to support product scale and complexity Identify and drive improvements in performance, observability, and infrastructure Lead the design of domain models aligned with evolving business needs Be a go-to person for backend excellence, and improve code quality Engineering centric requirement definition (user More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Harnham - Data & Analytics Recruitment
teams Nice to Have Kubernetes experience for deploying scalable ML services Data visualisation skills for presenting model outputs Knowledge of authentication/security best practices Experience with monitoring and observability tools for distributed systems Interested? If you want to join a world-class AI team at an early stage-where your work will directly shape the future of enterprise-scale More ❯
F# are welcome) Proven track record of building and scaling distributed backend systems Solid understanding of infrastructure-as-code and cloud orchestration (AWS, Terraform, Docker) Familiarity with queue management, observability tooling, and shipping in fast-paced environments Awareness of GenAI and prompt engineering, or a keen interest to develop expertise in this area A self-starter attitude, with a strong More ❯
control (Git/GitHub/GitLab) Experience with modern CI/CD tools and techniques Desirable skills: Experience in BDD and creating tests using Gherkin syntax Experience working with observability and monitoring tools such as Firebase or similar Experience of non-functional testing in a mobile environment Equal Opportunities Statement At AND Digital we embrace diversity and are committed to More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Become
Experience working within Agile delivery frameworks and high-accountability environments. Desirable Attributes Experience with design systems and component libraries (e.g., Material, Storybook). Familiarity with performance monitoring and frontend observability tools. Exposure to DevOps practices and CI/CD pipelines. Prior consulting experience or experience in client-facing roles. Engagement Model Initial contract engagement with a clear pathway to permanent More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Reed.co.uk
Model Deployment: MLOps, scaling, data engineering, and AI ethics awareness. Strategic Mindset: Ability to align technology with business objectives and cost efficiency. Security & Compliance Knowledge: GDPR, API authentication, and observability best practices. Big data processing: Understanding data lakes, warehouses, and tools like Spark, Kafka, and Airflow. ETL Pipelines: Ability to evaluate data ingestion, transformation, and cleaning processes. DevOps & CI/ More ❯
Only) - Architect and Develop Internal Security Tooling: Design, implement, and evolve secure systems architecture that supports the Regulated Industries internal tooling platform. Leverage automation and intelligent workflows to enhance observability, policy enforcement, and team productivity. - Implement AI-Powered Security Workflows: Integrate AI/ML agents-including Microsoft Security Copilot and LLM-based copilots-into operational pipelines for tasks such as More ❯
Telford, Shropshire, West Midlands, United Kingdom
LA International Computer Consultants Ltd
insight, and proactive incident management. Key Responsibilities: * Translate high-level monitoring non-functional requirements (NFRs) into actionable configurations across tools such as Splunk, Dynatrace, and AppDynamics. * Deliver full-stack observability solutions, including application-aware network performance monitoring (NPM), synthetics, log analytics, and infrastructure metrics. * Provide live support for monitoring technologies and assist with live service support, including key business events More ❯
security, reliability, and compliance of our solutions, focusing on both functional and non-functional needs. • work closely with the global team to further develop and innovate the platform - building observability into our solutions, monitor production health, help to resolve incidents, and remediate the root cause of risks and issues. • share knowledge and expertise with colleagues, help with hiring, and contribute More ❯
best practices. Ensure solutions align with technical strategy and business goals. Work in small, valuable increments using agile practices and modern tools. Monitor, debug, and improve system performance with observability tools. You'll love this role if You have 2+ years of Ruby development experience, with good knowledge of Ruby on Rails in production, and want to grow your expertise. More ❯
build-and-load programs up to browser extensions and web applications. Develop software to analyse and interpret cryptocurrency usage behaviours and trends on the clear and dark web Implement observability mechanisms (we use DataDog) to detect problems in your environment(s), and run the associated business processes to resolve Work with the existing engineers on your team to foster their More ❯
build-and-load programs up to browser extensions and web applications. Develop software to analyse and interpret cryptocurrency usage behaviours and trends on the clear and dark web Implement observability mechanisms (we use DataDog) to detect problems in your environment(s), and run the associated business processes to resolve Work with the existing engineers on your team to foster their More ❯
Maidenhead, Berkshire, United Kingdom Hybrid / WFH Options
dynaTrace software GmbH
required to accommodate global time zones Hybrid or remote considered Package to include bonus and equity Why you will love being a Dynatracer Dynatrace is a leader in unified observability and security. We provide a culture of excellence with competitive compensation packages designed to recognize and reward performance. Our employees work with the largest cloud providers, including AWS, Microsoft, and More ❯