using modern, agile development practices like code review, TDD, CI/CD and pairing using tools like Git and GitHub. Experience of operationally managing software components once live, including; observability, logging, metrics, error reporting, debugging and live incident management. Experience of working with sensitive personal data. Competitive salary starting from £85,000 Generous Pension Scheme - We invest in your future More ❯
influencing at all levels. A mindset focused on long-term sustainability and strategic technical thinking. Bonus Points For Fintech or regulated environment experience, particularly investment platforms. Familiarity with modern observability stacks and incident response processes. Experience with security-first architecture and data protection best practices. Why Join? Well-Backed & Ambitious: Backed by a globally recognised financial group with significant investment More ❯
influencing at all levels. A mindset focused on long-term sustainability and strategic technical thinking. Bonus Points For Fintech or regulated environment experience, particularly investment platforms. Familiarity with modern observability stacks and incident response processes. Experience with security-first architecture and data protection best practices. Why Join? Well-Backed & Ambitious: Backed by a globally recognised financial group with significant investment More ❯
Ansible, Puppet, Chef, or Chocolatey for application-level configuration management. Writing scripts and pipelines to support lift-and-shift deployments and prepare environments for future use. Supporting monitoring and observability using tools like Splunk, Cloudability, Wiz, AWS Systems Manager, and CloudFormation. Collaborating with cloud infrastructure and OT stakeholders to ensure on-premise workloads are mirrored effectively in AWS. Contributing to More ❯
solutions meet business needs. Experience with data ingestion tools, like Fivetran. Advantageous Exposure to deploying applications with Kubernetes. Experience with Data Orchestrator tools (Airflow, Prefect, etc.) Experience with Data Observability tools (Montecarlo, Great Expectations, etc.) Experience with Data Catalog tools (Amundsen, OpenMetadata, etc.) Interview Process Call with the talent team Take home task Tech interview CPTO interview Life at Lendable More ❯
Exposure to site reliability engineering: root cause analysis, in-production troubleshooting, on-call rotations ) • Exposure to infrastructure management: CI/CD, containerization, orchestration, infra-as-code, monitoring, logging, alerting, observability ). • Technical product mindset (e.g. understanding how to debug poor adoption). • Excellent problem-solving and communication skills (ability to contextualizing, gauging risks and getting buy-in for high stakes More ❯
with cross-functional stakeholders including the Data Platform team and Engineering teams. Design and maintain reliable, scalable cloud infrastructure (primarily AWS). Drive key initiatives involving container orchestration (Kubernetes), observability, security, and CI/CD. Establish best practices in platform engineering and foster a servant-leadership culture focused on empathy, empowerment, and collaboration. Work with your peers and colleagues at More ❯
with cross-functional stakeholders including the Data Platform team and Engineering teams. Design and maintain reliable, scalable cloud infrastructure (primarily AWS). Drive key initiatives involving container orchestration (Kubernetes), observability, security, and CI/CD. Establish best practices in platform engineering and foster a servant-leadership culture focused on empathy, empowerment, and collaboration. Work with your peers and colleagues at More ❯
Engineers to understand problems, analyse requirements & deliver solutions that enhance engineering productivity Write code for low latency, highly available and scalable solutions Contribute to delivering initiatives to improve system observability, incident response processes and operational efficiency Continually update technical knowledge and skills using internal training as well taking time to self-develop utilising external sources Champion a culture of continuous More ❯
ElasticCache Familiarity with modern CI/CD platforms – ideally GitLab, but GitHub Actions or CircleCI also welcome Proficiency in testing frameworks like JUnit and RestAssured A passion for monitoring, observability , and maintaining resilient systems Desirable Skills: Experience with monitoring and alerting tools like Datadog, Prometheus, Grafana, or PagerDuty Exposure to Python scripting Familiarity with deployment platforms such as Kubernetes and More ❯
learning, knowledge sharing and continuous improvement. You have a passion for DevOps and Platform as a Service. Understanding of security and compliance requirements related to platform infrastructure. Experience with observability practices and tooling, incident management processes and driving operational excellence. Diversity, Equity and Inclusion If you're excited about this role but your experience doesn't align perfectly, we encourage More ❯
expertise across Postgres, Redis, InfluxDB, and ClickHouse-schema design, indexing, and caching for sub-second reads. Experience deploying microservices in production using Docker and Kubernetes. Skilled in setting up observability and alerting pipelines (Prometheus, Grafana), including model drift detection. Experience with real-time ML inference and model serving frameworks (e.g., TorchServe, Triton, BentoML) for low-latency applications. Experience designing feedback More ❯
scripting skills in SQL, Python, or Bash for automation and tooling. Solid understanding of Snowflake security features such as data masking, encryption, identity federation, and network policies. Familiarity with observability practices, including query profiling, usage tracking, and integration with monitoring tools. Demonstrated ability to optimise performance and manage costs in large-scale Snowflake environments. Excellent collaboration and communication skills, with More ❯
environment. Deep expertise in backend development (Python preferred), cloud infrastructure (GCP/AWS), and system design. Strong understanding of modern software development best practices, including CI/CD, containerization, observability, and microservices. Experience working closely with Product teams to align technical decisions with business priorities. Excellent communication and stakeholder management skills, with the ability to translate technical complexity into business More ❯
and/or React Native mobile apps. Regularly releasing working software, using trunk-based development, automated test suites, and infrastructure-as-code principles. Incorporating requirements such as performance, resilience, observability, maintainability, security and accessibility. Collaborating with other disciplines, building effective working relationships. With your team, achieving a shared understanding of user needs, Kooth commercial and operational goals. Deepening knowledge of More ❯
Snowflake. Understanding of testing strategies, including unit, integration, and system testing (TDD/BDD is a plus). Experience with CI/CD pipelines, monitoring tools, and production-grade observability practices. Strong problem-solving skills, especially when dealing with data integrity, scale, and operational complexity. Comfortable working independently and navigating ambiguity, especially when translating regulatory or compliance needs into technical More ❯
infra-as-code (Terraform), rollout, and user training. Collaborate directly with executives & operators -run white-boarding sessions, turn ambiguous requirements into concrete specs, demo weekly, and iterate fast. Champion observability & reliability -instrument services with OpenTelemetry, define SLIs/SLOs, and automate incident response. Contribute across the stack -build lightweight front-ends when needed and pair with ML engineers on inference More ❯
native architecture patterns Expertise in Kubernetes (specifically EKS) and K8s-native tooling (ie - Helm) Comfortable in a coding and scripting language Have proficiency withleveraging modern tracing, metrics and related observability topics - i.e. Open Telemetry Expertise in creating strong CI/CD pipelines and adapting to the feedback you gain from them, working with GitOps Experience working in a security-conscious More ❯
running on Java 21. We're in the process of moving our backend services to Spring Boot. We've invested heavily in our DataDog integration to bring world class observability and monitoring to our systems. We've recently moved to Gitlab and are currently building out our next generation of automated deployment pipelines. We've incorporated some of the best More ❯
Experience with major public cloud platforms , including Google Cloud Platform (GCP) , AWS , and Azure Strong understanding of networking technologies , such as LAN, WAN, firewalls , and related infrastructure Proficient with observability and monitoring tools , e.g Grafana, SolarWinds, Prometheus, AWS CloudWatch, Splunk Familiarity with DevOps practices , including CI/CD pipelines , is beneficial If you would be interested in having a further More ❯
using GCP-native tools and technologies. * Develop capabilities which allow Platform Engineering teams to operate with a DevOps ethos. * Collaborate with development teams to optimize application performance, reliability, and observability on GCP. * Implement and enforce Service Level Objectives (SLOs) and Error Budgets to ensure a balance between reliability and feature development. * Develop and maintain a comprehensive monitoring and alerting platform More ❯
using GCP-native tools and technologies.* Develop capabilities which allow Platform Engineering teams to operate with a DevOps ethos.* Collaborate with development teams to optimize application performance, reliability, and observability on GCP.* Implement and enforce Service Level Objectives (SLOs) and Error Budgets to ensure a balance between reliability and feature development.* Develop and maintain a comprehensive monitoring and alerting platform More ❯
product teams to define and deliver integration solutions - Troubleshoot and resolve issues such as data inconsistencies, auth errors, and performance bottlenecks - Monitor integration performance and implement logging, alerting, and observability - Document architecture, workflows, and integration processes - Contribute to continuous improvement of integration tools and practices What You Bring: - Proven experience building backend services using Node.js, Python, Java, or similar - Strong More ❯
secure, scalable cloud data solutions, aligning with business and compliance needs. Key Responsibilities Design, build, and maintain cloud-native data pipelines (Azure/GCP) Implement scalable data management frameworks: observability, validation, lineage Translate business needs into effective technical prototypes and solutions Collaborate with stakeholders, data teams, and service partners Ensure data security, governance, and regulatory compliance Monitor and optimise cloud More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
INTEC SELECT LIMITED
with infrastructure teams to ensure system reliability and operational efficiency Integrate monitoring and logging solutions (e.g., Prometheus, Grafana, ELK) Define strategies for disaster recovery, scaling, and infrastructure resilience Improve observability by enhancing visibility into performance and error metrics Skills and Experience Required 10+ years of backend development experience, including 5+ years in an architectural or engineering leadership role Proven experience More ❯