technical leader responsible for the reliability, scalability, and security of the entire GEEIQ platform. You'll tackle our biggest infrastructure challenges, from scaling our Kubernetes clusters to maturing our observability stack and refining our deployment pipelines. We are looking for an experienced and pragmatic engineer who is passionate about building robust, automated, and secure systems. You will work alongside our … CD pipelines in GitHub Actions to make them faster, more reliable, and more secure. Champion developer productivity by building tools, automating workflows, and reducing friction in the development lifecycle. Observability & Reliability (SRE) Lead the charge on improving our observability strategy. Design and implement a robust monitoring, logging, and alerting framework using tools like Grafana, Prometheus, and native AWS services. Enhance … and security. Demonstrated ability to design, build, and significantly improve CI/CD pipelines, with specific experience in GitHub Actions. A strong track record of building out and improving observability stacks (monitoring, logging, tracing). Experience implementing security controls and working within compliance frameworks (experience with SOC2 is a major plus). Proven ability to mentor and collaborate with other More ❯
architectures across Azure, AWS, and Google Cloud Leading platform engineering squads using DevSecOps, Kubernetes, and automation tooling Enabling edge and private cloud capabilities (e.g., Azure Stack, AWS Outposts) Implementing observability and governance tooling to support modern operations Supporting Agile and product-based delivery using SRE, CI/CD, and Infrastructure as Code Advising clients on architecture optimisation, security, cost control More ❯
automation, scalability, and high reliability. A strong working knowledge of Microsoft Azure is essential. The role involves daily coding, technical leadership across orchestration, CI/CD pipelines, cloud services, observability, and security-working alongside site reliability, onboarding, architecture, and delivery functions. You're expected to scale impact through others by upskilling team members, hiring where needed, and championing platform engineering More ❯
new infrastructure and services in line with internal security, operational, and performance standards Automate recurring tasks and develop tooling that improves visibility and consistency across environments Manage monitoring and observability tooling to ensure proactive incident response Participate in an on-call rota to support incident handling and resolution Produce high-quality documentation and technical diagrams Requirements: Strong experience administering Linux More ❯
Reigate, Surrey, South East, United Kingdom Hybrid / WFH Options
Client Server
of IaC principles and tools such as Terraform and Pulumi You have experience of building and improving CI/CD pipelines for product teams You have experience with cloud observability (logging, tracing, metrics, monitoring and alerting) You have experience with Containerisation - Azure Container Apps preferred You have strong scripting skills with PowerShell and/or C# .Net coding You enjoy More ❯
through coaching, recruitment, and career development aligned with DDaT frameworks. Excellent development skills, with a depth of experience including C#, Java (Spring Boot, JPA/Hibernate), REST API's, observability and monitoring, queue technologies and security. Detailed knowledge of best practices such as SOLID principles Experience of building new and evolving microservices with emphasis on high availability and data integrity. More ❯
through coaching, recruitment, and career development aligned with DDaT frameworks. Excellent development skills, with a depth of experience including C#, Java (Spring Boot, JPA/Hibernate), REST API's, observability and monitoring, queue technologies and security. Detailed knowledge of best practices such as SOLID principles Experience of building new and evolving microservices with emphasis on high availability and data integrity. More ❯
workflows. Implement robust monitoring, alerting, and incident response processes to maintain high levels of system reliability and uptime. Continuously assess and integrate new tools and technologies to enhance automation, observability, and scalability. Drive platform automation across provisioning, deployments, security controls, and operational workflows Proven experience in a DevOps or platform engineering role, ideally within a fast-paced or regulated environment. More ❯
Fi authentication systems, CRMs and partnered PropTech tools Continually hone and perfect our homegrown DevOps and CI/CD processes by further developing GitHub Actions pipelines, Terraform definitions and observability integrations. Ensure quality & reliability: establish testing best practices (unit, integration, end-to-end), conduct code reviews and demand high quality standards Shape and refine our cloud-native platform to optimise More ❯
such as Docker, ECS, or Kubernetes Solid programming skills in one or more languages (e.g., Java, Python, TypeScript) Experience in designing and implementing CI/CD pipelines Familiar with observability tools, logging frameworks, and performance monitoring Background in serverless technologies (e.g., Lambda, Step Functions, API Gateway) Experience with data tools like EMR, Glue, or Apache Spark Understanding of event-driven More ❯
Caldecotte, Milton Keynes, Buckinghamshire, England, United Kingdom
Connells Group HQ
day-to-day and strategic decision making.You will be a hands-on and customer focused engineering servant-leader. You will be comfortable moving across orchestration, automation, pipelines, cloud services, observability and security domains (even if you are not an expert in them all). A non-negotiable is experience and familiarity with Microsoft Azure.You will play your part in operating More ❯
technical considerations related to the rapid developments in tech Ensure high-quality code and best practices. Write clean, maintainable and efficient code and ensure code quality through TDD and observability practices Develop RESTful APIs using FastAPI and Pydantic Work with SQL and NoSQL databases, as well as ORM tools like SQLAlchemy and SQLModel Participate in Agile XP methodologies like pair More ❯
/IP, VLANs, routing). You will bring some of these skills, but more importantly you're interested in learning these things: • Hardware & physical infrastructure. • Data-driven monitoring and observability (Grafana, InfluxDB, Prometheus, Elastic). • Exposure to configuration management (Puppet, Ansible, Terraform). • Some exposure to scripting (Bash, Python). • Supporting CI/CD delivery pipelines (GitLab, GitHub). More ❯
using modern, agile development practices like code review, TDD, CI/CD and pairing using tools like Git and GitHub. Experience of operationally managing software components once live, including; observability, logging, metrics, error reporting, debugging and live incident management. Experience of working with sensitive personal data. Competitive salary starting from £85,000 Generous Pension Scheme - We invest in your future More ❯
using modern, agile development practices like code review, TDD, CI/CD and pairing using tools like Git and GitHub. Experience of operationally managing software components once live, including; observability, logging, metrics, error reporting, debugging and live incident management. Experience of working with sensitive personal data. Competitive salary starting from £85,000 Generous Pension Scheme - We invest in your future More ❯
influencing at all levels. A mindset focused on long-term sustainability and strategic technical thinking. Bonus Points For Fintech or regulated environment experience, particularly investment platforms. Familiarity with modern observability stacks and incident response processes. Experience with security-first architecture and data protection best practices. Why Join? Well-Backed & Ambitious: Backed by a globally recognised financial group with significant investment More ❯
influencing at all levels. A mindset focused on long-term sustainability and strategic technical thinking. Bonus Points For Fintech or regulated environment experience, particularly investment platforms. Familiarity with modern observability stacks and incident response processes. Experience with security-first architecture and data protection best practices. Why Join? Well-Backed & Ambitious: Backed by a globally recognised financial group with significant investment More ❯
solutions meet business needs. Experience with data ingestion tools, like Fivetran. Advantageous Exposure to deploying applications with Kubernetes. Experience with Data Orchestrator tools (Airflow, Prefect, etc.) Experience with Data Observability tools (Montecarlo, Great Expectations, etc.) Experience with Data Catalog tools (Amundsen, OpenMetadata, etc.) Interview Process Call with the talent team Take home task Tech interview CPTO interview Life at Lendable More ❯
Exposure to site reliability engineering: root cause analysis, in-production troubleshooting, on-call rotations ) • Exposure to infrastructure management: CI/CD, containerization, orchestration, infra-as-code, monitoring, logging, alerting, observability ). • Technical product mindset (e.g. understanding how to debug poor adoption). • Excellent problem-solving and communication skills (ability to contextualizing, gauging risks and getting buy-in for high stakes More ❯
with cross-functional stakeholders including the Data Platform team and Engineering teams. Design and maintain reliable, scalable cloud infrastructure (primarily AWS). Drive key initiatives involving container orchestration (Kubernetes), observability, security, and CI/CD. Establish best practices in platform engineering and foster a servant-leadership culture focused on empathy, empowerment, and collaboration. Work with your peers and colleagues at More ❯
with cross-functional stakeholders including the Data Platform team and Engineering teams. Design and maintain reliable, scalable cloud infrastructure (primarily AWS). Drive key initiatives involving container orchestration (Kubernetes), observability, security, and CI/CD. Establish best practices in platform engineering and foster a servant-leadership culture focused on empathy, empowerment, and collaboration. Work with your peers and colleagues at More ❯
ElasticCache Familiarity with modern CI/CD platforms – ideally GitLab, but GitHub Actions or CircleCI also welcome Proficiency in testing frameworks like JUnit and RestAssured A passion for monitoring, observability , and maintaining resilient systems Desirable Skills: Experience with monitoring and alerting tools like Datadog, Prometheus, Grafana, or PagerDuty Exposure to Python scripting Familiarity with deployment platforms such as Kubernetes and More ❯
available deployments. Responsibilities Mentor and lead a team of DevOps specialists, promoting best practices, documentation, and knowledge sharing. Collaborate cross functionally (Dev, QA, Management etc.) to enhance deployment quality, observability, and stability. Implement monitoring, logging, alerting into systems to proactively detect issues and maintain system health. Design the architecture, implementation, and management of end-to-end CI/CD pipelines More ❯
learning, knowledge sharing and continuous improvement. You have a passion for DevOps and Platform as a Service. Understanding of security and compliance requirements related to platform infrastructure. Experience with observability practices and tooling, incident management processes and driving operational excellence. Diversity, Equity and Inclusion If you're excited about this role but your experience doesn't align perfectly, we encourage More ❯
Snowflake. Understanding of testing strategies, including unit, integration, and system testing (TDD/BDD is a plus). Experience with CI/CD pipelines, monitoring tools, and production-grade observability practices. Strong problem-solving skills, especially when dealing with data integrity, scale, and operational complexity. Comfortable working independently and navigating ambiguity, especially when translating regulatory or compliance needs into technical More ❯