Kubernetes: Workload orchestration and container management CI/CD: GitHub Actions or Azure DevOps pipelines with end-to-end automation Event-Driven Architecture: Kafka or similar messaging systems Monitoring & Observability: Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking, fintech, healthcare) Background in Site More ❯
or ARM templates. Experience with automation and scripting using PowerShell, Bash, or Python. Strong knowledge of cloud security practices and governance models within Azure environments. Experience with monitoring and observability tools such as Azure Monitor, Log Analytics, Prometheus, or Grafana. Strong troubleshooting and analytical skills, particularly in complex cloud and networked environments. If you're interested in the role, please More ❯
Kubernetes: Workload orchestration and container management CI/CD: GitHub Actions or Azure DevOps pipelines with end-to-end automation Event-Driven Architecture: Kafka or similar messaging systems Monitoring & Observability: Azure Monitor, Open Telemetry, Prometheus etc. Secure-by-Design Practices: Policy as Code, automated validation, compliance controls Nice to Haves Experience in regulated environments (banking, fintech, healthcare) Background in Site More ❯
London, South East, England, United Kingdom Hybrid/Remote Options
Salt Search
within distributed system architectures Knowledge of Agile delivery methods , including CI/CD pipelines and test automation frameworks Strong understanding of programming best practices around security, availability, performance, and observability Excellent problem-solving and collaboration skills, with the ability to work effectively in a cross-functional, Agile environment A passion for clean code, scalability, and continuous learning Why Join? You More ❯
run queries. Strong experience with cloud platforms such as AWS, Azure, or Google Cloud, including services like EC2, S3, RDS, and Kubernetes. Expertise in implementing and managing monitoring and observability tools like Prometheus, Grafana, ELK stack, or similar. Experienced with infrastructure as code (IaC) tools like Terraform, Ansible, or Puppet. Extensive experience with automation tools and scripting languages to streamline More ❯
offs of architectural and design decisions. Experience with Sequelize or similar tools Knowledge of security, accessibility and performance best practices. Exposure to agile or lean delivery environments. Familiarity with observability tools. More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Experis
and cloud deployment workflows. Practical experience with MongoDB or similar NoSQL databases. Problem-solver with strong communication skills and a collaborative mindset. Bonus: experience with message queues (Kafka, RabbitMQ), observability tooling, or DevOps-style environments. 🌟 Why You’ll Love It Here Impact & Ownership — You’ll own core backend systems and shape architectural decisions. Modern Tech Stack — Node.js, TypeScript, NestJS, MongoDB More ❯
and cloud deployment workflows. Practical experience with MongoDB or similar NoSQL databases. Problem-solver with strong communication skills and a collaborative mindset. Bonus: experience with message queues (Kafka, RabbitMQ), observability tooling, or DevOps-style environments. 🌟 Why You’ll Love It Here Impact & Ownership — You’ll own core backend systems and shape architectural decisions. Modern Tech Stack — Node.js, TypeScript, NestJS, MongoDB More ❯
offs of architectural and design decisions. Experience with Sequelize or similar tools Knowledge of security, accessibility and performance best practices. Exposure to agile or lean delivery environments. Familiarity with observability tools. Contract Philip Boltt at Lorien Global IND_PC1 Guidant, Carbon60, Lorien & SRG - The Impellam Group Portfolio are acting as an Employment Business in relation to this vacancy. More ❯
Neptune) and property graph modelling. ● Data engineering: ETL pipelines, document processing, schema design for AI applications. ● Cloud platforms (GCP preferred, AWS/Azure also relevant) and containerisation (Docker). ● Observability and monitoring for LLM applications (tracing, metrics, cost tracking). ● Secure coding practices for regulated industries and sensitive data handling. More ❯
South East London, London, United Kingdom Hybrid/Remote Options
Stepstone UK
Familiarity with deploying and scaling ML models in the cloud, particularly with AWS and SageMaker Understanding of DevOps processes and tools: CI/CD, Docker, Terraform, and monitoring/observability Bonus: experience with vector databases, semantic search, or event-driven systems like Kafka Additional Information Were a community here that cares as much about your life outside work as how More ❯
that automate their processes. Contribute to the development of our Virtual Agent development platform that scales with our product strategy. Ensure our AI services maintain high standards of reliability, observability, availability, and performance. Participate in our machine learning community to influence how we implement machine learning and computer vision technologies, shaping Unitary's future. Take ownership of customer outcomes with More ❯
and deployment automation. Architect and manage scalable and secure cloud infrastructure (AWS, Azure, or GCP). Collaborate with data science and engineering teams for AI/ML workloads. Implement observability practices (monitoring, logging, alerting). Drive SRE best practices and disaster recovery strategies. Leadership & Strategy Act as the guardian of guardrails while empowering product squads to move fast. Partner with More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Tata Consultancy Services
stability, and ROI. Manage and own the technical debt process, ensuring long-term maintainability and scalability. Collaborate with ML and GenAI teams to operationalize models into production pipelines. Ensure observability, data quality, lineage, and governance across the data lifecycle. Mentor and lead a team of engineers, fostering a culture of innovation and technical excellence. Your Profile Essential skills/knowledge More ❯
stability, and ROI. Manage and own the technical debt process, ensuring long-term maintainability and scalability. Collaborate with ML and GenAI teams to operationalize models into production pipelines. Ensure observability, data quality, lineage, and governance across the data lifecycle. Mentor and lead a team of engineers, fostering a culture of innovation and technical excellence. Your Profile Essential skills/knowledge More ❯
that automate their processes. Contribute to the development of our Virtual Agent development platform that scales with our product strategy. Ensure our AI services maintain high standards of reliability, observability, availability, and performance. Participate in our machine learning community to influence how we implement machine learning and computer vision technologies, shaping Unitary's future. Take ownership of customer outcomes with More ❯
Nottingham, Nottinghamshire, United Kingdom Hybrid/Remote Options
London Stock Exchange Group
not static and must evolve over time as technology and standards change. You are not afraid to dive deep - writing code, defining standards around CI/CD, maximizing automation, observability and supportability whilst making sure solutions are cost effective. A confident communicator, you will lead with data when collaborating with stakeholders. You will lead by example and mentor more junior More ❯
We're Hiring: Site Reliability Engineer (SRE) Fully Remote (UK-based candidates) | Permanent Role Supporting our US office Join a high-impact SRE team focused on automation, observability, and scaling infrastructure to support millions of users. Tech Stack Highlights Java | Kotlin | C++ | Postgres AWS (EC2, ECS, Fargate, Route53) New Relic | Splunk | DataDog Terraform | Helm | Kubernetes | Microservices What Were Looking For More ❯
SQL queries for relational databases. Integrate and manage applications in AWS cloud environments. Collaborate with cross-functional teams to ensure smooth delivery and integration of features. Implement monitoring and observability solutions (e.g., Datadog) for system health and performance tracking. Maintain high standards of code quality, reliability, and security. Primary Skills Strong programming skills in Java and Spring Boot. Hands-on More ❯
deliver high-quality, scalable solutions. Building and maintaining services using Python, TypeScript, and cloud platforms such as AWS or GCP. Working with serverless architectures, containerisation, and NoSQL databases. Championing observability, data-driven decision making, and continuous improvement. Influencing product direction by questioning, pushing back, and ensuring features align with user needs. A natural curiosity about the product and a willingness More ❯
and automation are at the heart of technology delivery. Responsibilities include: Designing and enforcing SLOs, SLIs, and SLAs to ensure high reliability and performance. Building and maintaining monitoring/observability solutions (Datadog, Grafana, Azure Application Insights, Log Analytics). Managing Infrastructure as Code (Terraform, Pulumi, CloudFormation) for scalable, repeatable deployments. Automating with PowerShell, Python, or Bash to drive efficiency. Supporting More ❯
City of London, London, United Kingdom Hybrid/Remote Options
N Consulting Global
CD, modular design, and automated testing • Contributing to the development of a lakehouse architecture using Apache Iceberg • Collaborating with business teams to translate requirements into data-driven solutions • Building observability into data flows and implementing basic quality checks • Participating in code reviews, pair programming, and architecture discussions • Continuously learning about the financial indices domain and sharing insights with the team More ❯
CD, modular design, and automated testing • Contributing to the development of a lakehouse architecture using Apache Iceberg • Collaborating with business teams to translate requirements into data-driven solutions • Building observability into data flows and implementing basic quality checks • Participating in code reviews, pair programming, and architecture discussions • Continuously learning about the financial indices domain and sharing insights with the team More ❯
with React, Vue, or Blazor Integrate LLMs and GenAI features into core product experiences Lead technical decision-making and mentor engineers within your squad Ensure best practices across testing, observability, and code quality What We’re Looking For Proven experience delivering AI/ML-powered production systems (not prototypes) Strong full-stack capability – C# .NET + modern JavaScript frameworks Solid More ❯
and automation are at the heart of technology delivery. Responsibilities include: Designing and enforcing SLOs, SLIs, and SLAs to ensure high reliability and performance. Building and maintaining monitoring/observability solutions (Datadog, Grafana, Azure Application Insights, Log Analytics). Managing Infrastructure as Code (Terraform, Pulumi, CloudFormation) for scalable, repeatable deployments. Automating with PowerShell, Python, or Bash to drive efficiency. Supporting More ❯