Basingstoke, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
native Infrastructure-as-Code (IaC) solutions from the ground up? Our client is seeking a talented and motivated Senior Software Engineer to lead the development of our next-generation observability platform. THIS IS NOT A DEVOPS ROLE. Responsibilities Collaborate within a dynamic software engineering team to architect and build a new cloud-native IaC platform. Develop software using technologies such More ❯
Hull, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
native Infrastructure-as-Code (IaC) solutions from the ground up? Our client is seeking a talented and motivated Senior Software Engineer to lead the development of our next-generation observability platform. THIS IS NOT A DEVOPS ROLE. Responsibilities Collaborate within a dynamic software engineering team to architect and build a new cloud-native IaC platform. Develop software using technologies such More ❯
Bristol, Gloucestershire, United Kingdom Hybrid / WFH Options
Twinstream Limited
Socials & Events Cycle to Work Scheme & Life Assurance Key Responsibilities of the Site Reliability Engineer: Work closely with engineers and sysadmins to increase performance and reduce toil Advance system observability, monitoring and alerting Automate, troubleshoot, and proactively resolve issues before they escalate Improve development environments to meet delivery and quality targets Research and evaluate tools and platforms to support scale More ❯
BS1, Bristol, City of Bristol, United Kingdom Hybrid / WFH Options
Twinstream Limited
Socials & Events Cycle to Work Scheme & Life Assurance Key Responsibilities of the Site Reliability Engineer: Work closely with engineers and sysadmins to increase performance and reduce toil Advance system observability, monitoring and alerting Automate, troubleshoot, and proactively resolve issues before they escalate Improve development environments to meet delivery and quality targets Research and evaluate tools and platforms to support scale More ❯
Employment Type: Permanent
Salary: £80000 - £110000/annum Hybrid, Great Benefits
Bristol, Avon, South West, United Kingdom Hybrid / WFH Options
Twinstream Limited
Socials & Events Cycle to Work Scheme & Life Assurance Key Responsibilities of the Site Reliability Engineer: Work closely with engineers and sysadmins to increase performance and reduce toil Advance system observability, monitoring and alerting Automate, troubleshoot, and proactively resolve issues before they escalate Improve development environments to meet delivery and quality targets Research and evaluate tools and platforms to support scale More ❯
multi-tenant SaaS or large enterprise application. Certifications: AWS Certified Solutions Architect, Google Professional Cloud Architect, Azure Solutions Architect Expert. Experience in data architecture, AI/ML integration, and observability frameworks . More ❯
multi-tenant SaaS or large enterprise application. Certifications: AWS Certified Solutions Architect, Google Professional Cloud Architect, Azure Solutions Architect Expert. Experience in data architecture, AI/ML integration, and observability frameworks . More ❯
some experience in a leadership or managerial position. Strong knowledge of cloud platforms (AWS, GCP, Azure) and modern infrastructure technologies (Kubernetes, Docker, Terraform). Expertise in monitoring, logging, and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk). Proficiency in at least one programming or scripting language (e.g., Python, Go, Bash). Deep understanding of networking, databases, and distributed systems. Strong More ❯
Terraform). Experience in software development in general, with skills in a high-level language (e.g., Python, JavaScript, TypeScript, Java) and familiarity with modern development practices Understanding of Cloud Observability, Monitoring, and Tracing tools (Datadog, CloudWatch, Jaeger, ELK) and how best to leverage to support effective MTTR and mitigate high CFR Our UK benefits: Stock Options Annual Performance Bonus or More ❯
such as Python, Java Spring Boot, Unix Shell. Deep knowledge of software applications and technical processes with emerging depth in one or more technical disciplines Proficiency and experience in observability such as white and black box monitoring, SLO alerting, and telemetry collection using tools such as Grafana, Geneos, Dynatrace, Prometheus, Datadog, Splunk, etc. Proficiency in continuous integration and continuous delivery More ❯
optimize data flow, connectivity, and interoperability Help to implement best practices and process improvements to enhance delivery efficiency and team performance Work with various internal teams to continuously improve observability and supportability capabilities of the company platform Key Requirements A highly motivated, technical and detail-oriented support engineer, able to work autonomously with minimal direction, passionate about learning new things More ❯
optimize data flow, connectivity, and interoperability Help to implement best practices and process improvements to enhance delivery efficiency and team performance Work with various internal teams to continuously improve observability and supportability capabilities of the company platform Key Requirements A highly motivated, technical and detail-oriented support engineer, able to work autonomously with minimal direction, passionate about learning new things More ❯
London, England, United Kingdom Hybrid / WFH Options
Ten Lifestyle Group
cost optimisation). Experience with cloud platforms (AWS, GCP, Azure) and infrastructure-as-code (Terraform). Familiarity and hands-on with DevOps practices (CI/CD, Docker, K8s) and observability tools (Prometheus, Grafana, Datadog). Experience in distributed systems and scaling. Knowledge and hands-on experience with multiple data stores (both SQL and NoSQL). Desired experience in building agentic More ❯
Enterprise level Cloud & DevOps standards and best practices in the areas of cloud infrastructure, infrastructure as code, and DevOps toolchain Provide thought leadership, design and implementation roadmap for Continuous Observability platform to maintain overall services & infrastructure health along with automated remediation and disaster recovery failovers capabilities Team Leadership: Leadership & Mentoring: As an Azure Sr Architect, you will be responsible to More ❯
an on-call rota Working with the Vault squad & wider Technology stakeholders, you will support the continuous improvement of our services through the development of automation scripting and effective observability solutions. Enforce adherence to architectural standards/principles, global product-specific guidelines, usability design standards, etc. Working with Technology teams, you will onboard new tools to support business needs and More ❯
or GCP): Migration and operation of cloud environments, including compute and storage scalability Containerisation & Virtualisation: Familiarity with virtual and physical server provisioning, especially in strategic data centres Platform Resilience & Observability: Designing for uptime, performance, and root cause analysis. Web Services & APIs: Used for Integration with 24+ LBGI systems Batch Processing: Understanding of batch suite performance and scheduling constraints RPA & Automation More ❯
position will align to a discipline where you will be expected to build and support solutions aligned with SDLC principles, providing technical excellence with a focus on scripting and observability coupled with a security mindset. What will you be doing day-to-day? Automation and Orchestration: Streamline the delivery and support processes by leveraging automation and IaC principles. Support and More ❯
optimize data flow, connectivity, and interoperability Help to implement best practices and process improvements to enhance delivery efficiency and team performance Work with various internal teams to continuously improve observability and supportability capabilities of the company platform Key Requirements A highly motivated, technical and detail-oriented support engineer, able to work autonomously with minimal direction, passionate about learning new things More ❯
Cardiff, Wales, United Kingdom Hybrid / WFH Options
ZipRecruiter
/CD pipelines using GitHub Actions, AWS CodePipeline, Jenkins, and other tools, with an emphasis on reliability, reusability, and performance. Contribute to the design and integration of monitoring and observability solutions (CloudWatch, Prometheus, Grafana) to ensure infrastructure and model health. Champion software engineering excellence through Test-Driven Development (TDD), rigorous test automation, and continuous quality assurance practices. Support architectural decisions More ❯
a bias for Infrastructure (Python, Go, C#) • IAM Policy and Authentication/Authorization schemes • Web Services and REST API • Databases and Storage Systems • Development Build, Test, and Deployment Pipelines • Observability and Monitoring (Open Telemetry, TIG and ELK stacks) #LI-JS2 Together, as owners, let's turn meaningful insights into action. Life at CGI is rooted in ownership, teamwork, respect and More ❯
of reliability and automation. Help the team provide guidelines and blueprints on the DevOps lifecycle of applications. Maintain our internal tooling and automation to improve the reliability, scalability, and observability of our services. Proactively identify and solve issues across the whole stack, together with the rest of the infrastructure and engineering teams. Contribute to raising awareness in the security and More ❯
and the ability to balance multiple priorities in a fast-paced environment. DESIRED SKILLS AND EXPERIENCE: Financial Services experience Real-time and low latency market data experience Service orchestration, observability and monitoring platform experience Solid understanding of a Programming Language (preferably Python) Agile tools (Jira, GIT among other DevOps principles) AWS, Azure, GCP Certification LSEG is a leading global financial More ❯
adaptability and a commitment to continuous learning. Understanding of Continuous Integration and Continuous Deployment (CI/CD) principles and their role in efficient software development. Awareness of monitoring and observability practices to support system reliability and performance. A proactive mindset, eagerness to learn, and a collaborative approach to solving engineering challenges. Why Proofpoint Protecting people is at the heart of More ❯
London, England, United Kingdom Hybrid / WFH Options
Smarkets
designing, developing, and implementing distributed systems Can demonstrate deep knowledge in running services in cloud microservice environments and hands-on experience with Kubernetes Familiarity with AWS cloud Familiarity with observability principles and tools (Grafana, Prometheus, Sentry Elasticsearch, Jaeger) Excellent planning and communications skills and able to lead conversations with development and product teams Preferred Skills and Experience 6-8+ More ❯
transactional operations and columnar formats for efficient large-scale analytical querying. Support DevOps practices including CI/CD, infrastructure-as-code, automated testing, release and version control and system observability for data pipelines. Establish metrics and KPIs and identify and deploy tools to measure data pipeline health, data quality, timeliness and accuracy, team performance, cost-effectiveness, and business impact. Actively More ❯