AWS , Kubernetes , and associated DevOps practices. Champion DevOps culture by integrating CI/CD pipelines using Jenkins, GitLab, or similar tools. Leverage monitoring and observability tools like Grafana and Prometheus for system reliability. Enhance security practices and ensure compliance with stringent security and accreditation standards. What You’ll Bring: Active More ❯
SR2 | Socially Responsible Recruitment | Certified B Corporation™
Core Responsibilities Streamlining infrastructure setup and maintenance to support fast, scalable software delivery. Owning, managing, and enhancing our Continuous Integration (CI) infrastructure. Enhancing system observability with effective diagnostic and tracking solutions. Delivering automation solutions for end-users across the business. Ideal Skill Set Technical Competencies Confident with automation scripting , ideally More ❯
APIs Working with databases like PostgreSQL, DynamoDB and MongoDB Deploying cloud-native services using tools like Terraform, Docker and Kubernetes Improving performance, security and observability in backend systems Collaborating with product managers, cloud engineers and designers What they’re looking for: Proven experience writing Python APIs (Django, Flask or FastAPI More ❯
APIs Working with databases like PostgreSQL, DynamoDB and MongoDB Deploying cloud-native services using tools like Terraform, Docker and Kubernetes Improving performance, security and observability in Back End systems Collaborating with product managers, cloud engineers and designers What they're looking for: Proven experience writing Python APIs (Django, Flask or More ❯
in Java 7 + years in Linux/sysadmin AWS (EKS, EC2, CDK, VPC), Kubernetes, CI/CD IaC: Terraform, Helm, Ansible Experience with observability (New Relic, Splunk, DataDog) On-call rotation & agile/SCRUM experience Must be based in the UK or Germany with no need for sponsorship . More ❯
be able to build new DevOps pipelines AWS S3 RDS Route 53 IAM EKS Secrets Manager ECR Terraform Deployment of AWS Resources Pipelines OCI Observability ELK Dynatrace Prometheus Others Vault RedHat As an equal opportunities employer, we welcome applications from individuals of all backgrounds. However, for you to be eligible More ❯
be able to build new DevOps pipelines: AWS S3, RDS, Route 53, IAM, EKS, Secrets Manager, ECR Terraform Deployment of AWS Resources, Pipelines, OCI Observability: ELK, Dynatrace, Prometheus Others: Vault, RedHat As an equal opportunities employer, we welcome applications from individuals of all backgrounds. However, for you to be eligible More ❯
be able to build new DevOps pipelines AWS S3 RDS Route 53 IAM EKS Secrets Manager ECR Terraform Deployment of AWS Resources Pipelines OCI Observability ELK Dynatrace Prometheus Others Vault RedHat As an equal opportunities' employer, we welcome applications from individuals of all backgrounds. However, for you to be eligible More ❯
pipelines AWS S3 RDS Route 53 IAM EKS Secrets Manager ECR Kubernetes Helm Kops Ingress/Egress Terraform Deployment of AWS Resources Pipelines OCI Observability ELK Dynatrace Prometheus Others Vault RedHat Skills working in a secure environment and ability to adhere to security principles Experience in support organisation More ❯
including .NET, Node.js, JavaScript frameworks (React, Vue.js), Kotlin, Swift, and API integrations. Skilled in building cloud-native solutions on Azure or AWS, with IaC, observability, and HA/DR patterns in mind. Strong knowledge of CMS platforms such as Sitecore, plus experience with modern auth frameworks (OAuth2, OIDC, SAML). More ❯
West Midlands, United Kingdom Hybrid / WFH Options
Halian Technology Limited
including .NET, Node.js, JavaScript frameworks (React, Vue.js), Kotlin, Swift, and API integrations. Skilled in building cloud-native solutions on Azure or AWS, with IaC, observability, and HA/DR patterns in mind. Strong knowledge of CMS platforms such as Sitecore, plus experience with modern auth frameworks (OAuth2, OIDC, SAML). More ❯
cause analyses. Ability to work effectively with other technical teams such as DevOps to support deployments and troubleshoot issues. Knowledge of DevOps practices (CICD, observability, automation) is a bonus. Knowledge of AWS and logging tools such as Kibana and Datadog is an advantage Continuous learning mindset and a passion for More ❯
Smart Edge’s Client is looking for an individual to help with their Observability Subject Matter Expert (SME)@ Telford, UK Experince Required : • Observability Architecture & Strategy: Design and implement observability pipelines using AWS-native and third-party tools. Define telemetry standards (metrics, logs, traces) across microservices, APIs, and data pipelines. Establish … SLIs/SLOs and integrate them into service health dashboards. AWS Workload Monitoring: Implement observability for AWS Connect (contact flows, agent metrics, call quality). Monitor AWS Data Services (Glue, Redshift, Athena, S3, Lake Formation) for performance, throughput, and data lineage. Integrate AWS Integration Services (API Gateway, EventBridge, Step Functions … Lambda) with distributed tracing and structured logging. Tooling & Automation: Deploy and manage observability tools: CloudWatch, X-Ray, OpenTelemetry, Prometheus, Grafana, Datadog, Splunk, ELK. Automate alerting, anomaly detection, and incident correlation using AI/ML-based tools. Integrate observability into CI/CD pipelines and Infrastructure-as-Code (IaC) workflows. Incident More ❯
Smart Edge’s Client is looking for an individual to help with their Observability Subject Matter Expert (SME)@ Telford, UK Experince Required : • Observability Architecture & Strategy: Design and implement observability pipelines using AWS-native and third-party tools. Define telemetry standards (metrics, logs, traces) across microservices, APIs, and data pipelines. Establish … SLIs/SLOs and integrate them into service health dashboards. AWS Workload Monitoring: Implement observability for AWS Connect (contact flows, agent metrics, call quality). Monitor AWS Data Services (Glue, Redshift, Athena, S3, Lake Formation) for performance, throughput, and data lineage. Integrate AWS Integration Services (API Gateway, EventBridge, Step Functions … Lambda) with distributed tracing and structured logging. Tooling & Automation: Deploy and manage observability tools: CloudWatch, X-Ray, OpenTelemetry, Prometheus, Grafana, Datadog, Splunk, ELK. Automate alerting, anomaly detection, and incident correlation using AI/ML-based tools. Integrate observability into CI/CD pipelines and Infrastructure-as-Code (IaC) workflows. Incident More ❯
Code, Git). A great candidate will have good knowledge of Infrastructure as Code with Azure CI/CD integration, and the use of observability tools like Prometheus or Loki. Excellent skills in creating clear, detailed High Level & Low Level documentation with technical diagrams. A strong understanding of Microsoft Azure More ❯
Nottingham, England, United Kingdom Hybrid / WFH Options
EMBS Technology
Code, Git). A great candidate will have good knowledge of Infrastructure as Code with Azure CI/CD integration, and the use of observability tools like Prometheus or Loki. Excellent skills in creating clear, detailed High Level & Low Level documentation with technical diagrams. A strong understanding of Microsoft Azure More ❯
Nottingham, England, United Kingdom Hybrid / WFH Options
Digital Waffle
Python and TypeScript Work with product and engineering teams to shape models, services, and system behaviour Contribute to system architecture and infrastructure for scale, observability, and performance Explore and implement LLMs, prompt engineering, and AI orchestration frameworks Take ownership of features end-to-end, from design to deployment and monitoring More ❯
to interact with our systems and code Regularly contribute openly to team meetings (and encouraging others to do so) Champion the health, quality and observability of our projects Ship to production on your first day! Who you are Experience using React in production Experience in unit testing Knowledge of optimising More ❯
to interact with our systems and code Regularly contribute openly to team meetings (and encouraging others to do so) Champion the health, quality and observability of our projects Ship to production on your first day! Who you are Experience using React in production Experience in unit testing Knowledge of optimising More ❯
Nottingham, Nottinghamshire, East Midlands, United Kingdom
Microlise
Promote a culture of delivery excellence, engineering discipline, and personal accountability Encourage cross-team alignment on shared technical goals, such as implementation of DSC, observability standards, and automation Support long-term platform value creation by ensuring projects are not only delivered but deliver measurable benefit to the organisation What we More ❯
implementing data governance best practices, including data quality, compliance, and stewardship Familiarity with governance tools such as data catalogues, data dictionaries, lineage mapping, and observability tools In-depth understanding of data compliance and regulatory requirements (e.g., GDPR) Undergraduate degree (BA, BSc); Masters degree is a plus UK work authorisation is More ❯
Hereford, Herefordshire, West Midlands, United Kingdom Hybrid / WFH Options
Twinstream Limited
the Site Reliability Engineer: Partner with developers to improve performance and reliability across systems Automate toil and reduce unnecessary alerts with smart tooling Evolve observability so we can prevent issues before they become incidents Improve CI/CD pipelines and support development teams in delivering quality faster Explore new technologies … modern IaC practices Hands-on with Docker and orchestration tools (Kubernetes, OpenShift, or Docker Swarm) CI/CD experience (Jenkins or equivalent) Monitoring/observability tools: Grafana , Prometheus , or InfluxDB Event-driven messaging: RabbitMQ or similar Strong Linux skills, scripting, and understanding of network security protocols Experience with AWS: EC2 … S3, RDS, Lambda Desirable: Experience coding in Python, Java, or Go Exposure to cross-domain solutions Experience in a service management environment Observability best practices and metric-driven reliability improvement Security Requirements Due to the sensitive nature of our work, candidates must be eligible for Developed Vetting (DV) clearance. All More ❯
Infrastructure services; Review, advise and assure resilience of our critical platforms and services from non-functional perspective ensuring design and engineering KPIs like maintainability, observability, availability, recoverability and Security are improved and sustained. Developing and establishing high technical and design standard to deliver change that meets departmental policies and outcomes More ❯
Hereford, Herefordshire, United Kingdom Hybrid / WFH Options
TwinStream
with multiple feature development teams and the BAU/Support team to define and evolve our cloud & on-prem infrastructure & delivery pipelines, improving system observability, demonstrating performance and capacity improvements and proactively identifying and mitigating reliability risks. Key Responsibilities of the Site Reliability Engineer: Collaborate with Software Engineers to improve … reliability and performance in their subsystems Partner with System Administrators in automating toil and eliminating alerts Evolve observability and monitoring capabilities to identify and solve problems before they impact the business Support development environments to help us achieve our delivery and quality goals Research and evaluate technologies, tools and services … languages (Java, Go, Python or similar) Knowledge of cross-domain principles & technologies Experience of working in a service management environment Practical applications of using observability patterns in previous systems Creating and monitoring system availability metrics and using those to drive work that reduces downtime There are many great reasons to More ❯