pipelines with Jenkins, GitLab CI/CD or similar - Containerize applications using Docker and orchestrate with Kubernetes - Monitor systems with Prometheus, Grafana and apply observability best practices - Automate deployment processes and improve DevOps workflows - Ensure high availability, fault tolerance and disaster recovery of cloud infrastructure - Collaborate with developers and operational more »
for mentoring and technical coaching of developers Excellent verbal and written communication skills The following skills and experience are nice to have: Experience with observability and monitoring tools such as Datadog Familiarity with polyglot persistence and experience with a NoSql DB especially DocumentDB Experience with Kafka, HiveMQ, ElasticSearch Infrastructure as more »
services and technologies, particularly AWS, to optimize the performance and security of our cloud infrastructure. Monitor system health, performance, and availability using monitoring and observability tools, proactively identifying and resolving issues. Collaborate with cross-functional teams to troubleshoot and resolve complex infrastructure issues, minimizing downtime and improving system reliability. Mentor more »
Guildford, Surrey, South East, United Kingdom Hybrid / WFH Options
Allianz Insurance Plc
Proficiency in programming languages such as Python, and knowledge of PySpark and Spark pool clusters as well as ML libraries and frameworks. Proficiency with observability tools, such as: Prometheus and Grafana. Infrastructure-as-Code(IaC)Terraform and/or Bicep (for Azure resource provisioning). Extensive experience of Apigee, API more »
infrastructure needs, ideally in a self-service environment. WHAT YOU’LL DO: Build and provide platform features for compute, authentication, service discovery and calls, observability and redundancy Lead architectural approaches with the right tradeoffs between scale, cost and maintenance Coach junior engineers about design, development, testing and deployment best practices more »
serverless function runtime version upgrades) Infrastructure drift monitoring & management Softwaremaintenance (e.g. language/framework/package version upgrades) Key rotation management Tooling maintenance (e.g. observability stack) Performance and Load testing Role Requirements Strong knowledge of Microsoft technologies. Active Directory, Entra, SharePoint, 365, Windows 10/11, Intune, Application packaging (Intune more »
in GitHub as a Source Code Management (SCM) platform, with hands-on experience in version control and collaborative coding. Experience with Kubernetes Experience with observability tools such as Datadog, Grafana, and Elastic for monitoring, analytics, and proactive issue identification. Experience in incident management, showcasing the ability to respond promptly and more »
normal—and that’s where you come in! We are seeking a skilled Site Reliability Engineer (SRE) with experience in AWS, Serverless, Monitoring, and Observability to join our team. Responsibilities: Design, build and maintain scalable, and reliable cloud infrastructure in AWS Monitor and manage the performance, reliability, and security of more »
pipelines • AWS S3 RDS Route 53 IAM EKS Secrets Manager ECR • Kubernetes Helm Kops Ingress/Egress • Terraform Deployment of AWS Resources Pipelines OCI • Observability ELK Dynatrace Prometheus • Others Vault RedHat As an equal opportunities’ employer, we welcome applications from individuals of all backgrounds. However, for you to be eligible more »
/scripting languages such as Python, Java, or Go, utilized for automating deployment, configuration, management, and monitoring processes. Strong understanding and practical experience with observability/monitoring tooling like New Relic, Dynatrace, or Splunk. Ability to define and create monitors/alerts at both infrastructure and application layers. Demonstrated automation more »
Solid understanding of the processes involved in release, integration, and deployment, along with an awareness of the promotion pathways within these processes. Expertise in observability principles and practices, encompassing monitoring, logging, tracing, and alerting systems to ensure transparency and actionable insights into system performance and health. Tools Dynatrace, Datadog Familiarity more »
Solid understanding of the processes involved in release, integration, and deployment, along with an awareness of the promotion pathways within these processes. Expertise in observability principles and practices, encompassing monitoring, logging, tracing, and alerting systems to ensure transparency and actionable insights into system performance and health. Tools Dynatrace, Datadog Familiarity more »
essential Experience diagnosing and troubleshooting connectivity issues Well-versed in security by design and security by default. Familiarity with SOC compliance Strong command of observability and monitoring in a multi-tenant cloud Experience working in high-performing Agile teams Proficient development in a major programming language such as C#, Python more »
City of London, London, United Kingdom Hybrid / WFH Options
SKY
Working closely with the Platform Enablement team and the SRE team to drive new ideas into the product roadmaps around the deployment self-service, observability, security, and reliability Team Management Define and track key performance indicators (KPIs) to measure the success and impact of Linux initiatives. Fostering a team culture more »
Middlesex, South East, United Kingdom Hybrid / WFH Options
SKY
Working closely with the Platform Enablement team and the SRE team to drive new ideas into the product roadmaps around the deployment self-service, observability, security, and reliability Team Management Define and track key performance indicators (KPIs) to measure the success and impact of Linux initiatives. Fostering a team culture more »
Birmingham, West Midlands, United Kingdom Hybrid / WFH Options
Interact Consulting Limited
on experience with Terraform for Infrastructure as Code (IaC). Proficiency with Docker, Kubernetes, and AKS. Familiarity with Azure cloud-native services. Knowledge of observability and site-reliability engineering principles. Proficiency in SQL and experience working with relational databases. more »
solutions including for compute, storage, messaging and networking Good understanding of Docker and experience with CI/CD tooling Good understanding of security and observability best practices and tooling Effective communication and collaboration skills What else about you would excite us? Experience building and maintaining high-traffic server-side web more »
Take ownership of delivery, embracing a DevOps culture to own the lifecycle of software that powers our products, along with release automation and production observability Own one or more core areas of Oddschecker’s product and engineering stacks, iteratively delivering value Coach and mentor others in your squad and wider more »
Northern Ireland, United Kingdom Hybrid / WFH Options
SoftwareOne
CI/CD pipeline development IaC, specifically Terraform or a wrapper Kubernetes, or a Cloud-managed container orchestrator, e.g. AKS, EKS Experience operating an observability stack preferred Bonus points for being security-minded About You: You have experience operating a mature, high volume stack in the cloud You have opinions more »
CI/CD pipeline development IaC, specifically Terraform or a wrapper Kubernetes, or a Cloud-managed container orchestrator, e.g. AKS, EKS Experience operating an observability stack preferred Bonus points for being security-minded About You: You have experience operating a mature, high volume stack in the cloud You have opinions more »
using Python and TypeScript/JavaScript. Strong API design skills and a familiarity with building web applications. A proponent of great testing, first-class observability and automating everything. Familiarity with security principles and practices, including authentication, authorization, and encryption. Experience with cloud technologies (e.g. Docker, Kubernetes, Helm & Terraform) and platforms more »
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Confidential
skills, and the capacity to work well in a team environment are often considered essential. Understanding of DevOps principles and practices, including monitoring, logging, observability and infrastructure management. Why Join Clarus Software? Here are some of the reasons you will love working here: A team of amazing people. We are more »
Greater Manchester, England, United Kingdom Hybrid / WFH Options
Clarus WMS
skills, and the capacity to work well in a team environment are often considered essential. Understanding of DevOps principles and practices, including monitoring, logging, observability and infrastructure management. Why Join Clarus Software? Here are some of the reasons you will love working here: A team of amazing people. We are more »
cloud platform engineering, Transforming to a multi-cloud operating model, multi-cloud data platform, multi-cloud container platform, multi-cloud AI platform, SRE, Business observability , etc. Ability to translate strategic business priorities into supporting technology programs Exposure to some of these industry sectors (Banking, Financial Services, Retail sectors, Energy & Resources more »
primarily in an AWS Cloud environment using approaches/tooling similar to ours (see technical skills), delivering scalable, performant, reliable solutions. Data reliability/observability, data governance and information security credentials What technical skills are required Ability to implement and use modern data platform concepts, frameworks and tooling, covering all more »