offline deployments. - MLOps experience (Sagemaker, Kubeflow, ZenML). - Experience building RESTful services around AI pipelines. - ISO 27001, NIST SSDF, OWASP SAMM, or GDPR compliance literacy. - Experience with AWS Karpenter, Prometheus, or similar observability stacks. Soft Skills Research-driven mindset, eager to experiment and iterate. Able to bridge the gap between cutting-edge AI research and practical deployment. Strong communicator with More ❯
Confluent Platform, Confluent Cloud managed services, multi-cloud deployments, and Confluent for Kubernetes • Knowledge of data mesh architectures, KRaft migration, and modern event streaming patterns • Exposure to monitoring tools (Prometheus, Grafana, Splunk) • Experience with data lakes, data warehouses, or big data ecosystems Personal Besides the professional qualifications of the candidates we place great importance in addition to various forms personality More ❯
Confluent Platform, Confluent Cloud managed services, multi-cloud deployments, and Confluent for Kubernetes • Knowledge of data mesh architectures, KRaft migration, and modern event streaming patterns • Exposure to monitoring tools (Prometheus, Grafana, Splunk) • Experience with data lakes, data warehouses, or big data ecosystems Personal Besides the professional qualifications of the candidates we place great importance in addition to various forms personality More ❯
communication and collaboration skills, with a commitment to mentoring and team development. Preferred skills Understanding of observability practices, including logging, metrics, and tracing. Experience with monitoring tools such as Prometheus and Grafana. Awareness of cloud security best practices, including IAM policies and secret management. Exposure to Agile methodologies and cross-functional team collaboration. Passion for continuous learning and driving technical More ❯
improvement. Champion IaC, CaC, and DevSecOps best practices, providing mentorship and technical leadership to other engineers. Monitoring and Operational Excellence: Implement and manage monitoring, logging, and alerting solutions (e.g., Prometheus, Grafana, Stackdriver) to ensure high availability and proactive incident response for all hybrid infrastructure. Required Skills & Qualifications Experience: 5+ years of experience in a DevOps, SRE, or Platform Engineering role. More ❯
. Familiarity with configuration management tools (e.g., Ansible, Puppet, Chef). Knowledge of infrastructure as code (IaC) tools (e.g., Terraform, CloudFormation). Experience with monitoring and logging tools (e.g., Prometheus, ELK Stack, Datadog). Passion for continuous learning and professional development Desirable Certifications AWS Certified Solutions Architect - Professional AWS Certified Security - Specialty HashiCorp Certified: Terraform Associate ABOUT BUSINESS UNIT IBM More ❯
low latency Deploy and operate services on Kubernetes and Docker, leveraging AWS infrastructure such as EC2, S3, Lambda, and RDS Implement monitoring and observability using tools like Grafana and Prometheus to track system performance Collaborate with product, frontend, and analytics teams to deliver features that make a tangible impact on user experience Contribute to CI/CD pipelines, automated testing More ❯
Networking, and PaaS modernisation Microsoft Entra ID/Azure AD , Conditional Access, and hybrid identity integration Automation & Scripting : PowerShell, Bash, and Python Monitoring & Observability : Azure Monitor, Log Analytics, Grafana, Prometheus Governance & Security : Policy-as-Code, Zero Trust, and CAF-aligned Landing Zones 🎯 What we’re looking for Strong background in Azure cloud engineering and DevOps delivery Proven experience with Terraform More ❯
applications using Docker and Kubernetes (including EKS and AKS). * CI/CD for Data: Implement and maintain automated pipelines for data applications. * Monitoring & Observability: Deploy solutions using Grafana, Prometheus, and other tools to ensure data quality and system health. * Infrastructure as Code: Use Terraform and Ansible to provision and manage data infrastructure. * Performance Optimization: Enhance data processing for speed More ❯
CI/CD pipelines. Excellent communication and stakeholder management skills. Nice to Have eCommerce domain experience. Experience in building headless commerce systems. Knowledge of observability tools (e.g., Application Insights, Prometheus, Grafana). Familiarity with containerization (Docker, Kubernetes). To arrange a Teams-based interview, please send your CV in the first instance to: WA Consultants is an Employment Business and More ❯
Belfast, Northern Ireland, United Kingdom Hybrid / WFH Options
MK Recruit
Strong background in server monitoring, maintenance, and automation 5–7+ years relevant industry experience in a similar senior role Excellent communication and client-facing skills Experience with tools like Prometheus, Grafana, or Ansible is a plus Cloud experience (AWS/Azure), scripting (Bash/Python), or containerization (Docker/Kubernetes) desirable Why Beonic Hybrid working – 2–3 days in office More ❯
HEVC, MXF, ProRes, CMAF). Good knowledge of streaming protocols (RTMP, SRT, HLS, DASH, WebRTC) Familiarity with DevOps practices: Docker/Kubernetes, CI/CD pipelines, monitoring tools (Grafana, Prometheus, CloudWatch, etc.). Strong expertise with debugging and troubleshooting stream interruptions, latency spikes, and playback issues. Strong leadership and communication skills for cross-functional collaboration. Experience with broadcast/MAM More ❯
HEVC, MXF, ProRes, CMAF). Good knowledge of streaming protocols (RTMP, SRT, HLS, DASH, WebRTC) Familiarity with DevOps practices: Docker/Kubernetes, CI/CD pipelines, monitoring tools (Grafana, Prometheus, CloudWatch, etc.). Strong expertise with debugging and troubleshooting stream interruptions, latency spikes, and playback issues. Strong leadership and communication skills for cross-functional collaboration. Experience with broadcast/MAM More ❯
Nice to Have Experience in e-commerce , fintech, or other large-scale transactional systems. Exposure to event-driven architecture, streaming technologies, or data-intensive applications. Familiarity with observability tools (Prometheus, Grafana, OpenTelemetry). What’s on Offer Competitive day rate Opportunity to work on high-traffic, mission-critical systems at scale. Initial 6-month contract with strong potential for extension. More ❯
preferred) or infrastructure engineering. Beneficial Experience Exposure to Azure services: Cosmos DB, PostgreSQL, Redis Cache, Service Bus, Key Vault, Storage Accounts and PrivateLink integrations. Experience with monitoring tools (Grafana, Prometheus) and RESTful APIs (Postman). Awareness of cloud security tools such as CSPM, workload protection and WAFs. Knowledge of .NET Core/.NET 8 environments and modern single-page application More ❯
preferred) or infrastructure engineering. Beneficial Experience Exposure to Azure services: Cosmos DB, PostgreSQL, Redis Cache, Service Bus, Key Vault, Storage Accounts and PrivateLink integrations. Experience with monitoring tools (Grafana, Prometheus) and RESTful APIs (Postman). Awareness of cloud security tools such as CSPM, workload protection and WAFs. Knowledge of .NET Core/.NET 8 environments and modern single-page application More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Lorien
cloud-native storage: GCS, S3 and filer solutions. Exposure to data services: Cloud SQL, managed databases, MongoDB. Messaging systems: Kafka, RabbitMQ and EMS; API gateways like Apigee. Observability tools: Prometheus, Grafana, Cloud Monitoring. IAM and secrets management: dynamic secrets, Vault. Test-driven development and automated testing frameworks. Several years of real-world cloud engineering experience. Personal Attributes Curious and growth More ❯
cloud-native storage: GCS, S3 and filer solutions. Exposure to data services: Cloud SQL, managed databases, MongoDB. Messaging systems: Kafka, RabbitMQ and EMS; API gateways like Apigee. Observability tools: Prometheus, Grafana, Cloud Monitoring. IAM and secrets management: dynamic secrets, Vault. Test-driven development and automated testing frameworks. Several years of real-world cloud engineering experience. Personal Attributes Curious and growth More ❯
GitLab CI Advanced networking: BGP, OSPF, VLANs, VPNs, MPLS, SD-WAN, and cross-region connectivity Firewall administration (Cisco ASA or similar) and network access control Monitoring, logging, and observability (Prometheus, Grafana, CloudWatch, ECS, etc.) Linux and Windows systems administration IT infrastructure management: Active Directory, Microsoft services, networking, and on-prem systems Security, compliance, and disaster recovery in cloud and hybrid More ❯
GitLab CI Advanced networking: BGP, OSPF, VLANs, VPNs, MPLS, SD-WAN, and cross-region connectivity Firewall administration (Cisco ASA or similar) and network access control Monitoring, logging, and observability (Prometheus, Grafana, CloudWatch, ECS, etc.) Linux and Windows systems administration IT infrastructure management: Active Directory, Microsoft services, networking, and on-prem systems Security, compliance, and disaster recovery in cloud and hybrid More ❯