pipelines Familiarity with regulated workflows: ISO27001, SOC2, GDPR aren't just abbreviations, and don't fill you with dread Observability skills: Well familiar with Open Telemetry, Prometheus, Loki and Grafana CI/CD pipeline skills: You know what it takes to build templates and guardrails to allow the most junior developers to confidently push code, safely knowing that the computer More ❯
of their peers. Nice to have Building CI/CD pipelines. Knowledge of deployment, rollout, rollback strategies. Knowledge of observability practices (logging, metrics, tracing) and monitoring tools (e.g. Prometheus, Grafana). Understanding of cloud security best practices, including IAM policies and secret management. Time Off & Work-Life Balance 25 Days Annual Leave + bank holidays - plus the option to buy More ❯
Knowledge of cost optimisation strategies for cloud resources Experience with advanced Azure services such as Azure Kubernetes Service (AKS). Trade certifications Experience with infrastructure monitoring tools like Prometheus, Grafana, or Azure Monitor at scale Background in maintain disaster recovery and high-availability solutions for critical systems What We Offer At Netcompany, we believe in empowering our senior engineers to More ❯
various methods such as unit, integration, contract and E2E testing. You have a high degree of experience in observing the performance and health of applications via tools such as Grafana, Prometheus, Data Dog, Sentry, etc. You have a strong desire and are an advocate for performant applications. Proactive in solving problems simply and effectively, with an eye for pragmatic solutions. More ❯
Programming languages: Java, Python, Go Lang Container orchestration/Cloud platform: RedHat Openshift/AWS/Azure DevOps tools - Ansible, Chef, Kubernetes, GitLab SRE logging & Monitoring Tools - ELK stack, Grafana, Prometheus, Open Telemetry Other highly valued skills include: Strong understanding of Agile application development methodology. Strong knowledge of API development/principles Collaborating with the development teams to build scalable More ❯
Ability to balance technical depth with strategic thinking and business alignment. Tools Development & Deployment:GitHub, Docker, Kubernetes AI/ML:Azure AI, OpenAI, and similar Observability:Dynatrace, New Relic, Grafana, or similar QA & Testing:Selenium, Playwright, Postman, Cucumber, or similar Automation & IaC:Terraform, Ansible, Bicep, or similar Incident Management:PagerDuty, Opsgenie, or similar Security & Compliance:Snyk, SonarQube, or similar Collaboration More ❯
programming skills with experience in various libraries Experience with AWS Lambda functions and serverless architectures Knowledge of REST APIs, JSON/XML, and web services integration Familiarity With Cribl, Grafana, Logic Monitor, Datadog, Newrelic or comparable monitoring & APM solutions is a plus. Exposure to SIEM and Service Management toolsets like ServiceNow would be advantageous. Nice to have UNIX/RHEL More ❯
GitLab , GitHub Actions, or CircleCI Strong testing capabilities using JUnit , RestAssured , or similar frameworks Proactive with monitoring, observability, and system health Desirable Skills: Exposure to monitoring platforms like Datadog, Grafana, Prometheus , or PagerDuty Familiarity with Python scripting Experience with Kubernetes and deployment tools such as Helm Why Join H&B Tech? Help define the future of digital health & wellness in More ❯
AWS Lambda, Azure Functions); Experience with CI/CD pipelines and automation tools (e.g., GitHub Actions,Azure DevOps); Ability to switch between multiple languages and paradigms effectively; Familiarity with Grafana is a plus, as monitoring will become a growing focus in the project. You'll be: Self-motivated, proactive and continually looking for ways to improve and develop yourself; A More ❯
cloud architecture IoT 'smart' edge devices (using nVidia AI chips) Linux-based embedded OS on our Edge devices Continuous Integration and Delivery using Jenkins, SonarQube Terraform for infrastructure management Grafana, Elasticsearch, Kibana & New Relic for metrics, logs and monitoring In the company we also use: VueJS, MySQL, Spring Boot, Apache Camel, AWS Redshift, AWS SageMaker, Pentaho, Balena, Serverless functions Winnow More ❯
Ontario, California, United States Hybrid / WFH Options
annex it solutions
/GCP) with a focus on scalability, security, and cost optimization. Automate configuration management and deployments using Terraform, Ansible, or Chef. Implement monitoring, alerting, and logging solutions using Prometheus, Grafana, ELK, or equivalent. Troubleshoot complex production issues and ensure high system availability. Collaborate with development, QA, and security teams to improve software reliability and performance. Provide technical guidance and mentorship … ARM templates, Ansible. Strong scripting skills in Python, Bash, or PowerShell. Solid understanding of networking, security, and Linux/Windows system administration. Experience with monitoring/logging solutions: Prometheus, Grafana, ELK Stack. Excellent problem-solving, communication, and collaboration skills. Preferred Qualification Azure DevOps Engineer (AZ-400) or AWS DevOps certification. Experience with microservices architecture and serverless technologies Familiarity with Agile More ❯
mid to senior level candidate) o Docker/Kubernetes, etc. o Linux o Scripting - any Linux language o CI/CD tools (GitLab, Jenkins, etc.) o Storage provisioning (Prometheus, Grafana, etc. More ❯
Sheffield, South Yorkshire, Yorkshire, United Kingdom Hybrid / WFH Options
VANLOQ LIMITED
Required Skills: Proven experience in Python development & FastAPI Strong knowledge of PostgreSQL database administration Excellent problem-solving, debugging, and analytical skills Nice to Have: Exposure to observability tools ( Prometheus, Grafana, OpenTelemetry ) Experience with enterprise tools (Control M, True Sight, Guardium, Tenable Nessus, Delinea) Understanding of security and software development in highly regulated environments End-to-end experience with CI/ More ❯
Newcastle Upon Tyne, Tyne and Wear, England, United Kingdom Hybrid / WFH Options
Lorien
experience with Azure or AWS. Solid background with Terraform and IaC. Proven use of CI/CD tools (Jenkins, GitHub Actions, GitLab CI, etc.). Knowledge of Prometheus and Grafana for monitoring. Familiarity with collaboration tools like Slack. Either: Prior management/team lead experience, or A Senior DevOps engineer ready to progress into a managerial role. (Bonus) Background in More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Harnham - Data & Analytics Recruitment
up Experience Strong cloud skills (AWS, GCP, Azure) and containerisation (Docker, Kubernetes) Experience in automating deployments and orchestrating cloud environments Nice to have: Python (Jupyter, PyTorch), monitoring tools (Prometheus, Grafana), cloud databases (RDS, Aurora, Spanner), CI/CD tools (CircleCI), and data visualisation experience. This is a unique opportunity to join a visionary team redefining AI in 3D , with the More ❯
Employment Type: Full-Time
Salary: £140,000 - £160,000 per annum, Inc benefits
Micro Frontends and BFFs Hands-on expertise in React and TypeScript development with an eye for performance and resilience Proven ability to implement observability practices using tools like Prometheus, Grafana, or Azure Monitor Proficiency in containerisation and orchestration (Docker, Kubernetes - ideally AKS or GKE) Experience building and maintaining CI/CD pipelines for frontend applications (e.g. Azure DevOps, GitHub Actions More ❯
Docker, Kubernetes, Terraform). Skilled in cloud platforms (AWS, GCP, or Azure). Bonus points for: Familiarity with ML frameworks (PyTorch, Jupyter). Knowledge of cloud monitoring tools (Prometheus, Grafana). Experience with cloud-based databases (RDS, Aurora, Redshift, Spanner, etc.). Exposure to CI/CD pipelines (e.g., CircleCI). What's on offer: The chance to shape the More ❯
of Kubernetes and GPU scheduling, including setup of GPU-enabled clusters and deployment of GPU workloads in Kubernetes. Familiarity with GPU monitoring and observability, using tools such as Prometheus, Grafana, NVIDIA Data Center GPU Manager (DCGM), or custom scripts. Proven ability to analyze deployment approaches for GPU-accelerated serving frameworks and deliver reference implementations. Experience implementing software quality engineering practices More ❯
bash/python) and configuration management (Ansible, Salt) experience building tooling and drive automation with an IAC mindset Experience with open source monitoring and metrics collections (Nagios, TICK, Prometheus, Grafana, etc.) Comfortable operating in Linux based environments Production operations: incident response, triaging and troubleshooting Excellent verbal and written communication skills preferred as the team interfaces directly with senior stakeholders, external More ❯
Birmingham, West Midlands, United Kingdom Hybrid / WFH Options
DWP Digital
and skill at identifying performance bottlenecks cross application and infrastructure layers. Strong working knowledge and practical experience of performance testing tools (JMeter and K6) and performance monitoring tools like Grafana, Kibana, Kiali, Prometheus and AWS CloudWatch. Detailed working knowledge and familiarity with IaC and modern DevOps practices You and your role If you're passionate about making sure systems run More ❯
Leeds, West Yorkshire, Yorkshire, United Kingdom Hybrid / WFH Options
DWP Digital
and skill at identifying performance bottlenecks cross application and infrastructure layers. Strong working knowledge and practical experience of performance testing tools (JMeter and K6) and performance monitoring tools like Grafana, Kibana, Kiali, Prometheus and AWS CloudWatch. Detailed working knowledge and familiarity with IaC and modern DevOps practices You and your role If you're passionate about making sure systems run More ❯
Sheffield, South Yorkshire, Yorkshire, United Kingdom Hybrid / WFH Options
DWP Digital
and skill at identifying performance bottlenecks cross application and infrastructure layers. Strong working knowledge and practical experience of performance testing tools (JMeter and K6) and performance monitoring tools like Grafana, Kibana, Kiali, Prometheus and AWS CloudWatch. Detailed working knowledge and familiarity with IaC and modern DevOps practices You and your role If you're passionate about making sure systems run More ❯
Newcastle Upon Tyne, Tyne and Wear, North East, United Kingdom Hybrid / WFH Options
DWP Digital
and skill at identifying performance bottlenecks cross application and infrastructure layers. Strong working knowledge and practical experience of performance testing tools (JMeter and K6) and performance monitoring tools like Grafana, Kibana, Kiali, Prometheus and AWS CloudWatch. Detailed working knowledge and familiarity with IaC and modern DevOps practices You and your role If you're passionate about making sure systems run More ❯
BA, BS, MS, PHD, in Computer Science, Electrical Engineering or related field Ability to develop and maintain comprehensive monitoring, alerting systems and incident management using tools such as Prometheus, Grafana, OTEL and other observability stacks Ability to optimize, scale, and secure our infrastructure and Kubernetes environments, using deep Kubernetes and cloud platform experience Ability to Implement and maintain network policies More ❯