relevant tools. Security Best Practices: IAM, MFA, data encryption, firewall configurations. Programming/Scripting: Python, Terraform, or similar languages. Event-Driven Architectures: Kafka. Monitoring and Logging: Datadog, ELK Stack, Prometheus, etc. Experience in agile methodologies and DevOps practices. Location: Hybrid. Office located in London. (Hayes area). Office presence required: Yes. Frequency: 2-3 times a week at the office. More ❯
London, England, United Kingdom Hybrid / WFH Options
MrQ
Terraform creating Infrastructure as Code. Strong scripting skills in Bash, Python, or similar languages. Deep understanding of CI/CD tools like Jenkins. Solid knowledge of system monitoring tools (Prometheus, Grafana, etc.). What We Offer At MrQ, we take pride in providing an array of fantastic benefits to our valued team members. Enjoy a competitive salary package that recognizes More ❯
consistency and scalability. Monitoring and Alerting: Set up monitoring and logging systems to proactively detect and address potential issues, ensuring optimal performance and reliability, in environments like on-prem Prometheus/Thanos, as well as Grafana Cloud and Loki. Database Management: Manage hundreds of on-prem PostgreSQL databases, including performance tuning, backups, disaster recovery strategies, and their active/passive More ❯
Lisburn, Northern Ireland, United Kingdom Hybrid / WFH Options
Camlin Energy
in Computer Science, Management Information Systems, or related is desirable but not essential. Nice to have but not essential: Container Orchestration (Kubernetes, Docker Swarm) Service monitoring and graphing tools (Prometheus + Grafana, Nagios + Munin) Elastic stack Infrastructure as Code (Terraform) Repository solutions (Jfrog Artifactory, Jfrog Bintray, Reprepro) Lets Encrypt/ACME OpenVPN Apache Tomcat Messaging streams or communication platforms More ❯
City of London, England, United Kingdom Hybrid / WFH Options
Parser Limited
relevant tools. Security Best Practices: IAM, MFA, data encryption, firewall configurations. Programming/Scripting: Python, Terraform, or similar languages. Event-Driven Architectures: Kafka. Monitoring and Logging: Datadog, ELK Stack, Prometheus, etc. Experience in agile methodologies and DevOps practices. Location: Hybrid. Office located in London. (Hayes area). Office presence required: Yes. Frequency: 2-3 times a week at the office. More ❯
must have . Monitoring and Alerting: Set up monitoring and logging systems to proactively detect and address potential issues, ensuring optimal performance and reliability, in environments like on-prem Prometheus/Thanos, as well as Grafana Cloud and Loki. Database Management: Manage hundreds of on-prem PostgreSQL databases, including performance tuning, backups, disaster recovery strategies, and their active/passive More ❯
Bromley, England, United Kingdom Hybrid / WFH Options
CACI Ltd
and automated deployment processes. Excellent communication and leadership skills, with the ability to collaborate effectively with cross-functional teams and provide technical mentorship. Experience with monitoring and logging tools (Prometheus, Grafana, ELK Stack) and cloud cost optimisation strategies. Familiarity with serverless architectures and AWS Lambda. AWS certification(s) (AWS Certified Solutions Architect, AWS Certified Security – Specialty) preferred. Understanding of architectural More ❯
in one or more technical disciplines Proficiency and experience in observability such as white and black box monitoring, SLO alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc. Proficiency in continuous integration and continuous delivery tools (e.g., Jenkins, GitLab, Terraform, etc.) Experience with container and container orchestration (e.g., ECS, Kubernetes, Docker, etc.) Expertise in SRE More ❯
in one or more technical disciplines Proficiency and experience in observability such as white and black box monitoring, SLO alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc. Proficiency in continuous integration and continuous delivery tools (e.g., Jenkins, GitLab, Terraform, etc.) Experience with container and container orchestration (e.g., ECS, Kubernetes, Docker, etc.) Expertise in SRE More ❯
Container orchestration with Kubernetes. Front-end state management with Redux. TypeScript for enhanced JavaScript development. Back-end development with Node.js and Express.js. Monitoring and logging tools like ELK Stack, Prometheus, or Grafana. Automated UI testing with Selenium or Cypress. End-to-end testing with Mocha/Chai. Team communication with Slack or Microsoft Teams. Infrastructure as Code with Terraform or More ❯
it would be a plus: Strong understanding of modern infrastructure and site reliability engineering practice, including Infrastructure-as-code tools (e.g. Terraform, Ansible ) and metrics and observability tools (e.g. Prometheus, Grafana ). Strong understanding of modern DevOps practice, including DevOps stacks (e.g. Jenkins, GitLab, CircleCI ). Cloud experience (e.g. AWS, Google Cloud, Azure, Kubernetes). Familiar with software engineering ways More ❯
London, England, United Kingdom Hybrid / WFH Options
BBC
provision and manage cloud environments. Build and maintain CI/CD pipelines using GitHub Actions , AWS CodePipeline , CodeBuild , Jenkins . Integrate monitoring and observability tools such as AWS CloudWatch, Prometheus , Grafana for infrastructure and model health tracking. Ensure software quality through Test-Driven Development (TDD) , unit testing frameworks (e.g., pytest , unittest ), and automated integration tests. Conduct regular code reviews, participate More ❯
London, England, United Kingdom Hybrid / WFH Options
9fin
a great fit if you: Good working knowledge of AWS services including ECS, EC2, Lambda, VPC, IAM, Route53, CloudFront, S3, RDS Good understanding of monitoring and logging solutions, e.g. Prometheus, AWS Cloudwatch, Grafana, OpenTelemetry, Honeycomb, ELK etc. Basic SRE knowledge, and experience in alerting and incident management platforms (eg. Opsgenie, Pagerduty) Proven ability to provide and support strong and scalable More ❯
growth or large-scale environments. Strong expertise in cloud platforms (AWS, GCP, Azure) and container orchestration tools (Kubernetes, Docker). Deep knowledge of monitoring, logging, and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk). Proficiency in programming or scripting languages (e.g., Python, Go, Bash). Experience with incident management, post-mortems, and implementing preventative measures. Solid understanding of networking, databases More ❯
London, England, United Kingdom Hybrid / WFH Options
Circadia Technologies Ltd
SaMD (Software as a Medical Device) product. Instrumentation: Develop and maintain instrumentation infrastructure to ensure system health. Build instrumentation systems to provide timely system health checks and alerts using Prometheus and Grafana. Implement and maintain automated alerting and incident response systems for quick issue identification and resolution. DevOps Support: Automate routine tasks and processes to improve efficiency and reduce manual More ❯
a plus. Knowledge of server-side technologies such as JBose, NodeJS, Jetty Monitoring and analyzing PMI data Hands on experience on enterprise tools set such as Grafana, Dynatrace, AppDynamics,Prometheus etc. Knowledge on Unix shell scripting, PERL or Python programming is preferred Experience in handling DDoS/BOT attack and different security remediations Working experience with Network load balancers, Global More ❯
GCP or multi-cloud environments. Exposure to GitOps workflows and tools like ArgoCD or Kustomize. Knowledge of .NET applications in cloud settings. Familiarity with observability stacks (e.g., Grafana, ELK, Prometheus). Understanding of compliance frameworks like SOC 2 or ISO 27001. Use of AI tools for enhancing operational efficiency. Experience with SIEM integration and incident response tooling. Comfort with remote More ❯
Salford, England, United Kingdom Hybrid / WFH Options
BBC Group and Public Services
to provision and manage cloud environments. Build and maintain CI/CD pipelines using GitHub Actions, AWS CodePipeline, CodeBuild, Jenkins. Integrate monitoring and observability tools such as AWS CloudWatch, Prometheus, Grafana for infrastructure and model health tracking. Ensure software quality through Test-Driven Development (TDD), unit testing frameworks (e.g., pytest, unittest), and automated integration tests. Conduct regular code reviews, participate More ❯
Glasgow, Scotland, United Kingdom Hybrid / WFH Options
BBC Group and Public Services
to provision and manage cloud environments. Build and maintain CI/CD pipelines using GitHub Actions, AWS CodePipeline, CodeBuild, Jenkins. Integrate monitoring and observability tools such as AWS CloudWatch, Prometheus, Grafana for infrastructure and model health tracking. Ensure software quality through Test-Driven Development (TDD), unit testing frameworks (e.g., pytest, unittest), and automated integration tests. Conduct regular code reviews, participate More ❯
London, England, United Kingdom Hybrid / WFH Options
BBC Group and Public Services
to provision and manage cloud environments. Build and maintain CI/CD pipelines using GitHub Actions, AWS CodePipeline, CodeBuild, Jenkins. Integrate monitoring and observability tools such as AWS CloudWatch, Prometheus, Grafana for infrastructure and model health tracking. Ensure software quality through Test-Driven Development (TDD), unit testing frameworks (e.g., pytest, unittest), and automated integration tests. Conduct regular code reviews, participate More ❯
Cardiff, Wales, United Kingdom Hybrid / WFH Options
BBC Group and Public Services
to provision and manage cloud environments. Build and maintain CI/CD pipelines using GitHub Actions, AWS CodePipeline, CodeBuild, Jenkins. Integrate monitoring and observability tools such as AWS CloudWatch, Prometheus, Grafana for infrastructure and model health tracking. Ensure software quality through Test-Driven Development (TDD), unit testing frameworks (e.g., pytest, unittest), and automated integration tests. Conduct regular code reviews, participate More ❯