Kubernetes; experienced in scalable, portable BI and data environments. Environment Management: Managed Dev/QA/UAT freshness, data synchronisation, and Jira-integrated release workflows. Observability & Monitoring: Implemented CloudWatch, Datadog, Prometheus, and Grafana for logging, metrics, and alerting. Troubleshooting & Problem Solving: Strong analytical and cross-functional collaboration skills; effective under pressure. Project Delivery: Managed multiple concurrent BI and data releases More ❯
pipelines (GitHub Actions, GitLab CI, Azure DevOps, Jenkins) Experience withconfiguration managementtools such asChef/Puppet Strong proficiency in scripting/programming (Python, Go, or similar) Experience with observability platforms (Datadog, New Relic, Prometheus/Grafana) Knowledge of microservices architecture and service mesh technologies Understanding of security best practices and compliance frameworks Comfortable with asynchronous collaboration tools (Slack, Teams) Agile mindset More ❯
. Strong scripting skills in Python , Bash , or similar. Familiarity with Linux administration , networking, and system security. Experience with monitoring, logging, and observability tools (e.g., Prometheus, Grafana, ELK stack, Datadog). Desirable Skills Exposure to infrastructure security best practices (e.g., CIS Benchmarks, AWS Well-Architected Framework). Knowledge of configuration management (Ansible, Chef, or Puppet). Experience with serverless architectures More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Signify Technology
. Strong scripting skills in Python , Bash , or similar. Familiarity with Linux administration , networking, and system security. Experience with monitoring, logging, and observability tools (e.g., Prometheus, Grafana, ELK stack, Datadog). Desirable Skills Exposure to infrastructure security best practices (e.g., CIS Benchmarks, AWS Well-Architected Framework). Knowledge of configuration management (Ansible, Chef, or Puppet). Experience with serverless architectures More ❯
testing, and incident management. Hands on experience with Databricks , MLflow , or similar ML/ETL platforms is a plus. Bonus: Experience with container orchestration (Kubernetes) and observability tools like Datadog, Prometheus, or Grafana. Passion for building tools and platforms that empower teams and improve developer velocity. Excitement, passion and curiosity about our mission of connecting the world's health data More ❯
Kubernetes, Docker Knowledge of networking fundamentals (TCP/IP, DNS, load balancing Proficiency in Linux/Unix administration, scripting (Python, Bash, or similar Experience with monitoring tools (Prometheus, Grafana, DataDog Familiarity with containerization (Docker, Kubernetes) and cloud services. Experience with CI/CD systems (Jenkins, GitHub Actions, GitLab CI Strong analytical and problem-solving skills. Knowledge of security practices (IAM More ❯
Edinburgh, Midlothian, United Kingdom Hybrid/Remote Options
Aberdeen
tech talks to share knowledge and promote adoption of tools and practices. About the Candidate The ideal candidate will possess the following: Experience with observability tools (eg, Grafana, Prometheus, Datadog). Background in DevOps, SRE, or platform engineering with a security first mindset. Strong programming skills in languages such as .Net, JavaScript, Python or similar. Experience with CI/CD More ❯
deploying AI/ML workloads, particularly LLMs and vector-based apps. Comfortable with containerisation, Git workflows, and scripting (Bash, Python, etc.). Exposure to observability tools like Prometheus, Grafana, Datadog, or ELK. A proactive, detail-oriented approach with strong documentation and communication skills. Comfortable working with JIRA for tickets and sprint rituals. 🌟 Why Join Us? Join a team pushing the More ❯
deploying AI/ML workloads, particularly LLMs and vector-based apps. Comfortable with containerisation, Git workflows, and scripting (Bash, Python, etc.). Exposure to observability tools like Prometheus, Grafana, Datadog, or ELK. A proactive, detail-oriented approach with strong documentation and communication skills. Comfortable working with JIRA for tickets and sprint rituals. 🌟 Why Join Us? Join a team pushing the More ❯
or Windows administration, with the ability to architect secure, performant, and highly available cloud solutions. Proficiency with monitoring and log analytics tools such as AWS CloudWatch, ELK Stack, Prometheus, Datadog, or New Relic, to maintain observability and ensure operational excellence. Demonstrated leadership skills in managing complex, high-pressure situations and guiding teams through incident resolution. Exceptional communication and presentation skills More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Advanced Resource Managers
or Windows administration, with the ability to architect secure, performant, and highly available cloud solutions. Proficiency with monitoring and log analytics tools such as AWS CloudWatch, ELK Stack, Prometheus, Datadog, or New Relic, to maintain observability and ensure operational excellence. Demonstrated leadership skills in managing complex, high-pressure situations and guiding teams through incident resolution. Exceptional communication and presentation skills More ❯
YAML, JSON Build Tools: Maven, Gradle, NPM, Bazel, Go Databases: RDS, SQL, MySQL, Postgres, RedShift, MongoDB, DynamoDB Security Scans: SAST, Secrets, Container, DAST, Xray, Prisma Cloud Logging and Monitoring: DataDog, Splunk, App Dynamics, ELK, Grafana About PROLIM Corporation PROLIM is a leading provider of end-to-end IT, PLM and Engineering Services and Solutions for Global 1000 companies. They understand More ❯
based architectures, and queuing technologies, i.e. RabbitMQ Experience of REST and/or GraphQL APIs Knowledge of the core AWS services: i.e. EC2/ECS, RDS, S3 Experience using DataDog or similar observability tools Knowledge of containerisation: Docker, Kubernetes, AWS Fargate etc Any experience of front-end or fullstack development using TypeScript & React Experience building software for financial services and More ❯
City of London, London, United Kingdom Hybrid/Remote Options
TreasurySpring
based architectures, and queuing technologies, i.e. RabbitMQ Experience of REST and/or GraphQL APIs Knowledge of the core AWS services: i.e. EC2/ECS, RDS, S3 Experience using DataDog or similar observability tools Knowledge of containerisation: Docker, Kubernetes, AWS Fargate etc Any experience of front-end or fullstack development using TypeScript & React Experience building software for financial services and More ❯
Strong IaC (CloudFormation), Docker, CI/CD Experience with large data migrations (PostgreSQL/DynamoDB) Strong scripting + automation skills Nice to have: Terraform, Kubernetes, geospatial data, SRE experience, DataDog/Grafana, FinOps. Join to lead DevOps strategy, automate everything, and build reliable infrastructure for a platform used by half of UK cities. Lead DevOps Engineer - Climate Tech - London/ More ❯
Strong IaC (CloudFormation), Docker, CI/CD Experience with large data migrations (PostgreSQL/DynamoDB) Strong scripting + automation skills Nice to have: Terraform, Kubernetes, geospatial data, SRE experience, DataDog/Grafana, FinOps. Join to lead DevOps strategy, automate everything, and build reliable infrastructure for a platform used by half of UK cities. Lead DevOps Engineer - Climate Tech - London/ More ❯
scripting for automation Git version control Desirable (Future-Facing Skills): Infrastructure as Code (Terraform, Pulumi, Ansible) Container orchestration (Kubernetes) Go development for microservice utilities Modern observability tools (Prometheus, Grafana, Datadog) CI/CD pipeline management (GitHub Actions, GitLab CI, Jenkins) Firewall-as-a-Service solutions (e.g., Cloudflare) Endpoint/device management (e.g., Intune, NinjaOne) Exposure to ML Ops (deployment, scaling More ❯
/CD pipelines (e.g., Jenkins, TeamCity, Concourse). Familiarity with web/application servers such as NGINX, Apache, or JBoss. Exposure to monitoring and logging tools (ELK, Nagios, Splunk, DataDog, New Relic, etc.). Understanding of security and identity management (OAuth2, SSO, ADFS, Keycloak, etc.). Experience with version control systems (Git, Bitbucket, Subversion). Working knowledge of database technologies More ❯
platforms (GCP, AWS, or Azure), containerization, CI/CD, and infrastructure-as-code Docker; Kubernetes (EKS, GKE, AKS); Jenkins, GitLab CI, or GitHub Actions; Terraform or CloudFormation; Prometheus, Grafana, Datadog, or New Relic; Slurm, Torque, LSF; MPI; Hadoop or Spark;Director of In Experience with high-performance computing, distributed systems, and observability tools Strong communication and executive presence, with the More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Harnham
platforms (GCP, AWS, or Azure), containerization, CI/CD, and infrastructure-as-code Docker; Kubernetes (EKS, GKE, AKS); Jenkins, GitLab CI, or GitHub Actions; Terraform or CloudFormation; Prometheus, Grafana, Datadog, or New Relic; Slurm, Torque, LSF; MPI; Hadoop or Spark;Director of In Experience with high-performance computing, distributed systems, and observability tools Strong communication and executive presence, with the More ❯
design (REST, GraphQL) Experience with containerization (Docker, Kubernetes) and cloud-native development patterns DevOps & SRE Practices Experience implementing CI/CD pipelines and DevOps methodologies Knowledge of infrastructure monitoring (Datadog), log aggregation, and incident management Understanding of SLO/SLA definition and observability best practices Strategic & Business Acumen Ability to align technical initiatives with business objectives and articulate ROI Experience More ❯
KPIs (observability, alerting, SLAs) Hands on experience with CI/CD, containerization and orchestration tools (Docker, Kubernetes ) Knowledge of monitoring, logging, alerting and observability tools (Prometheus, Grafana, ELK Stack, Datadog ) Familiarity with infrastructure as code tools like Terraform or CloudFormation Proficiency in scripting languages (Python, Go, Bash ) and knowledge of software development best practices Strong understanding of networking, security, and More ❯
deployments (Kubernetes, Docker). Hands-on experience with data and model pipelines (feature stores, registries, distributed training, inference scaling). Knowledge of observability and monitoring stacks (Prometheus, Grafana, ELK, Datadog) for ML system performance. Experience collaborating with cross-functional teams in regulated industries (finance, insurance, health) with compliance and governance needs. Exceptional communication and leadership skills, with the ability to More ❯
complex issues to senior stakeholders and technical teams. Implementation of highly available and reliable systems, using multi-AZ and multiregional approaches Expertise with monitoring and observability tools (e.g. SolarWinds, Datadog, Azure/AWS native tools) Expertise with SLI/SLO management tools such as (ServiceNow) Expertise with Incident ticketing and change management systems such as (ServiceNow, Ivanti) Expertise with automated More ❯