OpenShift, Google Anthos, AWS EKS Anywhere, AWS Outposts A strong background in Go, Python or Java Experience with Postgres Experience with observability tools, e.g. Prometheus, Grafana Benefits Highly competitive salary Pension plan (match up to 5%) Life insurance - three times annual salary Competitive maternity (six months fully paid) and paternity More ❯
Knowledge of CI/CD best practices and tools (e.g. GitHub Actions, Jenkins, CodePipeline) Exposure to monitoring and observability tools for ML systems (e.g. Prometheus, Grafana, DataDog, WhyLabs, Evidently, etc.) Experience in building parallelised or distributed model inference pipelines Nice-to-Have Skills Familiarity with feature stores and model registries More ❯
AWS, such as S3, FSX, EKS, SQS, SNS, Kinesis, AmazonMQ, DynamoDB, GKE, CloudStorage, PubSub, Filestore Knowledge of modern observability technologies such as ELK, Splunk, Prometheus, Grafana, Micrometer "What-if" thinking, while designing or reviewing solutions, to foresee or catch potential problems as early in the development process, as only possible More ❯
About Us WorkForce Software, an ADP Company, is the first global provider of workforce management solutions with integrated employee experience capabilities. The company's WorkForce Suite adapts to each organization's needs-no matter how unique their pay rules, labor More ❯
as Code tools. Strong understanding of CI/CD practices , with experience setting up and managing pipelines. Familiarity with monitoring, alerting, and logging tools (Prometheus, Grafana, etc.). Comfortable working in agile teams and across multiple cloud platforms. Strong communication skills and a collaborative, problem-solving mindset. Willingness to take More ❯
london, south east england, United Kingdom Hybrid / WFH Options
RED Global
as Code tools. Strong understanding of CI/CD practices , with experience setting up and managing pipelines. Familiarity with monitoring, alerting, and logging tools (Prometheus, Grafana, etc.). Comfortable working in agile teams and across multiple cloud platforms. Strong communication skills and a collaborative, problem-solving mindset. Willingness to take More ❯
and trading opportunities. Experience - 8+ years in Python (or Golang) in a DevOps or SRE capacity. Strong Linux experience Understanding of Kubernetes, Public Cloud, Prometheus, Grafana, Telemetry and general Observability Experience with Gitlab, Bitbucket and CI (GitHub/CI/Bamboo) Willingness to engage in technical discussion and commit to More ❯
RDS Route 53 IAM EKS Secrets Manager ECR Kubernetes Helm Kops Ingress/Egress Terraform Deployment of AWS Resources Pipelines OCI Observability ELK Dynatrace Prometheus Others Vault RedHat Skills working in a secure environment and ability to adhere to security principles Experience in support organisation More ❯
london (city of london), south east england, United Kingdom
Ncounter Technology Recruitment
and trading opportunities. Experience - 8+ years in Python (or Golang) in a DevOps or SRE capacity. Strong Linux experience Understanding of Kubernetes, Public Cloud, Prometheus, Grafana, Telemetry and general Observability Experience with Gitlab, Bitbucket and CI (GitHub/CI/Bamboo) Willingness to engage in technical discussion and commit to More ❯
as a software engineer. Over 5 years in data engineering and pipeline development in high-volume production environments. Experience with monitoring systems such as Prometheus, Grafana, Zabbix, or Datadog. Experience in fintech or trading industries. Strong object-oriented development skills and software engineering fundamentals. Hands-on experience with cloud data More ❯
complex technical information to diverse stakeholders. Strong presentation abilities to establish credibility with executives. Preferred Qualifications: Certifications in AWS, Kubernetes, or observability platforms (e.g., Prometheus Certified). Experience in a data-driven or SaaS environment. MBA or relevant leadership experience. This role is located in London and is a hybrid More ❯
london, south east england, united kingdom Hybrid / WFH Options
ITR Partners
complex technical information to diverse stakeholders. Strong presentation abilities to establish credibility with executives. Preferred Qualifications: Certifications in AWS, Kubernetes, or observability platforms (e.g., Prometheus Certified). Experience in a data-driven or SaaS environment. MBA or relevant leadership experience. This role is located in London and is a hybrid More ❯
and business stakeholders to align technical solutions with strategic goals and provide guidance to junior engineers. Light DevOps & Infrastructure: Contribute to operational monitoring (e.g., Prometheus), CI/CD pipelines, and other essential tooling. Performance Optimization: Identify and fix performance bottlenecks in multi-threaded systems, ensuring high availability and low latency. More ❯
and business stakeholders to align technical solutions with strategic goals and provide guidance to junior engineers. Light DevOps & Infrastructure: Contribute to operational monitoring (e.g., Prometheus), CI/CD pipelines, and other essential tooling. Performance Optimization: Identify and fix performance bottlenecks in multi-threaded systems, ensuring high availability and low latency. More ❯
systems design and share responsibility with them in diagnosing, resolving, and preventing production issues. What We Value Experience with monitoring systems using tools like Prometheus and writing health checks Interest in learning and managing technologies like Spark, Hadoop, Elasticsearch, and Cassandra Familiarity with deploying GPUs Moderate experience with TCP/ More ❯
Search, Discovery & Insights, Company Profiles, Workflow & Efficiency, and many more. Our stack Java 17/21, Spring Boot (MVC, JDBC, Security). Postgres, Docker, Prometheus, K8s, Elastic. Team Stream Development Lead, 2 BE, 1 FE, 1 SDET As a qualified expert, You will Help architect, design, and develop complex, large More ❯
network configurations and deployments using infrastructure-as-code (IaC) tools e.g. Ansible, Terraform, or Python scripts. Monitoring and logging network performance using tools like Prometheus, Grafana, or ELK stack. Experience with developing and maintaining air gapped networks. Experience with Voice over IP (VoIP) technologies including SIP, RTP protocols, and implementation More ❯
CDP/LLDP) and network engineering, management, and operations. Experience with search and analytics engines/big data tools (OpenSearch, Kafka, Kibana, Telegraf, InfluxDB, Prometheus). Our Preferred Qualifications for this role: Basic understanding of AI and ML algorithms, including model training, testing, and deployment. Hands-on project experience in More ❯
leeds, west yorkshire, yorkshire and the humber, United Kingdom Hybrid / WFH Options
KPMG UK
GCP) Knowledge of Database systems and models. Ability to use wide variety of open-source technologies. Experience with logging/monitoring tools (DataDog, StackDriver, Prometheus etc), Knowledge of test automation frameworks. To discuss this or wider Technology roles with our recruitment team, all you need to do is apply, create More ❯
functional teams, including DevOps, Engineering, Service Reliability, and Service Delivery teams. Technical Expertise: In-depth knowledge of open-source and commercial observability tools (e.g., Prometheus, Grafana, NewRelic). Expertise in cloud environments (e.g., AWS, Azure) and infrastructure as code (IaC) tools like Terraform. Monitoring and Observability: Experience in creating and More ❯
build tools; Ansible or other IT Automation/software provisioning tools; JIRA, Confluence; * Experience in monitoring/reporting tools such as Splunk, Grafana/Prometheus etc * Experience in Agile practices * Working knowledge of environment monitoring tools such as GCO, NewRelic, Prometheus, Grafana. * Collaboration Skills: Proactive can-do attitude; A creative More ❯
build tools; Ansible or other IT Automation/software provisioning tools; JIRA, Confluence; * Experience in monitoring/reporting tools such as Splunk, Grafana/Prometheus etc * Experience in Agile practices * Working knowledge of environment monitoring tools such as GCO, NewRelic, Prometheus, Grafana. * Collaboration Skills: Proactive can-do attitude; A creative More ❯
Terraform or CloudFormation, and manage resources for optimal performance. Monitor, troubleshoot, and resolve incidents, optimizing systems to ensure reliability and minimize downtime. Implement monitoring (Prometheus, Grafana, Datadog) and set up alerting systems to proactively address issues and ensure scalability. Work with DevOps, engineering, and security teams to improve application deployment … networking services. Proficiency in using Terraform, CloudFormation, Ansible, or similar tools for automating infrastructure. Strong experience in monitoring and incident response using tools like Prometheus, Grafana, and ELK Stack. Strong scripting skills in Python, Bash, Go, or Ruby for automating tasks and building custom tools. Experience with CI/CD More ❯