or all of the following: configuration management, orchestration, CI/CD, infrastructure monitoring and telemetry Experience using Agile (e.g. Kanban or Scrum) Familiarity with telemetry tools such as Splunk, Grafana Experience with Web frameworks (BENTO, REACT, Angular, DJANGO) Bloomberg is an equal opportunity employer and we value diversity at our company. We do not discriminate on the basis of age More ❯
level of competency Understanding of configuration management, orchestration, CI/CD, infrastructure monitoring and telemetry Experience using Agile (e.g. Kanban or Scrum) Familiarity with telemetry tools such as Splunk, Grafana Experience with Web frameworks (BENTO, REACT, Angular, DJANGO) Bloomberg is an equal opportunity employer and we value diversity at our company. We do not discriminate on the basis of age More ❯
Python, Go, or similar languages for automation and scripting. Expert-level knowledge of AWS Networking, TLS, and security best practices. Experience with container orchestration (Kubernetes, EKS) and observability tools (Grafana, ELK). A passion for innovation, problem-solving, and delivering high-impact solutions. Experience leading/managing junior engineers Significant experience with Control Tower and deploying landing zones. For this More ❯
leading SRE, DevOps, or infrastructure teams. Strong background in systems engineering, cloud platforms (AWS, Azure), and container orchestration (Kubernetes). Proficiency in monitoring, alerting, and incident management tools (Prometheus, Grafana, PagerDuty). Solid understanding of networking, distributed systems, and performance tuning. Excellent communication, leadership, and stakeholder management skills. Preferred Qualifications Experience in a high-scale, high-availability SaaS environment. Familiarity More ❯
field. Preferred Qualifications Proven expertise and experience with database technologies including NoSQL databases like MongoDB, RDBMS such as Postgres and MySQL Exposure to Docker, Kubernetes, AWS, Helm, Terraform, Vault, Grafana, ELK Stack, New Relic Relevant experience in the maintenance of data APIs and data lake architectures, including experience with Apache Iceberg, Trino/Presto, Clickhouse, Snowflake, BigQuery. Master's degree More ❯
Python, Go, or similar languages for automation and scripting. Expert-level knowledge of AWS Networking, TLS, and security best practices. Experience with container orchestration (Kubernetes, EKS) and observability tools (Grafana, ELK). A passion for innovation, problem-solving, and delivering high-impact solutions. Why Work For Us? 25 days holiday + bank holidays Up to 5% employer pension contribution Educational More ❯
of version management software (e.g. Git, Mercurial) and issue tracking systems (e.g. Jira). Experience or understanding of common operations tooling such as Elastic Stack, StatsD/Graphite/Grafana, Sensu, Jenkins, or other CI/CD tooling. An understanding of the responsibilities of a system administrator (e.g. networking, filesystems, environment management). Relevant qualifications: e.g. a degree in Computer More ❯
to managing our infrastructure, using Terraform. - We follow a GitOps approach to managing our Kubernetes configuration, using ArgoCD and Helm. - We manage a high-availability metrics collection system using Grafana, Thanos & Prometheus. We're in the process of transitioning to OpenTelemetry and Honeycomb for our application telemetry (traces and metrics). - We manage a data pipeline using Pub/Sub More ❯
applications (Flux, risk/PnL). Act as subject matter expert for trading floor systems (Cloud 9, CME, TT, ICE), ensuring minimal downtime. Drive operational excellence through automation, monitoring (Grafana), process improvements, and cross-team collaboration. Communicate effectively with traders, tech leadership, and vendors, delivering clear updates and incident reports. Lead Support Engineer Key Skills Required: Advanced desktop support with More ❯
IPoE 13. Proven ability to work independently & collaboratively in a fast-paced technical environment. 14. Demonstratable knowledge of the telecommunications industry & technologies. 15. Experience of working with Prometheus and GrafanaMore ❯
s building something exceptional. Tech Snapshot (don't worry if you don't know it all): Kotlin, TypeScript, Terraform, Azure/AWS/GCP, Temporal, Postgres, graph databases, OpenTelemetry, Grafana, containerised dev environments, CI/CD pipelines. Perks & Culture ?? Competitive salary + EMI share options ?? Breakfast and dinner on tap, plus snacks that raise the bar ?? Regular socials + full More ❯
s building something exceptional. Tech Snapshot (don't worry if you don't know it all): Kotlin, TypeScript, Terraform, Azure/AWS/GCP, Temporal, Postgres, graph databases, OpenTelemetry, Grafana, containerised dev environments, CI/CD pipelines. Perks & Culture Competitive salary + EMI share options Breakfast and dinner on tap, plus snacks that raise the bar Regular socials + full More ❯
Technologies (Kubernetes, Open Shift) Messaging Technologies (Kafka, Solace, TIBCO) Database/Data Store/Data Query Technologies (SQL Server, Trino, Mongo, S3) Observability Technologies (OpenTelemetry, Elastic Stack/ELK, Grafana) This job description provides a high-level review of the types of work performed. Other job-related duties may be assigned as required. What we'll provide By joining Citi More ❯
or Ansible is considered advantageous.* Comprehensive understanding of virtualisation platforms and container orchestration tools enables you to propose scalable solutions confidently.* Familiarity with monitoring stacks such as Prometheus or Grafana allows you to provide valuable insights into system performance for clients.* Exceptional interpersonal skills empower you to build rapport with stakeholders at all levels while communicating complex ideas clearly.* Outstanding More ❯
Willingness to tackle challenging problems and make meaningful contributions to the success of both the team and the organization. Nice to Have: Experience with Docker and Kubernetes. Familiarity with Grafana and other monitoring tools. Prior experience with Scala and Java is an advantage. What we offer You will have the chance to be involved in something impactful, large-scale, and More ❯
London, Bloomsbury, United Kingdom Hybrid / WFH Options
IntaPeople
GitHub Actions, or AWS CodePipeline Support and train technical staff in upskilling necessary for ongoing operations Monitor and ensure system reliability, availability, and performance using tools likeCloudWatch, Prometheus, Icinga2, Grafana, and Datadog Automate deployment, scaling, and management of containerized applications using Docker and Kubernetes Desirable skills Travis CI Monitoring – Grafana, Icinga Prometheus Rabbit MQ/AMQP Working knowledge of security More ❯
EKS (Elastic Kubernetes Service) and Azure Kubernetes Service (AKS), ensuring their reliability, availability, and performance. Monitoring and Alerting: Monitor application performance and system health through observability tools (e.g., Prometheus, Grafana, ELK stack), proactively identifying and resolving issues to ensure high availability and rapid incident response. Security and IAM: Implement security best practices, managing Identity and Access Management (IAM) policies across … container orchestration technologies, particularly Kubernetes. Familiarity with version control systems (e.g., Git) and CI/CD pipelines for efficient code deployment. Knowledge of monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack) to ensure system observability. Strong experience with SQL databases and AWS DynamoDB, focusing on performance tuning and optimization. Proven ability to design and manage RESTful APIs, ensuring their More ❯
and deploying services with Java and Spring Boot. Comfort working in a cloud-native environment - Kubernetes (EKS), containers, scaling etc. An interest in observability, using tools like Prometheus and Grafana to keep services healthy and understand usage patterns. Familiarity with AWS services and how to integrate them into modern applications. A keen focus on quality and security, baking testing and More ❯
Private Networks, DWDM and Optical Networking, Data Centre builds and design fundamentals. etc. Experience with network modelling Eagerness to learn new technologies and mentor others Experience with Telemetry: Splunk, Grafana, Humio Experience with continuous integration and deployment tools Experience implementing, maintaining and troubleshooting MPLS, BGP, OSPF, IGMP, PIM related internal and external network routing issues in a production environment Knowledge More ❯
Private Networks, DWDM and Optical Networking, Data Centre builds and design fundamentals. etc. Experience with network modelling Eagerness to learn new technologies and mentor others Experience with Telemetry: Splunk, Grafana, Humio Experience with continuous integration and deployment tools Experience implementing, maintaining and troubleshooting MPLS, BGP, OSPF, IGMP, TCP, HTTPS, DNS, PIM related internal and external network routing issues in a More ❯
in virtualisation platforms as well as container orchestration technologies-including associated tooling-to optimise resource utilisation across diverse workloads.* Hands-on knowledge of monitoring stacks such as Prometheus or Grafana along with log management solutions like ELK/EFK or their equivalents.* Proven ability to diagnose intricate technical problems using structured troubleshooting methodologies that minimise disruption to business operations.* Exceptional More ❯
building and operating systems at scale Advanced knowledge of configuration management systems, such as: Puppet, Chef, Ansible, or related systems Significant experience of monitoring frameworks (such as CloudWatch, Datadog, Grafana, Elastic or similar) Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Eligo Recruitment
ll Bring Strong experience with GCP , Terraform , and Infrastructure-as-Code Deep knowledge of cloud networking, security automation, and compliance standards Proficiency in CI/CD pipelines , monitoring tools (Grafana, Datadog), and scripting A collaborative mindset with excellent communication and mentoring skills Why Join? Shape a next-gen AI infrastructure with autonomy and purpose Hybrid working with regular meetups in More ❯
Unix based systems. Experience withcloud-based platforms (e.g. AWS). Experience with real-time data messaging (e.g. Redis, websockets). Experience with deployment and monitoring tools, e.g. supervisor, dockers, Grafana, Nagios, etc. Excellent problem-solving skills and attention to detail. Preferred/Desirable Experience Experience in the finance or cryptocurrency sectors. Experience with exchange connectors implementation. Familiarity with low latency More ❯