At Cloud Bridge , we transform how businesses use AWS cloud services. We specialise in Consultancy, Managed Services, Cloud Governance, FinOps, and AI/ML to unlock AWS's full potential. Recognised as AWS's Rising Star Partner of the Year More ❯
About Us Babylon is a top VC-backed blockchain project founded by Claude. E. Shannon Award recipient, fellow of U.S. National Academy of Engineering, Professor David Tse from Stanford University. The project's mission is to bring Bitcoin's unparalleled More ❯
production and non-production environments. You will work across real-time incidents and projects, including capacity planning, WAN, and system observability using tools like Prometheus and Grafana. Requirements: Strong experience administering Solace PubSub+ messaging across environments (on-prem and Cloud) Strong knowledge of production support Configure and optimise Solace across … WAN environments, networking and latency Strong knowledge of tools such as Grafana and Prometheus Understanding of DevOps tooling and CI/CD pipelines desirable Proficiency in troubleshooting message delivery, persistence, and topic routing etc Good Linux/Unix knowledge and scripting (Bash, Python) Excellent communication and interpersonal skills skills If More ❯
production and non-production environments. You will work across real-time incidents and projects, including capacity planning, WAN, and system observability using tools like Prometheus and Grafana. Requirements: Strong experience administering Solace PubSub+ messaging across environments (on-prem and Cloud) Strong knowledge of production support Configure and optimise Solace across … WAN environments, networking and latency Strong knowledge of tools such as Grafana and Prometheus Understanding of DevOps tooling and CI/CD pipelines desirable Proficiency in troubleshooting message delivery, persistence, and topic routing etc Good Linux/Unix knowledge and scripting (Bash, Python) Excellent communication and interpersonal skills skills If More ❯
business is investing heavily in performance, uptime, and scalability. SRE is a key part of that strategy. Tech stack includes: AWS, Azure, Docker, Kubernetes, Prometheus, Grafana, Linux, and Cloudflare – but there’s full freedom to bring in new ideas and better tools if they help. What you'll be doing … re looking for: Solid experience with cloud platforms (AWS or Azure) Strong with containers and orchestration (Docker, Kubernetes) Good knowledge of monitoring and observability (Prometheus, Grafana, etc.) Strong Linux background Bonus: experience with Cloudflare or web security Flat structure. No micromanagement. Just a smart team, moving fast, and building things More ❯
to optimize data retrieval, caching, and indexing for fast responses. • Design fault-tolerant and resilient distributed systems using Kubernetes and cloud-native technologies. • Utilize Prometheus, Grafana, and Kibana for monitoring and observability of backend systems. • Optimize API performance and response times for a seamless user experience. Data Analytics & User Insights … driven architectures. • Deep understanding of data processing, analytics, and real-time event streaming. • Expertise in PostgreSQL, AWS and Kubernetes. • Proficiency in monitoring tools like Prometheus, Grafana, and Kibana. • Knowledge of security best practices, including OAuth, JWT, and data encryption. • Fluent in English with strong communication and collaboration skills. Preferred Qualifications More ❯
london (hounslow), south east england, united kingdom
eTeam
to optimize data retrieval, caching, and indexing for fast responses. • Design fault-tolerant and resilient distributed systems using Kubernetes and cloud-native technologies. • Utilize Prometheus, Grafana, and Kibana for monitoring and observability of backend systems. • Optimize API performance and response times for a seamless user experience. Data Analytics & User Insights … driven architectures. • Deep understanding of data processing, analytics, and real-time event streaming. • Expertise in PostgreSQL, AWS and Kubernetes. • Proficiency in monitoring tools like Prometheus, Grafana, and Kibana. • Knowledge of security best practices, including OAuth, JWT, and data encryption. • Fluent in English with strong communication and collaboration skills. Preferred Qualifications More ❯
environment rationalization to reduce duplication and inefficiency. Define and implement observability standards, including logging, metrics, tracing, and alerting . Use tools like New Relic , Prometheus , and Grafana , alongside building custom instrumentation for key platform services. Drive incident readiness and operational resilience by enabling actionable monitoring and alerting. Drive cloud cost … enablement frameworks. Experience with cloud-native technologies, Kubernetes, and Infrastructure as Code (Terraform, Helm, etc.). Strong understanding of observability tooling (especially New Relic, Prometheus, Grafana) and incident response best practices. Familiarity with FinOps, platform cost tracking, and infrastructure efficiency techniques. Excellent communication, leadership, and stakeholder management skills. Attract, hire More ❯
Uxbridge, Middlesex, United Kingdom Hybrid / WFH Options
Avature
You'll join a new team creating a greenfield product that will fundamentally change how giffgaff and our members (that's what we call our lovely customers) interact with each other and in the telecommunications industry-a brand new area More ❯
in London. As well as the day to day, you’ll have the opportunity to work on expanding monitoring with projects across Grafana and Prometheus, data centre and hardware projects and even networking and latency. This is the perfect opportunity for an experienced Solace engineer to take the next step … a 24/7 enterprise environment. Experience working with distributed systems over WAN. A good understanding of networking, latency, and failover strategy. Experience of Prometheus and Grafana for monitoring. Experience with capacity management, performance tuning, and system scaling. Familiarity with Linux and scripting (Bash, Python, etc.) Any knowledge of DevOps More ❯
in London. As well as the day to day, you’ll have the opportunity to work on expanding monitoring with projects across Grafana and Prometheus, data centre and hardware projects and even networking and latency. This is the perfect opportunity for an experienced Solace engineer to take the next step … a 24/7 enterprise environment. Experience working with distributed systems over WAN. A good understanding of networking, latency, and failover strategy. Experience of Prometheus and Grafana for monitoring. Experience with capacity management, performance tuning, and system scaling. Familiarity with Linux and scripting (Bash, Python, etc.) Any knowledge of DevOps More ❯
london, south east england, united kingdom Hybrid / WFH Options
Oliver Bernard
in London. As well as the day to day, you’ll have the opportunity to work on expanding monitoring with projects across Grafana and Prometheus, data centre and hardware projects and even networking and latency. This is the perfect opportunity for an experienced Solace engineer to take the next step … a 24/7 enterprise environment. Experience working with distributed systems over WAN. A good understanding of networking, latency, and failover strategy. Experience of Prometheus and Grafana for monitoring. Experience with capacity management, performance tuning, and system scaling. Familiarity with Linux and scripting (Bash, Python, etc.) Any knowledge of DevOps More ❯
london, south east england, united kingdom Hybrid / WFH Options
Oliver Bernard
in London. As well as the day to day, you’ll have the opportunity to work on expanding monitoring with projects across Grafana and Prometheus, data centre and hardware projects and even networking and latency. This is the perfect opportunity for an experienced Solace engineer to take the next step … a 24/7 enterprise environment. Experience working with distributed systems over WAN. A good understanding of networking, latency, and failover strategy. Experience of Prometheus and Grafana for monitoring. Experience with capacity management, performance tuning, and system scaling. Familiarity with Linux and scripting (Bash, Python, etc.) Any knowledge of DevOps More ❯
issues Performance tuning of JVM apps Nice to have Not vital, but you'll have the edge if you also have experience with: Kotlin Prometheus Query Language (PromQL) Grafana Prometheus or have worked in: an eCommerce organisation a shipping/logistics/exports organisation What you bring Agile: Test-Driven More ❯
Strong aptitude with SQL, Python and Airflow; Experience in Kubernetes, Docker, Django, Spark and related monitoring tools for DevOps a big plus (e.g. Grafana, Prometheus); Experience with dbt for pipeline modeling also beneficial; Skilled at shaping needs into a solid set of requirements and designing scalable solutions to meet them … DAGs with Python for ETL running on Kubernetes and Docker Django for custom app/database development Kubernetes for container management, with Grafana/Prometheus for monitoring Hugo/Markdown for data documentation More ❯
Site Reliability Engineer (SRE) - Data Platform London, England, United Kingdom Software and Services At Apple, we believe that innovation flourishes in an environment where ideas are challenged, collaboration is encouraged, and technology is pushed to its limits. This environment is More ❯
to join the team on a long term programme of work. Key Skills/Experience: * AWS services such as :- AWS Systems Manager, CloudWatch, Managed Prometheus, S3, Backup etc * MySQL MEM, * FortiGate Manager, * Wintel CA (Certificate Authority), * Trend Deep Secure AV, * ELC (CP) * Gitlab, * Log Shipper (Logstash/Kafka), * Windows Jump … to new staff * Risk/Issue identification * CSW Jira Technical skills required for this role: AWS services such as :- AWS Systems Manager, CloudWatch, Managed Prometheus, S3, Backup etc, MySQL MEM, FortiGate Manager, Wintel CA (Certificate Authority), Trend Deep Secure AV, ELC (CP), CICD, Gitlab, Log Shipper (Logstash/Kafka), Windows More ❯
to join the team on a long term programme of work. Key Skills/Experience: * AWS services such as :- AWS Systems Manager, CloudWatch, Managed Prometheus, S3, Backup etc * MySQL MEM, * FortiGate Manager, * Wintel CA (Certificate Authority), * Trend Deep Secure AV, * ELC (CP) * Gitlab, * Log Shipper (Logstash/Kafka), * Windows Jump … to new staff * Risk/Issue identification * CSW Jira Technical skills required for this role: AWS services such as :- AWS Systems Manager, CloudWatch, Managed Prometheus, S3, Backup etc, MySQL MEM, FortiGate Manager, Wintel CA (Certificate Authority), Trend Deep Secure AV, ELC (CP), CICD, Gitlab, Log Shipper (Logstash/Kafka), Windows More ❯