PyTorch, LangChain, or similartechnologies Demonstrated ability to leadcross-functional teams and operate within complex enterpriseecosystems Familiarity with monitoring,observability, and platform telemetry tools (e.g., Prometheus,Grafana, Azure Monitor) Exceptionalcommunication and stakeholder engagement skills to partner withbusiness, technical, and governanceteams Experience managing platform SLAs,incident management, and continuous improvement cycles inhigh More ❯
build tools; Ansible or other IT Automation/software provisioning tools; JIRA, Confluence; * Experience in monitoring/reporting tools such as Splunk, Grafana/Prometheus etc * Experience in Agile practices * Working knowledge of environment monitoring tools such as GCO, NewRelic, Prometheus, Grafana. * Collaboration Skills: Proactive can-do attitude; A creative More ❯
and reliability across production and non-production environments. This includes working on incident response, capacity planning, WAN optimization, and system observability using tools like Prometheus and Grafana . Key Responsibilities: Administer and maintain Solace PubSub+ appliances and software brokers across environments (on-prem and cloud). Provide production support for … messaging-related incidents, including root cause analysis and resolution. Monitor system performance and health using Prometheus and Grafana ; proactively identify and address anomalies. Configure and optimize Solace across WAN environments , ensuring low-latency, secure, and reliable messaging. Collaborate with development and application support teams to troubleshoot message flow issues and … in a 24x7 enterprise environment. Experience working with distributed systems over WAN , with an understanding of networking, latency, and failover strategies. Solid experience with Prometheus and Grafana for system monitoring and alerting. Proficiency in troubleshooting message delivery, persistence, and topic routing. Experience with capacity management , performance tuning, and system scaling. More ❯
3.10+), JavaScript and TypeScript for frontend Tools: RabbitMQ and ZeroMQ for messaging; PostgreSQL for data storage; Websockets for frontend communication Environment: Linux servers Observability: Prometheus, Grafana, Zabbix Benefits: Working alongside other extremely talented and driven engineers Extremely lucrative salary, bonus upto 30% and excellent benefits Greenfield Python/Golang work More ❯
years of professional work experience It would be a plus if the applicant had e xperience with production monitoring and logging tools (i.e. CloudWatch, Prometheus, OpenSearch/Elasticsearch, ELF More ❯
big fans of Azure Pipelines! Some of our services are migrating away from TeamCity and Octopus Deploy Our observability stack is Splunk, Grafana and Prometheus You As a software engineer, you will be: Part of a cross-functional team working with Product Managers, Testers and DevOps engineers Writing well-tested More ❯
london, south east england, United Kingdom Hybrid / WFH Options
CipherTek Recruitment
data points, either via APIs or other appropriate methods, to ensure real-time decision-making capabilities. Integrate with Instrumentation Platforms : Integrate the platform with Prometheus and Geneos for continuous monitoring, diagnostics, and system health checks. Desired technical Skills: Java Expertise: Extensive experience with Core Java , focusing on low-level performance More ❯
london, south east england, United Kingdom Hybrid / WFH Options
Tenth Revolution Group
experience of 5+ years DevOps expertise of 5+ years GenAI pilots (CICD pipelines, install LLM) LLM applications, Langchain, Conda Kubernetes CICD Pipeline build experience Prometheus To find out more apply with job post or contact o.king@tenthrevolution.com More ❯
london (city of london), south east england, United Kingdom Hybrid / WFH Options
Tenth Revolution Group
experience of 5+ years DevOps expertise of 5+ years GenAI pilots (CICD pipelines, install LLM) LLM applications, Langchain, Conda Kubernetes CICD Pipeline build experience Prometheus To find out more apply with job post or contact o.king@tenthrevolution.com More ❯
london (west end), south east england, United Kingdom Hybrid / WFH Options
Tenth Revolution Group
experience of 5+ years DevOps expertise of 5+ years GenAI pilots (CICD pipelines, install LLM) LLM applications, Langchain, Conda Kubernetes CICD Pipeline build experience Prometheus To find out more apply with job post or contact o.king@tenthrevolution.com More ❯
Be a key player involved in decision-making and collaborating with other stakeholders in the team 📈 💻 Tech Stack: React, TypeScript, Go, GCP, Docker, Terraform, Prometheus You’ll be a great fit if you have: A degree in Computer Science (or related field) from a top global university Strong fullstack experience More ❯
TeamCity, or similar tools. Establish and maintain key performance metrics through baseline vs. benchmarking analysis. Conduct in-depth performance analysis using tools like Grafana, Prometheus, AWS CloudWatch, and Azure Insights. Work closely with engineers and operational teams to diagnose and troubleshoot performance issues. What We Offer As well as a … an equal opportunity employer. Desired Experience Expertise in performance testing tools, such as JMeter, Gatling, LoadRunner, etc. Hands-on use of monitoring tools, i.e. Prometheus, Grafana, AWS CloudWatch, or similar Experience with Jenkins, TeamCity, Kubernetes, Docker, RabbitMQ (or similar) Ability to script and modify solutions in Python, Bash, Java, etc. More ❯
TeamCity, or similar tools. Establish and maintain key performance metrics through baseline vs. benchmarking analysis. Conduct in-depth performance analysis using tools like Grafana, Prometheus, AWS CloudWatch, and Azure Insights. Work closely with engineers and operational teams to diagnose and troubleshoot performance issues. What We Offer As well as a … an equal opportunity employer. Desired Experience Expertise in performance testing tools, such as JMeter, Gatling, LoadRunner, etc. Hands-on use of monitoring tools, i.e. Prometheus, Grafana, AWS CloudWatch, or similar Experience with Jenkins, TeamCity, Kubernetes, Docker, RabbitMQ (or similar) Ability to script and modify solutions in Python, Bash, Java, etc. More ❯
london, south east england, United Kingdom Hybrid / WFH Options
Maxwell Bond
Senior Developer, you’ll work with Java (17+), SpringBoot, Kotlin and NoSQL databases . They’ve got their own internal cloud and toolings using Prometheus and Grafana. As their Senior Developer, you’ll be: A key advocate for best coding practice, standards, and innovation. TDD approach is key for this More ❯
Operations function supporting custom applications. Desirable Requirements: Knowledge of service management processes (Incident, Problem, Change Management). Performance monitoring tools (Azure Monitor, App Insights, Prometheus & Grafana, SolarWinds). Network and firewall configuration. Understanding of Azure data services. Experience with identity management and RBAC. Understanding of Azure security principles. Knowledge of More ❯
and applications across their entire IT estate. You’ll help drive the vision, design and implementation of monitoring and observability systems including OpenTelemetry, Grafana, Prometheus and Splunk etc. Working side by side with DevOps teams you’ll also have the chance to work with containers and Kubernetes, OpenShift, Docker and … monitoring, DevOps and automation tools. Requirements: Excellent previous experience in a similar Observability/Monitoring role. Experience of engineering and supporting solutions (OpenTelemetry, Grafana, Prometheus, Splunk etc) Experience with tools such as Jenkins, Ansible or Puppet Good knowledge of Linux and infrastructure support Experience of CI/CD, Cloud (AWS More ❯
and applications across their entire IT estate. You’ll help drive the vision, design and implementation of monitoring and observability systems including OpenTelemetry, Grafana, Prometheus and Splunk etc. Working side by side with DevOps teams you’ll also have the chance to work with containers and Kubernetes, OpenShift, Docker and … monitoring, DevOps and automation tools. Requirements: Excellent previous experience in a similar Observability/Monitoring role. Experience of engineering and supporting solutions (OpenTelemetry, Grafana, Prometheus, Splunk etc) Experience with tools such as Jenkins, Ansible or Puppet Good knowledge of Linux and infrastructure support Experience of CI/CD, Cloud (AWS More ❯
Farnborough, midlands, United Kingdom Hybrid / WFH Options
Searchability NS&D
INNOVATIVE, SECURE PLATFORMS FOR DEFENCE & NATIONAL SECURITY! Work on secure-by-design platforms in a mission-led environment Tech stack: Kubernetes, Terraform, Jenkins, Git, Prometheus, Python Salary up to £95,000 DOE + Benefits Farnborough-based – Hybrid working model Must hold active SC or DV Clearance (Eligibility) To apply, email … best practices Develop scalable, secure infrastructure using Terraform and Ansible Evangelise GitOps and support deployment automation Monitor and improve platform performance using tools like Prometheus and Grafana Provide technical oversight and guidance to cross-functional teams Stay ahead of emerging tech trends to enhance platform capabilities WHAT I'M LOOKING … CI/CD tooling (e.g., Jenkins, GitLab CI/CD) Solid understanding of Git and version control best practices Experience with monitoring tools like Prometheus and Grafana Comfortable in fast-paced, agile environments Excellent communication and problem-solving skills Active SC or DV clearance required NICE TO HAVE Experience with More ❯
Farnborough, south east england, United Kingdom Hybrid / WFH Options
Searchability NS&D
INNOVATIVE, SECURE PLATFORMS FOR DEFENCE & NATIONAL SECURITY! Work on secure-by-design platforms in a mission-led environment Tech stack: Kubernetes, Terraform, Jenkins, Git, Prometheus, Python Salary up to £95,000 DOE + Benefits Farnborough-based – Hybrid working model Must hold active SC or DV Clearance (Eligibility) To apply, email … best practices Develop scalable, secure infrastructure using Terraform and Ansible Evangelise GitOps and support deployment automation Monitor and improve platform performance using tools like Prometheus and Grafana Provide technical oversight and guidance to cross-functional teams Stay ahead of emerging tech trends to enhance platform capabilities WHAT I'M LOOKING … CI/CD tooling (e.g., Jenkins, GitLab CI/CD) Solid understanding of Git and version control best practices Experience with monitoring tools like Prometheus and Grafana Comfortable in fast-paced, agile environments Excellent communication and problem-solving skills Active SC or DV clearance required NICE TO HAVE Experience with More ❯
HAProxy, Nginx) and network monitoring tools. Experience in DNS management and troubleshooting. Experience in network security best practices. Proficiency in monitoring and observability tools (Prometheus, Grafana, Splunk). Proficiency in at least one scripting language (Python, Bash) for automation. Experience with CI/CD pipeline management and DevOps practices. Strong … system performance. Experience in tools like df, du, lsblk, and fdisk for managing and troubleshooting file systems and disk partitions. Familiarity with tools like Prometheus and Grafana for monitoring and observability More ❯
dev, test, and production environments ✅ Provide hands-on support for production incidents, including root cause analysis and resolution ✅ Monitor performance and system health using Prometheus and Grafana ✅ Optimise Solace messaging across WAN environments for low-latency, secure data flow ✅ Work with application and development teams to resolve integration and message … of experience working with Solace PubSub+ in an enterprise setting 🔹 Strong background in supporting distributed systems, ideally in 24/7 environments 🔹 Proficiency with Prometheus , Grafana , and system observability practices 🔹 Solid understanding of WAN messaging, latency management, and failover strategies 🔹 Scripting skills (e.g., Bash, Python) and experience on Linux/ More ❯
Southampton, Hampshire, United Kingdom Hybrid / WFH Options
LA International Computer Consultants Ltd
to join the team on a long term programme of work. Key Skills/Experience: AWS services such as :- AWS Systems Manager, CloudWatch, Managed Prometheus, S3 click apply for full job details More ❯
london, south east england, United Kingdom Hybrid / WFH Options
Maxwell Bond
their Senior Frontend Developer, you’ll work with React and Javascript/Typescript. They’ve got their own internal cloud and tooling's, using Prometheus and Grafana. As their Senior Frontend Developer, you’ll be: A key advocate of best coding practice and standards with a TDD approach as you More ❯
You'll be responsible for ensuring world-class production environment reliability while implementing sophisticated monitoring solutions through their technology stack, including Splunk, Telegraf/Prometheus, Grafana, and PagerDuty. Role Impact: You'll drive excellence across production and non-production environments, optimizing trading data management, service delivery, and server operations. Your … operations experience in financial technology or similar industry Strong AWS cloud architecture expertise and advanced Linux systems administration Demonstrated success with monitoring solutions (Grafana, Prometheus) Experience optimizing build and release processes in trading environments Networking and troubleshooting capabilities Advanced Python and Bash scripting for automation Extensive experience with Docker and More ❯
ideally GCP or AWS. Deployment of cloud resources using Infrastructure-as-code such as Terraform. Any experience with dashboards and alerting with Grafana using Prometheus metrics, Loki logging, and Tempo tracing to monitor and debug services would be an advantage. Strong communication skills and team player who can discuss complex … by working with other engineering, product, and support teams. Set up and monitor GCP resources using Infrastructure as code with Terraform. Working with Grafana, Prometheus and Loki to create dashboards and alerting. Be proactive in identifying and making improvements to our existing code base. Actively participate in resolving incidents that More ❯