and troubleshooting Proficiency with using Puppet for configuration management, automation and system provisioning Hands-on experience in monitoring and observability platforms such as Grafana, Prometheus, Elasticsearch, Jaeger Experience with cloud architectures such as GCP or AWS Familiarity with SQL databases and broker systems such as Kafka You are a solution More ❯
level (Ubuntu/RHEL). Working knowledge of at least one scripting language, e.g. Python, Bash. Monitoring and logging of systems using tools like Prometheus, Grafana, or ELK. Experience with IaC tools (ideally Ansible). Working knowledge of version control methodologies and practices. Desirable: Ideally a minimum of 5 years More ❯
containerised workloads, and Kubernetes clusters. Familiarity with teleoperation systems, game streaming, or low-latency video/control pipelines. Experience monitoring infrastructure with tools like Prometheus, Grafana, Netdata, or similar. Ability to write basic scripts in Python, Bash, or similar for automation and monitoring. Strong documentation and communication skills for cross More ❯
containerised workloads, and Kubernetes clusters. Familiarity with teleoperation systems, game streaming, or low-latency video/control pipelines. Experience monitoring infrastructure with tools like Prometheus, Grafana, Netdata, or similar. Ability to write basic scripts in Python, Bash, or similar for automation and monitoring. Strong documentation and communication skills for cross More ❯
as back-end levels. Working experience with various types of databases (MongoDB, Oracle, SQL, InfluxDB). Familiarity with monitoring tools like New Relic, Grafana, Prometheus, or Datadog. Knowledge of AWS cloud environments and performance testing in cloud-based architectures. Knowledge of containerisation tools (Docker, Kubernetes). Experience with CI/ More ❯
e.g. NMS applications, controllers, orchestrators, supervisory systems, etc.). Experience and understanding of Kafka messaging bus. Experience in using monitoring tools like Nagios, Grafana, Prometheus and Kibana is desired. Deployment environment: Kubernetes, Docker, microservices. Experience on Talos Kubernetes is an advantage. Deployment experience in cloud-based environment AWS/Azure More ❯
plus. Proven delivery of secure, scalable web apps with backend-for-frontend architecture and CDN integration. Proficiency with monitoring and observability tools such as Prometheus, Grafana, and OpenSearch. Deep understanding of CI/CD practices, GitLab pipelines, infrastructure as code, and centralized monitoring. Track record of mentoring and coaching engineers More ❯
ensuring reliability and performance. Experience in implementing observability, instrumenting applications to provide insights into system performance. Hands-on experience with tools such as Dynatrace, Prometheus and OpenTelemetry for monitoring, tracing, and real-time alerting is highly sought after. An understanding of microservices and container orchestration with the ability to optimise More ❯
Desired: Experience in AWS DevOps Experience of working with Spring Cloud Experience of working with Postfix Experience of using with monitoring tools - Grafana and Prometheus Creates a cohesive working environment and build high performing teams Strong stakeholder management Expertise in Risk management More ❯
Collaboration: Work closely with cross-functional teams (Development, Operations, and Security) to streamline processes and improve system reliability. Performance & Security: Implement robust monitoring (CloudWatch, Prometheus) and ensure systems are secure, compliant, and optimized for performance. Cost Management: Work on cost-efficient infrastructure strategies without compromising performance. What's in this More ❯
will have strong knowledge of Python, Golang or similar programming and scripting languages. You will have strong knowledge of Infrastructure metric visualisation using Splunk, Prometheus and Grafana. You will preferably have expertise with container technologies like Docker and orchestration platforms like Kubernetes. QRT is an equal opportunity employer. We welcome More ❯
switch configuration and patching. Experience with MS Active Directory configuration and management. Hardware asset management, including printer configuration. Nexus security scanning. Monitoring using Grafana, Prometheus, Alert Manager and Node exporter. Back-up and restore monitoring/validation. Problem diagnosis, troubleshooting, and fixing through log aggregation and analysis. Experience with Git More ❯
or Unix Shell. Deep understanding of software applications and technical processes, with emerging expertise in specific disciplines. Experience with observability tools like Grafana, Dynatrace, Prometheus, Datadog, Splunk, including monitoring, SLO alerting, and telemetry collection. Knowledge of CI/CD tools such as Jenkins, GitLab, Terraform. Experience with containers and orchestration More ❯
Git). Excellent problem-solving skills and attention to detail. Strong communication and teamwork abilities. Preferred Qualifications: Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack). Familiarity with Agile methodologies and DevOps practices. Benefits: Enhanced leave - 38 days inclusive of 8 UK Public Holidays Private Health Care More ❯
etc.) * Experience in observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others * Experience with continuous integration and continuous delivery tools like Jenkins, GitLab, or Terraform * Familiarity with container and container orchestration such as More ❯
Belfast, City of Belfast, County Antrim, United Kingdom Hybrid / WFH Options
Cala Consulting
build automation tools, build orchestration and environment automation. e.g. Jenkins, GitHub, GitLab, CloudFormation, Others Experience in implementing tools for logging, monitoring and alerting. e.g. Prometheus, Splunk, CloudWatch, Nagios Experience in creating and automating virtual machines in public and private clouds An understanding or experience of high availability, business continuity and More ❯
Employment Type: Permanent
Salary: £40000 - £60000/annum pension, share options, health
should have experience in setting up monitoring, logging, and alerting for improved system observability. Tech Stack: GitHub, Docker, Kubernetes, Ansible, Terraform, Gitlab, Synk, Vault, Prometheus, Grafana, Elk, Splunk What's in it for you At Accenture in addition to a competitive basic salary, you will also have an extensive benefits More ❯
skills and a track record of cross-team collaboration. Nice to have: Kubernetes expertise (GKE/AKS/EKS) and container-native observability stacks (Prometheus/Grafana). NoSQL experience (Firestore, Cosmos DB, DynamoDB, MongoDB). Experience with game-backend scales, real-time services or hybrid cloud/bare-metal More ❯
/service once live, including: observability best practises, logging best practises, error reporting, debugging and live incident management. Experience using tools such as Grafana, Prometheus, New Relic etc. Highly proficient in Python. Experience in data modelling and design patterns; in-depth knowledge of relational databases (PostgreSQL) and familiarity with data More ❯