Glasgow, Scotland, United Kingdom Hybrid / WFH Options
JR United Kingdom
like GitHub Actions, CircleCI, or similar. ● Experience with messaging systems (RabbitMQ, Kafka, etc.) and event-driven ● Proficiency in infrastructure as code (Terraform preferred). ● Familiarity with monitoring stacks (Prometheus, Grafana, ELK, etc.) and system tuning. ● Security-conscious mindset; experience implementing controls in regulated or financial environments is a plus. ● Excellent problem-solving skills and a proactive attitude. ● Strong communication and More ❯
Glasgow, Scotland, United Kingdom Hybrid / WFH Options
Waracle
containerization tools and platforms such as Docker, Kubernetes, EKS, AKS, etc Experienced in scripting languages such as Bash, Python, Powershell or similar. Skilled in monitoring/logging tools (Prometheus, Grafana, ELK, Azure Monitor). Jira administration skills, including workflow and integration management. Proactive individual who actively seeks to expand knowledge and keep up with industry trends. Excellent communication skills with More ❯
like Jenkins, GitLab, Terraform Experience in at least one observability tool such as Dynatrace, Datadog, New Relic, CloudWatch, AppDynamics, Splunk., Preferred Qualification Experience a plus in common SRE toolchains: Grafana, Prometheus, Elasticsearch, Kibana, Jaeger. #J-18808-Ljbffr More ❯
Enterprise repository management systems (Artifactory) Workflow management and collaboration tools (Jira, Confluence, Google Suite) Cloud infrastructure (AWS) Monitor, debug and improve system performance and reliability using tools like Datadog, Grafana, or OpenSearch Update and maintain the development experience via automated pipelines that includes timely feedback and a seamless path to release-level quality. Maintain and administer cloud infrastructure. Troubleshoot issues More ❯
Glasgow, Scotland, United Kingdom Hybrid / WFH Options
Sonos, Inc
Enterprise repository management systems (Artifactory) Workflow management and collaboration tools (Jira, Confluence, Google Suite) Cloud infrastructure (AWS) Monitor, debug and improve system performance and reliability using tools like Datadog, Grafana, or OpenSearch Own and update the development experience via automated pipelines that includes timely feedback and a seamless path to release-level quality. Maintain and administer cloud infrastructure. Research, implement More ❯
tools like Jenkins, GitLab, Terraform Experience in at least one observability tool such as Dynatrace, Datadog, New Relic, CloudWatch, AppDynamics, Splunk., Qualification Experience a plus in common SRE toolchains: Grafana, Prometheus, Elasticsearch, Kibana, Jaeger. About Us J.P. Morgan is a global leader in financial services, providing strategic advice and products to the world’s most prominent corporations, governments, wealthy individuals More ❯
Glasgow, Scotland, United Kingdom Hybrid / WFH Options
BBC Group and Public Services
provision and manage cloud environments. Build and maintain CI/CD pipelines using GitHub Actions, AWS CodePipeline, CodeBuild, Jenkins. Integrate monitoring and observability tools such as AWS CloudWatch, Prometheus, Grafana for infrastructure and model health tracking. Ensure software quality through Test-Driven Development (TDD), unit testing frameworks (e.g., pytest, unittest), and automated integration tests. Conduct regular code reviews, participate in More ❯
Terraform, Bicep, CloudFormation). Experience with CI/CD tools and containerization (Docker, Kubernetes, EKS, AKS). Scripting skills in Bash, Python, PowerShell, etc. Familiarity with monitoring tools (Prometheus, Grafana, ELK, Azure Monitor). Jira administration skills. Proactive learning attitude and excellent communication skills. We offer flexible working, 35 days holiday, pension, health support, development courses, and other benefits. We More ❯
Glasgow, Scotland, United Kingdom Hybrid / WFH Options
iO Associates - UK/EU
with internal back-end systems (data). Perform root cause analysis of production errors and resolve technical issues. Develop scripts to automate visualisation (management tools such as Sentry and Grafana). Design procedures for system troubleshooting and maintenance. Required Skills & Experience Strong experience with AWS services , particularly EKS, Kafka, Redshift, and CloudWatch . Hands-on experience with Kubernetes and container More ❯
like Docker, Kubernetes, AKS, and Helm. Programming skills in Python, Java, PowerShell, or Go, with understanding of REST APIs. Experience with observability tools such as DataDog, Prometheus, Splunk, Elasticsearch, Grafana, Azure Monitor. Experience with CI/CD tools like Git, Terraform, Jenkins. Azure cloud expertise in mission-critical environments. Additional qualifications Azure cloud certification. Understanding of operating systems such as More ❯
programming language, preferably python. Practitioner of unit testing, performance testing and BDD/acceptance testing. Understanding of OAuth 2.0 protocol for secure authorization. Proficiency with Open Telemetry tools including Grafana, Loki, Prometheus, and Cortex. Demonstrated experience in DevOps, understanding of CI/CD (Jenkins) and GitOps. Ability to articulate technical concepts effectively to diverse audiences. Strong desire and ability to More ❯
GitHub Actions, AWS CodePipeline, Jenkins, and other tools, with an emphasis on reliability, reusability, and performance. Contribute to the design and integration of monitoring and observability solutions (CloudWatch, Prometheus, Grafana) to ensure infrastructure and model health. Champion software engineering excellence through Test-Driven Development (TDD), rigorous test automation, and continuous quality assurance practices. Support architectural decisions for scalable and maintainable More ❯
language such as Python, Java Spring Boot, or .Net. Deep knowledge of software applications and technical processes, with emerging expertise in specific technical disciplines. Experience with observability tools like Grafana, Dynatrace, Prometheus, Datadog, Splunk, including monitoring, SLO alerting, and telemetry collection. Proficiency with CI/CD tools such as Jenkins, GitLab, Terraform. Experience with containerization and orchestration tools like Docker More ❯
emerging depth in one or more technical disciplines Proficiency and experience in observability such as white and black box monitoring, SLO alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc. Proficiency in continuous integration and continuous delivery tools (e.g., Jenkins, GitLab, Terraform, etc.) Experience with container and container orchestration (e.g., ECS, Kubernetes, Docker, etc.) Expertise More ❯
be a plus: Experience with Observability across multiple domains (APM, Infrastructure, Synthetics, Logs, etc ) within cloud and on-premise environments using Datadog, Azure Monitor and Application Insights. NewRelic and Grafana Experience working in B2B SaaS companies Experience with cloud containers, specifically Kubernetes Responsibilities & Duties Develop: Architecture, strategy and implementations to enable or enhance the Observability and Reliability of applications and More ❯
Familiar with site reliability concepts, principles, and practices Familiar with observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, New Relic, CloudWatch, AppDynamics. Familiarity with containers or a common Server OS such as Linux and Windows Emerging knowledge of software, applications and technical processes within More ❯
Glasgow, Scotland, United Kingdom Hybrid / WFH Options
Rocket Lab
for customers Tools and Technologies: AWS (Kubernetes/EKS and ECS, WAF, NLB, ACM, Network Firewall, API Gateway, ACM, Secrets Manager), Terraform, Argo CD, GitHub and GitHub Actions, Prometheus, Grafana, Alertmanager, Calico, Hashicorp Vault Spire operates a hybrid work model, and this position will require you to work a minimum of three business days per week in the office. Access More ❯
least one programming language such as Python or Java. Experience maintaining a cloud-based infrastructure. Familiarity with site reliability principles, concepts, and practices. Knowledge of observability tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, New Relic, CloudWatch, or AppDynamics. Familiarity with containers or common server operating systems like Linux and Windows. Emerging knowledge of software, applications, and technical processes within More ❯
Experience in deploying to cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Curiosity : A hunger to learn and grow your skills. Problem solving: Strong analytical problem-solving skills and attention to detail. You have the ability to break down a task More ❯
Glasgow, Scotland, United Kingdom Hybrid / WFH Options
JR United Kingdom
if you have: Previous experience on a cloud migration project ( AWS ? Azure) Familiarity with Azure DevOps and IaC tools like ARM/Bicep Monitoring/logging tools experience ( Prometheus, Grafana, ELK, etc.) #J-18808-Ljbffr More ❯
Automate software operations across clouds, considering distributed system complexities Maintain responsibility for core services, networks, and infrastructure Develop troubleshooting, capacity planning, and performance monitoring skills using tools like Prometheus, Grafana, Elasticsearch Collaborate with distributed engineering, operations, and support teams Focus on larger automation projects during dedicated development time Share expertise through design sessions, mentorship, and collaborative work Handle time-critical More ❯
Glasgow, Scotland, United Kingdom Hybrid / WFH Options
JR United Kingdom
experience. Working knowledge of HPC container runtimes (e.g., Singularity, Apptainer). Exposure to provisioning and automation tools (e.g., Ansible, PXE, Terraform). Experience with monitoring tools such as Prometheus, Grafana, and DCGM. Understanding of GPU/accelerator toolchains like CUDA or ROCm. A proactive, customer-first mindset with strong communication skills. Ability to work effectively in both individual and team More ❯
including versioning, packaging, and delivering. Good knowledge of SQL, relational databases like PostgreSQL Technical Skills nice to have: Kotlin, Typescript Angular GitHub Action pipelines REST, gRPC Elastic Stack, Prometheus, Grafana Additional Information Evooq is a global provider of technology-driven solutions for wealth managers. We are building an ecosystem that combines data, technology and investment expertise to make personalised investment More ❯
Glasgow, Scotland, United Kingdom Hybrid / WFH Options
Rocket Lab
Infrastructure as Code tools such as Terraform and Ansible Familiarity with Python data visualization libraries Experience with Typescript and React Experience implementing monitoring and alerting system using systems like Grafana, Prometheus, or Nagios Spire operates a hybrid work model, and this position will require you to work a minimum of three days per week in the office. Access to US More ❯
Glasgow, Scotland, United Kingdom Hybrid / WFH Options
JR United Kingdom
the Android SDK Infrastructure as code (we use Pulumi) Relational databases such as MySQL/PostgreSQL Proficiency in writing and maintaining test suites Monitoring and observability tools, for example Grafana/Crashlytics What we offer A competitive salary and benefits package (depending on experience). Holidays: 32 days paid leave including public holidays. Office located in the National Robotarium Edinburgh. More ❯