Southampton, Hampshire, South East, United Kingdom Hybrid / WFH Options
Spectrum It Recruitment Limited
Out If You Have: Practical experience managing large-scale Kubernetes clusters; certifications in Kubernetes are a strong bonus Hands-on familiarity with the Grafana Observability Suite, including tools like Loki, Mimir, and Tempo Background in administering or developing with popular monitoring and automation tools such as Splunk, Datadog, PagerDuty, or Rundeck Experience using configuration management platforms like Ansible, Puppet More ❯
Baltimore, Maryland, United States Hybrid / WFH Options
Archesys Inc
maintaining complex Grafana dashboards. Strong proficiency in at least one backend programming language (e.g., Python, Go, Java, Node.js). Extensive experience with various data sources for Grafana (e.g., Prometheus, Loki, Splunk, SQL databases, CloudWatch). Deep hands-on experience with AWS cloud services, including but not limited to EC2, ECS/EKS, Lambda, S3, RDS, CloudWatch, Kinesis, DynamoDB. Proven More ❯
CD systems (Jenkins, Atlassian Bitbucket cloud, GitLab, Azure DevOps); Experience with GitOps tools (ArgoCD, Flux); Knowledge of the network stack; Experience with virtualization systems; Experience in using logging systems: Loki, ELK-stack; Understanding of basic software development processes; Knowledge and practice with applying DevSecOps methodologies; Knowledge of JSON, XML, Yaml formats; Experience with Git; Understanding of building tools for More ❯
London, England, United Kingdom Hybrid / WFH Options
Circadia Health
Experience orchestrating GPU/AI workloads , MLops, or large‐language‐model serving. Knowledge of edge/IoT deployments and over‐the‐air update strategies. Exposure to observability stacks (OpenTelemetry, Loki) and security tooling (Falco, Aqua, Wiz). What We Offer Base salary £100,000 – £170,000 plus meaningful equity. Gym membership Comprehensive health, dental & vision coverage (UK & global travel More ❯
Experience orchestrating GPU/AI workloads , MLops, or large‐language‐model serving. Knowledge of edge/IoT deployments and over‐the‐air update strategies. Exposure to observability stacks (OpenTelemetry, Loki) and security tooling (Falco, Aqua, Wiz). What We Offer Base salary £100,000 – £170,000 plus meaningful equity. Gym membership Comprehensive health, dental & vision coverage (UK & global travel More ❯
United, Pennsylvania, United States Hybrid / WFH Options
Escape Velocity Entertainment Inc
in a Site Reliability, Devops, or Platform engineering role 5+ years of experience with observability, application monitoring and alerting, telemetry collection and data visualization using common tools (Prometheus, Grafana, Loki) Experience with GitOps workflows and Helix Core/Perforce versioning system Experience implementing and maintaining CI/CD systems - Buildkite, Github or Gitlab runners Expertise in IaC design using More ❯
Python. • Experience with containerization technologies (Docker, Helm, etc.). • Strong knowledge of CI/CD pipelines (Jenkins, ArgoCD, GitHub Actions). • Hands-on experience with observability tools (Prometheus, Grafana, Loki, Jaeger). • Understanding of networking, service meshes (Istio/Linkerd), and security best practices in Kubernetes. • Experience with multi-cluster and hybrid cloud Kubernetes deployments. More ❯
and build a new cloud-native IaC platform. Develop software using technologies such as Docker Compose, Terraform, Kubernetes (K8s), Python, and Go. Provision and orchestrate open-source services including Loki, Redis, Grafana, Authentik, Netbird, among others. Design and implement CI/CD pipelines to streamline deployment processes. Initially focus on AWS environments, with the goal of creating a solution More ❯
London, England, United Kingdom Hybrid / WFH Options
Bright Purple
and build a new cloud-native IaC platform. Develop software using technologies such as Docker Compose, Terraform, Kubernetes (K8s), Python, and Go. Provision and orchestrate open-source services including Loki, Redis, Grafana, Authentik, Netbird, among others. Design and implement CI/CD pipelines to streamline deployment processes. Initially focus on AWS environments, with the goal of creating a solution More ❯
Testing Azure Application Insights Azure Kubernetes Service • Platform tuning experience Beneficial skills • Bicep • CloudFlare • ARM Templates • Familiar with Octopus Deploy • Knowledge of C# .NET • Prometheus/Grafana dashboards • Seq, Loki or other application logging software • VM's Company benefits • Full private health insurance through our healthcare partner, Vitality Health • Group Life Insurance and Income Protection • BUPA Dental Insurance More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Stealth AI Startup
CI/CD, building pipelines in GitHub Actions, GitLab CI or CircleCI with automated tests and security gates. An observability and SRE mindset, using tools such as Prometheus, Grafana, Loki or ELK and OpenTelemetry. A security-first but pragmatic approach, covering secrets management, image provenance and zero-trust networking. Proficiency in at least one systems language (Go, Python or More ❯
CI/CD, building pipelines in GitHub Actions, GitLab CI or CircleCI with automated tests and security gates. An observability and SRE mindset, using tools such as Prometheus, Grafana, Loki or ELK and OpenTelemetry. A security-first but pragmatic approach, covering secrets management, image provenance and zero-trust networking. Proficiency in at least one systems language (Go, Python or More ❯
United, Pennsylvania, United States Hybrid / WFH Options
Akamai
Rust, or similar Have proficiency with a configuration management tool such as Ansible, SaltStack, Chef, Puppet, or similar Possess previous experience with observability tools such as Prometheus, Nagios, Grafana, Loki, ELK, New Relic, or similar Be familiar with containerization technologies such as Docker or Podman and container orchestration (Kubernetes, Nomad) Work in a way that works for you FlexBase More ❯
United, Pennsylvania, United States Hybrid / WFH Options
Akamai
CloudBees or similar Have hands-on mastery in Linux administration and container-based platforms like Docker Have hands on experience with monitoring and logging tools, such as Prometheus, Grafana, Loki or similar, as well as APMs such as Sentry and NewRelic. Possess an understanding of best practices related to systems reliability, observability and monitoring incl. adherence to SLOs. Have More ❯
security best practices and the ability to implement security controls at the infrastructure level Experience with monitoring and logging tools like DataDog or Grafana's observability stack (Prometheus, Tempo, Loki, Grafana) Familiarity with the open standard OpenTelemetry Excellent written and verbal communication skills, we're a collaborative team! PLEASE NOTE: Our engineering teams work fully remotely across Europe but More ❯
Seattle, Washington, United States Hybrid / WFH Options
Georgia IT Inc
with Chef and Ansible required Strong leadership, initiative taking, and capacity for decision making Expert knowledge in any or all of these is a huge plus: Prometheus Operator, Grafana, Loki, ELK Stack, OpenTelemetry, Jaeger/OpenTracing (and yes, we use ALL of these!) Participate in the on-call rotation for Operations support Bachelor's degree in CS or a More ❯
United, Pennsylvania, United States Hybrid / WFH Options
Akamai
such as Python or Bash. Have experience using automation tools such as Terraform, Ansible, Jenkins, or Salt Stack Possess an understanding of following monitoring and logging tools: Prometheus, Grafana, Loki or similar Work in a way that works for you FlexBase, Akamai's Global Flexible Working Program, is based on the principles that are helping us create the best More ❯
Scottsdale, Arizona, United States Hybrid / WFH Options
Saxon Global
Change Management and Problem Management. 1-2 years of Experience in Infrastructure Support, Configuration and Release Management. 2-3 years of hands on experience with Tools including Splunk, Grafana, Loki, APPDynamics or other APM solutions 2+ years of Experience with Application support built on-prem and native cloud environments Able to code - Java, SQL, PromSQL, Shell and Python. Root More ❯
monthly worldwide, requiring robust, scalable infrastructure solutions. You will be responsible for developing scalable solutions that handle billions of monthly requests, building and optimizing monitoring systems with Grafana and Loki, maximizing system uptime, and implementing redundancy across all architectural layers. Daily tasks involve configuring and maintaining servers, networks and applications (Kubernetes, Linux, Docker, S3 & Trino), proactively resolving infrastructure bottlenecks … product owners. Requirements: 4+ years Linux experience Experience building CI/CD with Github/Argo CD or similar tools Kubernetes Experience Experience with monitoring tools like Sentry, Grafana, Loki, Prometheus Salary up to €6,500 Hybrid working, one day in the office in Heerenveen. More ❯
Herndon, Virginia, United States Hybrid / WFH Options
TalentRemedy
with Keycloak, Okta or other OIDC/SAML-based SSO & Auth services Experience building and updating secure Docker images to deploy custom applications Familiarity Metrics & Monitoring tooling (Grafana, Mimir, Loki, Tempo, or similar) Experience with Tableau and Snowflake Base Salary Range : $170,000 - $200,000 annually plus 25% annual bonus More ❯
the Grafana LGTM Stack, which can be run fully managed with Grafana Cloud or self-managed with the Grafana Enterprise Stack, both featuring scalable metrics (Grafana Mimir), logs (Grafana Loki), and traces (Grafana Tempo). Benefits: For more information about the perks and benefits of working at Grafana, please check out our careers page. Equal Opportunity Employer: At Grafana More ❯