plus; Technical experience with: Docker, gRPC/protobuf, REST, WebSockets, OAuth2, MQTT Cloud experience: AWS, serverless, DynamoDB, CDK, IoT Core Framework experience: React, Django, CUDA, PyTorch, TensorFlow, NumPy, GoogleTest, Grafana, Embassy, Tokio Knowledge of or affinity with hardware control at embedded or PC level (I2C, SPI, GPIO, etc.); Knowledge on cloud (AWS) or front-end (React); Curious and independent mindset More ❯
plus; Technical experience with: Docker, gRPC/protobuf, REST, WebSockets, OAuth2, MQTT Cloud experience: AWS, serverless, DynamoDB, CDK, IoT Core Framework experience: React, Django, CUDA, PyTorch, TensorFlow, NumPy, GoogleTest, Grafana, Embassy, Tokio Knowledge of or affinity with hardware control at embedded or PC level (I2C, SPI, GPIO, etc.); Knowledge on cloud (AWS) or front-end (React); Curious and independent mindset More ❯
Edinburgh, Midlothian, United Kingdom Hybrid / WFH Options
Parity Network
CDK, S3, Aurora), TypeScript, React, and modern JavaScript. Proven experience in leading agile software teams Experience with infrastructure-as-code, DevOps tooling, testing frameworks (e.g. Cypress), and monitoring (e.g. Grafana) Background in digital public services, accessibility, and GDS standards is highly desirable. Why Apply? This is a pivotal role supporting the digital infrastructure for our client. You will shape innovative More ❯
Edinburgh, City of Edinburgh, United Kingdom Hybrid / WFH Options
Parity Network
CDK, S3, Aurora), TypeScript, React, and modern JavaScript. Proven experience in leading agile software teams Experience with infrastructure-as-code, DevOps tooling, testing frameworks (e.g. Cypress), and monitoring (e.g. Grafana) Background in digital public services, accessibility, and GDS standards is highly desirable. Why Apply? This is a pivotal role supporting the digital infrastructure for our client. You will shape innovative More ❯
a must. Educational background in Computer Science (BSc with professional experience, or MSc with internship experience). Exposure to distributed systems and WebSockets is a plus. Familiarity with Postgres , Grafana , and handling terabytes of data in high-scale, high-uptime environments. Preference for candidates who've worked on large, business-critical systems, not just small-scale, non-continuous uptime projects. More ❯
a must. Educational background in Computer Science (BSc with professional experience, or MSc with internship experience). Exposure to distributed systems and WebSockets is a plus. Familiarity with Postgres , Grafana , and handling terabytes of data in high-scale, high-uptime environments. Preference for candidates who've worked on large, business-critical systems, not just small-scale, non-continuous uptime projects. More ❯
London, England, United Kingdom Hybrid / WFH Options
Cryptio
technical and non-technical audiences. An analytical mindset, a passion for solving complex problems, and an interest in the crypto and fintech industry. Bonus Temporal.io, gRPC Tools: Docker, Gitlab, Grafana stack Infra: AWS, S3, Kubernetes Interview Process Talent Screen - 45 min Technical Interview - 90 min Team Interview - 45 min Leadership Interview - 45 min Benefits Fully-Remote/Hybrid – The choice More ❯
London, England, United Kingdom Hybrid / WFH Options
Mustard Systems
systems with a focus on flexibility, speed, and accuracy. Our Core Tech Stack Tools: RabbitMQ and ZeroMQ for messaging; PostgreSQL for data storage; Websockets for frontend communication Observability: Prometheus, Grafana, Zabbix Requirements Must-Haves: At least 5 years’ experience in a Software Development role, with solid Python expertise and a track record of building robust, high-quality solutions. At least More ❯
as RabbitMQ . Hands-on experience with databases like Clickhouse/InfluxDB and MariaDB . Proficiency in integration testing to ensure robust system interactions. Familiarity with monitoring tools like Grafana and CI/CD systems such as GitHub Actions . Experience with secure authentication workflows, including protocols like OAuth2 and OpenID Connect, and token-based authentication (e.g., JWT). Proven More ❯
mandatory, proficiency in German can be beneficial for seamless communication within the team. Tools & Platforms Jira Wiki CASM (Ticketing-Tool) Zabbix (Monitoring-Tool) ICINGA (Monitoring Tool) Jumpserver RedFish API Grafana Netbox Additional Information What do we offer you? International, positive, dynamic and motivated work environment. Hybrid work model (teleworking/on-site). Flexible schedule. Continuous training: Preparation for certifications More ❯
London, England, United Kingdom Hybrid / WFH Options
Gaia
PostgreSQL and SQS Our production services run on AWS and Vercel. We automate infrastructure and configuration using Pulumi and Github Actions Our observability stack is built up of Cloudwatch, Grafana and Sentry Throughout the interview process, we are looking for... You can take on big, loosely defined projects, break them down, and deliver them in an intuitive and digestible way More ❯
London, England, United Kingdom Hybrid / WFH Options
SEFE Marketing & Trading Ltd
troubleshoot issues in production for delivered items, ensuring timely resolution and minimal business impact (using tools such as ELK: Elastic Search, Kibana). Experience with data visualization tools (e.g., Grafana, Power BI, Tableau). Proficient in data analysis for various data formats (e.g., JSON, CSV, SQL). A strong understanding of the Commodities Trading landscape (both physical and financial). More ❯
Livingston, New Jersey, United States Hybrid / WFH Options
CoreWeave
000+ switches or nodes). Familiarity with Ansible. Understanding of Redfish API for system management. Experience with NVUE (NVIDIA User Experience) or similar network based CLI Experience with Grafana/PromQL Proficiency in at least one language (e.g., Python, Go). "The base pay and target total cash for this position range from $109,000 to $145,000 and More ❯
Formulate technical solutions aligned with business goals. Willingness to learn new technologies and solve problems in uncertain conditions. Our technologies Cloud Provider: Amazon AWS Monitoring & Logging: ELK (EFK), Prometheus, Grafana Why joining Smartcat might be your best move so far Fully remote team We are a global team of 200+ enthusiastic people spread across 30+ countries. We have been fully More ❯
London, England, United Kingdom Hybrid / WFH Options
incident.io
to help us ship quickly and safely Google Cloud Platform (GKE, Cloud SQL), with BigQuery, PubSub, Cloud Storage and more Terraform for infrastructure, CircleCI for testing and deployment Sentry, Grafana + Prometheus, and Kibana for observability Anthropic, Vertex, and OpenAI along with our own internal AI tooling for building AI features We choose tools carefully and optimise for developer experience More ❯
roles, with strong RCA skills. Familiarity with ITIL platforms like ServiceNow or similar problem and incident management tools. Good understanding of MS SQL is a plus. Good understanding of Grafana to support data-driven RCA. Technical expertise - have a deep understanding of the product they handle as well as the processes behind it. Strong analytical, problem solving and prioritisation skills. More ❯
New York City (Manhattan), New York, United States Hybrid / WFH Options
DeepL
preventative measures to avoid similar issues in the future Qualities we look for You have deep domain knowledge in Kubernetes and logging and monitoring tools such as Prometheus and Grafana You have a background in software development, optimally in backend applications- we are currently using C# but are hiring on a language-agnostic basis We'd especially love to talk More ❯
London, England, United Kingdom Hybrid / WFH Options
Smartcat Platform Inc
strong sense of ownership, and determination. Openness to constructive feedback and value the ideas and opinions of others. Our technologies Cloud Provider: Amazon AWS Monitoring & Logging: ELK (EFK), Prometheus, Grafana Why joining Smartcat might be your best move so far Fully remote team We are a global team of 200+ enthusiastic people spread across 30+ countries. We have been fully More ❯
London, England, United Kingdom Hybrid / WFH Options
Proton
and Linux Background in performance tuning and problem diagnosis at the OS, database and application levels Experience with software management configuration tools (eg. Puppet) and observability solutions (eg. Prometheus, Grafana, Loki, PMM, Icinga) Basic knowledge of networking, server hardware, storage and operating systems knowledge relevant for databases Willingness and ability to learn quickly Readiness to share knowledge and collaborate with More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Captur
model explainability, gathering and sharing insights from existing images, running experiments with new data sources and answering client queries about model performance with data-driven insights Create and maintain Grafana dashboards to visualise key performance metrics Support the Customer Success team with data-driven solutions Required Qualifications SQL (BigQuery and PostgreSQL) proficiency and Python programming skills Experience with Google Cloud … working with machine learning teams; understanding of the core concepts of model evaluation techniques and metrics, and suitable choices of their usage Experience working with data visualisation tools, particularly Grafana Strong communication skills for client-facing interactions Collaborative mindset for cross-functional work Nice to Have Experience with image based machine learning Experience with AI/ML deployment workflows Experience More ❯
model explainability, gathering and sharing insights from existing images, running experiments with new data sources and answering client queries about model performance with data-driven insights Create and maintain Grafana dashboards to visualise key performance metrics Support the Customer Success team with data-driven solutions Required Qualifications SQL (BigQuery and PostgreSQL) proficiency and Python programming skills Experience with Google Cloud … working with machine learning teams; understanding of the core concepts of model evaluation techniques and metrics, and suitable choices of their usage Experience working with data visualisation tools, particularly Grafana Strong communication skills for client-facing interactions Collaborative mindset for cross-functional work Nice to Have Experience with image based machine learning Experience with AI/ML deployment workflows Experience More ❯
London, England, United Kingdom Hybrid / WFH Options
Captur
model explainability, gathering and sharing insights from existing images, running experiments with new data sources and answering client queries about model performance with data-driven insights Create and maintain Grafana dashboards to visualise key performance metrics Support the Customer Success team with data-driven solutions Required Qualifications SQL (BigQuery and PostgreSQL) proficiency and Python programming skills Experience with Google Cloud … working with machine learning teams; understanding of the core concepts of model evaluation techniques and metrics, and suitable choices of their usage Experience working with data visualisation tools, particularly Grafana Strong communication skills for client-facing interactions Collaborative mindset for cross-functional work Nice to Have Experience with image based machine learning Experience with AI/ML deployment workflows Experience More ❯
Slough, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
experience with cutting-edge infrastructure, tooling, and observability platforms. What You'll Work With: Bash & Python scripting Networking fundamentals (switches, routers, packet analysis with tcpdump, Wireshark) InfluxDB, Kapacitor, Telegraf, Grafana (desirable but not essential) What You'll Get: Exposure to real-time, mission-critical systems A chance to grow in a high-performance, collaborative engineering environment Mentorship from experienced engineers More ❯