native applications working in a continuous delivery environment Modern observability practices Nice to have Not vital, but you'll have the edge if you also have experience with: Grafana Prometheus Kotlin or a least the willingness to learn it or have worked in: an eCommerce organisation a shipping/logistics/exports organisation What you bring Agile : Test-Driven Development More ❯
native applications Working in a Continuous Delivery environment Modern observability practices Nice to have Not vital, but you'll have the edge if you also have experience with: Grafana Prometheus Kotlin or a least the willingness to learn it Batch processing data pipelines or have worked in: an eCommerce organisation a shipping/logistics/exports organisation What you bring More ❯
availability and security. Automation & CI/CD: Implement and manage CI/CD pipelines for efficient deployment, testing, and monitoring of applications. Observability & Monitoring: Develop comprehensive monitoring solutions using Prometheus, Grafana, ELK stack, or similar tools to improve system reliability. Security & Compliance: Apply best practices for cloud security, IAM policies, and compliance frameworks (SOC2, ISO 27001, etc.). Incident Response … clusters). Proficiency in scripting and automation using Python, Bash, or Go. Experience with Infrastructure as Code (Terraform, CloudFormation, or Ansible). Familiarity with monitoring, logging, and observability tools (Prometheus, Grafana, Datadog, ELK, etc.). Strong understanding of networking concepts (VPC, Load Balancers, DNS, Firewalls). Experience with DevOps methodologies, CI/CD pipelines, and GitOps practices. Experience with high More ❯
RabbitMQ Azure DevOps. We are big fans of Azure Pipelines! Some of our services are migrating away from TeamCity and Octopus Deploy Our observability stack is Splunk, Grafana and Prometheus You As a software engineer, you will be: Part of a cross-functional team working with Product Managers, Testers and DevOps engineers Writing well-tested and maintainable code Getting involved More ❯
stack: Languages: TypeScript, Javascript Libraries and frameworks: gRPC, Redux, React Native, React, Next.js Datastores: Vitess, MySQL, CockroachDB, BigQuery, Redis Infrastructure: Google Cloud Platform, Kubernetes, Docker, PubSub, Terraform Monitoring: Grafana, Prometheus, Sentry, Metabase About you: You are a frontend developer with at least 5 years' experience You are fast and love to deliver incredible code You can reduce complex problems to More ❯
stack: Languages: TypeScript, Javascript Libraries and frameworks: gRPC, Redux, React Native, React, Next.js Datastores: Vitess, MySQL, CockroachDB, BigQuery, Redis Infrastructure: Google Cloud Platform, Kubernetes, Docker, PubSub, Terraform Monitoring: Grafana, Prometheus, Sentry, Metabase About you: You are a frontend developer with at least 2 years' experience You are fast and love to deliver incredible code You can reduce complex problems to More ❯
APIs, accessibility-minded experiences, and working in a continuous delivery environment. Nice-to-have Not vital, but you'll have the edge if you also have experience with: Grafana, Prometheus/Loki, Kubernetes, Sentry, and Storybook. or have worked in: an eCommerce organisation, or a shipping/logistics/exports organisation. Location This is a hybrid role. Our offices are More ❯
React on the Frontend. Tech & Data Science stack: Kubernetes & Docker on Google Cloud Python 3: Pandas, RabbitMQ, Celery, Flask, SciPy, NumPy, Dash, Plotly, Matplotlib Javascript, React, Redux PostgreSQL, Redis Prometheus, Alert Manager, DataDog If you joined the company in a Data Science role you would be working on sophisticated pricing algorithms which would enable companies in the entertainment industry to More ❯
Computer Science, Engineering, or related field. Strong programming skills in Go (ideally) Rust or C++. Solid experience in building and supporting complex backend systems at scale. Experience with Elasticsearch, Prometheus, Grafana and/or Datadog. Exposure either AWS or GCP plus IaC, (Terraform or similar) would be beneficial. Knowledge with open-source storage tools (Ceph, Minio, JuiceFS or Fuse) and More ❯
Site Reliability Engineering function they're building from scratch. They talked about production infrastructure, optimisation, automation and focusing on the deployment process rather than the build. We discussed Kubernetes, Prometheus and API Gateways. Most importantly, they spoke like they knew what the hell they were on about. Not just about SRE, but on the whole Engineering process. This is a More ❯
RabbitMQ Azure DevOps. We are big fans of Azure Pipelines! Some of our services are migrating away from TeamCity and Octopus Deploy Our observability stack is Splunk, Grafana and Prometheus You As a software engineer, you will be: Part of a cross-functional team working with Product Managers, Testers and DevOps engineers Writing well-tested and maintainable code Getting involved More ❯
reporting. Develop and implement TOC strategy, staffing models, and documentation standards. Participate in systems architecture, new tech evaluation, and vendor selection. Manage operational workflows, reporting systems (e.g., Zabbix, Grafana, Prometheus), and support international broadcast teams. Collaborate with leadership on technical direction and TOC transformation. Skills/Must Have: 5-7+ years in a technical leadership role within a TOC More ❯
Real Time data, designing systems that can elastically scale to handle surges in throughput and demand. Hands-on experience with modern technologies such as Kubernetes, Kafka, RocksDB, MongoDB, MemSQL, Prometheus, Tempo, and Snowflake is highly desirable. Exposure to cloud-native tooling and practices, with an emphasis on DevOps, cloud computing, Kubernetes, and stream processing is a strong advantage. Comfortable working More ❯
projects and other activities as required. Experience and Skills Essential Experience and demonstratable knowledge of SRE best practices Expert in Git and Gitops Expert in logging and monitoring solutions (Prometheus, Grafana etc.) Demonstratable knowledge of Cloud Expert knowledge of Kubernetes Proficient ability to communicate in English (Written and Verbal) Understanding of non-functional testing Significant DevOps experience Desirable Proven ability More ❯
evaluate and implement new technologies, and oversee their integration. Collaborate with external vendors and partners to ensure high-quality service delivery. Utilise and develop monitoring systems (e.g., Zabbix, Grafana, Prometheus) and oversee client reporting systems. Skills and Qualifications 5-7+ years' experience in a technical leadership role within a 24/7 broadcast, network operations centre (NOC), or Master More ❯
concurrent users (e.g., multi-tenant PostgreSQL, sharded MySQL). Strong backend fundamentals around concurrency, caching, indexing and distributed systems trade-offs. Proven track record of setting SLOs, building dashboards (Prometheus/Grafana, OpenTelemetry, etc.) and tuning alerts. Comfort with Kubernetes , IaC and cloud-native patterns; can debug from network to application layer. Start-up bias for action: you prioritise high More ❯
to work effectively with internal teams and customer-facing stakeholders. Technologies we use Golang AWS, CDK (TypeScript), Lambda, SQS, EventBridge, RDS, DynamoDB, OpenSearch Github, Github Actions Loki, Tempo, Grafana, Prometheus Event-driven architecture and domain-driven design How we reward our team Dynamic working environment with a diverse and driven team Huge opportunity for learning in a high growth environment More ❯
such as Python, Bash or Shell Develop and implement CI/CD pipelines for application deployment on Kubernetes Monitor the health of the platform and applications using tools like Prometheus, Grafana or ELK stack Assist with capacity planning and load testing of the platform and applications Develop and enforce best practices for building container-based applications Troubleshoot issues within the … Experience with Azure cloud platform Experience with Infrastructure as Code (IaC) tools like Terraform Familiarity with CI/CD tools like Argocd , jenkins etc Experience with monitoring tools like Prometheus , Grafana , ELK stack etc Strong scripting skills (Python, Bash, etc.) Ability to troubleshoot complex networking issues BS degree in Computer Science, Engineering or a related field Additional requirements Work experience More ❯
data. • Technically sound experience of Unix environments. • Good understanding of Networking principles. Desirable • Proficient in scripting and automation (preferably Shell and Python). • Familiarity with monitoring tools (e.g. Grafana, Prometheus, Elastic). Diversity & Inclusion Nomura is an equal opportunity employer. We value diversity and are committed to creating an inclusive environment for all our employees. We do not discriminate on More ❯
identified and progressed to resolution. Responsible for generating, developing and curating high quality Networks focused reports and KPIs using the local reporting systems of the platforms including Grafana and Prometheus Who we are The UK's fastest broadband network. The nation's best-loved mobile brand. And, one of the UK's biggest companies too. Diverse, high performing teams - jam More ❯
a strong sense of ownership, and determination. Openness to constructive feedback and value the ideas and opinions of others. Our technologies Cloud Provider: Amazon AWS Monitoring & Logging: ELK (EFK), Prometheus, Grafana Why joining Smartcat might be your best move so far Fully remote team We are a global team of 200+ enthusiastic people spread across 30+ countries. We have been More ❯
years of experience with containerization and orchestration (Docker + Kubernetes) and confidence operating cloud infrastructures Front-end development experience a plus DevOps skills, especially leveraging open source tools (Kibana, Prometheus, Grafana) a plus Sound understanding of agile software development best practices including CI/CD, testing, monitoring, alerting and documentation Being Cloud agnostic means not being able to use any … managed Kubertnes service, so therefore build own Kubernete - experience with only managed Kubernetes would not be applicable for the role Kubernetes experience on at least one cloud Prometheus stack (Grafana, Prometheus, alertmanager Kubernetes upgrade and maintenance experience Any logging infrastructure experience Terraform Ansible Shell/Python Scripting Gitlab pipelines (or any other CI/CD) Desirable experience: Kubernetes security Kubernetes More ❯