roads to help teams get their apps up and running quickly in a consistent manner Event-Driven: We share data through an event-driven system powered by MSK Observability: Datadog is used for comprehensive logging and monitoring Databases: We use a combination of MongoDB and AWS Relational Databases Automation and CICD: Deployments are highly automated using Jenkins pipelines and Github More ❯
App Services) Java or Microsoft technologies, Azure platform and .Net ecosystem Database technologies and platforms like SQL, NoSQL, Data Lake, Snowflake, Databricks, MongoDB, Oracle Observability platforms like Splunk, Dynatrace, Datadog, Grafana About Royal London We're the UK's largest mutual life, pensions and investment company, offering protection, long-term savings and asset management products and services. OurPeople Promise to More ❯
Docker, Helm, Python and Bash scripting. Supporting developers and other engineers with any pipeline issues. General management and operations of GitLab source control system. Monitoring and management of our Datadog instance, including log management and APM More ❯
Docker, Helm, Python and Bash scripting. Supporting developers and other engineers with any pipeline issues. General management and operations of GitLab source control system. Monitoring and management of our Datadog instance, including log management and APM More ❯
Newcastle Upon Tyne, Tyne And Wear, United Kingdom
Strive Gaming
in on-call rotations and help troubleshoot production issues. Tech Requirements (must have): IAC - Infrastructure as Code (Terraform) AWS Argo Strong linux skills ELK/LGTM stack knowledge Prometheus DataDog Grafana Kubernetes Helm Docker Bash/shell scripting Git Strong security mindset Tech (nice to have) Crowdstrike OnPrem/ESXI Windows Server EntraID More ❯
for rapid application development. Experience in DevOps practices, including infrastructure-as-code (IaC), monitoring, alerting, and incident management. Familiarity with observability tools (Grafana, Prometheus) and APM tools (New Relic, Datadog). Knowledge of microservices architecture, event-driven design, and scalability best practices. Experience implementing data compliance standards (GDPR, ISO 27001). Find.co is an Equal Opportunity/Affirmative Action employer More ❯
experience at leading tech companies, startups, and the enterprise software sphere. Our backers include Y Combinator, Index Ventures, and stellar angels such as the founders of Looker, GitHub, Mulesoft, Datadog and UiPath. More ❯
improvement Familiarity with some of our tech stack: PostgreSQL, or a similar RDBMS, particularly in Amazon RDS at scale Docker and Kubernetes, we use Amazon EKS in production Python Datadog, or a similar logging/monitoring tool Messaging queues, event-driven async processing or similar technologies - we use RabbitMQ Terraform, or a similar infrastructure-as-code tool Experience with a More ❯
of the React Framework, relative patterns and best practices. Good understanding of UI/UX best practices and considerations. Understanding of front-end observability with tools like Sentry, LogRocket, Datadog, or New Relic. Experience with CI/CD pipelines, like Github Actions, ArgoCD. Awareness of common front-end security risks (e.g., XSS, CSRF). Passion for writing clean, modular, scalable More ❯
services Familiarity with modern CI/CD pipelines such as GitLab, GitHub Actions, or CircleCI Comfortable working with shell scripting and command line tools Awareness of monitoring tools like Datadog Exposure to CDNs , Kubernetes , and deployment tools like Helm Why Join H&B Tech? Be part of a team shaping the future of health & wellness through technology Work in a More ❯
management tools (Ansible preferred). Experience with container orchestration and management platforms (Kubernetes, helm). Solid understanding of networking concepts. Solid understanding of security concepts. Experience with monitoring tools (Datadog, Stackdriver). Experience with at least one programming and/or scripting language (Go, Python or Bash). Skilledin presentation and documentation. Troubleshootingand analytical skills. Time and project management skills. More ❯
re looking for someone with deep expertise in: oInfrastructure as Code: Terraform, CloudFormation o Security best practices: IAM, KMS, encryption in transit/at rest, DevSecOps o Monitoring & observability: Datadog, Prometheus, Grafana, ELK, or similar What You Bring o 6+ years in DevOps or platform engineering, with experience in a technical lead role. o Proven experience designing and operating cloud More ❯
teams to address performance bottlenecks and ensure scalability. Assist engineering teams with implementing and reviewing SLOs Continually improve observability through monitoring and alerting, and dashboards, using tools such as DataDog or Prometheus for example. Work with other teams to ensure it is effective and provides full coverage. Ensure the service is highly available and resilient Champion best practices in design More ❯
Reigate, Surrey, South East, United Kingdom Hybrid / WFH Options
Client Server
in Azure (will also consider AWS or GCP experience) You have a deep understanding of cloud infrastructure and services including best practices around monitoring, scaling and security tools e.g. DataDog You have strong scripting skills with PowerShell (or Python) You have a good knowledge of basic networking, TCP/IP You have a good understanding of IaC, they use Pulumi More ❯
Cloud DevOps, SaaS, or observability, with 5+ years in leadership roles. Strong hands-on experience with AWS, GCP, Azure, K8S, Terraform and observability tools: Prometheus, Grafana, OpenTelemetry, ELK, Splunk, Datadog, and similar. Proficiency with metrics, logs, traces and APM. Leadership & Global Operations Proven success leading multi-regional or global technical teams with direct management of managers. Demonstrated ability to build More ❯
should have experience with The ability to lead and scale technical teams in multi-faceted governance environments AWS/Azure cloud platforms and enterprise observability tools (Elastic, Grafana, Splunk, DataDog, or similar) SRE/DevOps methodologies with Python proficiency for automation and infrastructure-as-code practices Some other highly valued skills may include AWS or Azure cloud certifications Experience implementing More ❯
WordPress security, user roles, and performance optimisation Desirable Past experience implementing a login/subscription service Familiarity with monorepos Experience with monitoring and logging using tools like Sentry or Datadog Experience working with design systems Devops knowledge, CI/CD, AWS Experience working with Docker Containers Experience using generative AI frameworks such as OpenAI models and APIs or any AI More ❯
technical, ambiguous domains. Strong knowledge of REST APIs , distributed system design, and performance optimization. Experience with both SQL and NoSQL data stores , caching layers, and observability tooling (e.g., Prometheus, Datadog). Nice to have: Experience deploying or integrating LLMs or NLP models in production systems. Comfortable balancing short-term execution with long-term architectural thinking . Passion for building highly More ❯
years of professional experience, some of which should have focus on Observability. Excellent knowledge and hands-on experience with monitoring, logging, and tracing tools such as Prometheus, VictoriaMetrics, Grafana, Datadog, New Relic, OpenTelemetry, ELK Stack, or similar. Experience with high volume data storage (Structured and unstructured). A strong technical background, with current capabilities and willingness to get hands on More ❯
stack; Experience with AWS Cloud services; Experience with Bash or Python scripting; Experience with CI/CD tools such as Gitlab CI; Familiar with application performance monitoring tools like Datadog, New Relic; Familiar with Docker orchestrators such as Amazon ECS or Kubernetes; Familiar with Git; Ability to solve issues with clear methods while knowing when to take intuitive leaps. Nice More ❯
teams to enhance stored procedures, functions, and triggers. Automation & Scripting Write SQL scripts and automate routine DBA tasks using Bash or Python. Learn and utilize monitoring tools such as Datadog and Grafana. Your Education & Qualifications Bachelor’s degree in Computer Science, Information Technology, or a related field (or equivalent experience). Minimum 2 years of experience in PostgreSQL database administration More ❯
infrastructure and deployment workflows - PowerShell is preferred Collaborate with engineering teams to support platform reliability and enable delivery Maintain visibility and awareness through monitoring and logging tools such as Datadog, Azure Monitor, App Insights etc. Support incident resolution and participate in an on-call rota to help maintain service uptime Qualifications The Requirements: Essential Experience: Proven experience in a Platform More ❯
Reigate, Surrey, England, United Kingdom Hybrid / WFH Options
Client Server Ltd
in Azure (will also consider AWS or GCP experience) You have a deep understanding of cloud infrastructure and services including best practices around monitoring, scaling and security tools e.g. DataDog You have strong scripting skills with PowerShell (or Python) You have a good knowledge of basic networking, TCP/IP You have a good understanding of IaC, they use Pulumi More ❯
etc.) At least 4 years of experience in industry working with the above tools and technologies. Nice to Have Experience with other parts of our tech stack (Clickhouse, Postgres, Datadog, Dagster, Temporal, AWS, Gitlab and RunAI) Any domain knowledge within Life Sciences and Biotech/Techbio, specifically around high throughput sequencing, ML models, and bioinformatics. What we can offer in More ❯