An existing DoD 8570 Baseline IAT Certification; higher levels preferred. • Experience with serverless computing (AWS Lambda, Azure Functions, etc.). • Familiarity with logging and monitoring platforms like CloudWatch, Prometheus, Datadog, or Splunk. • Experience with CI/CD tools like Jenkins, GitHub Actions, or GitLab CI. • An adjudicated Counterintelligence Polygraph. Soft Skills: • Self-driven • Strong communication and interpersonal skills. • Ability to More ❯
Who we are We are a London tech startup on the lookout for bright, motivated and self-driven individuals to join the team. Who you are You are a DevOps/Site Reliability Engineer with experience managing complex infrastructure and More ❯
Manage cloud infrastructure (OCI, AWS, Azure, or GCP) using Infrastructure as Code tools like Terraform or Serverless Functions. Monitor system health and performance using tools like Prometheus, Grafana, or Datadog or NewRelic. Collaborate closely with development teams to automate builds, performance tests, and deployments. Ensure system security, compliance, and best practices are followed in deployment pipelines. Ensure network security with More ❯
such as Azure, AWS or GCP Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation. Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar. Proven track record of maintaining highly-available and performant production environments. Ability to identify and implement effective mitigation strategies and operational playbooks. Useful/Bonus Skills More ❯
. Preferred Qualifications Experience in hybrid cloud environments and integration with on-premise systems. Background in DevOps, SRE, or Infrastructure Engineering. Knowledge of monitoring/logging tools (e.g., CloudWatch, Datadog, Prometheus, ELK). Experience with enterprise security and compliance frameworks (e.g., ISO 27001, SOC 2, GDPR). Familiarity with cost modeling and optimization strategies in AWS. More ❯
and other relevant tools. Security Best Practices: IAM, MFA, data encryption, firewall configurations. Programming/Scripting: Python, Terraform, or similar languages. Event-Driven Architectures: Kafka. Monitoring and Logging: Datadog, ELK Stack, Prometheus, etc. Experience in agile methodologies and DevOps practices. Location: Hybrid. Office located in London. (Hayes area). Office presence required: Yes. Frequency: 2-3 times a week at More ❯
development in general, with skills in a high-level language (e.g., Python, JavaScript, TypeScript, Java) and familiarity with modern development practices Understanding of Cloud Observability, Monitoring, and Tracing tools (Datadog, CloudWatch, Jaeger, ELK) and how best to leverage to support effective MTTR and mitigate high CFR Our UK benefits: Stock Options Annual Performance Bonus or Commissions Pension matched up to More ❯
Release and Deployment Manager Rockville, MD Public Trust Required Our Clientis seeking qualified, creative, and customer-focused technical representative to provide Release and Deployment Management support remotely and at our customer site as the Release Manager. You will work in More ❯
invites you to shape the future of parking systems, where you'll be at the forefront of developing a cutting-edge backend, optimizing CI pipelines, and enhancing monitoring with Datadog, all while ensuring the seamless operation of thousands of parking lots. If you're a passionate engineer with a hunger for continuous improvement and proficiency in a variety of programming … Experience with Terraform or similar tools for infrastructure automation. High-Throughput Systems: Strong experience in real production projects handling large-scale data flows. Monitoring & Observability: Proficiency in tools like Datadog, Prometheus, and Grafana. Security & Networking: Solid understanding of networking principles, security best practices, and cloud security. Agile & Fast-Paced Environments: Experience in agile teams, working in dynamic, fast-growing tech More ❯
tools and container orchestration (Docker, ECS, or Kubernetes) Solid understanding of system/network security, IAM, VPC, and secure cloud configurations Familiarity with monitoring and logging tools (e.g., CloudWatch, Datadog, Prometheus, Sentry) Experience with Postgres, Redis, and scalable backend systems Bonus: Exposure to fintech or regulated environments, GDPR/data compliance, or SOC2 setup A little about us Our founders More ❯
Proficiency with cloud providers like Azure, AWS, or GCP. Experience with Infrastructure as Code tools such as Terraform (preferred), Ansible, or CloudFormation. Experience with monitoring and logging tools like DataDog, Prometheus, Grafana, or similar. A track record of maintaining highly available and performant production environments. Ability to develop effective mitigation strategies and operational playbooks. Useful/Bonus Skills: Experience with More ❯
using Kubernetes or similar tools in production deployments Experience with: AWS security best practices including IAM, security groups, encryption, and compliance frameworks Monitoring tools such as CloudWatch, Prometheus, Grafana, DataDog, or NewRelic Infrastructure as Code using Terraform Containerised CI/CD solutions Linux system administration, including shell scripting and system optimisation Desirable Skills Experience with AWS services such as SQS More ❯
Familiarity with Infrastructure as Code and DevOps practices. Knowledge of Hyper-V management. Understanding of networking, security, and system administration (Linux/Windows). Experience with monitoring tools (e.g., DataDog, CloudWatch, Azure Monitor). Strong communication and collaboration skills. Responsibilities: Deploying and managing Kubernetes clusters, including networking, storage, and security. Collaborating with development and platform teams to deliver scalable, secure More ❯
GitLab CI). Write clean, production-grade code in Python (Scala is a bonus). Build infrastructure using Terraform, AWS CloudFormation, or SAM. Drive observability across the platform using Datadog or CloudWatch. Actively mentor Data Engineers and Associates, and lead technical discussions and design sessions. Key requirements: Must-Have: Strong experience with AWS services: Glue, Lambda, S3, Athena, Step Functions … operate services in production. Good to Have: Experience with Scala for data applications. Familiarity with serverless/event-driven architectures. Experience designing scalable, low-latency data services. Exposure to Datadog or CloudWatch monitoring tools. Nice to Have: Experience with LLM-powered applications or OpenAI APIs . Professional experience in a similar environment or high-scale system. Key Roles and Responsibilities More ❯
needed About You 5+ years' experience in Site Reliability Engineer roles Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (Prometheus/Grafana, New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud automation and infrastructure-as-code More ❯
needed About You 5+ years' experience in Site Reliability Engineer roles Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (Prometheus/Grafana, New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud automation and infrastructure-as-code More ❯
client portals, CMS-driven corporate sites, or digital channels for financial institutions Strong expertise in Cloud platforms (Azure preferred), CI/CD pipelines (Jenkins, GitLab CI, etc.), Monitoring & alerting (Datadog, Prometheus, ELK, Grafana), Infrastructure as Code (Terraform, CloudFormation), and Scripting (Python, Bash, or similar) Experience with Salesforce Experience Cloud, Kaltura, SiteCore or similar systems used for customer-facing web platforms More ❯
client portals, CMS-driven corporate sites, or digital channels for financial institutions Strong expertise in Cloud platforms (Azure preferred), CI/CD pipelines (Jenkins, GitLab CI, etc.), Monitoring & alerting (Datadog, Prometheus, ELK, Grafana), Infrastructure as Code (Terraform, CloudFormation), and Scripting (Python, Bash, or similar) Experience with Salesforce Experience Cloud, Kaltura, SiteCore or similar systems used for customer-facing web platforms More ❯
Arlington, Texas, United States Hybrid / WFH Options
Fisher Investments
client portals, CMS-driven corporate sites, or digital channels for financial institutions Strong expertise in Cloud platforms (Azure preferred), CI/CD pipelines (Jenkins, GitLab CI, etc.), Monitoring & alerting (Datadog, Prometheus, ELK, Grafana), Infrastructure as Code (Terraform, CloudFormation), and Scripting (Python, Bash, or similar) Experience with Salesforce Experience Cloud, Kaltura, SiteCore or similar systems used for customer-facing web platforms More ❯
Garland, Texas, United States Hybrid / WFH Options
Fisher Investments
client portals, CMS-driven corporate sites, or digital channels for financial institutions Strong expertise in Cloud platforms (Azure preferred), CI/CD pipelines (Jenkins, GitLab CI, etc.), Monitoring & alerting (Datadog, Prometheus, ELK, Grafana), Infrastructure as Code (Terraform, CloudFormation), and Scripting (Python, Bash, or similar) Experience with Salesforce Experience Cloud, Kaltura, SiteCore or similar systems used for customer-facing web platforms More ❯
distributed systems, microservices architecture, and RESTful API design. Hands-on experience with Kubernetes and container orchestration. Familiarity with monitoring, alerting, and logging tools (e.g., Prometheus, Grafana, ELK stack, or Datadog). Experience with Elastic will be highly helpful with this position. Hands-on experience with incident response, including designing and improving incident management processes. Expertise in Observability practices, including metrics More ❯
Gresham, Oregon, United States Hybrid / WFH Options
Fisher Investments
client portals, CMS-driven corporate sites, or digital channels for financial institutions Strong expertise in Cloud platforms (Azure preferred), CI/CD pipelines (Jenkins, GitLab CI, etc.), Monitoring & alerting (Datadog, Prometheus, ELK, Grafana), Infrastructure as Code (Terraform, CloudFormation), and Scripting (Python, Bash, or similar) Experience with Salesforce Experience Cloud, Kaltura, SiteCore or similar systems used for customer-facing web platforms More ❯
Portland, Oregon, United States Hybrid / WFH Options
Fisher Investments
client portals, CMS-driven corporate sites, or digital channels for financial institutions Strong expertise in Cloud platforms (Azure preferred), CI/CD pipelines (Jenkins, GitLab CI, etc.), Monitoring & alerting (Datadog, Prometheus, ELK, Grafana), Infrastructure as Code (Terraform, CloudFormation), and Scripting (Python, Bash, or similar) Experience with Salesforce Experience Cloud, Kaltura, SiteCore or similar systems used for customer-facing web platforms More ❯