Oxford, Oxfordshire, United Kingdom Hybrid / WFH Options
Nominet
control (Git) and testing practices (integration, automation). Problem-solving, collaboration, and growth mindset. Nice to have: Containerisation and orchestration (Docker, Kubernetes). Infrastructure as Code (Terraform, Ansible). Observability tools (Prometheus, Grafana, Databricks). What To Expect Next: 1st stage: Introduction call with a member of the TA team (30 mins) 2nd stage: Hiring manager interview (60 mins) What More ❯
development methodology Experience with container tools (Docker, Podman) and container orchestration Experience delivering microservices in Kubernetes-based systems Experience with infrastructure as code tools (Terraform, CloudFormation) Basic understanding of observability tools like Prometheus, Grafana or similar Location: Herndon, VA Herndon offers a charming blend of small-town ambiance and modern conveniences. In historic Herndon you'll find small town charm More ❯
and also with another public cloud provider such as AWS or GCP Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation. Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar. Proven track record of maintaining highly-available and performant production environments. Ability to identify and implement effective mitigation strategies and More ❯
to design and manage secure, scalable infrastructure. Proficiency in advanced scripting and automation using Python or similar languages to create robust tools and workflows. Familiarity with logging, monitoring, and observability tools, such as ELK Stack, Prometheus, or Grafana, to ensure system reliability and performance. CompTIA Security+ or similar DoD 8570 Certification Experience developing software as part of a team. Security More ❯
of resilient, secure infrastructure for applications hosted on AWS or other FedRAMP-authorized platforms. Build and maintain CI/CD pipelines with automated testing, scanning, and deployment gates. Champion observability best practices, including infrastructure and application monitoring, logging, and alerting. Drive incident response and postmortem culture, triaging critical issues and leading root cause analysis with delivery teams. Author and maintain More ❯
and MinIO in production environments. Familiarity with infrastructure-as-code and automation using cloud-init or Terraform. Experience with CI/CD pipelines and Git-based workflows. Background in observability (Prometheus, Grafana, or similar). Experience with Rancher Suite (Harvester, Longhorn, KubeVirt). Prior work with AWS (EKS, S3, Lambda, RDS) or other cloud platforms. Maintain and improve documentation and More ❯
and MinIO in production environments. Familiarity with infrastructure-as-code and automation using cloud-init or Terraform. Experience with CI/CD pipelines and Git-based workflows. Background in observability (Prometheus, Grafana, or similar). Experience with Rancher Suite (Harvester, Longhorn, KubeVirt). Prior work with AWS (EKS, S3, Lambda, RDS) or other cloud platforms. Maintain and improve documentation and More ❯
and MinIO in production environments. Familiarity with infrastructure-as-code and automation using cloud-init or Terraform. Experience with CI/CD pipelines and Git-based workflows. Background in observability (Prometheus, Grafana, or similar). Experience with Rancher Suite (Harvester, Longhorn, KubeVirt). Prior work with AWS (EKS, S3, Lambda, RDS) or other cloud platforms. Maintain and improve documentation and More ❯
Infrastructure as Code and automation (e.g., CloudFormation, Terraform, Ansible, Python, Bash) 3) DevOps pipelines, CI/CD tooling, and containerization (e.g., GitLab, Jenkins, Docker, Kubernetes) 4) Monitoring and observability in production environments (e.g., CloudWatch, Splunk, Prometheus) 5) Security, cost optimization, and disaster recovery in cloud environments Ideal Experience: 1) Experience in managing live production workloads in AWS 5) Experience deploying More ❯
Franklin, Wisconsin, United States Hybrid / WFH Options
Genesis10
Terraform for AWS infrastructure management. Knowledge of database monitoring tools (ELK stack, Dynatrace, DBI). Experience with CyberArk and Hashicorp vaulting solutions. Site Reliability Engineering (SRE) experience. Experience with observability and dashboard tools such as CloudWatch, Grafana, and Power BI. Knowledge of cloud security platforms like WIZ. Only candidates available and ready to work directly as Genesis10 employees will be More ❯
Edinburgh, Midlothian, United Kingdom Hybrid / WFH Options
Aberdeen Group
internal workshops, brown bags, or tech talks to share knowledge and promote adoption of tools and practices. About the Candidate The ideal candidate will possess the following: Experience with observability tools (e.g., Grafana, Prometheus, Datadog). Background in DevOps, SRE, or platform engineering with a security first mindset. Strong programming skills in languages such as .Net, JavaScript, Python or similar. More ❯
Derbyshire, Burton upon Trent, Staffordshire, United Kingdom
Amtis Professional Ltd
CloudFormation or ARM templates Scripting & Automation - Proficient in PowerShell, Bash, or Python Infrastructure as Code (IaC) - Hands-on experience with Terraform, Bicep, or ARM Certified: Terraform Associate preferred Monitoring & Observability - Familiarity with tools like Azure Monitor, AWS CloudWatch, Prometheus, Grafana Security & Compliance - Strong understanding of IAM, cloud security, compliance frameworks Cloud Platform Expertise: Proven experience with AWS and Azure cloud More ❯
Burton-On-Trent, Staffordshire, West Midlands, United Kingdom
Amtis Professional Ltd
CloudFormation or ARM templates Scripting & Automation - Proficient in PowerShell, Bash, or Python Infrastructure as Code (IaC) - Hands-on experience with Terraform, Bicep, or ARM Certified: Terraform Associate preferred Monitoring & Observability - Familiarity with tools like Azure Monitor, AWS CloudWatch, Prometheus, Grafana Security & Compliance - Strong understanding of IAM, cloud security, compliance frameworks Cloud Platform Expertise: Proven experience with AWS and Azure cloud More ❯
experience leading enterprise backup and disaster recovery initiatives. Working knowledge of cloud-native storage solutions such as Longhorn. Strong Linux administration skills, particularly with RHEL environments. Experience implementing comprehensive observability solutions using Prometheus, Grafana, Loki, and related tools. Ability to establish and enforce security policies through tools like Open Policy Agent. Knowledge of identity management solutions such as Keycloak. Experience More ❯
Linux administration skills (Red Hat/Ubuntu or equivalent) Desired Qualifications: • Experience with AWS, Azure, or GCP cloud environments • Knowledge of Infrastructure as Code (Terraform, Helm, CloudFormation) • Familiarity with observability tools (Prometheus, Grafana) Benefits: Exempt hourly position. 11 paid holidays, minimum of 3 weeks PTO, company sponsored group medical plan, company paid dental, vision, life insurance, and STD/LTD More ❯
effectively. A leadership mindset capable of mentoring team members, driving best practices, and influencing architectural decisions. Bonus Skills: Experience with event-driven architectures (RabbitMQ, Kafka). Advanced knowledge of observability (monitoring, logging, tracing) and performance tuning at scale. Familiarity with security best practices (OAuth2, JWT, encryption, etc.). Thank You Imtiaz Senior Recruiter, Marks IT Solutions More ❯
ML frameworks (e.g., scikit-learn, TensorFlow, PyTorch) and NLP libraries (e.g., spaCy, Hugging Face Transformers) • Hands-on with MLOps or model-serving tools (e.g., MLflow, SageMaker, Kubeflow) • Familiarity with observability stacks (Prometheus/Grafana preferred; CloudWatch, ELK/EFK acceptable) • Experience with event-driven and streaming systems (Kafka, Kinesis, SQS/SNS, AWS Step Functions) • Knowledge of Infrastructure as Code More ❯
verbal communication skills necessary to perform job duties and collaborate with team members You may excel in this role if you have the following skills: • Familiarity with monitoring and observability stacks such as Prometheus/Grafana (preferred), CloudWatch, or ELK/EFK • Contributions to open-source libraries or community projects or personal projects • Experience with search technologies such as OpenSearch More ❯
degree in Computer Science, Engineering, or a related field (or equivalent experience). Desired Skills: Experience with cross-platform mobile development frameworks (e.g., Flutter, React Native). Familiarity with observability tools (e.g., Prometheus, Grafana, ELK stack) for monitoring distributed systems. Contributions to open-source projects or a strong portfolio showcasing relevant technical expertise. W2 Status: Only candidates available and ready More ❯
Edinburgh, Midlothian, United Kingdom Hybrid / WFH Options
Aberdeen
internal workshops, brown bags, or tech talks to share knowledge and promote adoption of tools and practices. About the Candidate The ideal candidate will possess the following: Experience with observability tools (eg, Grafana, Prometheus, Datadog). Background in DevOps, SRE, or platform engineering with a security first mindset. Strong programming skills in languages such as .Net, JavaScript, Python or similar. More ❯
and subject matter expert in cloud-native technologies. Continuous Innovation and Optimization Identify opportunities for innovation in processes, tools, and technologies to maintain a competitive edge. Implement monitoring and observability solutions (e.g., Prometheus, Grafana, ELK Stack) to ensure system health and performance. Optimize cost, performance, and scalability of cloud-native solutions. What You Bring: Skills and Expertise Core Requirements Required More ❯
DRM, ad insertion. Define data models, database schemas, and caching strategies to support queries at scale. Instrument services with monitoring, metrics, tracing, alerting, and health checks; own reliability and observability for your services. Collaborate with mobile/front-end/hybrid app engineers to define API contracts, error modes, versioning, and graceful degradation. Participate in system design, architectural decisions, and More ❯
or higher required Hands-on expertise with Infrastructure as Code and CI/CD tools. Deep knowledge of DevSecOps practices, containerization and secure software development. Experience with monitoring and observability tools such as CloudWatch, Datadog, or Prometheus. Proficient in one or more backend languages and frontend technologies. Familiarity with NIST 800-53, FedRAMP, Zero Trust, and federal ATO processes. Excellent More ❯