Chantilly, Virginia, United States Hybrid / WFH Options
Aeyon
Java, Terraform, or Groovy. • Familiarity with cloud platforms like AWS, GCP, or Azure and their integration with OpenShift. • Experience with monitoring tools like Prometheus, Grafana, and logging tools like ELK stack or Splunk. • Proficiency in Git and experience with version control practices in a team environment. • Knowledge of container security More ❯
at least one programming language that compiles to machine code such as Rust, C++, or Go. Expert knowledge of monitoring technologies such as Prometheus, Grafana, and PagerDuty. Expert knowledge of deployment technologies such as Pulumi or Terraform. Expert knowledge of Kubernetes. Responsibilities: Improving our observability by adding/adjusting metrics. More ❯
Cambridge, Gloucestershire, UK Hybrid / WFH Options
AI Tech Suite
at least one programming language that compiles to machine code such as Rust, C++, or Go. Expert knowledge of monitoring technologies such as Prometheus, Grafana, and PagerDuty. Expert knowledge of deployment technologies such as Pulumi or Terraform. Expert knowledge of Kubernetes. Responsibilities: Improving our observability by adding/adjusting metrics. More ❯
at least one programming language that compiles to machine code such as Rust, C++, or Go. Expert knowledge of monitoring technologies such as Prometheus, Grafana, and PagerDuty. Expert knowledge of deployment technologies such as Pulumi or Terraform. Expert knowledge of Kubernetes. Responsibilities: Improving our observability by adding/adjusting metrics. More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
AI Tech Suite
at least one programming language that compiles to machine code such as Rust, C++, or Go. Expert knowledge of monitoring technologies such as Prometheus, Grafana, and PagerDuty. Expert knowledge of deployment technologies such as Pulumi or Terraform. Expert knowledge of Kubernetes. Responsibilities: Improving our observability by adding/adjusting metrics. More ❯
Washington, Washington DC, United States Hybrid / WFH Options
Agile Defense, Inc
cloud platforms, especially AWS. • Familiarity with containerization and orchestration tools such as Docker, Kubernetes, and OpenShift. • Experience with monitoring and logging tools like Prometheus, Grafana, ELK stack, or Splunk. • Strong problem-solving skills and the ability to troubleshoot complex system issues. WORKING CONDITIONS Environmental Conditions • Traditional office setting. Strength Demands More ❯
or PowerShell for automation. Understanding of AWS networking concepts, including VPCs, subnets, and security groups. Experience with monitoring and logging solutions such as Prometheus, Grafana, ELK Stack, or AWS CloudWatch. Familiarity with Zero Trust security models and best practices for securing cloud workloads. Ability to troubleshoot complex infrastructure issues and More ❯
london, south east england, united kingdom Hybrid / WFH Options
LHH
or PowerShell for automation. Understanding of AWS networking concepts, including VPCs, subnets, and security groups. Experience with monitoring and logging solutions such as Prometheus, Grafana, ELK Stack, or AWS CloudWatch. Familiarity with Zero Trust security models and best practices for securing cloud workloads. Ability to troubleshoot complex infrastructure issues and More ❯
end to end delivery of solutions. Expert knowledge of SRE fundamentals and a commitment to best practice Fluency with common observability tooling like Prometheus, Grafana, OTEL and Cloudwatch Experience analysing and building data telemetry, querying (PromQL), modelling, pipelines and dashboards to provide concise, focused insights and alerts for distributed systems More ❯
Bristol, Gloucestershire, United Kingdom Hybrid / WFH Options
Searchability
/CD) and automation tools like Terraform and Ansible Programming : Proficiency in Python, Go, or Ruby Monitoring and Observability : Hands-on experience with Prometheus, Grafana, ELK Stack, or similar technologies Core Attributes A passion for solving complex technical challenges in high-availability production environments Strong communication and collaboration skills, able More ❯
Greater Bristol Area, United Kingdom Hybrid / WFH Options
Searchability NS&D
CI/CD) and automation tools like Terraform and Ansible Programming : Proficiency in Python, Go, or Ruby Monitoring & Observability : Hands-on experience with Prometheus, Grafana, ELK Stack, or similar technologies Core Attributes A passion for solving complex technical challenges in high-availability production environments Strong communication and collaboration skills, able More ❯
. Understanding of CI/CD pipelines. Familiarity with scripting languages like Python, Bash, or Go. Experience with monitoring tools such as Datadog, Prometheus, Grafana, or ELK stack. Strong problem-solving, communication skills, and ability to work independently or in teams. Additional notes We value diverse backgrounds and perspectives. Even More ❯
. Understanding of CI/CD pipelines. Familiarity with scripting languages like Python, Bash, or Go. Experience with monitoring tools such as Datadog, Prometheus, Grafana, or ELK stack. Strong problem-solving, communication skills, and ability to work independently or in teams. Additional notes We value diverse backgrounds and perspectives. Even More ❯
their applications in AWS Setting best practices and policies, especially around microservice architecture Developing, maintaining and operating complex operational tooling (e.g. Kubernetes, Opensearch, Prometheus, Grafana, Github or equivalent alternative technologies) Assessing customer technical capabilities and upskilling for reduced friction and increased platform adoption Enhancing operational reliability and scalability of existing More ❯
using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation. Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar. Proven track record of maintaining highly-available and performant production environments. Ability to identify and implement effective mitigation strategies and operational playbooks. Useful More ❯
as code (IaC) tools such as Terraform, Ansible, or Chef for automation and configuration management. Strong understanding of monitoring and observability tools like Prometheus, Grafana, Azure App Insights for proactive system monitoring and troubleshooting. Knowledge of networking, security principles, and best practices in a cloud environment. Demonstrated experience of CI More ❯
as code (IaC) tools such as Terraform, Ansible, or Chef for automation and configuration management. Strong understanding of monitoring and observability tools like Prometheus, Grafana, Azure App Insights for proactive system monitoring and troubleshooting. Knowledge of networking, security principles, and best practices in a cloud environment. Demonstrated experience of CI More ❯
Annapolis Junction, Maryland, United States Hybrid / WFH Options
Lockheed Martin
for DevOps engineers include Python, Ansible, and Terraform. • Linux: RHEL, CentOS, and Amazon Linux experience. • Applications: Kubernetes, Docker Compose, Telegraf, Elasticsearch, Logstash, Kibana, and Grafana are all Linux-based tools. DevOps engineers need to be familiar with Linux in order to install, configure, and manage these tools. Must be fluent More ❯
by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS, RDS More ❯
by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS, RDS More ❯
by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS, RDS More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Future Talent Group
by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS, RDS More ❯
East London, London, United Kingdom Hybrid / WFH Options
Future Talent Group
by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS, RDS More ❯
Central London / West End, London, United Kingdom Hybrid / WFH Options
Future Talent Group
by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS, RDS More ❯
Bury, Greater Manchester, United Kingdom Hybrid / WFH Options
Future Talent Group
by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS, RDS More ❯