Chantilly, Virginia, United States Hybrid / WFH Options
Aeyon
Python, Java, Terraform, or Groovy. • Familiarity with cloud platforms like AWS, GCP, or Azure and their integration with OpenShift. • Experience with monitoring tools like Prometheus, Grafana, and logging tools like ELK stack or Splunk. • Proficiency in Git and experience with version control practices in a team environment. • Knowledge of container More ❯
Southampton, Hampshire, United Kingdom Hybrid / WFH Options
NICE
GitLab CI/CD, or CircleCI. Strong knowledge of containerization technologies (e.g., Docker, Kubernetes) and microservices architecture. Experience with monitoring and observability tools (e.g., Prometheus, Grafana, ELK stack, Cloudwatch). Excellent problem-solving skills and the ability to troubleshoot complex issues in distributed systems. Experience of Incident management and blameless More ❯
in at least one programming language that compiles to machine code such as Rust, C++, or Go. Expert knowledge of monitoring technologies such as Prometheus, Grafana, and PagerDuty. Expert knowledge of deployment technologies such as Pulumi or Terraform. Expert knowledge of Kubernetes. Responsibilities: Improving our observability by adding/adjusting More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
AI Tech Suite
in at least one programming language that compiles to machine code such as Rust, C++, or Go. Expert knowledge of monitoring technologies such as Prometheus, Grafana, and PagerDuty. Expert knowledge of deployment technologies such as Pulumi or Terraform. Expert knowledge of Kubernetes. Responsibilities: Improving our observability by adding/adjusting More ❯
Chicago, Illinois, United States Hybrid / WFH Options
Synergy Interactive
Kubernetes. Knowledge of configuration management tools (e.g., Ansible, Puppet, Chef). Understanding of network and security principles. Familiarity with monitoring and logging tools (e.g., Prometheus, ELK stack). What We Offer: Competitive salary and benefits package. Opportunities for professional growth and development. A collaborative and inclusive work environment. Flexible work More ❯
Bash, or PowerShell for automation. Understanding of AWS networking concepts, including VPCs, subnets, and security groups. Experience with monitoring and logging solutions such as Prometheus, Grafana, ELK Stack, or AWS CloudWatch. Familiarity with Zero Trust security models and best practices for securing cloud workloads. Ability to troubleshoot complex infrastructure issues More ❯
london, south east england, united kingdom Hybrid / WFH Options
LHH
Bash, or PowerShell for automation. Understanding of AWS networking concepts, including VPCs, subnets, and security groups. Experience with monitoring and logging solutions such as Prometheus, Grafana, ELK Stack, or AWS CloudWatch. Familiarity with Zero Trust security models and best practices for securing cloud workloads. Ability to troubleshoot complex infrastructure issues More ❯
own end to end delivery of solutions. Expert knowledge of SRE fundamentals and a commitment to best practice Fluency with common observability tooling like Prometheus, Grafana, OTEL and Cloudwatch Experience analysing and building data telemetry, querying (PromQL), modelling, pipelines and dashboards to provide concise, focused insights and alerts for distributed More ❯
Bristol, Gloucestershire, United Kingdom Hybrid / WFH Options
Searchability
CI/CD) and automation tools like Terraform and Ansible Programming : Proficiency in Python, Go, or Ruby Monitoring and Observability : Hands-on experience with Prometheus, Grafana, ELK Stack, or similar technologies Core Attributes A passion for solving complex technical challenges in high-availability production environments Strong communication and collaboration skills More ❯
Greater Bristol Area, United Kingdom Hybrid / WFH Options
Searchability NS&D
GitLab CI/CD) and automation tools like Terraform and Ansible Programming : Proficiency in Python, Go, or Ruby Monitoring & Observability : Hands-on experience with Prometheus, Grafana, ELK Stack, or similar technologies Core Attributes A passion for solving complex technical challenges in high-availability production environments Strong communication and collaboration skills More ❯
Kubernetes). Understanding of CI/CD pipelines. Familiarity with scripting languages like Python, Bash, or Go. Experience with monitoring tools such as Datadog, Prometheus, Grafana, or ELK stack. Strong problem-solving, communication skills, and ability to work independently or in teams. Additional notes We value diverse backgrounds and perspectives. More ❯
infrastructure as code (IaC) tools such as Terraform, Ansible, or Chef for automation and configuration management. Strong understanding of monitoring and observability tools like Prometheus, Grafana, Azure App Insights for proactive system monitoring and troubleshooting. Knowledge of networking, security principles, and best practices in a cloud environment. Demonstrated experience of More ❯
Bethesda, Maryland, United States Hybrid / WFH Options
Absolute Business Solutions Corp
other message passing systems Experience on a production/enterprise system Cloud infrastructure experience Experience with any of the following technologies: Kubernetes monitoring, e.g., Prometheus/Graphana GPU-based Kubernetes SALT for deployment automation Elasticsearch, Kibana, and Logstash, specifically admin experience Helm and Helmfile Cloudera usage, including Kafka topic creation More ❯
Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation. Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar. Proven track record of maintaining highly-available and performant production environments. Ability to identify and implement effective mitigation strategies and operational playbooks. More ❯
culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS More ❯
culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Future Talent Group
culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS More ❯
East London, London, United Kingdom Hybrid / WFH Options
Future Talent Group
culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS More ❯
culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS More ❯
Central London / West End, London, United Kingdom Hybrid / WFH Options
Future Talent Group
culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS More ❯
Leeds, West Yorkshire, United Kingdom Hybrid / WFH Options
Future Talent Group
culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS More ❯
Bury, Greater Manchester, United Kingdom Hybrid / WFH Options
Future Talent Group
culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS More ❯
Altrincham, Greater Manchester, United Kingdom Hybrid / WFH Options
Future Talent Group
culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS More ❯
Bolton, Greater Manchester, United Kingdom Hybrid / WFH Options
Future Talent Group
culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS More ❯
Leigh, Greater Manchester, United Kingdom Hybrid / WFH Options
Future Talent Group
culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS More ❯