in at least one programming language that compiles to machine code such as Rust, C++, or Go. Expert knowledge of monitoring technologies such as Prometheus, Grafana, and PagerDuty. Expert knowledge of deployment technologies such as Pulumi or Terraform. Expert knowledge of Kubernetes. Responsibilities: Improving our observability by adding/adjusting More ❯
AWS cloud services to implement highly efficient architecture. Ability to analyze infrastructure and implement security best practices. Experience with infrastructure monitoring tools like Nagios, Prometheus, Grafana. Expertise in containerization platforms like Docker and container orchestration platforms like Kubernetes and Rancher. Familiarity with infrastructure as code tools such as Terraform, CloudFormation More ❯
Bash, or PowerShell for automation. Understanding of AWS networking concepts, including VPCs, subnets, and security groups. Experience with monitoring and logging solutions such as Prometheus, Grafana, ELK Stack, or AWS CloudWatch. Familiarity with Zero Trust security models and best practices for securing cloud workloads. Ability to troubleshoot complex infrastructure issues More ❯
london, south east england, United Kingdom Hybrid / WFH Options
LHH
Bash, or PowerShell for automation. Understanding of AWS networking concepts, including VPCs, subnets, and security groups. Experience with monitoring and logging solutions such as Prometheus, Grafana, ELK Stack, or AWS CloudWatch. Familiarity with Zero Trust security models and best practices for securing cloud workloads. Ability to troubleshoot complex infrastructure issues More ❯
own end to end delivery of solutions. Expert knowledge of SRE fundamentals and a commitment to best practice Fluency with common observability tooling like Prometheus, Grafana, OTEL and Cloudwatch Experience analysing and building data telemetry, querying (PromQL), modelling, pipelines and dashboards to provide concise, focused insights and alerts for distributed More ❯
networking security (NSGs, ASGs), and governance policies to ensure compliance and risk mitigation. Monitoring & Logging : Experience with Azure Monitor, Application Insights, Log Analytics, and Prometheus/Grafana for observability and performance monitoring. Scripting & Automation : Strong scripting skills in PowerShell, Bash, and Python , along with automation frameworks like Ansible . Collaboration More ❯
or Google Cloud Platform. • Security: Experience with tools for delivering SCA, SAST, DAST capabilities. • Monitoring and Logging: Proficiency with tools like Splunk, Dynatrace, Datadog, Prometheus, Grafana. • Version Control: Strong understanding of Git and version control practices. • Scripting: Skills in scripting languages like Bash, PowerShell, or Perl. • Containerization: Familiarity with Docker More ❯
or Google Cloud Platform. • Security: Experience with tools for delivering SCA, SAST, DAST capabilities. • Monitoring and Logging: Proficiency with tools like Splunk, Dynatrace, Datadog, Prometheus, Grafana. • Version Control: Strong understanding of Git and version control practices. • Scripting: Skills in scripting languages like Bash, PowerShell, or Perl. • Containerization: Familiarity with Docker More ❯
infrastructure as code (IaC) tools such as Terraform, Ansible, or Chef for automation and configuration management. Strong understanding of monitoring and observability tools like Prometheus, Grafana, Azure App Insights for proactive system monitoring and troubleshooting. Knowledge of networking, security principles, and best practices in a cloud environment. Demonstrated experience of More ❯
/IP, DNS, HTTP Experience of deploying Continuous Integration solutions An awareness of security considerations in web application deployment Monitoring/Logging aka ELK, Prometheus/Grafana etc Strong AWS knowledge - EC2, EKS, RDS, Aurora, networking, cost management If you'd like to discuss this DevOps Engineer in more detail More ❯
Experience with TDD, BDD, and automated testing frameworks (PyTest, Selenium). Familiarity with security best practices in software development. Knowledge of observability tools like Prometheus, Grafana, and ELK stack. More ❯
Deploy, GitHub Actions, TeamCity, Jenkins, Azure DevOps). Monitoring and Observability Familiarity with observability concepts and ensuring system reliability. Experience with monitoring tools like Prometheus, Grafana and Sumo Logic. System Configuration and Documentation Experience designing, documenting, amending and refactoring moderately complex software configurations for deployment and system components. Server Administration More ❯
Deploy, GitHub Actions, TeamCity, Jenkins, Azure DevOps). Monitoring and Observability Familiarity with observability concepts and ensuring system reliability. Experience with monitoring tools like Prometheus, Grafana and Sumo Logic. System Configuration and Documentation Experience designing, documenting, amending and refactoring moderately complex software configurations for deployment and system components. Server Administration More ❯
in Java and C++. We use Airflow for workflow management, Kafka for data pipelines, Bitbucket for source control, Jenkins for continuous integration, Grafana + Prometheus for metrics collection, ELK for log shipping and monitoring, Docker and Kubernetes for containerisation, OpenStack for our private cloud, Ansible and Terraform for architecture automation More ❯
culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS More ❯
culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS More ❯
culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Future Talent Group
culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS More ❯
East London, London, United Kingdom Hybrid / WFH Options
Future Talent Group
culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS More ❯
Central London / West End, London, United Kingdom Hybrid / WFH Options
Future Talent Group
culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS More ❯
london, south east england, united kingdom Hybrid / WFH Options
Future Talent Group
culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Future Talent Group
culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS More ❯
london (west end), south east england, united kingdom Hybrid / WFH Options
Future Talent Group
culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS More ❯
Kubernetes). Understanding of CI/CD pipelines. Familiarity with scripting languages like Python, Bash, or Go. Knowledge of monitoring tools such as Datadog, Prometheus, Grafana, or ELK stack. Excellent problem-solving and communication skills, with the ability to work independently or in teams. Additional notes: We value diverse backgrounds More ❯