understanding of CI/CD pipelines and tools (e.g., Github CI, GitLab CI, CircleCI, Jenkins). Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack). Strong scripting skills (e.g., Bash, Python) for automation tasks. Excellent problem-solving skills and attention to detail. Strong communication and collaboration skills. More ❯
understanding of CI/CD pipelines and tools (e.g., Github CI, GitLab CI, CircleCI, Jenkins). Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack). Strong scripting skills (e.g., Bash, Python) for automation tasks. Excellent problem-solving skills and attention to detail. Strong communication and collaboration skills. More ❯
understanding of CI/CD pipelines and tools (e.g., Github CI, GitLab CI, CircleCI, Jenkins). Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack). Strong scripting skills (e.g., Bash, Python) for automation tasks. Excellent problem-solving skills and attention to detail. Strong communication and collaboration skills. More ❯
understanding of CI/CD pipelines and tools (e.g., Github CI, GitLab CI, CircleCI, Jenkins). Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack). Strong scripting skills (e.g., Bash, Python) for automation tasks. Excellent problem-solving skills and attention to detail. Strong communication and collaboration skills. More ❯
AWS Lambda, Google Cloud Functions, Azure Functions) Containerisation technologies (e.g. Docker, Kubernetes, OpenShift) Tools for logging, monitoring, alerting and observability (e.g. ELK, Splunk, Prometheus, Grafana) Working knowledge of operating systems including CLI experience, deploying and configurating application or web servers We are currently operating a discretionary hybrid working model which More ❯
end to end delivery of solutions. Expert knowledge of SRE fundamentals and a commitment to best practice Fluency with common observability tooling like Prometheus, Grafana, OTEL and Cloudwatch Experience analysing and building data telemetry, querying (PromQL), modelling, pipelines and dashboards to provide concise, focused insights and alerts for distributed systems More ❯
NSGs, ASGs), and governance policies to ensure compliance and risk mitigation. Monitoring & Logging : Experience with Azure Monitor, Application Insights, Log Analytics, and Prometheus/Grafana for observability and performance monitoring. Scripting & Automation : Strong scripting skills in PowerShell, Bash, and Python , along with automation frameworks like Ansible . Collaboration & Problem-Solving More ❯
london, south east england, United Kingdom Hybrid / WFH Options
LHH
or PowerShell for automation. Understanding of AWS networking concepts, including VPCs, subnets, and security groups. Experience with monitoring and logging solutions such as Prometheus, Grafana, ELK Stack, or AWS CloudWatch. Familiarity with Zero Trust security models and best practices for securing cloud workloads. Ability to troubleshoot complex infrastructure issues and More ❯
Implement comprehensive monitoring, logging, and alerting systems to proactively identify and address performance issues, errors, and security threats. Use tools like Azure Monitor, Prometheus, Grafana, or similar to collect and analyse metrics, logs, and traces. Configure alerts and notifications to ensure timely responses to critical events. Security & Compliance: Implement security More ❯
Implement comprehensive monitoring, logging, and alerting systems to proactively identify and address performance issues, errors, and security threats. Use tools like Azure Monitor, Prometheus, Grafana, or similar to collect and analyse metrics, logs, and traces. Configure alerts and notifications to ensure timely responses to critical events. Security & Compliance: Implement security More ❯
Implement comprehensive monitoring, logging, and alerting systems to proactively identify and address performance issues, errors, and security threats. Use tools like Azure Monitor, Prometheus, Grafana, or similar to collect and analyse metrics, logs, and traces. Configure alerts and notifications to ensure timely responses to critical events. Security & Compliance: Implement security More ❯
Implement comprehensive monitoring, logging, and alerting systems to proactively identify and address performance issues, errors, and security threats. Use tools like Azure Monitor, Prometheus, Grafana, or similar to collect and analyse metrics, logs, and traces. Configure alerts and notifications to ensure timely responses to critical events. Security & Compliance: Implement security More ❯
or other build tools; Ansible or other IT Automation/software provisioning tools; JIRA, Confluence; * Experience in monitoring/reporting tools such as Splunk, Grafana/Prometheus etc * Experience in Agile practices * Working knowledge of environment monitoring tools such as GCO, NewRelic, Prometheus, Grafana. * Collaboration Skills: Proactive can-do attitude More ❯
using Infrastructure as Code for configuration management and code implementation - Terraform etc. Experience setting up and using monitoring and alerting tools such as Dynatrace, Grafana, Cloudwatch etc. Experience using Configuration management tools like Puppet, Ansible, Packer, Chef. Experience with various testing tooling - Selenium, Cucumber etc Experience in scripting - bash/ More ❯
analytics in C#. We use Airflow for workflow management, Kafka for data pipelines, Bitbucket for source control, Jenkins for continuous integration, ELK for logs, Grafana, Prometheus &InfluxDb for metrics, Docker and Kubernetes for containerisation, OpenStack for our private cloud, Ansible and Terraform for architecture automation, and Slack for internal communication. More ❯
containerization for applications and their subsequent orchestration within Kubernetes environments. Experience working on at least one monitoring/observability stack (Datadog, ELK, Splunk, Loki, Grafana). Strong knowledge of Unix or Linux Strong communication skills to collaborate with various stakeholders Able to work independently in a fast-paced environment Detail More ❯
Experience with messaging systems (RabbitMQ, Kafka, etc.) and event-driven architectures. ● Proficiency in infrastructure as code (Terraform preferred). ● Familiarity with monitoring stacks (Prometheus, Grafana, ELK, etc.) and system tuning. ● Security-conscious mindset; experience implementing controls in regulated or financial environments is a plus. ● Excellent problem-solving skills and a More ❯
Experience with messaging systems (RabbitMQ, Kafka, etc.) and event-driven architectures. ● Proficiency in infrastructure as code (Terraform preferred). ● Familiarity with monitoring stacks (Prometheus, Grafana, ELK, etc.) and system tuning. ● Security-conscious mindset; experience implementing controls in regulated or financial environments is a plus. ● Excellent problem-solving skills and a More ❯
Experience with messaging systems (RabbitMQ, Kafka, etc.) and event-driven architectures. ● Proficiency in infrastructure as code (Terraform preferred). ● Familiarity with monitoring stacks (Prometheus, Grafana, ELK, etc.) and system tuning. ● Security-conscious mindset; experience implementing controls in regulated or financial environments is a plus. ● Excellent problem-solving skills and a More ❯
london, south east england, United Kingdom Hybrid / WFH Options
Aimhire
Experience with messaging systems (RabbitMQ, Kafka, etc.) and event-driven architectures. ● Proficiency in infrastructure as code (Terraform preferred). ● Familiarity with monitoring stacks (Prometheus, Grafana, ELK, etc.) and system tuning. ● Security-conscious mindset; experience implementing controls in regulated or financial environments is a plus. ● Excellent problem-solving skills and a More ❯
data throughput in Java and C++. We use Airflow for workflow management, Kafka for data pipelines, Bitbucket for source control, Jenkins for continuous integration, Grafana + Prometheus for metrics collection, ELK for log shipping and monitoring, Docker and Kubernetes for containerisation, OpenStack for our private cloud, Ansible and Terraform for More ❯