Support Kubernetes/OpenShift environments and application deployments Enable developers through onboarding and technical support Maintain and improve CI/CD pipelines (Tekton, Argo CD) Monitor systems using Prometheus, Grafana, Splunk, Loki, and EFK Automate infrastructure provisioning using scripting and IaC tools Collaborate with vendors and internal teams for issue resolution What You'll Bring Strong Linux (Red Hat) and More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Arm Limited
withcontainerizationtechnologies (Docker) and deployment methodologies Familiarity withSQLfor building and querying relational databases. Clear technical writing todocumentdata schemas, APIs, and dashboard usage. "Nice to Have" Skills and Experience Experience with Grafana, Prometheus, or similar observability platforms. Familiarity with SAST and SCA tools (e.g., Coverity, Black Duck) and experience understanding their findings. Experience defining and visualizing key security and performance metrics within More ❯
Technologies (Kubernetes, Open Shift) Messaging Technologies (Kafka, Solace, TIBCO) Database/Data Store/Data Query Technologies (SQL Server, Trino, Mongo, S3) Observability Technologies (OpenTelemetry, Elastic Stack/ELK, Grafana) This job description provides a high-level review of the types of work performed. Other job-related duties may be assigned as required. Job Family Group: Technology Job Family: Applications More ❯
cloud/Linux fundamentals. Curiosity and the confidence to ask questions in a fast-moving team. Nice-to-haves Exposure to Kubernetes, Docker or Terraform. Experience with observability stacks (Grafana, Prometheus, OpenTelemetry). Familiarity with Postgres. Interest in data-privacy, AdTech/MarTech or large-scale data processing. Familiarity with Kafka, gRPC or Apache Spark. As well as working as More ❯
for someone with deep expertise in: oInfrastructure as Code: Terraform, CloudFormation o Security best practices: IAM, KMS, encryption in transit/at rest, DevSecOps o Monitoring & observability: Datadog, Prometheus, Grafana, ELK, or similar What You Bring o 6+ years in DevOps or platform engineering, with experience in a technical lead role. o Proven experience designing and operating cloud-native platforms More ❯
e.g. Make), code security (repository partitioning and automatic obfuscation), testing (e.g. GoogleTest, FFF), debugging (gdb, J-Link), scripting (Python), CI (Github CI), containerisation (Docker), profiling, monitoring and logging (e.g. Grafana) Experience with software development on both Linux and Windows Strong coding background, in C and other languages like Python, C++ or C#, with a quality mindset Strong analytical and problem More ❯
and one dynamic programming. ideally Rust & nodeJS Experience with Public Cloud providers, ideally AWS Experience with CI/CD tooling and pipelines Any experience with Observability platforms such as Grafana would be advantageous. Our Commitment to Diversity, Equity, Inclusionand Belonging (DEIB) We believe attracting and retaining the best talent and fostering an inclusive culture strengthens our business. DEIB improves our More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
INTEC SELECT LIMITED
Conduct architecture reviews, technical audits, and drive adoption of best practices Partner with infrastructure teams to ensure system reliability and operational efficiency Integrate monitoring and logging solutions (e.g., Prometheus, Grafana, ELK) Define strategies for disaster recovery, scaling, and infrastructure resilience Improve observability by enhancing visibility into performance and error metrics Skills and Experience Required 10+ years of backend development experience More ❯
weeks are ever the same. Essential Skills Solid Unix/Linux skills Experience with Bash, SQL, PHP Comfortable with Apache/Nginx, load balancers (HAProxy), and monitoring tools (Nagios, Grafana, Prometheus) Knowledge of log management (Graylog, Elasticsearch) Familiar with Ansible and Gitlab CI/CD Experience using Git/SVN What Sets You Apart Passionate self-starter who loves problem More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
DCS Recruitment
weeks are ever the same. Essential Skills Solid Unix/Linux skills Experience with Bash, SQL, PHP Comfortable with Apache/Nginx, load balancers (HAProxy), and monitoring tools (Nagios, Grafana, Prometheus) Knowledge of log management (Graylog, Elasticsearch) Familiar with Ansible and Gitlab CI/CD Experience using Git/SVN What Sets You Apart Passionate self-starter who loves problem More ❯
Chester, Cheshire, England, United Kingdom Hybrid / WFH Options
Robert Walters
APIs , CI/CD pipelines , and test-driven development using tools like Jest, Cypress, Playwright, or Pact. Proficiency with HTML5, CSS3, Redux, Docker, GitHub , and monitoring tools such as Grafana, Dynatrace or ELK . Experience managing and mentoring software engineers in Agile teams. A passion for engineering quality, scalability, and security. Bonus Points If You Have Experience building containerised applications More ❯
Python, Go, or similar languages for automation and scripting. Expert-level knowledge of AWS Networking, TLS, and security best practices. Experience with container orchestration (Kubernetes, EKS) and observability tools (Grafana, ELK). A passion for innovation, problem-solving, and delivering high-impact solutions. Why Work For Us? 25 days holiday + bank holidays Up to 5% employer pension contribution Educational More ❯
Leeds, Yorkshire, United Kingdom Hybrid / WFH Options
Yorkshire Building Society Group
excellent knowledge of Java and Microservices. Desirable skills include: Restful Design, working with API Gateway products (Google Apigee or equivalent) Containers and Container Orchestration - Kubernetes, OpenShift Logging, Monitoring & Alerting - Grafana, Prometheus, Dynatrace or equivalent CI/CD Pipelines - Gitlab, Artifactory. Security (OWASP, Certificates, JWT, Encryption techniques) Have experience of working within IT support environments About us At YBS we don More ❯
several microservices, also written in Python, utilising frameworks and libraries such as Celery, Eventlet, SQLAlchemy, etc. Additionally, GOV.UK Notify utilises AWS RDS (Postgres), AWS SQS, AWS ElastiCache, OpenTelemetry, Prometheus, Grafana and other related services. Concourse CI and Terraform are used to run build-pipelines and manage our infrastructure. For the frontend, we follow theGOV.UK Design System , making use of GOV.UK More ❯
several microservices, also written in Python, utilising frameworks and libraries such as Celery, Eventlet, SQLAlchemy, etc. Additionally, GOV.UK Notify utilises AWS RDS (Postgres), AWS SQS, AWS ElastiCache, OpenTelemetry, Prometheus, Grafana and other related services. Concourse CI and Terraform are used to run build-pipelines and manage our infrastructure. For the frontend, we follow theGOV.UK Design System , making use of GOV.UK More ❯
Technologies (Kubernetes, Open Shift) Messaging Technologies (Kafka, Solace, TIBCO) Database/Data Store/Data Query Technologies (SQL Server, Trino, Mongo, S3) Observability Technologies (OpenTelemetry, Elastic Stack/ELK, Grafana) Desirable Criteria: Background of delivering software using an iterative Agile approach. Previous experience and knowledge of Swaps or Prime Brokerage offerings This job description provides a high-level review of More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Jane's Group
Zscaler ZPA, AWS Identity Centre, Microsoft Entra AWS Backup, AWS Systems Manager, Security Hub, Guard Duty, Microsoft Defender Windows/Linux Operating Systems and Active Directory Cloudwatch, Cloudtrail, OpenSearch, Grafana, Kinesis Lambda, EKS, ECS and EC2 DocumentDB, DynamoDB, RDS, S3, OpenSearch EventBridge, Steps, Glue This position offers a hybrid working model , allowing for flexibility in your work environment. We are More ❯
Manchester, Lancashire, England, United Kingdom Hybrid / WFH Options
Clarke Recruitment Solutions
and container orchestration (ECS/EKS, Helm) Streamlining and optimising CI/CD pipelines (GitHub Actions/GitLab CI) Setting up and managing observability with Datadog, CloudWatch, Prometheus/Grafana Automating deployments and improving recovery, redundancy, and capacity planning Supporting Linux environments (Ubuntu/CentOS) Getting involved in incident response and helping us prevent problems before they happen What we … Hands-on with containers and orchestration (Docker, ECS/EKS, Helm) Experience with CI/CD pipelines (GitHub Actions or GitLab CI) Familiarity with monitoring tools (Datadog, CloudWatch, Prometheus, Grafana) Confident scripting in Python and Bash Strong communication skills and collaborative mindset Nice to have (not essential): Experience with Azure or GCP Knowledge of networking (VPC Peering, Direct Connect) Exposure … A collaborative, supportive team environment where your input matters Tech stack you’ll work with AWS | Terraform | Ansible | Docker | ECS/EKS | GitHub Actions | GitLab CI | Datadog | CloudWatch | Prometheus | Grafana | Linux | Python | Bash If you’re passionate about automation, thrive on solving complex problems, and want your work to make a genuine difference when it matters most, we’d love More ❯
operations, and scripting. Experience in supporting real-time or mission-critical systems (security, IoT, or similar sectors). Familiarity with log aggregation, monitoring, and alerting tools (e.g., ELK, Prometheus, Grafana). Good understanding of networking, VPNs, load balancing, DNS, and firewalls. Comfortable with Git and CI/CD workflows. Excellent troubleshooting skills and structured problem-solving approach. Strong written and … plus. Experience with Kubernetes or OpenShift for container orchestration. Familiarity with CI/CD pipelines and automation tools (e.g., GitHub Actions, Jenkins). Exposure to monitoring tools like Prometheus, Grafana, or ELK stack. Experience supporting enterprise customers in a B2B SaaS or software product company. Experience with access control and intrusion detection systems. Familiarity with virtualization technologies (e.g., VMware, Hyper More ❯
Hertfordshire office. In this role, you'll take ownership of the end-to-end monitoring and alerting stack, designing and maintaining infrastructure and alert configurations (e.g., with Prometheus/Grafana or equivalent), and building dashboards that clearly communicate metrics to business stakeholders. You'll drive system automation and integration, crafting scripts and workflows-primarily in Python-to onboard new services More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment Limited
Comfortable managing deployments using CI/CD pipelines (Github Actions, Jenkins, etc.)*Solid understanding of cloud infrastructure including AWS, Kubernetes, and contect delivery*Exposure to observability tooling (Datadog, Sentry, Grafana) and performance tuning best practice Reference Number: BBBH259301 To apply for this role or for to be considered for further roles, please click "Apply Now" or contact Tommy Williams at More ❯
with performance and load testing frameworks (e.g., k6, JMeter) Familiarity with cloud-based test environments and infrastructure (AWS preferred) Working knowledge of observability and test reporting tools (e.g., Datadog, Grafana) Experience improving test data strategies and test isolation techniques Contributions to internal tooling or open-source testing frameworks Background in building out quality initiatives at the org level EverQuote Can More ❯
production environment runs smoothly. Develops maintenance requirements and procedures. Monitoring and Observability: Monitors servers, applications and clusters for failures, system crashes and resource usage, etc using tools like Prometheus, Grafana or Elastic Stack (Elastic Search, Logstash and Kibana). FURTHER DUTIES WILL INCLUDE: Improve monitoring on our application servers which we are currently lacking. Re-implement high availability on database … CI/CD tools such as Version Control Systems (SVN or Git), Jira, GitLab, or Jenkins Experience in using configuration management, monitoring and logging tools such as Ansible, Prometheus, Grafana or Elastic Stack (Elastic Search, Logstash and Kibana). Extensive experience with Windows and Linux operating system environments Experience with infrastructure scripting solutions such as Linux and/or PowerShell More ❯
NW10, Middlesex, Greater London, United Kingdom Hybrid / WFH Options
ITH Pharma
production environment runs smoothly. Develops maintenance requirements and procedures. Monitoring and Observability: Monitors servers, applications and clusters for failures, system crashes and resource usage, etc using tools like Prometheus, Grafana or Elastic Stack (Elastic Search, Logstash and Kibana). FURTHER DUTIES WILL INCLUDE: Improve monitoring on our application servers which we are currently lacking. Re-implement high availability on database … CI/CD tools such as Version Control Systems (SVN or Git), Jira, GitLab, or Jenkins Experience in using configuration management, monitoring and logging tools such as Ansible, Prometheus, Grafana or Elastic Stack (Elastic Search, Logstash and Kibana). Extensive experience with Windows and Linux operating system environments Experience with infrastructure scripting solutions such as Linux and/or PowerShell More ❯
Networking Experience with PTP/PPS platforms (Meinberg, FSMLabs, ADVA/Oscilloscope) Proficiency in using commercial NMS tools (e.g. Zabbix, Solarwinds, Nagios) and open-source tools (e.g. Prometheus, Alertmanager, Grafana) Basic programming skills in Python or GoLang Knowledge of Infrastructure as Code (IaC) tools such as Ansible or Terraform Familiarity with network analysis tools such as Wireshark, Splunk, and PromQL More ❯