improving CI/CD pipelines with a strong focus on automation and security. Key Skills Required: CI/CD automation, Jenkins, Ansible, Docker, Kubernetes, Terraform Unix/shell scripting, Grafana, Azure Strong understanding of software delivery and Agile methodologies Desirable: Experience in the Power Transmission & Distribution domain Eligibility: Candidates must hold UK or NATO nationality . This role offers a More ❯
Farnborough, England, United Kingdom Hybrid / WFH Options
Addition+
Platform or Site Reliability Engineering (5+ years ideally) Proven background with Kubernetes, CI/CD tooling (e.g. GitLab, Jenkins), and IaC (Terraform, Ansible) Confident with monitoring tools (e.g. Prometheus, Grafana) Git proficiency and solid repository management knowledge Comfortable leading technical decisions and collaborating with engineering teams What’s in It for You: A genuinely collaborative, no-blame engineering culture Interesting More ❯
Guildford, Surrey, United Kingdom Hybrid / WFH Options
Electronic Arts
e.g. Perforce, Git) Configuration management tools (e.g. Chef, Ansible, Terraform, Packer) Secrets management tools (e.g Vault) Virtualization environments and tools (e.g. VMs, vSphere) Data and Observability tools (e.g. Splunk, Grafana, New Relic, Open Telemetry) Growth-oriented mindset About Electronic Arts We're proud to have an extensive portfolio of games and experiences, locations around the world, and opportunities across EA. More ❯
Lexington, Massachusetts, United States Hybrid / WFH Options
Raft
DoD/Air Force AOC Weapon System and operating standards within cleared facilities (SIPR, IL6) - Familiarity with AWS and cloud technologies - Skill in operating observability tooling and alerting (Prometheus, Grafana, etc.) - Knowledge of Platform One Big Bang Clearance Requirements: Active Secret security clearance Work Type: Hybrid - Hanscom AFB, MA highly preferred (or local to Reston, VA or Hampton, VA or More ❯
Newport News, Virginia, United States Hybrid / WFH Options
Raft
DoD/Air Force AOC Weapon System and operating standards within cleared facilities (SIPR, IL6) - Familiarity with AWS and cloud technologies - Skill in operating observability tooling and alerting (Prometheus, Grafana, etc.) - Knowledge of Platform One Big Bang Clearance Requirements: Active Secret security clearance Work Type: Hybrid - Hanscom AFB, MA highly preferred (or local to Reston, VA or Hampton, VA or More ❯
Python, Go, or similar languages for automation and scripting. Expert-level knowledge of AWS Networking, TLS, and security best practices. Experience with container orchestration (Kubernetes, EKS) and observability tools (Grafana, ELK). A passion for innovation, problem-solving, and delivering high-impact solutions. Why Work For Us? Competitive base salary £90,000-£100,000 25 days holiday + bank holidays More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Suits Me Limited
to enable rapid and reliable delivery of services Contributing to the design of scalable and secure platform components that enable developer productivity Building and improving observability tooling (e.g. CloudWatch, Grafana) to support rapid detection and resolution of issues Collaborating with developers and stakeholders across squads to understand infrastructure needs and ensure best practices are applied Writing technical documentation and contributing More ❯
to read code and trace failures in one or more of the following application languages Java GoLang React .NET Python Solid understanding of modern observability tooling (e.g., Datadog, Loki, Grafana) Comfortable working on a shared on-call rotation Enhanced leave - 38 days inclusive of 8 UK Public Holidays Private Health Care including family cover Life Assurance - 5x salary Flexible working More ❯
various methods such as unit, integration, contract and E2E testing. You have a high degree of experience in observing the performance and health of applications via tools such as Grafana, Prometheus, Data Dog, Sentry, etc. You have a strong desire and are an advocate for performant applications. Proactive in solving problems simply and effectively, with an eye for pragmatic solutions. More ❯
Chester, Cheshire, United Kingdom Hybrid / WFH Options
Lloyds Banking Group
such as Jest, Enzyme, React Testing Library, Pact, Cypress and Playwright. DevOps: Familiarity with CI/CD and build pipelines, using tools such as Github , Harness, Jenkins, Docker, ELK, Grafana and Dynatrace. Take ownership and responsibility for the lifespan of the things you contribute to. A "you build it, you run it" attitude. ABOUT WORKING FOR US Our ambition is More ❯
field. Preferred Qualifications Proven expertise and experience with database technologies including NoSQL databases like MongoDB, RDBMS such as Postgres and MySQL Exposure to Docker, Kubernetes, AWS, Helm, Terraform, Vault, Grafana, ELK Stack, New Relic Relevant experience in the maintenance of data APIs and data lake architectures, including experience with Apache Iceberg, Trino/Presto, Clickhouse, Snowflake, BigQuery. Master's degree More ❯
field. Preferred Qualifications Proven expertise and experience with database technologies including NoSQL databases like MongoDB, RDBMS such as Postgres and MySQL Exposure to Docker, Kubernetes, AWS, Helm, Terraform, Vault, Grafana, ELK Stack, New Relic Relevant experience in the maintenance of data APIs and data lake architectures, including experience with Apache Iceberg, Trino/Presto, Clickhouse, Snowflake, BigQuery. Master's degree More ❯
distributed systems, and the challenges of running high-performance API gateways. Familiarity with GraphQL Federation is a significant plus. Experience building or managing modern observability stacks (e.g., OpenTelemetry, Prometheus, Grafana, ClickHouse). A self-starter attitude and a leader's mindset: you are comfortable with ambiguity, can identify and solve ill-defined problems, and don't need hand-holding. Excellent More ❯
InfluxDB, and ClickHouseschema design, indexing, and caching for sub-second reads. Experience deploying microservices in production using Docker and Kubernetes. Skilled in setting up observability and alerting pipelines (Prometheus, Grafana), including model drift detection. Experience with real-time ML inference and model serving frameworks (e.g., TorchServe, Triton, BentoML) for low-latency applications. Experience designing feedback loops, active learning, or user More ❯
City of London, London, England, United Kingdom Hybrid / WFH Options
TalentTrade Recruitment Limited
secrets management. Good experience with continuous integration and continuous deployment (CI/CD) pipelines with GitHub Actions. Familiarity with monitoring and logging tools relevant to distributed systems (eg, Prometheus, Grafana, ELK stack). Experience with Scripting languages such as Bash or Python for automation tasks. More ❯
EC3N, Tower, Greater London, United Kingdom Hybrid / WFH Options
TalentTrade Recruitment Limited
secrets management. Good experience with continuous integration and continuous deployment (CI/CD) pipelines with GitHub Actions. Familiarity with monitoring and logging tools relevant to distributed systems (eg, Prometheus, Grafana, ELK stack). Experience with Scripting languages such as Bash or Python for automation tasks. More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
INTEC SELECT LIMITED
Conduct architecture reviews, technical audits, and drive adoption of best practices Partner with infrastructure teams to ensure system reliability and operational efficiency Integrate monitoring and logging solutions (e.g., Prometheus, Grafana, ELK) Define strategies for disaster recovery, scaling, and infrastructure resilience Improve observability by enhancing visibility into performance and error metrics Skills and Experience Required 10+ years of backend development experience More ❯
weeks are ever the same. Essential Skills Solid Unix/Linux skills Experience with Bash, SQL, PHP Comfortable with Apache/Nginx, load balancers (HAProxy), and monitoring tools (Nagios, Grafana, Prometheus) Knowledge of log management (Graylog, Elasticsearch) Familiar with Ansible and Gitlab CI/CD Experience using Git/SVN What Sets You Apart Passionate self-starter who loves problem More ❯
Manchester, Lancashire, England, United Kingdom Hybrid / WFH Options
DCS Recruitment
weeks are ever the same. Essential Skills Solid Unix/Linux skills Experience with Bash, SQL, PHP Comfortable with Apache/Nginx, load balancers (HAProxy), and monitoring tools (Nagios, Grafana, Prometheus) Knowledge of log management (Graylog, Elasticsearch) Familiar with Ansible and Gitlab CI/CD Experience using Git/SVN What Sets You Apart Passionate self-starter who loves problem More ❯
support highly available telephony solutions using AudioCodes and Oracle SBCs Develop scripts, tools, and APIs to improve SIP routing, call flows, and automation Integrate telephony with monitoring platforms like Grafana and ThousandEyes Collaborate with carriers to support SIP infrastructure and hybrid voice networks Contribute to hybrid cloud telephony solutions across UCaaS and CCaaS platforms Participate in Agile sprints and support More ❯
multi-account AWS setups. Extensive experience with AWS Organisations Expert-level knowledge of AWS Networking, TLS, and security best practices. Experience with container orchestration (Kubernetes, EKS) and observability tools (Grafana, ELK). A passion for innovation, problem-solving, and delivering high-impact solutions. Working with Control Tower and Landing Zones Why Work For Us? Competitive base salary up to More ❯
Town Centre, Telford, Shropshire, England, United Kingdom Hybrid / WFH Options
TXP
progress and milestones. Skills & Experience Strong background in test management and Agile delivery. Hands-on experience with CI/CD (GitLab), automation tools (Playwright, OWASP Zap, Gatling), and monitoring (Grafana, Splunk). Familiarity with cloud (AWS/Azure), Kubernetes, and databases (Oracle RDS, SQL, MongoDB). Technical knowledge of Java 21 and Spring Boot. Deep understanding of all test levels More ❯
complex systems to mixed audiences, and build trust through technical credibility. Automation-first mindset: Skilled in infrastructure-as-code (Terraform or Pulumi), CI/CD workflows, observability stacks (Prometheus, Grafana, Loki), and scripting (Python, Bash). Bonus: Prior experience working with GPU capacity providers, hyperscaler partnerships, or AI infrastructure startups. Benefits: Competitive total compensation package. Retirement or pension plan, in More ❯
Kafka). Strong grasp of telemetry, observability, and performance monitoring in distributed systems. Track record of technical leadership and setting engineering standards. Nice to Have: Experience with OpenTelemetry , Prometheus, Grafana, or similar observability tooling. Exposure to hybrid-cloud or cloud migration strategies. Familiarity with performance optimisation in low-latency data pipelines. Contributions to DevOps-related communities, blogs, open source, or More ❯
Kafka). Strong grasp of telemetry, observability, and performance monitoring in distributed systems. Track record of technical leadership and setting engineering standards. Nice to Have: Experience with OpenTelemetry , Prometheus, Grafana, or similar observability tooling. Exposure to hybrid-cloud or cloud migration strategies. Familiarity with performance optimisation in low-latency data pipelines. Contributions to DevOps-related communities, blogs, open source, or More ❯