at the heart of technology delivery. Responsibilities include: Designing and enforcing SLOs, SLIs, and SLAs to ensure high reliability and performance. Building and maintaining monitoring/observability solutions (Datadog, Grafana, Azure Application Insights, Log Analytics). Managing Infrastructure as Code (Terraform, Pulumi, CloudFormation) for scalable, repeatable deployments. Automating with PowerShell, Python, or Bash to drive efficiency. Supporting Kubernetes and AKS … Required: Proven Site Reliability Engineering background. Strong Terraform skills with live environment deployment. Kubernetes/AKS expertise. Scripting in PowerShell, Python or Bash. Monitoring experience (Datadog preferred, Azure or Grafana considered). Background in web applications and distributed systems. Desirable Skills: Knowledge of Microservices Architecture. Familiarity with Kanban. Experience with Puppet or Chef If you’re passionate about Site Reliability More ❯
at the heart of technology delivery. Responsibilities include: Designing and enforcing SLOs, SLIs, and SLAs to ensure high reliability and performance. Building and maintaining monitoring/observability solutions (Datadog, Grafana, Azure Application Insights, Log Analytics). Managing Infrastructure as Code (Terraform, Pulumi, CloudFormation) for scalable, repeatable deployments. Automating with PowerShell, Python, or Bash to drive efficiency. Supporting Kubernetes and AKS … Required: Proven Site Reliability Engineering background. Strong Terraform skills with live environment deployment. Kubernetes/AKS expertise. Scripting in PowerShell, Python or Bash. Monitoring experience (Datadog preferred, Azure or Grafana considered). Background in web applications and distributed systems. Desirable Skills: Knowledge of Microservices Architecture. Familiarity with Kanban. Experience with Puppet or Chef If you’re passionate about Site Reliability More ❯
at the heart of technology delivery. Responsibilities include: Designing and enforcing SLOs, SLIs, and SLAs to ensure high reliability and performance. Building and maintaining monitoring/observability solutions (Datadog, Grafana, Azure Application Insights, Log Analytics). Managing Infrastructure as Code (Terraform, Pulumi, CloudFormation) for scalable, repeatable deployments. Automating with PowerShell, Python, or Bash to drive efficiency. Supporting Kubernetes and AKS … Required: Proven Site Reliability Engineering background. Strong Terraform skills with live environment deployment. Kubernetes/AKS expertise. Scripting in PowerShell, Python or Bash. Monitoring experience (Datadog preferred, Azure or Grafana considered). Background in web applications and distributed systems. Desirable Skills: Knowledge of Microservices Architecture. Familiarity with Kanban. Experience with Puppet or Chef If you’re passionate about Site Reliability More ❯
Performance Test Engineer Performance Testing, Speed, scalability, reliability, System performance optimization, Performance test plans, Web and backend systems, Agile, JMeter, K6, LoadRunner, Gatling, Grafana, Prometheus, New Relic, CI/CD pipelines, Jenkins, Git, System architecture, APIs 6 months initial contract Remote with occasional travel to Bedford £425 - £475 – Inside IR35 We are seeking a skilled and detail-oriented Performance Test … Knowledge of Ephemeral environments and their implications for performance testing. Strong proficiency with performance testing tools such as JMeter, K6, LoadRunner, Gatling, or similar. Experience with monitoring tools like Grafana, Prometheus, New Relic. Familiarity with CI/CD pipelines and version control systems (e.g., Jenkins, Git). Strong understanding of system architecture, APIs, databases, cloud infrastructure, and their implications on More ❯
City of London, London, United Kingdom Hybrid/Remote Options
Hargreaves Lansdown
operating in both on-prem and cloud-native environments , with working knowledge of AWS , Docker , and Kubernetes . Advocates for observability and service health , using tools like Prometheus and Grafana to ensure reliability and performance. Champions quality and security , embedding testing and scanning into CI/CD pipelines and engineering workflows. Passionate about mentoring engineers , conducting code reviews , and driving … RDBMS (Oracle, Sybase)/NoSQL (Document DB) AWS/Docker/Kubernetes CI/CD pipelines : GitHub Actions, CircleCI, Bitrise Testing frameworks : Jest, Cypress, XCTest, Espresso Observability tools : Prometheus, Grafana Interview Process 3 Stage Interview Stage 1 - Discussion with our Hiring Manager (30mins): A chance to talk with our Hiring Manager in more detail about the role, our tech stack More ❯
Experienced Postgres Database Administrator to manage, maintain, and optimize PostgreSQL environments across on-premises and cloud platforms . The ideal candidate will have a strong understanding of PostgreSQL architecture, clustering technologies, and migration strategies, along with hands-on experience in More ❯
Knutsford, Cheshire, United Kingdom Hybrid/Remote Options
Experis
workflows across Linux, UNIX, and virtualized environments. Collaborate with infrastructure and operations teams to ensure secure and timely patching. Integrate patching with containerized platforms (Docker, Kubernetes) and dashboarding tools (Grafana, BI solutions). Develop APIs and system interfaces to support orchestration and reporting. Analyze patch data to identify trends, risks, and performance impacts. Maintain dashboards and documentation for compliance and … experience Experience in patch management, system administration, and IT operations. Strong knowledge of Linux, UNIX, virtualization, and containerization. Proficiency in Python, PHP, and web development. Familiarity with BI tools, Grafana, and data visualization. Background in data analysis, statistics, and mathematics. Experience in financial services IT environments is essential. More ❯
united kingdom, united kingdom Hybrid/Remote Options
WalletConnect
performant, and reusable components. Ensure reliability and quality through type-safe code, automated testing (unit/e2e), and CI/CD pipelines. Monitor, debug, and optimize production systems using Grafana, Sentry, and related tools. Drive performance optimizations — tree-shaking, code-splitting, bundle-size reduction, and build-time improvements. Collaborate cross-functionally, writing technical design docs and reviewing peers' work to … updates Comfortable working in a remote, autonomous, and async-first team. Nice to Have Experience building developer tools, infrastructure products, or cloud environments (Cloudflare, AWS, Supabase). Familiarity with Grafana, Sentry, and observability pipelines. Experience with web components and framework-agnostic UI libraries. Domain knowledge in crypto/Web3, smart contracts, or wallets. Open-source contributions or SDKs published publicly More ❯
London, South East, England, United Kingdom Hybrid/Remote Options
Harnham - Data & Analytics Recruitment
reports across teams, assess usage and objectives, and consolidate where possible Design and develop a suite of standard Power BI reports and dashboards from scratch Migrate existing reporting from Grafana to Power BI Optimize reports for performance, usability, and ongoing maintenance Collaborate with various teams and a dedicated data engineer to handle large volumes of data Ensure clear documentation and … Clear communication skills to engage with multiple stakeholders Preferred Background Experience with survey data or fraud data in banking or research sectors Projects involving migration of reporting tools (e.g., Grafana to Power BI) Track record of standardizing and optimizing reporting frameworks More ❯