London, South East, England, United Kingdom Hybrid / WFH Options
Michael Page Technology
The role of a Platform Support Engineer involves providing excellent technical support and maintenance for platform solutions within the technology and telecoms industry. You will ensure the smooth operation of systems, troubleshoot issues, and deliver high-quality service to internal More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Signify Technology
. Strong scripting skills in Python , Bash , or similar. Familiarity with Linux administration , networking, and system security. Experience with monitoring, logging, and observability tools (e.g., Prometheus, Grafana, ELK stack, Datadog). Desirable Skills Exposure to infrastructure security best practices (e.g., CIS Benchmarks, AWS Well-Architected Framework). Knowledge of configuration management (Ansible, Chef, or Puppet). Experience with serverless architectures More ❯
Lambda, DynamoDB). Drive automation across CI/CD pipelines using tools like GitHub Actions , Terraform , and Argo CD for seamless and secure deployments. Enhance observability using Prometheus , Grafana , Datadog , and CloudWatch , enabling proactive incident prevention. Own incident management and post-mortem practices — guiding the team through challenges calmly and driving meaningful improvement. Collaborate with global engineering and product teams More ❯
or Windows administration, with the ability to architect secure, performant, and highly available cloud solutions. Proficiency with monitoring and log analytics tools such as AWS CloudWatch, ELK Stack, Prometheus, Datadog, or New Relic, to maintain observability and ensure operational excellence. Demonstrated leadership skills in managing complex, high-pressure situations and guiding teams through incident resolution. Exceptional communication and presentation skills More ❯
City of London, London, United Kingdom Hybrid / WFH Options
RP International
Route53, S3, EC2, ECS, VPC, IAM, DynamoDB, CloudFront . Infrastructure as Code (IaC) using Terraform or CloudFormation. Experience with Jenkins, Git (Gitflow), Docker, and monitoring tools like CloudWatch or DataDog . The Offer: Initial 6-month contract with a high likelihood of extension. A key role in a high-profile, innovative project at the forefront of cloud and AI technology. More ❯
/CD pipelines (e.g., Jenkins, TeamCity, Concourse). Familiarity with web/application servers such as NGINX, Apache, or JBoss. Exposure to monitoring and logging tools (ELK, Nagios, Splunk, DataDog, New Relic, etc.). Understanding of security and identity management (OAuth2, SSO, ADFS, Keycloak, etc.). Experience with version control systems (Git, Bitbucket, Subversion). Working knowledge of database technologies More ❯
/CD pipelines (e.g., Jenkins, TeamCity, Concourse). Familiarity with web/application servers such as NGINX, Apache, or JBoss. Exposure to monitoring and logging tools (ELK, Nagios, Splunk, DataDog, New Relic, etc.). Understanding of security and identity management (OAuth2, SSO, ADFS, Keycloak, etc.). Experience with version control systems (Git, Bitbucket, Subversion). Working knowledge of database technologies More ❯
design (REST, GraphQL) Experience with containerization (Docker, Kubernetes) and cloud-native development patterns DevOps & SRE Practices Experience implementing CI/CD pipelines and DevOps methodologies Knowledge of infrastructure monitoring (Datadog), log aggregation, and incident management Understanding of SLO/SLA definition and observability best practices Strategic & Business Acumen Ability to align technical initiatives with business objectives and articulate ROI Experience More ❯
City of London, London, United Kingdom Hybrid / WFH Options
TreasurySpring
based architectures, and queuing technologies, i.e. RabbitMQ Experience of REST and/or GraphQL APIs Knowledge of the core AWS services: i.e. EC2/ECS, RDS, S3 Experience using DataDog or similar observability tools Knowledge of containerisation: Docker, Kubernetes, AWS Fargate etc Any experience of front-end or fullstack development using TypeScript & React Experience building software for financial services and More ❯
monitoring. Design and execute load, stress, endurance, and failover tests using industry-standard tools such as JMeter, LoadRunner, or ADS. Set up observability dashboards (Grafana, Splunk, Dynatrace, Kibana, or Datadog) to monitor test execution and system performance. Analyse results to identify performance bottlenecks, system vulnerabilities, and areas for optimisation. Report findings and recommendations clearly to both technical and non-technical More ❯
Your expertise will ensure seamless, reliable, and secure experiences for customers. Hands-on experience with AWS services (ECS, S3, RDS, Lambda, CloudFront, etc.). Proficiency with monitoring tools like DataDog, CloudWatch, and Grafana. Experience with Docker, ECS, Kubernetes, or similar containerisation technologies. Knowledge of scripting languages such as Bash, Python, NodeJS. Familiarity with Infrastructure as Code (IaC) tools like Terraform More ❯
pipelines through tools such as Azure DevOps, GitHub Actions, or Octopus. You’ll also be adept at automating workflows in Python or PowerShell and implementing modern observability solutions including DataDog, OpenSearch, and LogicMonitor. This is a rare opportunity to join a high-performing, global hedge fund where technology and engineering directly drive investment performance and operational scale. To ensure the More ❯
Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Please send updated CV quoting availability and outside IR35 day More ❯
Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Please send updated CV quoting availability and outside IR35 day More ❯
and fix any issues with Jenkins pipelines used for CI/CD. Implement/Deploy AWS in large, regulated enterprise environment Monitor the IT estate using tools such as Datadog/CloudWatch and proactively react to alerts, Perform patch management of the estate using tools including RedHat Satellite and Ansible Remediate any security vulnerabilities within a timely manner. Participate in More ❯
Python, JavaScript). Solid understanding and practical experience with Infrastructure as Code (IaC), CI/CD pipelines, and GitOps methodologies. Experience with monitoring and observability tools (e.g. Prometheus, Grafana, Datadog). Strong communication skills with a proven ability to collaborate with cross-functional teams (e.g. Data Scientists, Data Analysts, Product Managers, Product Engineers). Experience investigating and resolving operational incidents More ❯
and fix any issues with Jenkins pipelines used for CI/CD. Implement/Deploy AWS in large, regulated enterprise environment Monitor the IT estate using tools such as Datadog/CloudWatch and proactively react to alerts, Perform patch management of the estate using tools including RedHat Satellite and Ansible Remediate any security vulnerabilities within a timely manner. Participate in More ❯
analysing metrics and logs using KQL (Kusto Query Language). Skilled in performance troubleshooting, implementing Azure Service Health monitoring, and setting up distributed tracing. Ideally, knowledge and experience of Datadog Observability tooling. Security & Compliance - Strong understanding of Azure security best practises including Azure Security Center/Microsoft Defender for Cloud, encryption using Azure Key Vault, network security with NSGs and More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Oliver Bernard
with Docker and Kubernetes Strong Infrastructure as Code experience with Terraform History of working across CI/CD pipelines Monitoring and Observability experience with Prometheus, Grafana, and/or DataDog Prior experience overseeing Change and Incident Management processes Previous work in an Architectural capacity is also a massive bonus This position is open to Lead level Engineers, able to offer More ❯
with Docker and Kubernetes Strong Infrastructure as Code experience with Terraform History of working across CI/CD pipelines Monitoring and Observability experience with Prometheus, Grafana, and/or DataDog Prior experience overseeing Change and Incident Management processes Previous work in an Architectural capacity is also a massive bonus This position is open to Lead level Engineers, able to offer More ❯
Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Experience mentoring or coaching engineers. Please send updated CV quoting More ❯
team leverages a modern and scalable technology stack: Backend: Python (FastAPI), Node.js Frontend: React, TypeScript Database: PostgreSQL Infrastructure: AWS, Docker, Terraform CI/CD: GitHub Actions, Pulumi Monitoring & Observability: DataDog, Sentry Data & Analytics: dbt, Metabase Internal Tools: Retool Collaboration: Linear, Slack, Notion Candidates are not expected to have experience with every tool listed, but should be enthusiastic about learning and More ❯
Azure). Experience with relational databases and data processing and query engines (Spark, Trino, or similar). Familiarity with monitoring, observability, and alerting systems for production ML (Prometheus, Grafana, Datadog, or equivalent). Understanding of ML concepts. You don't need to train models, but you should speak the language of Research Engineers and understand their constraints. A mindset that More ❯
team leverages a modern and scalable technology stack: Backend: Python (FastAPI), Node.js Frontend: React, TypeScript Database: PostgreSQL Infrastructure: AWS, Docker, Terraform CI/CD: GitHub Actions, Pulumi Monitoring & Observability: DataDog, Sentry Data & Analytics: dbt, Metabase Internal Tools: Retool Collaboration: Linear, Slack, Notion Candidates are not expected to have experience with every tool listed, but should be enthusiastic about learning and More ❯
and applications Cloud networking and security fundamentals Collaboration and communication skills Highly adaptable to changes in a technical environment Desirable Experience using monitoring and observer ability toolsets inc. Splunk, Datadog Experience using Github Actions Experience using AWS RDS/SQL based solutions Experience using containerization in AWS Working data warehouse knowledge Redshift and Snowflake preferred Working with IaC – Terraform and More ❯