jobs, retries, monitoring, automation). Work with S3-style object storage: efficient layouts, lifecycle, throughput, and cost awareness. Add tooling around pipelines (progress/health visualization, metrics, alerts) for observability and faster iteration. Collaborate closely with ML engineers to align datasets with training needs and accelerate experimentation. Requirements Must-have Strong Python fundamentals; you write clean, maintainable, production-ready code. More ❯
overseeing the full AI solution lifecycle — from concept to production. Mentor and guide Lead AI Engineers and technical contributors to raise the engineering bar. Champion best practices for governance, observability, and lifecycle management of AI systems. Evaluate and introduce emerging frameworks, including Autogen, LangGraph, and other agentic ecosystems. Partner with senior stakeholders to align AI strategy with business goals. Drive More ❯
other internal teams to fully understand client requirements and deliver tailored technical solutions. Design and implement scalable, future-proof architectures for new third-party connectors and integrations. Enhance system observability by improving diagnostics, logging, and tracing to aid technical support teams in resolving issues swiftly. Oversee the ongoing development and management of the public API, covering REST and event streaming More ❯
SQL databases and query optimisation RESTful API design and implementation Git – version control and collaborative workflows Backend testing strategies (unit, integration, end-to-end) Experience with monitoring, logging, and observability tools Knowledge of scalable system architecture and design patterns Highly desirable AdonisJS, NestJS or similar MVC frameworks AWS infrastructure and services Infrastructure as Code (Terraform or similar) Distributed systems experience More ❯
international markets Previous experience in the parking or mobility sector Experience with GraphQL and modern API integration patterns Knowledge of micro-frontend architectures Experience with advanced performance monitoring and observability tools Growth Opportunities Opportunity to shape the frontend strategy for a rapidly growing international company Increasing involvement in strategic technical decision-making Development of broader technology leadership skills Experience in More ❯
DevOps, infrastructure, and platform engineering. Tech Stack Cloud: AWS (EC2, RDS, S3, IAM, CloudWatch, Lambda) Infrastructure as Code: Terraform Containerisation & Orchestration: Docker, Kubernetes (EKS), Helm Configuration Management: Ansible Monitoring & Observability: Grafana, Prometheus CI/CD: GitHub Actions Automation & Scripting: Python, Bash, Go or Java What We’re Looking For Proven experience running AWS cloud infrastructure in a production or regulated … financial) environment. Hands-on experience managing Kubernetes clusters (preferably EKS). Strong understanding of Infrastructure as Code using Terraform. Familiarity with monitoring and observability stacks such as Prometheus and Grafana. Experience building and maintaining CI/CD pipelines (GitHub Actions or similar). Strong scripting or automation skills using Python, Bash, Go or Java . A collaborative mindset — comfortable working More ❯
AWS (Core Services – EC2, RDS, S3, IAM, Lambda, CloudWatch) Infrastructure as Code: Terraform Containerisation & Orchestration: Docker, Kubernetes (EKS), Helm Configuration Management: Ansible CI/CD Pipelines: GitHub Actions Monitoring & Observability: Grafana, Prometheus Scripting/Automation: Python or Java What We’re Looking For Proven experience managing and scaling AWS cloud environments , ideally supporting live software products or high-traffic platforms. … Strong background in Terraform and Infrastructure as Code best practices. Practical experience with Kubernetes (EKS) in production. Familiarity with monitoring and observability tools such as Grafana and Prometheus. Hands-on experience building CI/CD pipelines (GitHub Actions, Jenkins, CircleCI, etc.). Solid scripting and automation experience using Python or Java . A collaborative engineer who enjoys working closely with More ❯
london, south east england, united kingdom Hybrid/Remote Options
Black Pen Recruitment
tooling, systems design, and operational resilience. Their environment offers opportunities to work on everything from CI/CD pipelines and container orchestration to configuration management, infrastructure as code, and observability tooling. While you may bring experience in specific tools or platforms, you will be expected to contribute broadly across our infrastructure landscape. Our client's core product is a comprehensive … Solid Linux administration and general networking knowledge Understanding of infrastructure security best practices, including secure configuration, identity and access management, and compliance controls Experience with monitoring, alerting, and system observability Background in financial services infrastructure is advantageous but not required More ❯
pipelines, reducing deployment time and improving release reliability Strengthen system resilience through infrastructure improvements and scalability planning Work with Product Engineer's to enhance developer experience Drive automation and observability Requirements: Strong GCP experience Deep understanding of Terraform CI/CD pipelines Containerisation (Kubernetes, GKE) If you're interested get in touch ASAP More ❯
to build cost-effective solutions on Microsoft Azure while maintaining agility and fostering innovation. This position is perfect for engineers who are passionate about optimising cloud usage, enhancing cost observability, and championing a Fin Ops culture. Experience in some of the following would be ideal Partner with engineering, finance and product teams to drive cost-efficiency across Azure Clear understanding More ❯
Monitor and optimise network performance across cloud and on-premise environments Troubleshoot and resolve connectivity issues quickly and effectively Automate network configuration using Terraform, PowerShell and Azure CLI Maintain observability using Azure Monitor, Log Analytics and Network Watcher Ensure deployments align with security and compliance standards Produce technical documentation and support knowledge sharing Required Experience: Strong hands-on experience with More ❯
City of London, London, United Kingdom Hybrid/Remote Options
ARC IT Recruitment Ltd
/MTTR via automation, clear SLAs, and robust RCAs/post-mortems. Safer, faster releases (blue/green, canary, feature flags) in partnership with Trading, Quant, and Engineering. Mature observability (logs/metrics/traces), capacity planning, and performance tuning for low-latency flows. Strong production hygiene and controls aligned to MiFID II/MAR/best-ex. Leadership of More ❯
hands-on experience in Microsoft Azure ML Studio * Experience using business intelligence tools, preferably Power BI * Experience applying Generative AI and prompting techniques * Strong understanding of data governance, model observability, and compliance frameworks * Proven ability to deliver secure, scalable, and responsible data science solutions If this sounds like you and you are available on short notice, apply now More ❯
practices for automation tools such as Power Automate Desktop. * Build out robust ALM processes using Azure DevOps or GitHub - including pipelines, solution management, environment variables, and connection references. * Implement observability and monitoring through Application Insights, Azure Monitor, and alerting frameworks. * Design secure integration layers using Azure services such as API Management, Service Bus, Functions, Logic Apps, and Key Vault. * Lead More ❯
the rapid and efficient development of new third-party connectors. Ensure the system's interfaces, testing protocols, and designs are robust and future-proof. Monitoring and Diagnostics: Enhance system observability and streamline the diagnosis of technical issues through advanced logging and tracing capabilities, aiding front-line technical staff. API Development and Management: Oversee the development and maintenance of our public More ❯
london, south east england, united kingdom Hybrid/Remote Options
Fresha
projects autonomously. Developer Experience - Extend our local development experience offerings for engineers Knowledge Sharing - Enrich knowledge across the department by creating Documentation, SOPs, Runbooks and fascinating knowledge-sharing sessions Observability - Extend Monitoring & Observability capabilities. Accessibility - Simplifying the process for engineers to access this data Collaboration - Collaborate and enable engineers to do their jobs more efficiently Efficiency - Developing tools to maximise More ❯
optimise BI dashboards and data products using Tableau, translating business needs into visual insights. Orchestrate and monitor data pipelines, ensuring data quality and timely delivery. Implement data quality checks, observability, and maintain data cataloging and lineage. Drive CI/CD practices using GitHub Actions or similar tools. Collaborate with cross-functional teams to improve platform capabilities and analytics maturity. Requirements More ❯
optimise BI dashboards and data products using Tableau, translating business needs into visual insights. Orchestrate and monitor data pipelines, ensuring data quality and timely delivery. Implement data quality checks, observability, and maintain data cataloging and lineage. Drive CI/CD practices using GitHub Actions or similar tools. Collaborate with cross-functional teams to improve platform capabilities and analytics maturity. Requirements More ❯
London, South East, England, United Kingdom Hybrid/Remote Options
Addition
trusted C-level relationships. Confident presenter with a consultative approach tailored to enterprise telecom clients. Experience working with nearshore/offshore delivery models is a plus. Knowledge of AIOps, observability, network automation, or platform engineering is advantageous. What’s in It for You Join a global team of 6,000+ technologists, with autonomy to shape growth in a critical sector. More ❯
production setting. Strong proficiency in Python and TypeScript and experience with asynchronous programming and large-scale systems. Good understanding of modern software engineering practices: API design, modular architecture, testing, observability, continuous deployment. Comfortable diagnosing failures in deep technology stacks, using a variety of tools, logs, trace systems, metrics, etc. Preferred/Bonus Qualifications: Previous experience working with multi-modal models More ❯
aggregates). Collaborate with analysts, BI users, data scientists, and business stakeholders to translate data requirements into reliable data products (tables, views, metrics). Ensure data quality, consistency, and observability (tests, monitoring, alerting). Optimize SQL queries and transformations for performance in your data warehouse/lakehouse environment. Support or own CI/CD workflows around analytics (e.g. git, reviews More ❯
next-generation IT services and solutions. The role DXC Technology is seeking a dynamic and technically adept Solution Engineer to support our strategic partnership with Dynatrace, a leader in observability and application performance monitoring. This role is pivotal in driving technical sales engagements, crafting tailored observability solutions, and aligning Dynatraces capabilities with DXCs service offerings. As a trusted technical advisor … deployment strategies, and ongoing optimization of Dynatrace solutions. Solution Design & Presentation: Design tailored solutions leveraging the Dynatrace platform, ensuring scalability, security, and performance across diverse environments. Design and propose observability architectures across cloud-native, hybrid, and multi-cloud environments. Develop complex proposals and cost models aligned with customer goals and DXCs service offerings Deliver compelling product demonstrations and proof-of … Systems Integrator (GSI). Proven experience in pre-sales, solution architecture, or consulting roles. Strong understanding of cloud platforms (AWS, Azure, GCP), DevOps, and microservices. Hands-on experience with observability tools, APM, log analytics, and infrastructure monitoring. Proficiency in scripting or programming languages (e.g., Java, Python, JavaScript). Certifications in Dynatrace, AWS, Azure, or GCP. Familiarity with OpenTelemetry, FluentBit, Cribl More ❯