of our new GCP environment, working collaboratively with cross-functional teams. This role requires a proactive individual who excels at bridging the gap between development and operations while ensuring performance, reliability, and scalability during and after the migration. Key Responsibilities : Infrastructure Design & Implementation: Architect, design, and implement robust, secure, and scalable infrastructure on GCP using Terraform. Platform Reliability & Automation … response. Develop CI/CD pipelines, automated testing, and disaster recovery strategies. Container & Orchestration Management: Manage container orchestration using Kubernetes (GKE) and deploy containerized applications with a focus on performance, scalability, and security. Work closely with development, security, and product teams to ensure seamless integration, adherence to best practices, and successful migration execution. Monitoring, Logging, & PerformanceTuningMore ❯
Watford, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
into our platform. Ensure smooth end-to-end integration of AI/ML functionality within broader product development efforts. Technical Leadership Define best practices for MLOps, model deployment, and performance tuning. Evaluate and implement new frameworks, tools, and cloud-native AI solutions (including LLMs). Make strategic technical decisions that align with broader business goals. Support the growth of … AWS (e.g., SageMaker, Bedrock) and infrastructure-as-code tools like Terraform. Solid understanding of large-scale data pipelines, distributed systems, and microservice architectures. Comfortable working with LLMs, model fine-tuning, and building features powered by generative AI. Experience setting up CI/CD pipelines for ML workflows and maintaining scalable systems in the cloud. A proven track record of More ❯
direction with business priorities to shape the roadmap. Technical Leadership Design, optimize, and govern modern, large-scale data architectures. Oversee architecture across applications, infrastructure, and ecosystem integrations. Apply advanced performancetuning, database optimization, and domain-driven design principles. Collaborate with and mentor engineers on system architecture, CI/CD workflows, and API integrations. Provide technical oversight in data … effectiveness of operations through thoughtful oversight. Required Experience Proven experience in large-scale data/content operations and platform engineering. Expertise in Microsoft SQL Server, including: Database design and performancetuning Transact-SQL (T-SQL), C# REST API integration Familiarity with Data Lakehouse architectures, especially Databricks. Strong understanding of data architecture, domain modeling, and system design in distributed More ❯
Lead the design, build, and maintenance of resilient and secure cloud-based solutions.Define and implement standardised patterns, practices, and tools to support the development lifecycle.Ensure platforms are optimized for performance, reliability, and scalability. Cloud Optimization Continuously review and optimize cloud resources for cost efficiency and performance improvements.Ensure solutions adhere to best practices for security, compliance, and operational efficiency. … practices such as Infrastructure as Code (IaC), CI/CD, and containerization. Experience managing and optimizing non-production environments to ensure stability and security. Expertise in cost optimization and performancetuning for cloud environments. Exceptional troubleshooting skills, with experience in 4th-line support for critical applications. Strong leadership and team-building skills, with experience in creating and growing More ❯
watford, hertfordshire, east anglia, united kingdom Hybrid / WFH Options
Queen Square Recruitment
through Large Language Models (LLMs), GenAI, and predictive analytics. Key Responsibilities: Design, develop, and deploy AI/ML models, including LLMs and GenAI. Conduct feature engineering, model optimization, and performance tuning. Implement and train supervised, unsupervised, and reinforcement learning models. Carry out advanced data exploration, analysis, and preprocessing. Deploy scalable models into production, with ongoing monitoring and tuning. Collaborate More ❯
now scaling our operations in London. Our work environment rewards innovation, speed, and bold thinking. The role We’re hiring Senior and Staff Software Engineers to build the high-performance computing infrastructure that powers our Optical Tensor Processing Units (OTPUs). This isn’t just about scaling models—it’s about rethinking how AI workloads are executed at speed … hardware and ML teams to optimise every layer of the stack—from model representation and execution to data movement and scheduling. Whether it’s through compiler techniques, systems-level tuning, or custom runtime design, you’ll play a critical role in shaping the performance layer of our AI platform. This is a role for engineers who think in … just model accuracy. If you’ve worked in HFT, large-scale scientific compute, or AI infrastructure at serious scale, we’d love to talk. Responsibilities Design and build high-performance systems for running AI/ML workloads across distributed compute clusters Optimise for ultra-low latency and real-time inference at scale—profiling, tuning, and rewriting critical systems More ❯
to be: Equally comfortable leading and coding Strong on integration patterns and XML tech Confident translating specs into clean, documented solutions Experienced with agile teams, CI/CD, and performancetuning If you're a strong technical lead who still loves writing great code please apply #J-18808-Ljbffr More ❯
Leadership Lead the Web development team in the design, development, deployment, and maintenance of web applications (Mobile & Desktop). Oversee architecture and design decisions for new features, ensuring scalability, performance, and maintainability, working closely with the Architecture Team (Solution & Enterprise Architecture). Conduct code reviews to ensure adherence to best practices, coding standards, and technical guidelines to ensuring high … improvements in the codebase. Code Coverage - The percentage of code covered by Unit/Automation tests. Post Release Defect - Reduction in the number of defects found post release. App Performance & Stability App Performance Metrics- user facing metrics e.g. application load times, responsiveness of features, memory usage, etc. Customer Satisfaction App Store Rating. User Retention & Engagement Metrics. Customer Issue … feature. Mentoring Effectiveness - Feedback from developers regarding the support and guidance they receive. Technical Innovation Regularly exploring new tools, libraries, and technologies that can improve the app's functionality, performance, or development efficiency. Risk & Technical Debt Management Reduction in Technical Debt. Risk Mitigation & Resolution Time - The number of high-risk technical issues identified and resolved within a defined period. More ❯
Watford, England, United Kingdom Hybrid / WFH Options
JR United Kingdom
to meet client-specific requirements. Provide technical configuration and customisation across SAP MDG modules. Lead integration activities between SAP MDG and external systems for smooth data workflows. Conduct troubleshooting, performancetuning, and technical assessments . Collaborate with business and functional stakeholders to translate needs into actionable technical deliverables. Key Skills Required Proven hands-on experience as an SAP More ❯