Employment Type

Remote Jobs

Hybrid/WFH 40

Sort By

Relevance
Date

Locations

Job Titles

101 to 120 of 120 Memory Management Jobs in London

AI Research Engineer (Model Serving & Inference)

London, England, United Kingdom

Tether Operations Limited

various applications and hardware environments. Responsibilities: Design and deploy high-performance, resource-efficient model serving architectures adaptable to diverse environments. Establish and track performance metrics like latency, throughput, and memory usage. Develop and monitor inference tests, analyze results, and validate performance improvements. Prepare realistic datasets and scenarios to evaluate model performance in low-resource settings. Identify bottlenecks and optimize … with a strong publication record. Proven experience in low-level kernel and inference optimizations on mobile devices, with measurable improvements. Deep understanding of model serving architectures, optimization techniques, and memory management in resource-constrained environments. Expertise in CPU/GPU kernel development for mobile platforms and deploying inference pipelines on such devices. Ability to apply empirical research to … overcome latency, bottleneck, and memory challenges, with experience in evaluation frameworks and iterative optimization. #J-18808-Ljbffr More ❯

Posted: Yesterday

C++ Developer - ETF Front Office Engineering

City Of London, England, United Kingdom

Selby Jennings

free data structures and event-driven architectures for sub-microsecond latency. Integrate with internal risk, compliance, and analytics systems to ensure safe and compliant trading. Optimise CPU cache usage, memory layout, and instruction pipelines for maximum throughput. Collaborate with traders to prototype and deploy new strategies in production with minimal latency overhead. Technical Requirements: Expert-level C++ (C++17/… with deep understanding of memory management, concurrency (lock-free, wait-free), and template metaprogramming. Proven experience in low-latency trading systems, preferably in ETFs, equities, or delta-one products. Strong knowledge of networking protocols (TCP/UDP, FIX, multicast), kernel bypass, and hardware timestamping. Experience with real-time market data processing, order book construction, and tick-to-trade More ❯

Posted: 4 days ago

Tech Lead, Inference Performance, Onboard

London, England, United Kingdom

Wayve

performance, ensuring efficient data transfer and low-latency processing across diverse hardware platforms. Define and oversee the design of robust, scalable systems that leverage advanced techniques in system and memory management to maximise real-time processing efficiency and minimise latency. Drive the identification and implementation of solutions to improve system performance, focusing on end-to-end data flow … tools and methodologies, guiding the team to identify bottlenecks and implement effective solutions. Act as a key liaison between software, hardware, and machine learning teams to ensure seamless data management, preprocessing, and optimal system utilisation, fostering an integrated approach to performance optimisation. Establish and champion industry best practices for high-performance, low-latency systems, setting a strong technical standard … systems. Strong background in developing software for embedded systems using high-performance system languages such as C++ and/or Rust. Deep understanding of system architecture, low-level programming, memory management, and resource utilisation, particularly for high-performance, real-time applications. Proven ability to lead and mentor engineers while working effectively across disciplines with hardware, software, and machine More ❯

Posted: Yesterday

Software Engineer (Compilers & Virtual Machines) - Full-time

London, England, United Kingdom
Hybrid / WFH Options

Parity Technologies

You have experience or are excited to learn about virtual machines, RISC-V, AMD64 assembly, dynamic recompilation, bytecode interpreters, compilers (especially code optimization and machine code generation), low-level memory management, low-level OS interfaces (Linux), security sandboxing, performance engineering, zero-knowledge proof systems, and zero-knowledge virtual machines You work autonomously and solve problems independently You work More ❯

Posted: 2 days ago

Staff AI Engineer

Hounslow, England, United Kingdom

JR United Kingdom

systems. Hands-on experience with advanced inference optimisation techniques, such as KVCache, MoE, adaptive batching, or gradient checkpointing. Strong understanding of low-level operating systems concepts, including multi-threading, memory management, and performance tuning. Knowledge of efficient training and inference evaluation strategies, with demonstrated success in improving model efficiency. In all we do, our core values guide us. More ❯

Posted: 2 days ago

Staff AI Engineer

London, England, United Kingdom

JR United Kingdom

Posted: 2 days ago

GPU Kernel Mode Driver Firmware Engineer

London, England, United Kingdom

Apple Inc

for the GPU. As a key member of our team, you will apply your knowledge of low-level system architecture to help the design and delivery of features in memory management, power management, asynchronous command processing, and error concealment for a wide variety of applications from games to machine learning, to image processing. You will partner with … knowledge of C/C++ Understanding of computer system architecture Understanding of driver development or operating system concepts Preferred Qualifications Excellent written and oral communication skills Experience with power management and control theory #J-18808-Ljbffr More ❯

Posted: 2 days ago

GCP AI Engineer

London Area, United Kingdom
Hybrid / WFH Options

Anson McCade

and LangFlow. Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, FAISS, Chroma, or PostgreSQL. Fine-tune prompts to optimise performance, reliability, and alignment. Design and implement memory modules for short-term and long-term agent behaviours. Deploy models and orchestrate AI systems on cloud platforms such as AWS Bedrock, Google Vertex AI, and Azure AI Studio. … collaboratively with their teams, Google Premier Cloud Partner experience etc Proven track record designing and deploying agentic and generative AI prototypes. Deep understanding of semantic search, vector databases, and memory management strategies. Familiarity with cloud AI tools, observability platforms, and performance optimisation. This is an opportunity to work at the forefront of AI innovation, where your work will More ❯

Posted: Today

GCP AI Engineer

City of London, London, United Kingdom
Hybrid / WFH Options

Anson McCade

Posted: Today

GCP Data & AI Architect

City of London, London, United Kingdom
Hybrid / WFH Options

Anson McCade

and LangFlow. • Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, FAISS, Chroma, or PostgreSQL. • Fine-tune prompts to optimise performance, reliability, and alignment. • Design and implement memory modules for short-term and long-term agent behaviours. • Deploy models and orchestrate AI systems on cloud platforms such as AWS Bedrock, Google Vertex AI, and Azure AI Studio. … collaboratively with their teams, Google Premier Cloud Partner experience etc • Proven track record designing and deploying agentic and generative AI prototypes. • Deep understanding of semantic search, vector databases, and memory management strategies. • Familiarity with cloud AI tools, observability platforms, and performance optimisation. This is an opportunity to work at the forefront of AI innovation, where your work will More ❯

Posted: Today

GCP Data & AI Engineer

City of London, London, United Kingdom
Hybrid / WFH Options

Anson McCade

Posted: Today

GCP Data & AI Architect

London Area, United Kingdom
Hybrid / WFH Options

Anson McCade

Posted: Today

GCP Data & AI Engineer

London, England, United Kingdom

Anson McCade

Posted: Yesterday

Agentic AI Architect

London, England, United Kingdom
Hybrid / WFH Options

Infosys

Responsibilities · Design, develop, and deploy autonomous AI agent ecosystems using frameworks such as LangChain, AutoGen, CrewAI, and Semantic Kernel. · Architect LLM-powered workflows involving multi-agent collaboration, decision logic, memory management, and external tool integration. · Collaborate with consulting teams to align AI agent solutions with business goals and industry use cases across sectors (FSI, Retail, Manufacturing, etc.). … agents · Strong experience with Python and orchestration libraries such as LangChain, LlamaIndex, Semantic Kernel, AutoGen, or similar. · Deep knowledge of LLMs (GPT, Claude, LLaMA, Mistral, etc.), prompt engineering, agent memory, tool calling, and autonomous task execution. · Experience with pre-sales, RFP/RFI support, and proposal creation in a consulting or enterprise services environment. · Understanding of enterprise solutioning with More ❯

Posted: 2 days ago

Agentic AI Architect

London, England, United Kingdom
Hybrid / WFH Options

Infosys Consulting - Europe

Responsibilities • Design, develop, and deploy autonomous AI agent ecosystems using frameworks such as LangChain, AutoGen, CrewAI, and Semantic Kernel. • Architect LLM-powered workflows involving multi-agent collaboration, decision logic, memory management, and external tool integration. • Collaborate with consulting teams to align AI agent solutions with business goals and industry use cases across sectors (FSI, Retail, Manufacturing, etc.). … agents • Strong experience with Python and orchestration libraries such as LangChain, LlamaIndex, Semantic Kernel, AutoGen, or similar. • Deep knowledge of LLMs (GPT, Claude, LLaMA, Mistral, etc.), prompt engineering, agent memory, tool calling, and autonomous task execution. • Experience with pre-sales, RFP/RFI support, and proposal creation in a consulting or enterprise services environment. • Understanding of enterprise solutioning with More ❯

Posted: Yesterday

AI RAG Engineer

City of London, Greater London, UK

Nihires

Ragas, TruLens, or your own scoring systems Experience tuning open-source models (e.g., Mistral, LLaMA, Falcon) or working with APIs (OpenAI, Anthropic, Cohere) Exposure to agentic systems, tools + memory management, or multi-step reasoning chains Experience in fast-paced, early-stage startup environments Why This Role Is Unique You’ll be engineering AI features that ship to More ❯

Posted: Today

AI RAG Engineer

London, England, United Kingdom

JR United Kingdom

Ragas , TruLens , or your own scoring systems Experience tuning open-source models (e.g., Mistral, LLaMA, Falcon) or working with APIs (OpenAI, Anthropic, Cohere) Exposure to agentic systems , tools + memory management, or multi-step reasoning chains Experience in fast-paced, early-stage startup environments ? Why This Role Is Unique You’ll be engineering AI features that ship to More ❯

Posted: 3 days ago

Language Content Specialist - German, Amazon Business Marketing

London, England, United Kingdom

Amazon

appropriate terminology. This role also involves evaluating and adapting source content for target markets, taking into consideration product availability, cultural sensitivities, and local business practices. The position includes terminology management (style guides, term bases and translation memory management) for the German market. Working within established standard operating procedures, you will exercise independent judgment to make critical decisions More ❯

Posted: 2 days ago

Language Content Specialist - Italian, Amazon Business Marketing

London, England, United Kingdom

Amazon

appropriate terminology. This role also involves evaluating and adapting source content for target markets, taking into consideration product availability, cultural sensitivities, and local business practices. The position includes terminology management (style guides, term bases and translation memory management) for the Italian market. Working within established standard operating procedures, you will exercise independent judgment to make critical decisions More ❯

Posted: 2 days ago

Language Content Specialist - Spanish, Amazon Business Marketing

London, England, United Kingdom

Amazon

appropriate terminology. This role also involves evaluating and adapting source content for target markets, taking into consideration product availability, cultural sensitivities, and local business practices. The position includes terminology management (style guides, term bases and translation memory management) for the Spanish market. Working within established standard operating procedures, you will exercise independent judgment to make critical decisions More ❯

Posted: 2 days ago

2 3 45

Salary Guide

Memory Management
London

10th Percentile: £84,500
25th Percentile: £90,000
Median: £150,000
75th Percentile: £165,000
90th Percentile: £167,000

More Memory Management insights »