Related - C 894
- Java 4,013
- C++ 2,411
- Python 11,599
- React 2,805
- JavaScript 2,317
- AWS 9,156
- TypeScript 2,591
- C# 2,312
- GitHub 1,788
- PostgreSQL 1,627
- HTML 1,573
- CSS 1,358
- Relational Database 1,058
- AngularJS 774
More
|
101 to 120 of 120 Memory Management Jobs in London
London, England, United Kingdom Tether Operations Limited
various applications and hardware environments. Responsibilities: Design and deploy high-performance, resource-efficient model serving architectures adaptable to diverse environments. Establish and track performance metrics like latency, throughput, and memory usage. Develop and monitor inference tests, analyze results, and validate performance improvements. Prepare realistic datasets and scenarios to evaluate model performance in low-resource settings. Identify bottlenecks and optimize … with a strong publication record. Proven experience in low-level kernel and inference optimizations on mobile devices, with measurable improvements. Deep understanding of model serving architectures, optimization techniques, and memory management in resource-constrained environments. Expertise in CPU/GPU kernel development for mobile platforms and deploying inference pipelines on such devices. Ability to apply empirical research to … overcome latency, bottleneck, and memory challenges, with experience in evaluation frameworks and iterative optimization. #J-18808-Ljbffr More ❯
City Of London, England, United Kingdom Selby Jennings
free data structures and event-driven architectures for sub-microsecond latency. Integrate with internal risk, compliance, and analytics systems to ensure safe and compliant trading. Optimise CPU cache usage, memory layout, and instruction pipelines for maximum throughput. Collaborate with traders to prototype and deploy new strategies in production with minimal latency overhead. Technical Requirements: Expert-level C++ (C++17/… with deep understanding of memory management, concurrency (lock-free, wait-free), and template metaprogramming. Proven experience in low-latency trading systems, preferably in ETFs, equities, or delta-one products. Strong knowledge of networking protocols (TCP/UDP, FIX, multicast), kernel bypass, and hardware timestamping. Experience with real-time market data processing, order book construction, and tick-to-trade More ❯
London, England, United Kingdom Wayve
performance, ensuring efficient data transfer and low-latency processing across diverse hardware platforms. Define and oversee the design of robust, scalable systems that leverage advanced techniques in system and memory management to maximise real-time processing efficiency and minimise latency. Drive the identification and implementation of solutions to improve system performance, focusing on end-to-end data flow … tools and methodologies, guiding the team to identify bottlenecks and implement effective solutions. Act as a key liaison between software, hardware, and machine learning teams to ensure seamless data management, preprocessing, and optimal system utilisation, fostering an integrated approach to performance optimisation. Establish and champion industry best practices for high-performance, low-latency systems, setting a strong technical standard … systems. Strong background in developing software for embedded systems using high-performance system languages such as C++ and/or Rust. Deep understanding of system architecture, low-level programming, memory management, and resource utilisation, particularly for high-performance, real-time applications. Proven ability to lead and mentor engineers while working effectively across disciplines with hardware, software, and machine More ❯
London, England, United Kingdom Hybrid / WFH Options Parity Technologies
You have experience or are excited to learn about virtual machines, RISC-V, AMD64 assembly, dynamic recompilation, bytecode interpreters, compilers (especially code optimization and machine code generation), low-level memory management, low-level OS interfaces (Linux), security sandboxing, performance engineering, zero-knowledge proof systems, and zero-knowledge virtual machines You work autonomously and solve problems independently You work More ❯
Hounslow, England, United Kingdom JR United Kingdom
systems. Hands-on experience with advanced inference optimisation techniques, such as KVCache, MoE, adaptive batching, or gradient checkpointing. Strong understanding of low-level operating systems concepts, including multi-threading, memory management, and performance tuning. Knowledge of efficient training and inference evaluation strategies, with demonstrated success in improving model efficiency. In all we do, our core values guide us. More ❯
London, England, United Kingdom JR United Kingdom
systems. Hands-on experience with advanced inference optimisation techniques, such as KVCache, MoE, adaptive batching, or gradient checkpointing. Strong understanding of low-level operating systems concepts, including multi-threading, memory management, and performance tuning. Knowledge of efficient training and inference evaluation strategies, with demonstrated success in improving model efficiency. In all we do, our core values guide us. More ❯
London, England, United Kingdom Apple Inc
for the GPU. As a key member of our team, you will apply your knowledge of low-level system architecture to help the design and delivery of features in memory management, power management, asynchronous command processing, and error concealment for a wide variety of applications from games to machine learning, to image processing. You will partner with … knowledge of C/C++ Understanding of computer system architecture Understanding of driver development or operating system concepts Preferred Qualifications Excellent written and oral communication skills Experience with power management and control theory #J-18808-Ljbffr More ❯
London Area, United Kingdom Hybrid / WFH Options Anson McCade
and LangFlow. Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, FAISS, Chroma, or PostgreSQL. Fine-tune prompts to optimise performance, reliability, and alignment. Design and implement memory modules for short-term and long-term agent behaviours. Deploy models and orchestrate AI systems on cloud platforms such as AWS Bedrock, Google Vertex AI, and Azure AI Studio. … collaboratively with their teams, Google Premier Cloud Partner experience etc Proven track record designing and deploying agentic and generative AI prototypes. Deep understanding of semantic search, vector databases, and memory management strategies. Familiarity with cloud AI tools, observability platforms, and performance optimisation. This is an opportunity to work at the forefront of AI innovation, where your work will More ❯
City of London, London, United Kingdom Hybrid / WFH Options Anson McCade
and LangFlow. Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, FAISS, Chroma, or PostgreSQL. Fine-tune prompts to optimise performance, reliability, and alignment. Design and implement memory modules for short-term and long-term agent behaviours. Deploy models and orchestrate AI systems on cloud platforms such as AWS Bedrock, Google Vertex AI, and Azure AI Studio. … collaboratively with their teams, Google Premier Cloud Partner experience etc Proven track record designing and deploying agentic and generative AI prototypes. Deep understanding of semantic search, vector databases, and memory management strategies. Familiarity with cloud AI tools, observability platforms, and performance optimisation. This is an opportunity to work at the forefront of AI innovation, where your work will More ❯
City of London, London, United Kingdom Hybrid / WFH Options Anson McCade
and LangFlow. • Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, FAISS, Chroma, or PostgreSQL. • Fine-tune prompts to optimise performance, reliability, and alignment. • Design and implement memory modules for short-term and long-term agent behaviours. • Deploy models and orchestrate AI systems on cloud platforms such as AWS Bedrock, Google Vertex AI, and Azure AI Studio. … collaboratively with their teams, Google Premier Cloud Partner experience etc • Proven track record designing and deploying agentic and generative AI prototypes. • Deep understanding of semantic search, vector databases, and memory management strategies. • Familiarity with cloud AI tools, observability platforms, and performance optimisation. This is an opportunity to work at the forefront of AI innovation, where your work will More ❯
City of London, London, United Kingdom Hybrid / WFH Options Anson McCade
and LangFlow. • Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, FAISS, Chroma, or PostgreSQL. • Fine-tune prompts to optimise performance, reliability, and alignment. • Design and implement memory modules for short-term and long-term agent behaviours. • Deploy models and orchestrate AI systems on cloud platforms such as AWS Bedrock, Google Vertex AI, and Azure AI Studio. … collaboratively with their teams, Google Premier Cloud Partner experience etc • Proven track record designing and deploying agentic and generative AI prototypes. • Deep understanding of semantic search, vector databases, and memory management strategies. • Familiarity with cloud AI tools, observability platforms, and performance optimisation. This is an opportunity to work at the forefront of AI innovation, where your work will More ❯
London Area, United Kingdom Hybrid / WFH Options Anson McCade
and LangFlow. • Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, FAISS, Chroma, or PostgreSQL. • Fine-tune prompts to optimise performance, reliability, and alignment. • Design and implement memory modules for short-term and long-term agent behaviours. • Deploy models and orchestrate AI systems on cloud platforms such as AWS Bedrock, Google Vertex AI, and Azure AI Studio. … collaboratively with their teams, Google Premier Cloud Partner experience etc • Proven track record designing and deploying agentic and generative AI prototypes. • Deep understanding of semantic search, vector databases, and memory management strategies. • Familiarity with cloud AI tools, observability platforms, and performance optimisation. This is an opportunity to work at the forefront of AI innovation, where your work will More ❯
London, England, United Kingdom Anson McCade
and LangFlow. • Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, FAISS, Chroma, or PostgreSQL. • Fine-tune prompts to optimise performance, reliability, and alignment. • Design and implement memory modules for short-term and long-term agent behaviours. • Deploy models and orchestrate AI systems on cloud platforms such as AWS Bedrock, Google Vertex AI, and Azure AI Studio. … collaboratively with their teams, Google Premier Cloud Partner experience etc • Proven track record designing and deploying agentic and generative AI prototypes. • Deep understanding of semantic search, vector databases, and memory management strategies. • Familiarity with cloud AI tools, observability platforms, and performance optimisation. This is an opportunity to work at the forefront of AI innovation, where your work will More ❯
London, England, United Kingdom Hybrid / WFH Options Infosys
Responsibilities · Design, develop, and deploy autonomous AI agent ecosystems using frameworks such as LangChain, AutoGen, CrewAI, and Semantic Kernel. · Architect LLM-powered workflows involving multi-agent collaboration, decision logic, memory management, and external tool integration. · Collaborate with consulting teams to align AI agent solutions with business goals and industry use cases across sectors (FSI, Retail, Manufacturing, etc.). … agents · Strong experience with Python and orchestration libraries such as LangChain, LlamaIndex, Semantic Kernel, AutoGen, or similar. · Deep knowledge of LLMs (GPT, Claude, LLaMA, Mistral, etc.), prompt engineering, agent memory, tool calling, and autonomous task execution. · Experience with pre-sales, RFP/RFI support, and proposal creation in a consulting or enterprise services environment. · Understanding of enterprise solutioning with More ❯
London, England, United Kingdom Hybrid / WFH Options Infosys Consulting - Europe
Responsibilities • Design, develop, and deploy autonomous AI agent ecosystems using frameworks such as LangChain, AutoGen, CrewAI, and Semantic Kernel. • Architect LLM-powered workflows involving multi-agent collaboration, decision logic, memory management, and external tool integration. • Collaborate with consulting teams to align AI agent solutions with business goals and industry use cases across sectors (FSI, Retail, Manufacturing, etc.). … agents • Strong experience with Python and orchestration libraries such as LangChain, LlamaIndex, Semantic Kernel, AutoGen, or similar. • Deep knowledge of LLMs (GPT, Claude, LLaMA, Mistral, etc.), prompt engineering, agent memory, tool calling, and autonomous task execution. • Experience with pre-sales, RFP/RFI support, and proposal creation in a consulting or enterprise services environment. • Understanding of enterprise solutioning with More ❯
City of London, Greater London, UK Nihires
Ragas, TruLens, or your own scoring systems Experience tuning open-source models (e.g., Mistral, LLaMA, Falcon) or working with APIs (OpenAI, Anthropic, Cohere) Exposure to agentic systems, tools + memory management, or multi-step reasoning chains Experience in fast-paced, early-stage startup environments Why This Role Is Unique You’ll be engineering AI features that ship to More ❯
London, England, United Kingdom JR United Kingdom
Ragas , TruLens , or your own scoring systems Experience tuning open-source models (e.g., Mistral, LLaMA, Falcon) or working with APIs (OpenAI, Anthropic, Cohere) Exposure to agentic systems , tools + memory management, or multi-step reasoning chains Experience in fast-paced, early-stage startup environments ? Why This Role Is Unique You’ll be engineering AI features that ship to More ❯
London, England, United Kingdom Amazon
appropriate terminology. This role also involves evaluating and adapting source content for target markets, taking into consideration product availability, cultural sensitivities, and local business practices. The position includes terminology management (style guides, term bases and translation memory management) for the German market. Working within established standard operating procedures, you will exercise independent judgment to make critical decisions More ❯
London, England, United Kingdom Amazon
appropriate terminology. This role also involves evaluating and adapting source content for target markets, taking into consideration product availability, cultural sensitivities, and local business practices. The position includes terminology management (style guides, term bases and translation memory management) for the Italian market. Working within established standard operating procedures, you will exercise independent judgment to make critical decisions More ❯
London, England, United Kingdom Amazon
appropriate terminology. This role also involves evaluating and adapting source content for target markets, taking into consideration product availability, cultural sensitivities, and local business practices. The position includes terminology management (style guides, term bases and translation memory management) for the Spanish market. Working within established standard operating procedures, you will exercise independent judgment to make critical decisions More ❯
|
Salary Guide Memory Management London - 10th Percentile
- £84,500
- 25th Percentile
- £90,000
- Median
- £150,000
- 75th Percentile
- £165,000
- 90th Percentile
- £167,000
|