City of London, London, United Kingdom Hybrid / WFH Options
ViVA Tech Talent
of tick data , time-series databases , and real-time market data . Experience working with global market data feeds (e.g., Bloomberg, Reuters, FIX, Kafka). Proficiency in performance tuning, memorymanagement, and multi-threaded programming in KDB+ . Comfortable working in Agile/DevOps environments, including CI/CD pipelines. Familiarity with Linux systems , scripting (Python/Bash More ❯
features and enhancements into the trading platform. Code Review & Maintenance: Participate in code reviews, maintain high code quality standards, and contribute to the continuous improvement of development practices. Risk Management: Develop and integrate risk management tools to monitor and mitigate potential risks in trading strategies. Qualifications Educational Background: Bachelor's, Master's, or PhD in Computer Science, Engineering … Mathematics, Physics, or a related quantitative discipline. Programming Skills: Expertise in C++: Strong proficiency in C++ (11/14/17) with a focus on performance optimization, memorymanagement, and multithreading. Additional Languages: Proficiency in Python, R, or other scripting languages is a plus. Financial Knowledge: Market Expertise: In-depth knowledge of equities and futures markets, with a More ❯
routing and execution components that handle market volatility Apply advanced networking principles to minimise round-trip times and network jitter Continuously assess and remove inefficiencies in JVM performance and memorymanagement Strengthen operational resilience through improved monitoring, alerting, and fault tolerance SKILL/EXPERIENCE REQUIRED: Expertise in market-making or algorithmic trading, including deep understanding of order book … environments Proficiency in networking concepts, including TCP/UDP, multicast, and low-level optimizations like DPDK or kernel bypass Mastery of concurrent programming, including lock-free algorithms and shared memory architecture Practical experience with Linux performance tuning, including NUMA balancing and use of huge pages Background in high-frequency trading and real-time market operations Ability to design tools More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Peaple Talent
Kubernetes. Excellent interpersonal and communication abilities, enabling effective teamwork and collaboration. Strong knowledge of both Linux and Windows operating systems. Proven experience working with multi-threaded applications and complex memorymanagement scenarios. Confident navigating across multiple technology domains and full software stacks. Passionate about learning and committed to continuous professional growth. Exposure to high-level languages; Python and More ❯
design patterns and performance optimisation. Experience building and supporting real-time or high-throughput systems. Familiarity with messaging protocols (e.g., FIX, MQ, Kafka). Good grasp of data structures, memorymanagement, and JVM tuning. Strong problem-solving and debugging skills in a time-sensitive environment. Desirable: Previous experience in an FX , equities , or electronic trading environment. Knowledge of More ❯
to optimise execution performance Build and maintain messaging infrastructure using technologies such as Aeron, SBE, Disruptor, Chronicle, and FIX Tune the JVM for microsecond-level performance (0GC, CPU affinity, memorymanagement) Contribute to architecture and design decisions for latency-sensitive services Ensure code quality through rigorous testing, benchmarking, and profiling Required Skills & Experience: Strong Core Java expertise (JVM … engines Familiarity with low-latency messaging libraries (e.g. Aeron, SBE, Disruptor) Solid understanding of network-level tuning and OS-level performance optimisation Desirable: Experience with 0GC and off-heap memorymanagement Exposure to cloud-based deployment and containerisation (Docker, Kubernetes, CI/CD) Background working in high-frequency trading or market-making environments More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Insight Global
have the following: • Advanced proficiency in Java • Strong experience in low-latency, high-throughput systems, particularly within FX trading platforms (e.g. LSE, 360T) • Deep understanding of efficient garbage collection, memorymanagement, and object modeling • Unit testing experience with Grabler, JUnit, or similar frameworks • Proficient in Oracle or similar relational databases • Comfortable working in agile environments, especially those transitioning More ❯
team in London on a full-time basis What You’ll Do Architect and implement high-performance data processing systems in Rust Leverage Apache Arrow and Parquet for in-memory and on-disk data efficiency Integrate and extend systems like DataFusion, ClickHouse, and DuckDB Design low-latency pipelines for analytical workloads Collaborate with product and platform teams to expose … Fit If You: Have experience with Rust/C/C++ or systems-level languages Have worked with or contributed to analytical databases like ClickHouse, DuckDB, or DataFusion Understand memorymanagement, file formats, and query execution pipelines Are passionate about open-source and care about software craftsmanship Thrive in a startup environment Want to help reinvent how data More ❯
grade solutions Continuously profile and tune system performance to minimise latency and maximise throughput Required Skills & Experience: Strong proficiency in core Java , with a focus on performance, concurrency, and memorymanagement Proven experience building low-latency, high-throughput systems in trading or similarly demanding environments Deep understanding of multithreading , lock-free programming , and garbage collection tuning More ❯
with ML engineers to implement GPU-level optimizations for ML model training and inference, focusing on speed and efficiency. Profile and optimize ML workloads running on GPUs, focusing on memorymanagement, parallelization, and performance tuning. Develop and optimize custom GPU drivers and frameworks for ML-specific tasks, including model training, AI inference, and data preprocessing. Collaborate with data … to date with the latest GPU architecture and machine learning advancements, applying new techniques to optimize system performance. Skills and Experience: Proficiency in C++ with a strong focus on memorymanagement, multi-threading, and low-level performance optimizations. Experience with GPU architectures (e.g., NVIDIA, AMD) and programming frameworks like CUDA, OpenCL, and TensorFlow. Understanding of machine learning algorithms More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Techfellow Limited
You Bring... 4-8 years' experience managing large-scale Linux infrastructure in high-performance, distributed, or AI-centric environments Deep technical fluency with GPU architecture, deployment, and tuning (e.g. memorymanagement, driver compatibility, hardware diagnostics) Strong scripting and automation skills, especially in Python, with infrastructure-as-code mindset Hands-on experience resolving GPU workload issues across compute clusters … and supporting technologies Familiarity with performance tooling and debugging in live production environments Practical experience with CUDA or systems-level programming in C/C++ Experience with config management frameworks like Salt, Ansible, or Puppet (Preferred) Experience with GPU communication and interconnect technologies (e.g. collective communication libraries such as NCCL, low-latency solutions like GPUDirect RDMA, or high-speed More ❯
Java. The Ideal Candidate: 8+ years engineering experience, ideally in a front-office trading context. Expert in Java with deep systems-level understanding. Strong grasp of low-latency architecture, memorymanagement, and optimisation. Exposure to crypto markets is a plus —but not a prerequisite. Please contact daniel.mclagan@stanfordblack.com for more information. If this role isn't right for More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Annapurna
real-time decision-making in autonomous driving. What to Expect The successful candidate will focus on host-side software and hardware interactions to ensure optimal data transfer and resource management for efficient AI inference on GPUs. Key responsibilities include Developing and optimizing C++ code for efficient data transfer and latency management between the host and GPUs across diverse … vendor platforms. Working with low-level system and memorymanagement techniques to minimize latency and improve real-time inference performance. Utilizing and implementing GPU programming APIs (e.g., CUDA, OpenCL) to ensure high efficiency and compatibility across GPUs. Profiling and debugging system performance using tools like NVIDIA Nsight, Intel VTune, and vendor-specific profilers, identifying bottlenecks and implementing effective … modern C++ standards. Proven experience in GPU programming and optimization, with proficiency in CUDA, OpenCL, or other GPU programming frameworks. Strong knowledge of parallel computing concepts, including data locality, memory access patterns, and synchronization. Proficiency with performance profiling tools and techniques for identifying and resolving system bottlenecks. Experience in system-level programming, including memorymanagement, data alignment More ❯
more reliable with every release What we’re looking for: Strong experience with Python, particularly in embedded or hardware-heavy environments Solid grasp of systems-level concepts: concurrency, networking, memorymanagement Experience working with hardware integrations, serial protocols, or device control Confident debugging in real-world environments (scopes, logs, traces – whatever gets the job done) Bonus if you More ❯
end performance. Lead development of critical path systems including market gateways, matching engines, and order handlers. Tech Stack & Requirements: Core Java (11+), with expertise in low-level programming: threading, memorymanagement, lock-free design. Proven experience with low-latency techniques : Zero GC/off-heap memory Busy-spin/lock-free queues False sharing avoidance GC tuning More ❯
free data structures and event-driven architectures for sub-microsecond latency. Integrate with internal risk, compliance, and analytics systems to ensure safe and compliant trading. Optimise CPU cache usage, memory layout, and instruction pipelines for maximum throughput. Collaborate with traders to prototype and deploy new strategies in production with minimal latency overhead. Technical Requirements: Expert-level C++ (C++17/… with deep understanding of memorymanagement, concurrency (lock-free, wait-free), and template metaprogramming. Proven experience in low-latency trading systems, preferably in ETFs, equities, or delta-one products. Strong knowledge of networking protocols (TCP/UDP, FIX, multicast), kernel bypass, and hardware timestamping. Experience with real-time market data processing, order book construction, and tick-to-trade More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Anson McCade
and LangFlow. Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, FAISS, Chroma, or PostgreSQL. Fine-tune prompts to optimise performance, reliability, and alignment. Design and implement memory modules for short-term and long-term agent behaviours. Deploy models and orchestrate AI systems on cloud platforms such as AWS Bedrock, Google Vertex AI, and Azure AI Studio. … collaboratively with their teams, Google Premier Cloud Partner experience etc Proven track record designing and deploying agentic and generative AI prototypes. Deep understanding of semantic search, vector databases, and memorymanagement strategies. Familiarity with cloud AI tools, observability platforms, and performance optimisation. This is an opportunity to work at the forefront of AI innovation, where your work will More ❯
Ragas , TruLens , or your own scoring systems Experience tuning open-source models (e.g., Mistral, LLaMA, Falcon) or working with APIs (OpenAI, Anthropic, Cohere) Exposure to agentic systems , tools + memorymanagement, or multi-step reasoning chains Experience in fast-paced, early-stage startup environments 🧪 Why This Role Is Unique You’ll be engineering AI features that ship to More ❯