to think through client needs and incorporate end-user feedback Strong UI/UX instincts and an eye for visual design Understanding of client-side performance, including rendering optimizations, memorymanagement, and state management Positive attitude, sense of humor and creativity Strong analytical, project leadership and communication skills Team leadership and management skills You should have … a strong interest in web-based software development and additional experience in the financial services technology/asset management space would be a bonus. What we offer: Competitive compensation Challenging, engaging and rewarding work A culture of trust, ownership, responsibility and autonomy in your work An incredible team of smart and mission-driven people to work with Fun working More ❯
firm-wide initiatives, and we are seeking individuals who are motivated to take challenging tasks Team Background The Credit Risk Technology Team is responsible for delivering Counterparty Credit Risk Management software solutions. These tools allow salespeople, traders, operations staff and risk officers to manage Citi's exposure to financial institutions, governments and corporates that trade with Citi. The team … builds and maintains software that provides a complete platform for our users: limit monitoring, exception management, stress testing analysis and escalation workflow capabilities are some key areas of focus. Our users rely on the system for an integrated view of trades, collateral, market data, across many dozens of sources. Ideal Background Participates in architecture governance bodies. Provide technical leadership … automate remediation of recurring issues to improve overall operational stability of software applications and systems. Design & develop scalable micro-services to build a functional component that will support Risk Management functions. Partner with the product management team and other stakeholders in brainstorming sessions to identify solutions. Host sessions for design and code review at multiple levels of the More ❯
routing and execution components that handle market volatility Apply advanced networking principles to minimise round-trip times and network jitter Continuously assess and remove inefficiencies in JVM performance and memorymanagement Strengthen operational resilience through improved monitoring, alerting, and fault tolerance SKILL/EXPERIENCE REQUIRED: Expertise in market-making or algorithmic trading, including deep understanding of order book … environments Proficiency in networking concepts, including TCP/UDP, multicast, and low-level optimizations like DPDK or kernel bypass Mastery of concurrent programming, including lock-free algorithms and shared memory architecture Practical experience with Linux performance tuning, including NUMA balancing and use of huge pages Background in high-frequency trading and real-time market operations Ability to design tools More ❯
and paradigms used to ensure the system is easy to understand, performant, scalable, testable, robust, and observable. The role is a conjunction of technical and managerial roles, with line-management duties, while giving technical direction to a growing team of developers globally. The platform is a Greenfield build using standard modern technologies such as Java, Spring Boot, Kubernetes, Kafka … Experience: Expert knowledge of distributed systems including event-driven architecture; at-least-once messaging; CAP Theorem; horizontal and vertical scaling strategies; massively distributed architectures Expert knowledge of Java, JVM, memorymanagement, garbage collection Thorough understanding of multithreaded environment challenges Expert knowledge of Spring, SpringBoot framework and associated technologies Expert knowledge of test frameworks, such as Junit, Mockito, writing More ❯
various environments, including resource-limited devices and complex multi-modal systems. Your responsibilities include designing robust inference pipelines, establishing performance metrics, and troubleshooting bottlenecks to achieve low-latency, low-memory AI performance in real-world applications. Responsibilities : Design and deploy efficient model serving architectures optimized for diverse environments, including resource-constrained devices. Set and monitor performance targets such as … latency, throughput, and memory usage. Conduct inference testing in simulated and live environments, tracking key performance indicators and documenting results. Prepare high-quality datasets and scenarios for real-world deployment testing, focusing on low-resource devices. Analyze pipeline efficiency, diagnose bottlenecks, and optimize for scalability and reliability. Collaborate with cross-functional teams to integrate optimized frameworks into production, ensuring … related field; PhD preferred, with a strong publication record in AI R&D. Proven experience in kernel and inference optimization on mobile devices, with measurable improvements in latency and memory footprint. Deep understanding of model serving architectures, low-latency techniques, and memorymanagement in resource-constrained environments. Expertise in CPU/GPU kernel development for mobile platforms More ❯
S/4HANA performance tuningABAP, SQL, HANA DB, and application server metrics. Proficient with performance testing tools such as LoadRunner Enterpriseand Apache JMeter. Strong understanding of SQL query optimization, memorymanagement, and workload distributionin SAP HANA. Proven experience with performance trace toolslike ST12, ST05, SAT, HANA Studio PlanViz. Experience in analysing dump logs, database statistics, and job performance More ❯
and w/soft-skills interpersonally An open-minded and collaborative attitude Preferred technical and professional experience Expertise in Java and Scala programming on the JVM Experience with concurrency, memorymanagement and I/O Experience with Linux or other Unix-like systems Experience with distributed databases, DataStax Enterprise or Apache Cassandra in particular Experience with distributed computing More ❯
and w/soft-skills interpersonally An open-minded and collaborative attitude Preferred technical and professional experience Expertise in Java and Scala programming on the JVM Experience with concurrency, memorymanagement and I/O Experience with Linux or other Unix-like systems Experience with distributed databases, DataStax Enterprise or Apache Cassandra in particular Experience with distributed computing More ❯
Conduct rigorous testing and debugging of applications to ensure high-quality, bug-free software. Documentation Prepare detailed documentation for code, APIs, and system architecture. Collaboration Collaborate with the product management team to understand project requirements and deliver software that meets or exceeds user expectations. Optimisation Continuously optimise and improve application performance, taking into account memorymanagement and More ❯
training pipeline, integrate state-of-the-art MLLMs into the auto-labelling pipeline. - Collaborate with Applied Scientists to process massive data, scale machine learning models while optimizing GPU utilization, memorymanagement, and the training workflows (like kernel fusion, mixed-precision training, gradient accumulation, offloading optimizer states, massive parallelization, etc). - Design and maintain large-scale distributed training systems …/Scrum environment to move fast and deliver high quality software. BASIC QUALIFICATIONS - 3+ years of non-internship professional software development experience, including coding standards, code reviews, source control management, build processes, testing, and operations. - 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience. - Proficient in Python or related … model training. Contributions to popular open-source LLM frameworks or research publications in top-tier AI conferences, such as CVPR, ECCV, ICCV, ICLR, etc. - Experience in GPU utilization and memory optimization techniques like kernel fusion and custom kernels, mixed precision training using lower precision and dynamic loss scaling, gradient (activation) checkpointing, gradient accumulation, offloading optimizer states, and smart prefetching More ❯
Investigate and resolve production issues as part of ongoing product support. Skills 3+ years of professional experience in software development. Proficient in C++11 or later, with strong understanding of memorymanagement, concurrency, and object-oriented design. Hands-on experience with AngularJS and front-end development (HTML, CSS, JavaScript). Familiarity with REST APIs, JSON, and web service integration. More ❯
Employment Type: Permanent
Salary: £50000 - £60000/annum Medical, pension & more
design the architecture of our next generation of models, deploy them into production and optimize their performance. Monitoring model deployments to anticipate and mitigate system performance issues (disk utilization, memory and CPU usage) Build and enhance tools for monitoring and measuring model accuracy, performance, and data quality Documenting procedures and guides to facilitate knowledge sharing and helping other engineers … to level up through pairing and mentoring Participating in hiring and onboarding new team members; taking on end-to-end project management responsibilities as we grow. Requirements Background in solving complex technical challenges at the intersection of Software Development, and Data/ML Engineering Proficiency with Python and databases in high scale environments. Hands-on experience with message queues … tools like K8s, Docker or others. Experience in designing, building, and maintaining data collection tools, caching/storage systems, queues and pipelines Knowledge of best practices for performance optimization, memorymanagement, model scalability, as well as data storage and quality tuning Strong analytical and troubleshooting skills - methodically decomposing systems to identify bottlenecks, determine root causes, and implement effective More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Anson McCade
and LangFlow. Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, FAISS, Chroma, or PostgreSQL. Fine-tune prompts to optimise performance, reliability, and alignment. Design and implement memory modules for short-term and long-term agent behaviours. Deploy models and orchestrate AI systems on cloud platforms such as AWS Bedrock, Google Vertex AI, and Azure AI Studio. … LLM frameworks such as LangChain, LangGraph, LangFlow, CrewAI, or Autogen. Proven track record designing and deploying agentic and generative AI prototypes. Deep understanding of semantic search, vector databases, and memorymanagement strategies. Familiarity with cloud AI tools, observability platforms, and performance optimisation. This is an opportunity to work at the forefront of AI innovation, where your work will More ❯
and LangFlow. Implement retrieval-augmented generation (RAG) pipelines using vector databases like Pinecone, FAISS, Chroma, or PostgreSQL. Fine-tune prompts to optimise performance, reliability, and alignment. Design and implement memory modules for short-term and long-term agent behaviours. Deploy models and orchestrate AI systems on cloud platforms such as AWS Bedrock, Google Vertex AI, and Azure AI Studio. … LLM frameworks such as LangChain, LangGraph, LangFlow, CrewAI, or Autogen. Proven track record designing and deploying agentic and generative AI prototypes. Deep understanding of semantic search, vector databases, and memorymanagement strategies. Familiarity with cloud AI tools, observability platforms, and performance optimisation. This is an opportunity to work at the forefront of AI innovation, where your work will More ❯