building complex relationship graphs with trillions of nodes and edges excite you? The AWS Resource Explorer team owns and operates highly scalable, highly available distributedsystems that are used by all AWS customers and internal service teams alike. We are pioneers in areas such as identity, access management … resource management, security and providing massively scaled, distributed platform capabilities to all of AWS. We enable our customers to control and secure the most sensitive data on the internet. We are building some of the largest and most complex distributedsystems in the world, and we need … Identity features such as access management and sign on, cryptography, console, builder & developer tools, and even projects like automating all of our contractual billing systems, AWS Platform is always innovating with the customer in mind. The AWS Platform team sustains over 750 million transactions per second. Security is the More ❯
will elevate engineering standards, inspire engineers and enable continuous improvements. You are hands on and know what it takes to build highly available, scalable, distributedsystems and you cherish every opportunity to share your experience and knowledge with the engineering community. You have an operator mindset. You understand … providing guidance and assistance in the career development of others. WHAT YOU’LL BRING: At least 12+ years’ experience in designing and developing complex distributedsystems with at least 5+ years of experience contributing to large scale projects as an architect or as a key individual contributor. Expert … level experience in C++ is required. Experience developing and evolving distributedsystems (e.g., consensus algorithms, networking, fault tolerance, observability). Experience with blockchain technology and protocols, and traditional financial systems and concepts. Experience crafting or contributing to the multi-year engineering roadmap of open source software. Experience More ❯
worldwide impact in robotics while solving challenges at the edge of robotics research? Our team in Amazon Robotics builds high-performance, real-time robotic systems that can perceive, learn, and act intelligently alongside humans - at Amazon scale. Our mission is to enable robots to interact safely, efficiently, and fluently … are seeking an experienced Robotics Control Engineer to join our team. In this role, you will leverage your experience and insights gained from delivering systems to help lead us in developing products that are both innovative and scalable. Working with a world-class team of experts in motion planning … Work closely with hardware and software engineering team members to drive scalable, real-time implementations. Partner closely with hardware engineering team members on developing systems from prototyping to production level. Launch, test, and debug our systems both in the lab and in the field. Develop tools and best More ❯
with experienced cross-disciplinary Amazonians to conceive, design, and bring to market innovative products and services. Design and build innovative technologies in a large distributed computing environment and help lead fundamental changes in the industry. Create solutions to run predictions on distributedsystems with exposure to innovative … technologies at incredible scale and speed. Build distributed storage, index, and query systems that are scalable, fault-tolerant, low cost, and easy to manage/use. Work in an agile environment to deliver high quality software. BASIC QUALIFICATIONS Graduated less than 24 months ago or about to complete … fields at time of application. Knowledge of Computer Science fundamentals. Programming experience in C or Java/Rust. Knowledge in databases. Experience with Operating systems and Kernel Internals, Rust programming. PREFERRED QUALIFICATIONS Previous technical internship(s) if applicable. Experience with distributed, multi-tiered systems, algorithms, and relational More ❯
with experienced cross-disciplinary Amazonians to conceive, design, and bring to market innovative products and services. Design and build innovative technologies in a large distributed computing environment and help lead fundamental changes in the industry. Create solutions to run predictions on distributedsystems with exposure to innovative … technologies at incredible scale and speed. Build distributed storage, index, and query systems that are scalable, fault-tolerant, low cost, and easy to manage/use. Work in an agile environment to deliver high quality software. BASIC QUALIFICATIONS Graduated less than 24 months ago or about to complete … fields at time of application. Knowledge of Computer Science fundamentals. Programming experience in C or Java/Rust. Knowledge in databases. Experience with Operating systems and Kernel Internals, Rust programming. PREFERRED QUALIFICATIONS Previous technical internship(s) if applicable. Experience with distributed, multi-tiered systems, algorithms, and relational More ❯
you will collaborate closely with product management and engineering, serving as a key advocate for Confluent's platform. This role demands expertise in Kafka, distributedsystems, and pre-sales engineering, while engaging with cross-functional teams to drive product success and customer outcomes. Ideal candidates may have backgrounds … Customers with Real-Time Architectures Understand customer challenges with traditional Data Warehouses, Data Lakes, and Batch Analytics workflows, and guide them toward real-time, distributed architectures using Kafka, Flink, Kafka Streams, and modern ETL/ELT frameworks. Help customers optimize their data platforms by focusing on early-stage data … and cost efficiency. Provide Technical Expertise Assist customers and sales teams in designing, deploying, and optimizing real-time data streaming platforms, integrating Kafka with distributed processing, and ensuring alignment with business goals. Architect solutions to unify operational and analytical workloads, enabling a data mesh or streaming-first architecture. Partner More ❯
Client's Security Team Overview The client's security team is the central engine driving data and systems security at the company, supporting it and all of its family of apps. The org is responsible for inhibiting malicious actors from compromising our environment, detecting, and responding to them before … teams or conducting Information Security consulting engagements Experience in writing custom scripts in Python/C++ and PHP/Hack Experience building large-scale distributedsystems or similar experience Experience in assessing security deficiencies in first-party/internal information systems and recommending mitigating controls Knowledge of … evaluating systems architectural designs, data-flow diagrams and technical security implementations, particularly in the context of access management in different geographical locations. Experience with developing security reporting and recommendations that are meaningful, defensible and actionable for a variety of audiences Ability to manage competing priorities and simultaneous projects in More ❯
Client’s Security Team Overview The client’s security team is the central engine driving data and systems security at the company, supporting it and all of its family of apps. The org is responsible for inhibiting malicious actors from compromising our environment, detecting, and responding to them before … teams or conducting Information Security consulting engagements Experience in writing custom scripts in Python/C++ and PHP/Hack Experience building large-scale distributedsystems or similar experience Experience in assessing security deficiencies in first-party/internal information systems and recommending mitigating controls Knowledge of … evaluating systems architectural designs, data-flow diagrams and technical security implementations, particularly in the context of access management in different geographical locations. Experience with developing security reporting and recommendations that are meaningful, defensible and actionable for a variety of audiences Ability to manage competing priorities and simultaneous projects in More ❯
Rate: £550 per day (umbrella) Our client, a forward-thinking organisation, is looking for a passionate and skilled DevOps Engineer to join their dynamic Systems Engineering team. Reporting to the Head of Systems Engineering, you will play a vital role in designing, implementing, and managing both cloud and … on-premise infrastructures, tooling, and systems that support our development teams. Key Responsibilities: Infrastructure Design & Deployment : Architect and deploy scalable, high-performance systems that align with the organisation's long-term operational and business objectives. Performance Monitoring : Continuously monitor, analyse, and enhance the reliability of large-scale distributedsystems to ensure seamless uptime and operational efficiency. Automation Solutions : Develop automation solutions for configuration management, system provisioning, and deployment, significantly streamlining operations and minimising manual processes. Security Compliance : Implement and maintain robust security protocols, ensuring that systems comply with regulatory standards and best practises, including data More ❯
in one or more programming languages. 5 years of experience with data structures or algorithms. 3 years of experience in designing, analyzing, and troubleshooting distributedsystems, and 2 years of experience leading projects and providing technical leadership. Experience in SRE or incident management/response environments. Preferred qualifications … Experience working in computing, distributedsystems, storage, or networking. Experience in telemetry systems, incident and risk management. Expertise in designing, analyzing, and troubleshooting large-scale distributed systems. Ability to debug, optimize code, and to automate routine tasks. Excellent problem-solving approach, with verbal and written communication … skills. About the job Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both our internally critical and our externally-visible systems-have reliability, uptime appropriate to customer's More ❯
in one or more programming languages. 5 years of experience with data structures or algorithms. 3 years of experience in designing, analyzing, and troubleshooting distributedsystems, and 2 years of experience leading projects and providing technical leadership. Preferred Qualifications: Experience working in computing, distributedsystems, storage … or networking. Expertise in designing, analyzing, and troubleshooting large-scale distributed systems. Ability to debug, optimize code, and to automate routine tasks. Systematic problem-solving approach, coupled with effective verbal and written communication skills. About the Job Site Reliability Engineering (SRE) combines software and systems engineering to build … and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both our internally critical and our externally-visible systems-have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally, SREs will keep an ever-watchful eye on More ❯
Join Team Purple. Join IFS. Job Description We are looking for a technical leader who is passionate about building scalable, resilient, and high-performance distributed systems. As the Lead Software Engineer, you will play a pivotal role in the design, development, and deployment of microservices architectures running on Kubernetes … Lead the architecture, deployment, and management of containerized applications using Kubernetes, ensuring they are highly available, scalable, and resilient. System Scalability : Ensure that the systems are designed for horizontal scalability and can handle large volumes of traffic by effectively utilizing Kubernetes and container orchestration tools. Continuous Improvement : Drive the … adoption of best practices and improve existing microservices, systems, and workflows to enhance system performance, reliability, and efficiency. Mentorship & Leadership : Provide leadership and mentorship to engineering teams, helping to develop junior and mid-level engineers by fostering a culture of learning and knowledge sharing. Collaboration : Work closely with product More ❯
Apache Pulsar , Apache Airflow , Temporal , and Apache Flink , sharing knowledge and suggesting improvements. Documentation: Contribute to clear and concise documentation for software, processes, and systems to ensure team alignment and knowledge sharing. Your Qualifications: Experience: Professional experience in Python development or related software engineering roles. Python Proficiency: Strong knowledge … Django , Flask , or FastAPI . Database Management: Solid experience with relational databases like PostgreSQL or MySQL and familiarity with NoSQL databases like Redis . DistributedSystems: A basic understanding of distributedsystems and microservices architecture, as well as cloud-based infrastructure (AWS, GCP, Azure), is a … Tools: Familiarity with containerization ( Docker ) and infrastructure automation tools like Terraform or Ansible . Real-time Data Streaming: Experience with Apache Pulsar or similar systems for real-time messaging and stream processing is a plus. Data Engineering: Experience with Apache Spark , Databricks , or similar big data platforms for processing More ❯
Apache Pulsar , Apache Airflow , Temporal , and Apache Flink , sharing knowledge and suggesting improvements. Documentation: Contribute to clear and concise documentation for software, processes, and systems to ensure team alignment and knowledge sharing. Your Qualifications: Experience: Professional experience in Python development or related software engineering roles. Python Proficiency: Strong knowledge … Django , Flask , or FastAPI . Database Management: Solid experience with relational databases like PostgreSQL or MySQL and familiarity with NoSQL databases like Redis . DistributedSystems: A basic understanding of distributedsystems and microservices architecture, as well as cloud-based infrastructure (AWS, GCP, Azure), is a … Tools: Familiarity with containerization ( Docker ) and infrastructure automation tools like Terraform or Ansible . Real-time Data Streaming: Experience with Apache Pulsar or similar systems for real-time messaging and stream processing is a plus. Data Engineering: Experience with Apache Spark , Databricks , or similar big data platforms for processing More ❯
Responsibilities: Collaborative engineering: Work within a larger team to rapidly develop proof-of-concept prototypes to validate research ideas and integrate them into production systems and infrastructure Performance Analysis: Conduct in-depth profiling and tuning of operating systems and large-scale distributedsystems, leveraging heterogeneous hardware … to do the job: Required: Bachelor's or Master's degree in Computer Science or a related technical field. A solid background in operating systems and/or distributedsystems and/or ML systems. Excellent programming skills, master of at least one language, such as C … communication and teamwork skills. Be comfortable with research methodology. Desired: Familiarity with current LLM architectures (e.g. Llama3, DeepSeek V3) Familiarity with production LLM serving systems and inference optimizations (e.g. VLLM) Experience with accelerator programming (e.g. CUDA, Triton) and communication libraries (e.g. NCCL More ❯
and builder tooling in AWS, including support for customers who require specialised security solutions for their cloud services. We are looking for an experienced Systems Engineering and Operations Leader to contribute to the growth of our organisation within Amazon Web Services. Key job responsibilities You love to work with … these services. Drive root cause analysis and coach others on performing them. Lead local development efforts to improve operational performance. Drive continual improvement in systems operations through the development of new tooling and automation. Define and drive projects and report on their progress. Meet with customers to develop relationships … and at home, there's nothing we can't achieve. BASIC QUALIFICATIONS - Relevant hands-on engineering and administrative experience in networking, network services, operating systems - Experience as a systems engineering and operations leader for an Internet service or leading edge IT organisation - Demonstrated success building and leading teams More ❯
TASTING - Data protection in the electricity grid distributed control infrastructure Post-doc Supervisor: Mathilde Arnaud (CEA - University Paris-Saclay) Starting Date: January 2025 (24 months) Location: Palaiseau (in Paris area) Context This offer is part of TASTING1, a four-year project that has received state funding managed by the … National Research Agency under the France 2030 program. This project aims to address the main challenges related to the modernization and security of electrical systems with an emphasis on the cloud/edge continuum. Its structure responds to the PEPR TASE call for "Technological solutions for the digitization of … intelligent energy systems," co-led by CEA and CNRS. The project focuses on scientific challenges surrounding ICT infrastructure, which is considered a crucial element in driving the significant transformations expected in energy infrastructures over the next decades. Seven academic laboratories are involved in TASTING: G2Elab, GeePs, IRIT, CEA-LIST More ❯
the future of cloud database technology? Join Amazon Aurora's Storage Fabric team in Bangalore as a Senior Manager and lead a breakthrough in distributed storage systems. This unique role offers a blend of technical innovation and site leadership, perfect for visionaries eager to reshape the cloud database landscape. … directly impact millions of global customers, pushing the boundaries of what's possible in cloud computing. As a technical leader, you'll solve complex distributedsystems challenges and mentor top-tier talent. As a site leader, you'll elevate Aurora Bangalore's status as a key innovation center … engineers in developing large-scale database or storage solutions, demonstrating a proven track record in managing complex, high-impact projects. A strong foundation in distributedsystems is essential, coupled with the ability to drive the team towards resolving intricate operational challenges. The right individual excels in cultivating a More ❯
to optimizing the cost of our network, and forecasting future capacity needs. NTE mission is to distribute quality traffic data to our customers through systems that provide visibility into the traffic utilization of the Amazon network. We are looking for an exceptional Software Development Engineer (SDE) to build and … automate our software systems that leverage data streaming solutions for devising the next generation of network monitoring systems. The ideal candidate will share our excitement about cloud computing and will be passionate about delivering high quality services spanning full SDLC lifecycle. You should have good knowledge of distributedsystems with design and implementation experience, as well as the ability to lead and coach engineers. You will be customer centric and enjoy working in a fast-paced environment that requires excellent technical, business and communication skills. NTE team is part of AWS Infrastructure Services which owns the More ❯
to optimizing the cost of our network, and forecasting future capacity needs. NTE mission is to distribute quality traffic data to our customers through systems that provide visibility into the traffic utilization of the Amazon network. We are looking for an exceptional Software Development Engineer (SDE) to build and … automate our software systems that leverage data streaming solutions for devising the next generation of network monitoring systems. The ideal candidate will share our excitement about cloud computing and will be passionate about delivering high quality services spanning full SDLC lifecycle. You should have good knowledge of distributedsystems with design and implementation experience, as well as the ability to lead and coach engineers. You will be customer centric and enjoy working in a fast-paced environment that requires excellent technical, business and communication skills. NTE team is part of AWS Infrastructure Services which owns the More ❯
invest in Rust. For front-end development we prefer React and Flutter. Golang is an essential language for our engineering teams, who build the systems that deliver Ubuntu to the world. From our software distribution systems, to those which build and test every possible kind of open source … on every architecture, from our systems management tools to our distributedsystems operations R&D, we count on Golang for its tasteful concurrency and developer ecosystem. Juju, Livepatch, LXD, MAAS, Microk8s, Snapd, Ubuntu Core, Ubuntu Pro, and many more Canonical offerings include Golang components. We also want … and enterprises around the world. We expect the highest engineering standards and strong motivation to get things done well in a fully remote and distributed environment. These roles require extensive personal experience with Linux - the more different versions of Linux the better! Location : we have open roles for Golang More ❯
Data Pipelines and API Optimisation, you’ll be central to our platform's performance and scalability. You’ll design, build, and iterate on the systems that handle data ingestion, transformation, and delivery - ensuring they are scalable, reliable, and performant. You should have a strong background in backend engineering, particularly … around distributedsystems, database optimisation, and API development. You’ll work with technologies like JavaScript and TypeScript to build robust services that meet high standards for performance and reliability. This role is ideal for someone experienced, self-driven, and excited about leading backend projects involving large-scale data … infrastructure and mentor other engineers as we scale. Skills, Knowledge, and Expertise Experience 5+ years in Backend Software Engineering, with a focus on complex systems, data pipelines, and API development. Expertise with backend languages such as JavaScript/TypeScript (Node.js); additional experience with Python, Go, or Java is a More ❯
Data Pipelines and API Optimisation, you’ll be central to our platform's performance and scalability. You’ll design, build, and iterate on the systems that handle data ingestion, transformation, and delivery - ensuring they are scalable, reliable, and performant. You should have a strong background in backend engineering, particularly … around distributedsystems, database optimisation, and API development. You’ll work with technologies like JavaScript and TypeScript to build robust services that meet high standards for performance and reliability. This role is ideal for someone experienced, self-driven, and excited about leading backend projects involving large-scale data … infrastructure and mentor other engineers as we scale. Skills, Knowledge, and Expertise Experience 5+ years in Backend Software Engineering, with a focus on complex systems, data pipelines, and API development. Expertise with backend languages such as JavaScript/TypeScript (Node.js); additional experience with Python, Go, or Java is a More ❯
Lead Software Engineer | Sustainable Cloud Native Platform️ | DistributedSystems Up to £120,000 per annum plus 20% bonus plus stock London office ️Golang, Rust, C, Kubernetes, Linux, Cloud Native, Distributed systems... Are you a Lead Software Engineer with considerable cloud/infrastructure experience? Looking to join a … experience working with Golang, Rust or C/C++ Experience working on software infrastructure or building cloud native software Experience working with high scale, distributedsystems Understanding of systems design and how different components interact If this sounds like you, feel free to apply and I can More ❯
Optimize Cloud Resources: Collaborate with development teams to optimize cloud resources and enhance system reliability. Monitoring and Alerting: Develop and maintain monitoring and alerting systems to proactively identify and resolve issues affecting the reliability of our writing solutions. Post-Mortem Analyses: Conduct post-mortem analyses of system failures to … identify root causes and implement preventive measures. Security and Compliance: Ensure the security and compliance of our systems, adhering to industry standards and regulations. Business Continuity: Develop and maintain business continuity plans to ensure system resilience. Troubleshooting and Performance Tuning: Utilize strong troubleshooting skills, log analysis, and performance tuning … and tools like Git, GitHub Actions, Jira, and Confluence. Programming Languages: Strong proficiency in programming languages such as Python, Bash, Java, Go, and Scala. DistributedSystems: Well-developed understanding of distributedsystems, cloud computing, and containerization technologies. Troubleshooting and Debugging: Excellent problem-solving and debugging skills. More ❯