ETL processes. Proficiency in Python. Experience with cloud platforms (AWS, Azure, or GCP). Knowledge of data modelling, warehousing, and optimisation. Familiarity with big data frameworks (e.g. Apache Spark, Hadoop). Understanding of data governance, security, and compliance best practices. Strong problem-solving skills and experience working in agile environments. Desirable: Experience with Docker/Kubernetes, streaming data (Kafka More ❯
data modelling, warehousing, and performance optimisation. Proven experience with cloud platforms (AWS, Azure, or GCP) and their data services. Hands-on experience with big data frameworks (e.g. Apache Spark, Hadoop). Strong knowledge of data governance, security, and compliance. Ability to lead technical projects and mentor junior engineers. Excellent problem-solving skills and experience in agile environments. Desirable: Experience More ❯
data modelling tools, data warehousing, ETL processes, and data integration techniques. · Experience with at least one cloud data platform (e.g. AWS, Azure, Google Cloud) and big data technologies (e.g., Hadoop, Spark). · Strong knowledge of data workflow solutions like Azure Data Factory, Apache NiFi, Apache Airflow etc · Good knowledge of stream and batch processing solutions like Apache Flink, ApacheMore ❯
experience within either Flask, Tornado or Django, Docker Experience working with ETL pipelines is desirable e.g. Luigi, Airflow or Argo Experience with big data technologies, such as Apache Spark, Hadoop, Kafka, etc. Data acquisition and development of data sets and improving data quality Preparing data for predictive and prescriptive modelling Reporting tools (e.g. Tableau, PowerBI, Qlik) GDPR and Government More ❯
communication skills We're excited if you have 7+ years of experience delivering multi tier, highly scalable, distributed web applications Experience working with Distributed computing frameworks knowledge: Hive/Hadoop, Apache Spark, Kafka, Airflow Working with programming languages Python , Java, SQL. Working on building ETL (Extraction Transformation and Loading) solution using PySpark Experience in SQL/NoSQL database design More ❯
Crewe, Cheshire, United Kingdom Hybrid/Remote Options
Manchester Digital
AI model development. Expertise in Python, R, or Julia, with proficiency in pandas, NumPy, SciPy, scikit-learn, TensorFlow, or PyTorch. Experience with SQL, NoSQL, and big data technologies (Spark, Hadoop, Snowflake, Databricks, etc.). Strong background in statistical modelling, probability theory, and mathematical optimization. Experience deploying machine learning models to production (MLOps, Docker, Kubernetes, etc.). Familiarity with AWS More ❯
Manchester, Lancashire, United Kingdom Hybrid/Remote Options
CHEP UK Ltd
such as Python, R, and SQL for data analysis and model development. Experience working with cloud computing platforms including AWS and Azure, and familiarity with distributed computing frameworks like Hadoop and Spark. Deep understanding of supply chain operations and the ability to apply data science methods to solve real-world business problems effectively. Strong foundational knowledge in mathematics and More ❯
lancashire, north west england, united kingdom Hybrid/Remote Options
CHEP
such as Python, R, and SQL for data analysis and model development. Experience working with cloud computing platforms including AWS and Azure, and familiarity with distributed computing frameworks like Hadoop and Spark. Deep understanding of supply chain operations and the ability to apply data science methods to solve real-world business problems effectively. Strong foundational knowledge in mathematics and More ❯
Banking/Financial Services domain is a plus. Preferred Qualifications Certifications in Pentaho, Big Data, or Cloud Platforms (AWS/GCP/Azure). Experience with Big Data technologies (Hadoop, Spark) and cloud data services. More ❯
and familiarity with data science libraries and frameworks. Strong expertise in statistical modelling, predictive analytics, and machine learning. In-depth understanding of data architecture and big data technologies (e.g., Hadoop, Spark, AWS). Exceptional problem-solving skills and ability to think strategically. Outstanding communication abilities to convey technical concepts to non-technical stakeholders. Experience in managing cross-departmental or More ❯
success. The Skills You’ll Need: Experience in architecture & design and consulting services focused on enterprise solutions, data analytics platform, lake houses, data engineering, data processing, data warehousing, ETL, Hadoop & Big Data. Experience in defining and designing data governance, data management, and data security solutions for an enterprise across business verticals Experience on at least one of the More ❯
Python) and other database applications; · Understanding of PC environment and related software, including Microsoft Office applications; · Knowledge of data engineering using data stores including MS SQL Server, Oracle, NoSQL, Hadoop or other distributed data technologies. Experience using data visualization tools is a plus; · Experienced with Excel to aggregate, model, and manage large data sets; · Familiar with Microsoft Power BI More ❯
e.g. MS SQL, Oracle) NoSQL technologies skills (e.g. MongoDB, InfluxDB, Neo4J) Data exchange and processing skills (e.g. ETL, ESB, API) Development (e.g. Python) skills Big data technologies knowledge (e.g. Hadoop stack) Knowledge in NLP (Natural Language Processing) Knowledge in OCR (Object Character Recognition) Knowledge in Generative AI (Artificial Intelligence) would be advantageous Experience in containerisation technologies (e.g. Docker) would More ❯
utilising strong communication and stakeholder management skills when engaging with customers Significant experience of coding in Python and Scala or Java Experience with big data processing tools such as Hadoop or Spark Cloud experience; GCP specifically in this case, including services such as Cloud Run, Cloud Functions, BigQuery, GCS, Secret Manager, Vertex AI etc. Experience with Terraform Prior experience More ❯
languages (Python, Bash) and programming languages (Java). Hands-on experience with DevOps tools : GitLab, Ansible, Prometheus, Grafana, Nagios, Argo CD, Rancher, Harbour. Deep understanding of big data technologies : Hadoop, Spark, and NoSQL databases. Nice to Have Familiarity with agile methodologies (Scrum or Kanban). Strong problem-solving skills and a collaborative working style. Excellent communication skills , with the More ❯
commercial impact. Understanding of ML Ops vs DevOps and broader software engineering standards. Cloud experience (any platform). Previous mentoring experience. Nice to have: Snowflake or Databricks Spark, PySpark, Hadoop or similar big data tooling BI exposure (PowerBI, Tableau, etc.) Interview Process Video call - high-level overview and initial discussion In-person technical presentation - based on a provided example More ❯
Stevenage, Hertfordshire, South East, United Kingdom Hybrid/Remote Options
MBDA
e.g. MS SQL, Oracle...) noSQL technologies skills (e.g. MongoDB, InfluxDB, Neo4J...) Data exchange and processing skills (e.g. ETL, ESB, API...) Development (e.g. Python) skills Big data technologies knowledge (e.g. Hadoop stack) Knowledge in NLP (Natural Language Processing) Knowledge in OCR (Object Character Recognition) Knowledge in Generative AI (Artificial Intelligence) would be advantageous Experience in containerisation technologies (e.g. Docker) would More ❯
Docker and orchestration tools like Kubernetes. Familiarity with Infrastructure as Code (IaC) tools such as Terraform or CloudFormation. Knowledge of data engineering and experience with big data technologies like Hadoop, Spark, or Kafka. Experience with CI/CD pipelines and automation, such as using Jenkins, GitLab, or CircleCI. As an equal opportunities employer, we welcome applications from individuals of More ❯
and social benefits (e.g. UK pension schema) What do you offer? Strong hands-on experience working with modern Big Data technologies such as Apache Spark, Trino, Apache Kafka, ApacheHadoop, Apache HBase, Apache Nifi, Apache Airflow, Opensearch Proficiency in cloud-native technologies such as containerization and Kubernetes Strong knowledge of DevOps tools (Terraform, Ansible, ArgoCD, GitOps, etc.) Proficiency in More ❯
Preferred Qualifications Experience converting research studies into tangible real-world changes Knowledge of AWS platforms such as S3, Glue, Athena, Sagemaker Experience with big data technologies such as AWS, Hadoop, Spark, Pig, Hive etc. PhD in Industrial/Organizational Psychology or related field Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central More ❯
Glasgow, Scotland, United Kingdom Hybrid/Remote Options
Caspian One
and cloud-native tooling Tech Stack Programming: Python (Java familiarity is a plus) AWS: S3, Kinesis, Glue, Lambda, Step Functions, SageMaker, and more On-Prem: Managed Kubernetes Platform and Hadoop ecosystem Why This Role is Different Direct Impact: Build AI tools that traders and quants use daily to optimize strategies Creative Freedom: Open collaboration and the chance to bring … ECS, IAM, KMS, API Gateway, Step Functions, MSK, CloudFormation) Passion for being hands-on while enabling and growing a small engineering team Nice to Have Experience with on-prem Hadoop and Kubernetes Familiarity with AWS cost management and optimisation tools Knowledge of front-office developer workflows in financial services More ❯
Data Solutions in Mission-Critical areas. WE NEED THE BIG DATA ENGINEER TO HAVE.... Current DV clearance - Standard or Enhanced Must have experience with big data tools such as Hadoop, Cloudera or Elasticsearch Experience with Palantir Foundry is preferred but not essential Experience working in an Agile Scrum environment Experience in design, development, test and integration of software IT …/DEVELOPPED VETTING/DEVELOPED VETTED/DEEP VETTING/DEEP VETTED/SC CLEARED/SC CLEARANCE/SECURITY CLEARED/SECURITY CLEARANCE/NIFI/CLOUDERA/HADOOP/KAFKA/ELASTIC SEARCH More ❯