Period
to 23 February 2018

The following table provides summary statistics for contract job vacancies advertised in England with a requirement for PySpark skills. Included is a benchmarking guide to the contractor rates offered in vacancies that have cited PySpark over the 6 months to 23 February 2018 with a comparison to the same period in the previous 2 years.

Note that daily and hourly rates are treated separately in these statistics. When calculating contractor rate percentiles, daily rates are never derived from quoted hourly rates or vice versa.

PySpark - Spark Python API
UK > England
6 months to
23 Feb 2018
Same period 2017 Same period 2016
Rank 726 810 851
Rank change year-on-year +84 +41 -
Contract jobs citing PySpark 121 16 1
As % of all contract IT jobs advertised in England 0.13% 0.018% 0.001%
As % of the Libraries, Frameworks & Software Standards category 0.40% 0.057% 0.003%
Number of daily rates quoted 110 17 1
England median daily rate £550 £525 £275
Median daily rate % change year-on-year +4.76% +90.91% -
10th Percentile £411 £481 £238
90th Percentile £741 £690 £313
UK median daily rate £550 £525 £275
% change year-on-year +4.76% +90.91% -
Number of hourly rates quoted 0 0 0
England median hourly rate - - -
UK median hourly rate - - -

PySpark is in the Software Libraries and Frameworks category. The following table is for comparison with the above and provides summary statistics for all contract job vacancies advertised in England with a requirement for technical specification, industry standards, software libraries and framework skills.

All Software Libraries and Frameworks
England
Contract vacancies with a requirement for technical specification, industry standards, software libraries and framework skills 30,118 28,294 29,111
As % of all contract IT jobs advertised in England 32.32% 31.81% 29.30%
Number of daily rates quoted 21,643 19,709 20,345
England median daily rate £475 £435 £425
Median daily rate % change year-on-year +9.20% +2.35% +6.25%
10th Percentile £313 £300 £283
90th Percentile £631 £600 £594
UK median daily rate £463 £425 £415
% change year-on-year +8.82% +2.41% +3.75%
Number of hourly rates quoted 327 324 510
England median hourly rate £40.31 £40.00 £38.50
Median hourly rate % change year-on-year +0.76% +3.90% +10.00%
10th Percentile £20.00 £15.75 £16.23
90th Percentile £56.36 £52.26 £56.58
UK median hourly rate £40.31 £40.00 £38.50
% change year-on-year +0.76% +3.90% +10.00%

PySpark
Job Vacancy Trend in England

Job postings citing PySpark as a percentage of all IT jobs advertised in England.

Job vacancy trend for PySpark in England

PySpark
Contractor Daily Rate Trend in England

This chart provides the 3-month moving average for daily rates quoted in contract jobs citing PySpark in England.

Contractor daily rate trend for PySpark in England

PySpark
Contractor Daily Rate Histogram in England

The daily rate distribution of IT jobs citing PySpark in England over the 6 months to 23 February 2018.

Contractor daily rate histogram for PySpark in England

PySpark
Contract Job Locations in England

The table below looks at the demand and provides a guide to the median contractor rates quoted in IT jobs citing PySpark within the England region over the 6 months to 23 February 2018. The 'Rank Change' column provides an indication of the change in demand within each location based on the same 6 month period last year.

Location Rank Change
on Same Period
Last Year
Matching
Contract
IT Job Ads
Median
Daily Rate
Past 6 Months
Median Daily Rate
% Change
on Same Period
Last Year
Live
Job
Vacancies
London +50 102 £550 -8.33% 9
South East - 11 £519 -
North of England +38 6 £550 -
North West - 6 £550 -
East of England - 2 £466 -
PySpark
UK

For the 6 months to 23 February 2018, IT contractor jobs citing PySpark also mentioned the following skills in order of popularity. The figures indicate the absolute number co-occurrences and as a proportion of all contract job ads across the England region with a requirement for PySpark.

1 116 (95.87%) Python
2 108 (89.26%) Apache Spark
3 79 (65.29%) Hadoop
4 76 (62.81%) Big Data
5 67 (55.37%) Apache Hive
6 51 (42.15%) Data Science
7 46 (38.02%) Oozie
8 45 (37.19%) Java
9 44 (36.36%) SQL
10 43 (35.54%) Machine Learning
11 42 (34.71%) Kafka
12 41 (33.88%) HDFS
12 41 (33.88%) Apache Flume
13 39 (32.23%) Open Source
14 37 (30.58%) PostgreSQL
14 37 (30.58%) DevOps
14 37 (30.58%) Data Engineering
15 35 (28.93%) Ambari
16 31 (25.62%) MapReduce
17 29 (23.97%) GitHub
17 29 (23.97%) Business Intelligence
18 28 (23.14%) Data Warehouse
19 27 (22.31%) Shell Script
20 26 (21.49%) Scala
21 23 (19.01%) Tableau
21 23 (19.01%) Oracle
21 23 (19.01%) Hadoop YARN
22 22 (18.18%) Sqoop
22 22 (18.18%) Amazon AWS
23 21 (17.36%) Apache

PySpark
Co-occurring IT Skills in England by Category

The follow tables expand on the table above by listing co-occurrences grouped by category. The same job type, locality and period is covered with up to 20 co-occurrences shown in each of the following categories:

Application Platforms
1 108 (89.26%) Apache Spark
2 21 (17.36%) Apache
3 18 (14.88%) Apache Pig
4 16 (13.22%) Jupyter
5 3 (2.48%) Apache Airflow
Cloud Services
1 29 (23.97%) GitHub
2 22 (18.18%) Amazon AWS
3 20 (16.53%) Bluemix
3 20 (16.53%) IBM Cloud
4 10 (8.26%) Amazon EMR
4 10 (8.26%) Amazon S3
5 7 (5.79%) Amazon Kinesis
5 7 (5.79%) AWS Data Pipeline
5 7 (5.79%) Microsoft Azure
6 5 (4.13%) Amazon EC2
6 5 (4.13%) AWS Lambda
6 5 (4.13%) Serverless
7 1 (0.83%) Amazon CloudWatch
7 1 (0.83%) Cloud Computing
7 1 (0.83%) SaaS
Communications & Networking
1 1 (0.83%) Internet
1 1 (0.83%) SMS
Database & Business Intelligence
1 79 (65.29%) Hadoop
2 76 (62.81%) Big Data
3 67 (55.37%) Apache Hive
4 37 (30.58%) PostgreSQL
5 28 (23.14%) Data Warehouse
6 22 (18.18%) Sqoop
7 20 (16.53%) HBase
8 16 (13.22%) Mahout
8 16 (13.22%) MongoDB
9 15 (12.40%) Oracle Database
9 15 (12.40%) RDBMS
9 15 (12.40%) Relational Database
10 13 (10.74%) Geospatial Data
11 9 (7.44%) Amazon Redshift
12 6 (4.96%) Oracle ODI
13 5 (4.13%) OLAP
13 5 (4.13%) Star Schema
14 4 (3.31%) Impala
14 4 (3.31%) Pentaho
14 4 (3.31%) SQL Server
Development Applications
1 18 (14.88%) Jenkins
2 12 (9.92%) Git (software)
3 5 (4.13%) JIRA
3 5 (4.13%) TeamCity
4 1 (0.83%) JUnit
General
1 15 (12.40%) Finance
1 15 (12.40%) Retail
2 13 (10.74%) Telecoms
3 6 (4.96%) Electronics
3 6 (4.96%) Manufacturing
3 6 (4.96%) Marketing
4 3 (2.48%) Advertising
5 1 (0.83%) Banking
Job Titles
1 42 (34.71%) Data Engineer
2 33 (27.27%) Data Scientist
3 32 (26.45%) Big Data Engineer
4 17 (14.05%) Developer
5 11 (9.09%) Hadoop Engineer
6 8 (6.61%) Machine Learning Scientist
7 7 (5.79%) Machine Learning Engineer
7 7 (5.79%) Python Developer
8 5 (4.13%) Architect
8 5 (4.13%) Big Data Developer
8 5 (4.13%) Scala Developer
8 5 (4.13%) Senior Data Engineer
8 5 (4.13%) Software Developer
9 4 (3.31%) AWS Engineer
9 4 (3.31%) Big Data Architect
9 4 (3.31%) Data Architect
9 4 (3.31%) Senior Data Scientist
10 3 (2.48%) Apache Spark Developer
10 3 (2.48%) Java Engineer
11 2 (1.65%) Cloud Developer
Libraries, Frameworks & Software Standards
1 42 (34.71%) Kafka
2 18 (14.88%) Pandas
2 18 (14.88%) scikit-learn
3 16 (13.22%) Matplotlib
3 16 (13.22%) NumPy
3 16 (13.22%) Seaborn
4 12 (9.92%) Keras
4 12 (9.92%) TensorFlow
5 5 (4.13%) Spark Streaming
5 5 (4.13%) Theano
6 3 (2.48%) Apache Avro
7 2 (1.65%) JSON
7 2 (1.65%) React
7 2 (1.65%) XML
8 1 (0.83%) jQuery
8 1 (0.83%) Lucene
8 1 (0.83%) MLlib
Miscellaneous
1 12 (9.92%) Algorithms
2 8 (6.61%) Computer Science
3 5 (4.13%) Analytical Skills
4 4 (3.31%) User Experience
5 3 (2.48%) Data Centre
6 2 (1.65%) Smart Home
7 1 (0.83%) Fintech
7 1 (0.83%) Fixed Income
7 1 (0.83%) Greenfield Project
7 1 (0.83%) Online Banking
7 1 (0.83%) Social Media
Operating Systems
1 7 (5.79%) Linux
2 4 (3.31%) Unix
Processes & Methodologies
1 51 (42.15%) Data Science
2 43 (35.54%) Machine Learning
3 39 (32.23%) Open Source
4 37 (30.58%) Data Engineering
4 37 (30.58%) DevOps
5 31 (25.62%) MapReduce
6 29 (23.97%) Business Intelligence
7 18 (14.88%) ETL
7 18 (14.88%) Logistic Regression
8 17 (14.05%) Statistics
9 16 (13.22%) Bayesian Methods
9 16 (13.22%) Linear Regression
9 16 (13.22%) Trend Analysis
10 11 (9.09%) Agile Software Development
10 11 (9.09%) Deep Learning
11 10 (8.26%) Predictive Modelling
11 10 (8.26%) Stakeholder Management
12 9 (7.44%) Data Integration
12 9 (7.44%) Mathematics
13 8 (6.61%) Data Modelling
Programming Languages
1 116 (95.87%) Python
2 45 (37.19%) Java
3 44 (36.36%) SQL
4 27 (22.31%) Shell Script
5 26 (21.49%) Scala
6 15 (12.40%) PL/SQL
7 9 (7.44%) R
8 1 (0.83%) Bash Shell
Qualifications
1 11 (9.09%) PhD
2 4 (3.31%) Degree
Quality Assurance & Compliance
1 3 (2.48%) QA
System Software
1 41 (33.88%) Apache Flume
1 41 (33.88%) HDFS
2 16 (13.22%) Docker
3 1 (0.83%) Apache ZooKeeper
Systems Management
1 46 (38.02%) Oozie
2 35 (28.93%) Ambari
3 23 (19.01%) Hadoop YARN
4 11 (9.09%) Kubernetes
5 1 (0.83%) Apache Slider
Vendors
1 23 (19.01%) Oracle
1 23 (19.01%) Tableau
2 20 (16.53%) IBM
3 16 (13.22%) MapR
4 15 (12.40%) Cloudera
5 6 (4.96%) SAP
6 3 (2.48%) Omniture
6 3 (2.48%) Salesforce.com
7 2 (1.65%) Microsoft
8 1 (0.83%) Palantir