Period
to 11 December 2017

The following table provides summary statistics for contract job vacancies advertised in England with a requirement for PySpark skills. Included is a benchmarking guide to the contractor rates offered in vacancies that have cited PySpark over the 6 months to 11 December 2017 with a comparison to the same period in the previous 2 years.

Note that daily and hourly rates are treated separately in these statistics. When calculating contractor rate percentiles, daily rates are never derived from quoted hourly rates or vice versa.

PySpark - Spark Python API
UK > England
6 months to
11 Dec 2017
Same period 2016 Same period 2015
Rank 740 848 868
Rank change year-on-year +108 +20 -
Contract jobs citing PySpark 111 9 2
As % of all contract IT jobs advertised in England 0.12% 0.009% 0.002%
As % of the Libraries, Frameworks & Software Standards category 0.36% 0.031% 0.007%
Number of daily rates quoted 98 4 1
England median daily rate £538 £650 £275
Median daily rate % change year-on-year -17.34% +136.45% -
10th Percentile £400 £548 £238
90th Percentile £675 £776 £313
UK median daily rate £538 £650 £275
% change year-on-year -17.34% +136.45% -
Number of hourly rates quoted 0 0 0
England median hourly rate - - -
UK median hourly rate - - -

PySpark is in the Software Libraries and Frameworks category. The following table is for comparison with the above and provides summary statistics for all contract job vacancies advertised in England with a requirement for technical specification, industry standards, software libraries and framework skills.

All Software Libraries and Frameworks
England
Contract vacancies with a requirement for technical specification, industry standards, software libraries and framework skills 30756 29401 30198
As % of all contract IT jobs advertised in England 32.36% 30.80% 29.10%
Number of daily rates quoted 21926 20473 21021
England median daily rate £470 £425 £415
Median daily rate % change year-on-year +10.59% +2.41% +3.75%
10th Percentile £313 £300 £278
90th Percentile £625 £588 £588
UK median daily rate £450 £425 £413
% change year-on-year +5.88% +3.03% +3.13%
Number of hourly rates quoted 355 336 515
England median hourly rate £40.00 £39.00 £37.50
Median hourly rate % change year-on-year +2.56% +4.00% +7.14%
10th Percentile £18.20 £15.00 £14.25
90th Percentile £52.50 £50.00 £52.37
UK median hourly rate £40.00 £39.60 £37.50
% change year-on-year +1.01% +5.60% +7.14%

PySpark
Job Vacancy Trend in England

Job postings citing PySpark as a percentage of all IT jobs advertised in England.

Job vacancy trend for PySpark in England

PySpark
Contractor Daily Rate Trend in England

This chart provides the 3-month moving average for daily rates quoted in contract jobs citing PySpark in England.

Contractor daily rate trend for PySpark in England

PySpark
Contractor Daily Rate Histogram in England

The daily rate distribution of IT jobs citing PySpark in England over the 6 months to 11 December 2017.

Contractor daily rate histogram for PySpark in England

PySpark
Contract Job Locations in England

The table below looks at the demand and provides a guide to the median contractor rates quoted in IT jobs citing PySpark within the England region over the 6 months to 11 December 2017. The 'Rank Change' column provides an indication of the change in demand within each location based on the same 6 month period last year.

Location Rank Change
on Same Period
Last Year
Matching
Contract
IT Job Ads
Median
Daily Rate
Past 6 Months
Median Daily Rate
% Change
on Same Period
Last Year
Live
Job
Vacancies
London +60 98 £550 -21.43% 5
South East - 8 £519 - 1
North of England +46 3 £538 -2.27% 2
North West - 3 £538 - 2
East of England - 2 £466 - 1
PySpark
UK

For the 6 months to 11 December 2017, IT contractor jobs citing PySpark also mentioned the following skills in order of popularity. The figures indicate the number co-occurrences and its proportion to all contract ads across the England region with a requirement for PySpark.

1 109 (98.20%) Python
2 101 (90.99%) Apache Spark
3 86 (77.48%) Big Data
4 82 (73.87%) Hadoop
5 78 (70.27%) Apache Hive
6 65 (58.56%) Oozie
7 60 (54.05%) HDFS
7 60 (54.05%) Java
8 59 (53.15%) Kafka
8 59 (53.15%) Apache Flume
9 54 (48.65%) Open Source
10 52 (46.85%) DevOps
11 50 (45.05%) Ambari
11 50 (45.05%) PostgreSQL
12 48 (43.24%) Data Engineering
13 45 (40.54%) GitHub
14 42 (37.84%) Hadoop YARN
15 41 (36.94%) Sqoop
16 39 (35.14%) MapReduce
16 39 (35.14%) Tableau
17 36 (32.43%) IBM
18 35 (31.53%) Bluemix
18 35 (31.53%) IBM Cloud
18 35 (31.53%) Jenkins
19 34 (30.63%) Oracle
20 33 (29.73%) SQL
21 30 (27.03%) Data Warehouse
22 26 (23.42%) Business Intelligence
23 25 (22.52%) Shell Script
23 25 (22.52%) Amazon AWS

PySpark
Co-occurring IT Skills in England by Category

The follow tables expand on the table above by listing co-occurrences grouped by category. The same job type, locality and period is covered with up to 20 co-occurrences shown in each of the following categories:

Application Platforms
1 101 (90.99%) Apache Spark
2 11 (9.91%) Apache
3 9 (8.11%) Apache Pig
4 5 (4.50%) Jupyter
5 4 (3.60%) Apache Airflow
6 1 (0.90%) Apache Solr
6 1 (0.90%) Elasticsearch
6 1 (0.90%) JBoss
6 1 (0.90%) Mule
6 1 (0.90%) Tomcat
6 1 (0.90%) WebSphere
Cloud Services
1 45 (40.54%) GitHub
2 35 (31.53%) Bluemix
2 35 (31.53%) IBM Cloud
3 25 (22.52%) Amazon AWS
4 8 (7.21%) Amazon EMR
4 8 (7.21%) Amazon S3
5 7 (6.31%) AWS Data Pipeline
6 5 (4.50%) Amazon Kinesis
7 4 (3.60%) Amazon EC2
7 4 (3.60%) AWS Lambda
7 4 (3.60%) Microsoft Azure
7 4 (3.60%) Serverless
8 2 (1.80%) Cloud Computing
Communications & Networking
1 3 (2.70%) Network Security
2 2 (1.80%) SSL
3 1 (0.90%) Internet
3 1 (0.90%) SMS
Database & Business Intelligence
1 86 (77.48%) Big Data
2 82 (73.87%) Hadoop
3 78 (70.27%) Apache Hive
4 50 (45.05%) PostgreSQL
5 41 (36.94%) Sqoop
6 30 (27.03%) Data Warehouse
7 15 (13.51%) Oracle Database
7 15 (13.51%) RDBMS
7 15 (13.51%) Relational Database
8 14 (12.61%) HBase
8 14 (12.61%) Oracle ODI
9 10 (9.01%) Impala
10 8 (7.21%) Amazon Redshift
10 8 (7.21%) Geospatial Data
11 6 (5.41%) Pentaho
12 5 (4.50%) Mahout
12 5 (4.50%) MongoDB
12 5 (4.50%) Star Schema
13 4 (3.60%) NoSQL
13 4 (3.60%) SQL Server
Development Applications
1 35 (31.53%) Jenkins
2 11 (9.91%) Git (software)
3 3 (2.70%) JIRA
4 2 (1.80%) JUnit
4 2 (1.80%) TeamCity
5 1 (0.90%) Apache Continuum
5 1 (0.90%) Hudson
5 1 (0.90%) Subversion
General
1 12 (10.81%) Telecoms
2 11 (9.91%) Retail
3 9 (8.11%) Finance
4 6 (5.41%) Marketing
5 5 (4.50%) Electronics
5 5 (4.50%) Manufacturing
6 3 (2.70%) Banking
7 1 (0.90%) Publishing
Job Titles
1 44 (39.64%) Data Engineer
2 36 (32.43%) Big Data Engineer
3 19 (17.12%) Data Scientist
4 16 (14.41%) Developer
5 11 (9.91%) Hadoop Engineer
6 8 (7.21%) Machine Learning Scientist
7 7 (6.31%) Big Data Developer
7 7 (6.31%) Python Developer
8 5 (4.50%) Architect
9 4 (3.60%) Administrator
9 4 (3.60%) Apache Spark Developer
9 4 (3.60%) AWS Engineer
9 4 (3.60%) Senior Data Engineer
10 3 (2.70%) Big Data Architect
10 3 (2.70%) Cloud Administrator
10 3 (2.70%) Cloud Developer
10 3 (2.70%) Data Architect
10 3 (2.70%) Java Engineer
10 3 (2.70%) Senior Data Scientist
11 2 (1.80%) Software Developer
Libraries, Frameworks & Software Standards
1 59 (53.15%) Kafka
2 9 (8.11%) scikit-learn
3 8 (7.21%) Pandas
4 6 (5.41%) Apache Avro
4 6 (5.41%) Keras
4 6 (5.41%) Matplotlib
4 6 (5.41%) Spark Streaming
4 6 (5.41%) TensorFlow
5 5 (4.50%) NumPy
5 5 (4.50%) Seaborn
5 5 (4.50%) Theano
6 4 (3.60%) XML
7 3 (2.70%) JSON
8 2 (1.80%) .NET
8 2 (1.80%) jQuery
8 2 (1.80%) Lucene
8 2 (1.80%) MLlib
8 2 (1.80%) React
8 2 (1.80%) RESTful
8 2 (1.80%) Spring
Miscellaneous
1 12 (10.81%) Algorithms
2 9 (8.11%) Analytical Skills
3 4 (3.60%) User Experience
4 3 (2.70%) Data Centre
5 2 (1.80%) Smart Home
6 1 (0.90%) Computer Science
6 1 (0.90%) Data Structures
6 1 (0.90%) Distributed Denial-of-Service
6 1 (0.90%) Fintech
6 1 (0.90%) Fixed Income
6 1 (0.90%) Greenfield Project
6 1 (0.90%) Management Information System
6 1 (0.90%) Online Banking
6 1 (0.90%) Private Cloud
Operating Systems
1 6 (5.41%) Unix
2 5 (4.50%) Linux
3 2 (1.80%) Android
3 2 (1.80%) Apple iOS
3 2 (1.80%) Windows
4 1 (0.90%) Red Hat Enterprise Linux
Processes & Methodologies
1 54 (48.65%) Open Source
2 52 (46.85%) DevOps
3 48 (43.24%) Data Engineering
4 39 (35.14%) MapReduce
5 26 (23.42%) Business Intelligence
6 20 (18.02%) Machine Learning
7 17 (15.32%) ETL
8 13 (11.71%) Agile Software Development
9 10 (9.01%) Predictive Modelling
9 10 (9.01%) Stakeholder Management
10 8 (7.21%) Data Integration
10 8 (7.21%) Neural Network
11 7 (6.31%) Data Modelling
11 7 (6.31%) Logistic Regression
11 7 (6.31%) Statistics
12 6 (5.41%) Migration
13 5 (4.50%) Bayesian Methods
13 5 (4.50%) Digital Marketing
13 5 (4.50%) Programme Management
13 5 (4.50%) Trend Analysis
Programming Languages
1 109 (98.20%) Python
2 60 (54.05%) Java
3 33 (29.73%) SQL
4 25 (22.52%) Shell Script
5 15 (13.51%) PL/SQL
6 9 (8.11%) Scala
7 8 (7.21%) R
8 2 (1.80%) COBOL
9 1 (0.90%) JavaScript
9 1 (0.90%) T-SQL
Qualifications
1 10 (9.01%) PhD
2 1 (0.90%) CISA
2 1 (0.90%) CISM
2 1 (0.90%) CISSP
2 1 (0.90%) Degree
Quality Assurance & Compliance
1 3 (2.70%) QA
2 1 (0.90%) Data Quality
2 1 (0.90%) FedRAMP
2 1 (0.90%) PCI DSS
System Software
1 60 (54.05%) HDFS
2 59 (53.15%) Apache Flume
3 21 (18.92%) Docker
4 3 (2.70%) Apache ZooKeeper
Systems Management
1 65 (58.56%) Oozie
2 50 (45.05%) Ambari
3 42 (37.84%) Hadoop YARN
4 19 (17.12%) Kubernetes
5 2 (1.80%) Apache Slider
5 2 (1.80%) Control-M
5 2 (1.80%) Puppet
6 1 (0.90%) Ansible
6 1 (0.90%) Kibana
6 1 (0.90%) logstash
6 1 (0.90%) Nagios
Vendors
1 39 (35.14%) Tableau
2 36 (32.43%) IBM
3 34 (30.63%) Oracle
4 20 (18.02%) Cloudera
5 6 (5.41%) MapR
6 5 (4.50%) SAP
7 3 (2.70%) Microsoft
7 3 (2.70%) Omniture
7 3 (2.70%) Palantir
7 3 (2.70%) Salesforce.com
8 2 (1.80%) SaltStack
9 1 (0.90%) Hortonworks
9 1 (0.90%) Red Hat