Period
to 15 December 2017

The following table provides summary statistics for contract job vacancies advertised in the City of London with a requirement for PySpark skills. Included is a benchmarking guide to the contractor rates offered in vacancies that have cited PySpark over the 6 months to 15 December 2017 with a comparison to the same period in the previous 2 years.

Note that daily and hourly rates are treated separately in these statistics. When calculating contractor rate percentiles, daily rates are never derived from quoted hourly rates or vice versa.

PySpark - Spark Python API
Central London > City of London
6 months to
15 Dec 2017
Same period 2016 Same period 2015
Rank 222 - 256
Rank change year-on-year - +256 -
Contract jobs citing PySpark 33 0 1
As % of all contract IT jobs advertised in the City of London 0.46% - 0.012%
As % of the Libraries, Frameworks & Software Standards category 1.22% - 0.038%
Number of daily rates quoted 30 0 0
City of London median daily rate £563 - -
Central London median daily rate £563 - -
Number of hourly rates quoted 0 0 0
City of London median hourly rate - - -
Central London median hourly rate - - -

PySpark is in the Software Libraries and Frameworks category. The following table is for comparison with the above and provides summary statistics for all contract job vacancies advertised in the City of London with a requirement for technical specification, industry standards, software libraries and framework skills.

All Software Libraries and Frameworks
City of London
Contract vacancies with a requirement for technical specification, industry standards, software libraries and framework skills 2696 2339 2621
As % of all contract IT jobs advertised in the City of London 37.48% 35.77% 31.15%
Number of daily rates quoted 2281 1973 2185
City of London median daily rate £513 £475 £475
Median daily rate % change year-on-year +7.89% - +5.56%
10th Percentile £363 £325 £313
90th Percentile £675 £650 £638
Central London median daily rate £500 £475 £460
% change year-on-year +5.26% +3.26% +8.24%
Number of hourly rates quoted 8 4 10
City of London median hourly rate £25.79 £35.00 £38.75
Median hourly rate % change year-on-year -26.32% -9.68% +27.43%
10th Percentile £19.33 £19.00 £19.98
90th Percentile £41.25 £56.31 £68.88
Central London median hourly rate £21.50 £35.00 £38.75
% change year-on-year -38.57% -9.68% +27.43%

PySpark
Job Vacancy Trend in the City of London

Job postings citing PySpark as a percentage of all IT jobs advertised in the City of London.

Job vacancy trend for PySpark in the City of London

PySpark
Contractor Daily Rate Trend in the City of London

This chart provides the 3-month moving average for daily rates quoted in contract jobs citing PySpark in the City of London.

Contractor daily rate trend for PySpark in the City of London

PySpark
Contractor Daily Rate Histogram in the City of London

The daily rate distribution of IT jobs citing PySpark in the City of London over the 6 months to 15 December 2017.

Contractor daily rate histogram for PySpark in the City of London

For the 6 months to 15 December 2017, IT contractor jobs citing PySpark also mentioned the following skills in order of popularity. The figures indicate the number co-occurrences and its proportion to all contract ads across the City of London region with a requirement for PySpark.

1 33 (100.00%) Python
2 31 (93.94%) Hadoop
2 31 (93.94%) Apache Hive
3 30 (90.91%) Apache Flume
3 30 (90.91%) Oozie
3 30 (90.91%) HDFS
4 29 (87.88%) Apache Spark
4 29 (87.88%) Big Data
5 28 (84.85%) Kafka
6 26 (78.79%) Java
6 26 (78.79%) Oracle
7 25 (75.76%) Open Source
7 25 (75.76%) Data Engineering
8 24 (72.73%) DevOps
8 24 (72.73%) Ambari
8 24 (72.73%) PostgreSQL
9 16 (48.48%) Amazon AWS
9 16 (48.48%) GitHub
9 16 (48.48%) Tableau
9 16 (48.48%) Sqoop
9 16 (48.48%) Hadoop YARN
10 14 (42.42%) Docker
10 14 (42.42%) Kubernetes
10 14 (42.42%) Bluemix
10 14 (42.42%) IBM Cloud
10 14 (42.42%) Jenkins
10 14 (42.42%) Oracle ODI
10 14 (42.42%) MapReduce
10 14 (42.42%) Shell Script
10 14 (42.42%) IBM

PySpark
Co-occurring IT Skills in the City of London by Category

The follow tables expand on the table above by listing co-occurrences grouped by category. The same job type, locality and period is covered with up to 20 co-occurrences shown in each of the following categories:

Application Platforms
1 29 (87.88%) Apache Spark
2 2 (6.06%) Apache
2 2 (6.06%) Apache Pig
Cloud Services
1 16 (48.48%) Amazon AWS
1 16 (48.48%) GitHub
2 14 (42.42%) Bluemix
2 14 (42.42%) IBM Cloud
3 2 (6.06%) Amazon EMR
3 2 (6.06%) Amazon S3
3 2 (6.06%) AWS Data Pipeline
Communications & Networking
1 2 (6.06%) Network Security
1 2 (6.06%) SSL
Database & Business Intelligence
1 31 (93.94%) Apache Hive
1 31 (93.94%) Hadoop
2 29 (87.88%) Big Data
3 24 (72.73%) PostgreSQL
4 16 (48.48%) Sqoop
5 14 (42.42%) Oracle ODI
6 12 (36.36%) Data Warehouse
7 10 (30.30%) Oracle Database
7 10 (30.30%) RDBMS
7 10 (30.30%) Relational Database
8 4 (12.12%) HBase
9 3 (9.09%) Impala
10 2 (6.06%) Amazon RDS
10 2 (6.06%) Amazon Redshift
10 2 (6.06%) Apache Cassandra
10 2 (6.06%) Cloudera CDH
10 2 (6.06%) NoSQL
10 2 (6.06%) Pentaho
10 2 (6.06%) Power BI
10 2 (6.06%) SQLite
Development Applications
1 14 (42.42%) Jenkins
General
1 1 (3.03%) Retail
Job Titles
1 4 (12.12%) Data Engineer
2 2 (6.06%) Administrator
2 2 (6.06%) Cloud Administrator
3 1 (3.03%) Architect
3 1 (3.03%) Big Data Architect
3 1 (3.03%) Data Architect
3 1 (3.03%) Data Scientist
Libraries, Frameworks & Software Standards
1 28 (84.85%) Kafka
2 2 (6.06%) .NET
2 2 (6.06%) scikit-learn
3 1 (3.03%) Matplotlib
3 1 (3.03%) Pandas
3 1 (3.03%) SparkR
Miscellaneous
1 4 (12.12%) Analytical Skills
2 2 (6.06%) Data Centre
3 1 (3.03%) Algorithms
3 1 (3.03%) Computer Science
3 1 (3.03%) Data Structures
3 1 (3.03%) Management Information System
3 1 (3.03%) Smart Home
Operating Systems
1 4 (12.12%) Unix
2 2 (6.06%) Android
2 2 (6.06%) Apple iOS
2 2 (6.06%) Windows
Processes & Methodologies
1 25 (75.76%) Data Engineering
1 25 (75.76%) Open Source
2 24 (72.73%) DevOps
3 14 (42.42%) MapReduce
4 12 (36.36%) Business Intelligence
5 3 (9.09%) Agile Software Development
5 3 (9.09%) Analytics
5 3 (9.09%) Machine Learning
6 2 (6.06%) Collaborative Working
6 2 (6.06%) Configuration Management
6 2 (6.06%) Continuous Improvement
6 2 (6.06%) CRM
6 2 (6.06%) Data Modelling
6 2 (6.06%) Data Validation
6 2 (6.06%) ETL
6 2 (6.06%) Logistic Regression
6 2 (6.06%) Neural Network
6 2 (6.06%) Statistics
6 2 (6.06%) Validation
7 1 (3.03%) Mathematics
Programming Languages
1 33 (100.00%) Python
2 26 (78.79%) Java
3 14 (42.42%) Shell Script
4 13 (39.39%) SQL
5 10 (30.30%) PL/SQL
6 2 (6.06%) COBOL
6 2 (6.06%) Scala
Qualifications
1 1 (3.03%) Degree
Quality Assurance & Compliance
1 2 (6.06%) QA
2 1 (3.03%) Data Quality
System Software
1 30 (90.91%) Apache Flume
1 30 (90.91%) HDFS
2 14 (42.42%) Docker
3 2 (6.06%) Apache ZooKeeper
Systems Management
1 30 (90.91%) Oozie
2 24 (72.73%) Ambari
3 16 (48.48%) Hadoop YARN
4 14 (42.42%) Kubernetes
5 2 (6.06%) Control-M
5 2 (6.06%) Puppet
Vendors
1 26 (78.79%) Oracle
2 16 (48.48%) Tableau
3 14 (42.42%) IBM
4 2 (6.06%) Cloudera
4 2 (6.06%) Omniture
4 2 (6.06%) Palantir
4 2 (6.06%) Salesforce.com
4 2 (6.06%) SaltStack