data.
The January, 2016 dataset comprises 509,478 trips (rows) while the July, 2016 dataset comprises 1,380,110 trips (rows).  We download the data sets from AWS cloud and unzip the files in our separate PUIDATA directory, in order to separate the data from the python notebook code.