Introduction
Citi Bike is an initiative started in New York City to allow residents to use publicly accessible bicycles to travel throughout the five boroughs. There are stations throughout NYC, and it has recently expanded to Jersey City, New Jersey as well. Data is constantly collected on users, including travel time, location of bike retrieval and return, as well as some demographic information (age, gender) where available.
This data has been made publicly available, going back to when Citi Bike was first implemented in 2014. Taking an (admittedly) unorthodox perspective, I examined the data to see if there were any distinguishable characteristics among users of a certain age. Since age (or more specifically, birth year) data is collected, it is reasonable to expect that unexpected patterns may emerge.
Data
Using Citi Bike
public system data site, travel data was selected for January 2018. The Citi Bike data sets are dispersed monthly. The choice of month was arbitrary, as there were no reasons to suspect skewing of data among the various months or years (Further analysis may be done to determine if there is an ideal month/year for this analysis). After retrieval, data was filtered into subsets for users born before and after 1980. The year 1980 was chosen based on an examination of the data set. The year 1980 divided the data enough to extract two fairly equal sample sizes.
Methodology
Data was first plotted to determine the mean trip duration for the two samples.