The analysis is finding the citibike usage for weekdays and weekends. But there is no such column that has the day of the week. The day of the week therefore is extracted from the start time column and a column named dayofweek is added to the existing dataframe. Data cleaning is performed to get rid of unwanted columns. The only columns that are retianed and which are relevant for analysis are tripduration and day of week.
Methodology:
I have started by defining my null and alternate hypothesis which are as follows:
1. Null and Alternate hypothesis
The null hypothesis for citi bike usage is:
H0 : Average trip duration during weekends is same or less than weekdays
H0: (Avg. Trip Duration)weekends <= ( Avg. Trip Duration )weekdays
The alternate hypothesis is:
H1: Average trip duration during weekdays is more than weekends
H1: ( Avg. Trip Duration )weekends  >  ( Avg. Trip Duration )weekdays
The trip duration by weekday is as follows: