Fig.1 shows the clean dataset includes  only variables that relevant to my research: 'tripduration' and 'usertype'. 'Usertype' represents the numerical value of 'usertype': subscriber is represented by 1, and customer is represented by 2.