What is the data to be looked at:
  1. Yelp Data : API data for all restaurants that are open at night time. According to the documentation, we will get a maximum of 150 restaurants for New York. This data allows us to get the names of all the 150 restaurants in Manhattan, along with their cuisines and average ratings, according to Yelp.
  2. Instagram hashtag data : The instagram hashtag data for all the 150 Yelp restaurants for the month of July 2017 at night time from 9pm to 2am. This data will be scraped from an account wherein the hashtags will be the names of all the restaurants without any dashes or capital letters in the names, to go for how people usually write hashtags in Instagram.
  3. Taxi and Limousine Commission Data : The TLC data for pickup locations and drop off locations for taxi districts. All the data will be merged in one file to see correlations. This data allows us to have data on the no. of incoming as well as outgoing trips from each taxi zones or ID’s.