What is the data to be looked at:
- Yelp Data : API data for all restaurants that are open at night
time. According to the documentation, we will get a maximum of 150
restaurants for New York. This data allows us to get the names of all
the 150 restaurants in Manhattan, along with their cuisines and
average ratings, according to Yelp.
- Instagram hashtag data : The instagram hashtag data for all the
150 Yelp restaurants for the month of July 2017 at night time from 9pm
to 2am. This data will be scraped from an account wherein the hashtags
will be the names of all the restaurants without any dashes or capital
letters in the names, to go for how people usually write hashtags in
Instagram.
- Taxi and Limousine Commission Data : The TLC data for pickup
locations and drop off locations for taxi districts. All the data will
be merged in one file to see correlations. This data allows us to have
data on the no. of incoming as well as outgoing trips from each taxi
zones or ID’s.