Data:
All data for this project can be found on the NYC Open Data Portal. This analysis uses two datasets: the
NYC Parks Capital Tracker data and the
NYC Parks Properties Information. There is limited data available to the public regarding the details of NYC Parks' construction projects. However, these two sets are adequate for this exercise.
NYC Parks Capital Tracker Data -
This set tracks all data related to capital contracts. It includes data related to funding, current phase of construction, proposed timelines for phase completions, as well as actual deadlines for construction. It also includes some information regarding the specific type of park, specifically the park identification number.
NYC Parks Properties Information -
This dataset is an inventory of all recorded elements for a park. It is the base layer for which all information rests. This dataset includes park identification numbers, borough, type of park, classification of park property, political boundary that the park falls into, and a variety of other park features.
The two datasets were merged using the common thread of park identification number. Extracting this from the Capital Tracker dataset required an extra step as some of the information, including the park identification number, was packaged in the data frame within a dictionary, rather than as a callable type such as a float.
The next step was creating new columns that quantified the amount of time between the proposed procurement duration and the actual procurement duration. This was a simple calculation done by subtracting the start date from both the proposed and actual completion dates. The function used to be able to manipulate the dates was: