I created four variables for my analysis:
- Eviction percentage: total evictions per zip code divided by total units per zip code, multiplied by 100.
- Litigations percentage: total litigations per zip code divided by total units per zip code, multiplied by 100.
- Income per person: income divided by total population per zip code.
- Non-white fraction: the total number of white households divided by total households, subtracted from one.
Missing Values
After first merging the eviction data with the zip code shape file, I merged the PLUTO data, income, litigations, and finally the race data. In the end, I was left with 183 zip codes out of 263 NYC zip codes. Some zip codes contained 0 values for income or units, or very low numbers. I dropped any zip codes with income less than $500, and units less than 100. Most significantly, areas like the Rockaways are missing from my datasets, and that section of New York does not appear on my maps.