Name: Srikanth
github ID: srikanth261
NYU ID: sn2495
PUI Extra credit project
Problem Description:
Predict the number of Violent Crimes and in specific , the number of murders in metro-areas using publicly available crime and census data with the help of a data/statistical model
Data:
Data will be used primarily from the crime statistics reported on the city data website and also the US census data.
Analysis:
I am interested in understanding the factors that influence the rate of crime in cities. And want to compare the data about various cities to understand the variables that affect crime in cities.
Step 1: The first step will be to scrape the data from the above websites
Step 2: Cleaning the data might take significant effort based on how good the data is on the websites. Predictors such as population, gender ratio, poverty levels need to be created from the raw data that has been scraped
Step 3: Explore the data to get a feel for which variables might have a correlation with the crime rate
Step 4: Build the linear and polynomial Regression models to obtain a good prediction model for crime rate for the cities
Step 5: Draw conclusions
References:
Deliverables:
Below are the expected deliverables:
- An ipynb notebook showing the work/code
- A slide deck summarizing the approach and findings
- An algorithm that can be used by city agencies to predict crime in major metropolitan cities in the USA