You can access the data on Kaggle website: https://www.kaggle.com/harlfoxem/housesalesprediction/data. This dataset contains house sale prices for King County, which includes Seattle. It includes homes sold between May 2014 and May 2015. For each house, the data includes some attributes including the date and price the house was sold,  number of bedrooms, bathrooms, square footage of the home, basemen, and lot, number of floors, whether it has a view to waterfront, whether it has been viewed, overall condition and overall grade given to the housing unit, based on King County grading system, year built and renovated, zip code, longitude and latitude, living room area and lot size area in 2015. The problem we will be solving here is to predict the housing price using the labeled data. Hence, this is a supervised learning problem, where we could use a part of the data to build and train our model and a keep a part of the data unseen to the model for test purposes.