Picking A Model To Predict Future House Prices in the US
Real Estate has been fluctuating throughout the years.

Picking A Model To Predict Future House Prices in the US

Ladies and gentlemen, I have been well aware on the alleged great migration within the United States of America.

I've witnessed the Northeastern exodus to Florida.

I've seen videos of the California exoduses to Texas, Nevada, and Arizona.

The main reasons being are increased state taxes, increased costs of living, as well as state politics from their old places of living to evade.

Now, in recent years, the cost of living and other fees have risen everywhere, especially in the hottest destinations to move to:

  • Texas: house prices have risen dramatically due to high demand and low supply and labor; mainly in the Austin, Dallas-Fort Worth, & Houston metroplexes.
  • Georgia: house prices have risen in the Atlanta metroplex
  • Florida: house prices have risen statewide, especially when you factor in the global warming conditions there.

Around a month ago, I took the time to do some digging on this ordeal using data science and machine learning to pick which model is best suited for predicting future house prices in the country. Here's what I have found so far:

No alt text provided for this image
Horizantal - Year , Vertical - Housing Price Value

If you take a look at the graph to the left, you'll notice the value of American homes skyrocket over the past decade.

No alt text provided for this image
Horizantal - Year, Vertical - Adjacent Housing Price Value



If you take a look at this graph to the left, you'll notice the value of adjacent American homes also skyrocket over the past decade. Please take note that the adjacent housing price is the price of a neighboring home right next to the one that you're targeting.


I've tested ten machine learning models to see which one is best suitable for prediction.

For the first six models, I've utilized the following:

  • Logistic Regression
  • LASSO
  • Elastic Net
  • K-Nearest Neighbors
  • Classification & Regression Trees
  • Supporting Vector Machines

No alt text provided for this image
Algorithm Comparison on the first six models

Now let's scale them.

No alt text provided for this image
First six models scaled

All of the ten models are measured by a negative mean square error (MSE). The model with the least negative MSE will be the one that we'll use for a prediction. Now let's test out the remaining four ensemble models:

  • AdaBoost
  • Gradient Boosting Method
  • Random Forest
  • Extra Trees

No alt text provided for this image
All ensemble models scaled

Based on all of the data shown, the Extra Trees model will be used as a predictor.

For more information on this dataset, please visit my GitHub page below.

#inflation #datascience #machinelearning #realestate

要查看或添加评论,请登录

Robert Paul的更多文章

  • The Blowout Brush Bananza

    The Blowout Brush Bananza

    Ladies and gentlemen, I anticipate what you are probably thinking when you start reading this article, "Why is…

  • Playing Around With Deep Learning: The Iceland Version

    Playing Around With Deep Learning: The Iceland Version

    I hope everyone is having a happy weekend so far preparing for Football Sunday: The FIFA World Cup Final and Week 15 of…

  • Can FTX Recover?

    Can FTX Recover?

    Ladies and gentlemen, I took the time to run some machine learning models of the dataset of the FTT coin from the last…

  • Digging deep into the collapse of FTX

    Digging deep into the collapse of FTX

    Ladies and gentlemen, I want to write a quick excerpt based on the findings that I have found numbers wise. Below is a…

  • Ego + Hubris = Denial = Arrogance

    Ego + Hubris = Denial = Arrogance

    Ladies and gentlemen, if you were not aware of the cryptocurrency news by now, FTX, the company that handles and…

  • Wine Quality Dataset

    Wine Quality Dataset

    (Courtesy: M Yasser H of Kaggle (https://www.kaggle.

  • Sample Housing Market Problem

    Sample Housing Market Problem

    (Courtesy: M Yasser H of Kaggle (https://www.kaggle.

  • AMC Stock Data & Its History

    AMC Stock Data & Its History

    (Courtesy: Arpit Verma of Kaggle) https://www.kaggle.

  • Online Shoppers Purchasing Intention Dataset

    Online Shoppers Purchasing Intention Dataset

    (Courtesy: Akash Patel of Kaggle) Ladies and gentlemen, I did a thorough analysis on a dataset of multiple vectors…

社区洞察

其他会员也浏览了