AIS Logo
← Back to Library
Unveiling Location-Specific Price Drivers: A Two-Stage Cluster Analysis for Interpretable House Price Predictions

Unveiling Location-Specific Price Drivers: A Two-Stage Cluster Analysis for Interpretable House Price Predictions

Paul Gümmer, Julian Rosenberger, Mathias Kraus, Patrick Zschech, and Nico Hambauer
This study proposes a novel machine learning approach for house price prediction using a two-stage clustering method on 43,309 German property listings from 2023. The method first groups properties by location and then refines these groups with additional property features, subsequently applying interpretable models like linear regression (LR) or generalized additive models (GAM) to each cluster. This balances predictive accuracy with the ability to understand the model's decision-making process.

Problem Predicting house prices is difficult because of significant variations in local markets. Current methods often use either highly complex 'black-box' models that are accurate but hard to interpret, or overly simplistic models that are interpretable but fail to capture the nuances of different market segments. This creates a trade-off between accuracy and transparency, making it difficult for real estate professionals to get reliable and understandable property valuations.

Outcome - The two-stage clustering approach significantly improved prediction accuracy compared to models without clustering.
- The mean absolute error was reduced by 36% for the Generalized Additive Model (GAM/EBM) and 58% for the Linear Regression (LR) model.
- The method provides deeper, cluster-specific insights into how different features, like construction year and living space, affect property prices in different local markets.
- By segmenting the market, the model reveals that price drivers vary significantly across geographical locations and property types, enhancing market transparency for buyers, sellers, and analysts.
House Pricing, Cluster Analysis, Interpretable Machine Learning, Location-Specific Predictions