The Information Value of Property Description
Abstract
This paper employs a ML-Hedonic approach to quantify the value of uniqueness, a type of soft" information embedded in real estate advertisements. We rst propose an unsupervised learning algorithm to quantify levels of semantic deviation ("uniqueness") in descriptions, the textual portions of real estate advertisements. We then estimated the impact of description uniqueness on real estate transaction outcomes using linear hedonic pricing models. The results indicate textual data disseminate information that numerical data cannot capture, and property descriptions effectively narrow the information gap between structured real estate data and the houses by conveying "soft" information about unique house features. A one standard deviation (0.08) increase in description uniqueness compared to neighboring properties leads to a 5.6% increase in property sale prices and a 2.3{day delay in the closing time, controlling for house characteristics, transaction circumstances, and agent unobservables. This paper provides theoretical and empirical insights on how to utilize the emerging Machine Learningtools in economic research.