Deriving adequate sample sizes for ANN-based modelling of real estate valuation tasks by complexity analysis
Research output: Contribution to journal › Research article › Contributed › peer-review
Contributors
Abstract
Property valuation in areas with few transactions on basis of a linear regression fails due to a not sufficient number of purchasing cases. One approach which is enhancing the available data set is to evaluate these purchasing cases together with a neighbouring submarket. However, it leads to non-linearities. Consequently, non-linear models for a cross-submarket real estate valuation are required to obtain reasonable results. We focus in this contribution on non-linear modelling on basis of artificial neural networks (ANN). A prerequisite for these procedures is an adequate sample size. We present a new approach based on the aggregation of submarkets additional to the markets with few transactions at the expense of increasing complexity of the model required. The cross-submarket ANN estimation aims to reach accuracies comparable to local property valuation procedures in a first step and in further consequence to enable a reasonable estimation in areas with few transactions. We introduce an extended Kalman filter (EKF) estimation procedure for the ANN parameters and compare it to the standard optimisation procedure Levenberg Marquardt (LM) as well as to the multiple linear regression. Thus, German spatial and functional submarkets are aggregated. For the spatially aggregated data set, the ANN estimation leads to improved results. The ANN estimation of the functionally aggregated data appears deceptively simple due to too small samples not representing the sampling density. The question arises, what are adequate sample sizes regarding the complexity of the unknown relationship. We purpose a model complexity analysis procedure based on resampling and the structural risk minimisation theory and derive a minimum sample size for the spatially aggregated data. Only for the EKF computations, this minimum sample size is reached due to less variance of the ANN estimations. Generally, the EKF computation leads to a better ANN performance in contrast to LM. Finally, the spatial cross-submarket ANN estimation reaches accuracies of local property valuation procedures.
Details
Original language | English |
---|---|
Article number | 105475 |
Pages (from-to) | 1-16 |
Number of pages | 16 |
Journal | Land Use Policy |
Volume | 107 |
Issue number | 8 |
Publication status | Published - Aug 2021 |
Peer-reviewed | Yes |
External IDs
Scopus | 85105279645 |
---|---|
ORCID | /0000-0001-8962-1505/work/125727846 |
ORCID | /0000-0003-2742-5183/work/142252424 |
Keywords
Keywords
- Adequate sample size, Real estate valuation, Artificial neural network, Complexity analysis