Deriving adequate sample sizes for ANN-based modelling of real estate valuation tasks by complexity analysis

Sabine Horvath; Matthias Soot; Sebastian Zaddach; Hans Neuner; Alexandra Weitkamp

doi:10.1016/j.landusepol.2021.105475

Deriving adequate sample sizes for ANN-based modelling of real estate valuation tasks by complexity analysis

Research output: Contribution to journal › Research article › Contributed › peer-review

Contributors

Sabine Horvath - , Chair of Land Management (Author)
Matthias Soot - , Chair of Land Management (Author)
Sebastian Zaddach - (Author)
Hans Neuner - (Author)
Alexandra Weitkamp - , Chair of Land Management (Author)

Abstract

Property valuation in areas with few transactions on basis of a linear regression fails due to a not sufficient number of purchasing cases. One approach which is enhancing the available data set is to evaluate these purchasing cases together with a neighbouring submarket. However, it leads to non-linearities. Consequently, non-linear models for a cross-submarket real estate valuation are required to obtain reasonable results. We focus in this contribution on non-linear modelling on basis of artificial neural networks (ANN). A prerequisite for these procedures is an adequate sample size. We present a new approach based on the aggregation of submarkets additional to the markets with few transactions at the expense of increasing complexity of the model required. The cross-submarket ANN estimation aims to reach accuracies comparable to local property valuation procedures in a first step and in further consequence to enable a reasonable estimation in areas with few transactions. We introduce an extended Kalman filter (EKF) estimation procedure for the ANN parameters and compare it to the standard optimisation procedure Levenberg Marquardt (LM) as well as to the multiple linear regression. Thus, German spatial and functional submarkets are aggregated. For the spatially aggregated data set, the ANN estimation leads to improved results. The ANN estimation of the functionally aggregated data appears deceptively simple due to too small samples not representing the sampling density. The question arises, what are adequate sample sizes regarding the complexity of the unknown relationship. We purpose a model complexity analysis procedure based on resampling and the structural risk minimisation theory and derive a minimum sample size for the spatially aggregated data. Only for the EKF computations, this minimum sample size is reached due to less variance of the ANN estimations. Generally, the EKF computation leads to a better ANN performance in contrast to LM. Finally, the spatial cross-submarket ANN estimation reaches accuracies of local property valuation procedures.

Details

Original language	English
Article number	105475
Pages (from-to)	1-16
Number of pages	16
Journal	Land Use Policy
Volume	107
Issue number	8
Publication status	Published - Aug 2021
Peer-reviewed	Yes

External IDs

Scopus	85105279645
ORCID	/0000-0001-8962-1505/work/125727846
ORCID	/0000-0003-2742-5183/work/142252424

Keywords

Adequate sample size, Real estate valuation, Artificial neural network, Complexity analysis

Research Portal of the TU Dresden

Contributors

Abstract

Details

External IDs

Keywords

Keywords