The Interpretative Effects of Normalization Techniques on Complex Regression Modeling: An Application to Real Estate Values Using Machine Learning

The performance of machine learning models depends on several factors, including data normalization, which can significantly improve its accuracy. There are many standardization techniques, and none is universally suitable; the choice depends on the characteristics of the problem, the predictive tas...

Full description

Saved in:
Bibliographic Details
Main Authors: Debora Anelli, Pierluigi Morano, Francesco Tajani, Maria Rosaria Guarini
Format: Article
Language:English
Published: MDPI AG 2025-06-01
Series:Information
Subjects:
Online Access:https://www.mdpi.com/2078-2489/16/6/486
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The performance of machine learning models depends on several factors, including data normalization, which can significantly improve its accuracy. There are many standardization techniques, and none is universally suitable; the choice depends on the characteristics of the problem, the predictive task, and the needs of the model used. This study analyzes how normalization techniques influence the outcomes of real estate price regression models using machine learning to uncover complex relationships between urban and economic factors. Six normalization techniques are employed to assess how they affect the estimation of relationships between property value and factors like social degradation, resident population, per capita income, green spaces, building conditions, and degraded neighborhood presence. The study’s findings underscore the pivotal role of normalization in shaping the perception of variables, accentuating critical thresholds, or distorting anticipated functional relationships. The work is the first application of a methodological approach to define the best technique on the basis of two criteria: statistical reliability and empirical evidence of the functional relationships obtainable with each standardization technique. Notably, the study underscores the potential of machine-learning-based regression to circumvent the limitations of conventional models, thereby yielding more robust and interpretable results.
ISSN:2078-2489