학술논문

Analysis and prediction of second-hand house price based on random forest
Document Type
article
Source
Applied Mathematics and Nonlinear Sciences, Vol 7, Iss 1, Pp 27-42 (2022)
Subject
data analysis
second-hand house
random forest
python
crawler technology
Mathematics
QA1-939
Language
English
ISSN
2444-8656
Abstract
Using Python language and combined with data analysis and mining technology, the authors capture and clean the housing source data of second-hand houses in Chengdu from Beike Network, and visually analyse the cleaned data. Then, a Random Forest (RF) model is established for 38,363 data elements. According to the visual analysis results, the model variables are revalued, the key factors affecting house prices are studied and the optimised model is used to predict house prices. The experiment shows that the deviation between the house price predicted by the RF model and that predicted by the real house price is small; it also indicates the accuracy of the RF model and demonstrates its good application value.