Sensitive Multispectral Variable Screening Method and Yield Prediction Models for Sugarcane Based on Gray Relational Analysis and Correlation Analysis

Sugarcane yield prediction plays a pivotal role in enabling farmers to monitor crop development and optimize cultivation practices, guiding harvesting operations for sugar mills. In this study, we established three experimental fields, which were planted with three main sugarcane cultivars in Guangx...

Full description

Saved in:
Bibliographic Details
Main Authors: Shimin Zhang, Huojuan Qin, Xiuhua Li, Muqing Zhang, Wei Yao, Xuegang Lyu, Hongtao Jiang
Format: Article
Language:English
Published: MDPI AG 2025-06-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/17/12/2055
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Sugarcane yield prediction plays a pivotal role in enabling farmers to monitor crop development and optimize cultivation practices, guiding harvesting operations for sugar mills. In this study, we established three experimental fields, which were planted with three main sugarcane cultivars in Guangxi, China, respectively, implementing a multi-gradient fertilization design with 39 plots and 810 sampling grids. Multispectral imagery was acquired by unmanned aerial vehicles (UAVs) during five critical growth stages: mid-tillering (T1), late-tillering (T2), mid-elongation (T3), late-elongation (T4), and maturation (T5). Following rigorous image preprocessing (including stitching, geometric correction, and radiometric correction), 16 VIs were extracted. To identify yield-sensitive vegetation indices (VIs), a spectral feature selection criterion combining gray relational analysis and correlation analysis (GRD-r) was proposed. Subsequently, three supervised learning algorithms—Gradient Boosting Decision Tree (GBDT), Random Forest (RF), and Support Vector Machine (SVM)—were employed to develop both single-stage and multi-stage yield prediction models. Results demonstrated that multi-stage models consistently outperformed their single-stage counterparts. Among the single-stage models, the RF model using T3-stage features achieved the highest accuracy (R<sup>2</sup> = 0.78, <i>RMSE</i><sub>V</sub> = 7.47 t/hm<sup>2</sup>). The best performance among multi-stage models was obtained using a GBDT model constructed from a combination of DVI (T1), NDVI (T2), TDVI (T3), NDVI (T4), and SRPI (T5), yielding R<sup>2</sup> = 0.83 and <i>RMSE</i><sub>V</sub> = 6.63 t/hm<sup>2</sup>. This study highlights the advantages of integrating multi-temporal spectral features and advanced machine learning techniques for improving sugarcane yield prediction, providing a theoretical foundation and practical guidance for precision agriculture and harvest logistics.
ISSN:2072-4292