A Novel Approach to Pine Nut Classification: Combining Near-Infrared Spectroscopy and Image Shape Features with Soft Voting-Based Ensemble Learning

Pine nuts hold significant economic value due to their rich plant protein and healthy fats, yet precise variety classification has long been hindered by limitations of traditional techniques such as chemical analysis and machine vision. This study proposes a novel near-infrared (NIR) spectral featur...

Full description

Saved in:
Bibliographic Details
Main Authors: Yueyun Yu, Xin Huang, Danjv Lv, Benjamin K. Ng, Chan-Tong Lam
Format: Article
Language:English
Published: MDPI AG 2025-06-01
Series:Mathematics
Subjects:
Online Access:https://www.mdpi.com/2227-7390/13/12/2009
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Pine nuts hold significant economic value due to their rich plant protein and healthy fats, yet precise variety classification has long been hindered by limitations of traditional techniques such as chemical analysis and machine vision. This study proposes a novel near-infrared (NIR) spectral feature selection algorithm, termed the improved binary equilibrium optimizer with selection probability (IBiEO-SP), which incorporates a dynamic probability adjustment mechanism to achieve efficient feature dimensionality reduction. Experimental validation on a dataset comprising seven pine nut varieties demonstrated that, compared to particle swarm optimization (PSO) and the genetic algorithm (GA), the IBiEO-SP algorithm improved average classification accuracy by 5.7% (<i>p</i> < 0.01, Student’s <i>t</i>-test) under four spectral preprocessing methods (MSC, SNV, SG1, and SG2). Remarkably, only 2–3 features were required to achieve optimal performance (MSC + random forest: 99.05% accuracy, 100% F1/precision; SNV + KNN: 97.14% accuracy, 100% F1/precision). Furthermore, a multimodal data synergy strategy integrating NIR spectroscopy with morphological features was proposed, and a classification model was constructed using a soft voting ensemble. The final classification accuracy reached 99.95%, representing a 2.9% improvement over single-spectral-mode analysis. The results indicate that the IBiEO-SP algorithm effectively balances feature discriminative power and model generalization needs, overcoming the contradiction between high-dimensional data redundancy and low-dimensional information loss. This work provides a high-precision, low-complexity solution for rapid quality detection of pine nuts, with broad implications for agricultural product inspection and food safety.
ISSN:2227-7390