Tomato detection in natural environment based on improved YOLOv8 network

In this paper, an improved lightweight YOLOv8 method is proposed to detect the ripeness of tomato fruits, given the problems of subtle differences between neighboring stages of ripening and mutual occlusion of branches, leaves, and fruits. The method replaces the backbone network of the original YO...

Full description

Saved in:
Bibliographic Details
Main Authors: Wancheng Dong, Yipeng Zhao, Jiaxing Pei, Zuolong Feng, Zhikai Ma, Leilei Wang, Simon Shemin Wang
Format: Article
Language:English
Published: PAGEPress Publications 2025-07-01
Series:Journal of Agricultural Engineering
Subjects:
Online Access:https://www.agroengineering.org/jae/article/view/1732
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In this paper, an improved lightweight YOLOv8 method is proposed to detect the ripeness of tomato fruits, given the problems of subtle differences between neighboring stages of ripening and mutual occlusion of branches, leaves, and fruits. The method replaces the backbone network of the original YOLOv8 with a more lightweight MobileNetV3 structure to reduce the number of parameters of the model; at the same time, it integrates the convolutional attention mechanism module (CBAM) in the feature extraction network, which enhances the network's capability of extracting features of tomato fruits. At the same time, it introduces the SCYLLA-IoU (SIoU) as a bounded YOLOv8 frame regression loss function, effectively solving the mismatch problem between the predicted frame and the actual frame and improving recognition accuracy. Compared with the current mainstream models Resnet50, VGG16, YOLOv3, YOLOv5, YOLOv7, etc., the model is in an advantageous position regarding precision rate, recall rate, and detection accuracy. The research and experimental results show that the mean values of precision, recall rate, and average precision of the improved MCS-YOLOv8 model under the test set are 91.2%, 90.2%, and 90.3%, respectively. The detection speed of a single image is 5.4ms, and the model occupies less memory by 8.7 M. The model has a clear advantage in both detection speed and precision rate and also shows that the improved MCS-YOLOv8 model can provide strong technical support for tomato-picking robots in complex environments in the field.
ISSN:1974-7071
2239-6268