agronomy
Article
Attempting to Estimate the Unseen—Correction for Occluded
Fruit in Tree Fruit Load Estimation by Machine Vision with
Deep Learning
Anand Koirala * , Kerry B. Walsh and Zhenglin Wang
Citation: Koirala, A.; Walsh, K.B.;
Wang, Z. Attempting to Estimate the
Unseen—Correction for Occluded
Fruit in Tree Fruit Load Estimation by
Machine Vision with Deep Learning.
Agronomy 2021, 11, 347. https://
doi.org/10.3390/agronomy11020347
Academic Editor: Jung Eek Son
Received: 25 January 2021
Accepted: 10 February 2021
Published: 15 February 2021
Publisher’s Note: MDPI stays neutral
with regard to jurisdictional claims in
published maps and institutional affil-
iations.
Copyright: © 2021 by the authors.
Licensee MDPI, Basel, Switzerland.
This article is an open access article
distributed under the terms and
conditions of the Creative Commons
Attribution (CC BY) license (https://
creativecommons.org/licenses/by/
4.0/).
Institute for Future Farming Systems, Central Queensland University, Building 361, Bruce Highway,
Rockhampton, QLD 4701, Australia; k.walsh@cqu.edu.au (K.B.W.); z.wang@cqu.edu.au (Z.W.)
* Correspondence: anand.koirala@cqumail.com; Tel.: +61-411096239
Abstract: Machine vision from ground vehicles is being used for estimation of fruit load on trees, but
a correction is required for occlusion by foliage or other fruits. This requires a manually estimated
factor (the reference method). It was hypothesised that canopy images could hold information related
to the number of occluded fruits. Several image features, such as the proportion of fruit that were
partly occluded, were used in training Random forest and multi-layered perceptron (MLP) models
for estimation of a correction factor per tree. In another approach, deep learning convolutional neural
networks (CNNs) were directly trained against harvest count of fruit per tree. A R2 of 0.98 (n = 98
trees) was achieved for the correlation of fruit count predicted by a Random forest model and the
ground truth fruit count, compared to a R
2
of 0.68 for the reference method. Error on prediction
of whole orchard (880 trees) fruit load compared to packhouse count was 1.6% for the MLP model
and 13.6% for the reference method. However, the performance of these models on data of another
season was at best equivalent and generally poorer than the reference method. This result indicates
that training on one season of data was insufficient for the development of a robust model.
Keywords: fruit occlusion; deep learning; machine vision; yield estimation; fruit count; neural
network; CNN; tree crop; Mangifera indica; MLP; canopy
1. Introduction
1.1. In-Field Approches to the Estimation of Tree Fruit Load
For any crop, yield estimation aids harvest resourcing and market planning. Current
practice for tree fruit yield estimation is based on knowledge of previous yield history, visual
observation of tree condition and/or manual counting of fruit on trees. Manual counting of a
sample of trees is current best practice for fruit load estimation, but this is labour intensive
and can be unreliable. For example, the coefficient of variation (standard deviation on tree
fruit load divided by mean fruit load) for ten mango orchards was reported to vary between
27 and 93%, while for one orchard the prediction error of manual count of fruit in an orchard
relative to actual harvest count was 31 and 10% for counts based on 5 and 33%, respectively,
of the 469 trees in the orchard [1]. There is a need for an alternative estimation method, given
the workload for manual fruit counting of such numbers of trees.
Several researchers have reported on the use of machine vision for tree fruit detection
and counting. A recent review [2] reported high accuracies for deep learning methods used
in detection of fruit in canopy images, e.g., a F1 score of 0.968 was achieved for detection
of fruit in images of mango canopies using a customised deep learning YOLO model.
However, the proportion of fruit on a tree that are captured in images acquired from the
interrow depends on canopy foliage and fruit density. A “dense” canopy will have a higher
proportion of fruit hidden from camera view than a less dense canopy, with fruit occluded
by foliage or other fruit. In sparse canopies, more fruits are visible, but a given fruit may
be seen twice in images from both sides of the tree row, leading to a double count.
Agronomy 2021, 11, 347. https://doi.org/10.3390/agronomy11020347 https://www.mdpi.com/journal/agronomy