Back to Search Start Over

Spectral data augmentation for leaf nutrient uptake quantification.

Authors :
Martins, R.C.
Queirós, C.
Silva, F.M.
Santos, F.
Barroso, T.G.
Tosin, R.
Cunha, M.
Leão, M.
Damásio, M.
Martins, P.
Silvestre, J.
Source :
Biosystems Engineering. Oct2024, Vol. 246, p82-95. 14p.
Publication Year :
2024

Abstract

Data scarcity is a hurdle for physiology-based precision agriculture. Measuring nutrient uptake by visible-near infrared spectroscopy implies collecting spectral and compositional data from low-throughput, such as inductively coupled plasma optical emission spectroscopy. This paper introduces data augmentation in spectroscopy by hybridisation for expanding real-world data into synthetic datasets statistically representative of the real data, allowing the quantification of macronutrients (N, P, K, Ca, Mg, and S) and micronutrients (Fe, Mn, Zn, Cu, and B). Partial least squares (PLS), local partial least squares (LocPLS), and self-learning artificial intelligence (SLAI) were used to determine the capacity to expand the knowledge base. PLS using only real-world data (RWD) cannot quantify some nutrients (N and Cu in grapevine leaves and K, Ca, Mg, S, and Cu in apple tree leaves). The synthetic dataset of the study allowed predicting real-world leaf composition of macronutrients (N, P, K, Ca, Mg and S) (Pearson coefficient correlation (R) ∼ 0.61–0.94 and standard error (SE) ∼ 0.04–0.05%) and micro-nutrients (Fe, Mn, Zn, Cu and B) (R ∼ 0.66–0.91 and SE ∼ 0.88–3.98 ppm) in grapevine leaves using LocPLS and SLAI. The synthetic dataset loses significance if the real-world counterpart has low representativity, resulting in poor quantifications of macronutrients (R ∼ 0.51–0.72 and SE ∼ 0.02–0.13%) and micronutrients (R ∼ 0.53–0.76 and SE ∼ 8.89–37.89 ppm), and not allowing S quantification (R = 0.37, SE = 0.01) in apple tree leaves. Representative real-world sampling makes data augmentation in spectroscopy very efficient in expanding the knowledge base and nutrient quantifications. [Display omitted] • Data hybridisation expands the information of the original dataset. • Synthetic data reproduces spectral and compositional information of the samples. • Hybridisation creates samples that could have been physically sampled. • Representativity of real-world data is essential for information expansion. • Data augmentation as a diagnostic tool of knowledge base representativity. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
15375110
Volume :
246
Database :
Academic Search Index
Journal :
Biosystems Engineering
Publication Type :
Academic Journal
Accession number :
179396794
Full Text :
https://doi.org/10.1016/j.biosystemseng.2024.07.001