Publication Details

Category Text Publication
Reference Category Journals
DOI 10.3390/rs61212247
Title (Primary) Extraction of plant physiological status from hyperspectral signatures using machine learning methods
Author Doktor, D.; Lausch, A.; Spengler, D.; Thurner, M.
Source Titel Remote Sensing
Year 2014
Department CLE
Volume 6
Issue 12
Page From 12247
Page To 12274
Language englisch
Keywords hyperspectral data; vegetation status; random forest; PROSAIL; crop
UFZ wide themes RU1;
Abstract The machine learning method, random forest (RF), is applied in order to derive biophysical and structural vegetation parameters from hyperspectral signatures. Hyperspectral data are, among other things, characterized by their high dimensionality and autocorrelation. Common multivariate regression approaches, which usually include only a limited number of spectral indices as predictors, do not make full use of the available information. In contrast, machine learning methods, such as RF, are supposed to be better suited to extract information on vegetation status. First, vegetation parameters are extracted from hyperspectral signatures simulated with the radiative transfer model, PROSAIL. Second, the transferability of these results with respect to laboratory and field measurements is investigated. situ observations of plant physiological parameters and corresponding spectra are gathered in the laboratory for summer barley (vulgare). Field situ measurements focus on winter crops over several growing seasons. Chlorophyll content, Leaf Area Index and phenological growth stages are derived from simulated and measured spectra. RF performs very robustly and with a very high accuracy on PROSAIL simulated data. Furthermore, it is almost unaffected by introduced noise and bias in data. When applied to laboratory data, the prediction accuracy is still good (Cab: 2 = 0.94/ LAI: 2 = 0.80/BBCH (Growth stages of mono-and dicotyledonous plants) : 2 = 0.91), but not as high as for simulated spectra. Transferability to field measurements is given with prediction levels as high as for laboratory data (Cab: 2 = 0.89/LAI: 2 = 0.89/BBCH: 2 = Wavelengths for deriving plant physiological status based on simulated and measured hyperspectral signatures are mostly selected from appropriate spectral regions (both field and laboratory): 700–800 nm regressing on Cab and 800–1300 nm regressing on LAI. Results suggest that the prediction accuracy of vegetation parameters using RF is not hampered by the high dimensionality of hyperspectral signatures (given preceding feature reduction). Wavelengths selected as important for prediction might, however, vary between underlying datasets. The introduction of changing environmental factors (soil, illumination conditions) has some detrimental effect, but more important factors seem to stem from measurement uncertainties and plant geometries.
Persistent UFZ Identifier https://www.ufz.de/index.php?en=20939&ufzPublicationIdentifier=15559
Doktor, D., Lausch, A., Spengler, D., Thurner, M. (2014):
Extraction of plant physiological status from hyperspectral signatures using machine learning methods
Remote Sens. 6 (12), 12247 - 12274 10.3390/rs61212247