Publication Details

Category Text Publication
Reference Category Journals
DOI 10.1016/j.biocon.2014.11.012
Title (Primary) Uncertainty analysis of crowd-sourced and professionally collected field data used in species distribution models of Taiwanese moths
Author Lin, Y.-P.; Deng, D.; Lin, W.-C.; Lemmens, R.; Crossman, N.D.; Henle, K.; Schmeller, D.S.
Source Titel Biological Conservation
Year 2015
Department NSF
Volume 181
Page From 102
Page To 110
Language englisch
Supplements https://ars.els-cdn.com/content/image/1-s2.0-S0006320714004376-mmc1.docx
Keywords Social media; Citizen science; Volunteer survey; Prediction of species distribution; Uncertainty; Natural language; Large-scale monitoring program
UFZ wide themes RU1
Abstract The purposes of this study are to extract the names of species and places for a citizen-science monitoring program, to obtain crowd-sourced data of acceptable quality, and to assess the quality and the uncertainty of predictions based on crowd-sourced data and professional data. We used Natural Language Processing to extract names of species and places from text messages in a citizen science project. Bootstrap and Maximum Entropy methods were used to assess the uncertainty in the model predictions based on crowd-sourced data from the EnjoyMoths project in Taiwan. We compared uncertainty in the predictions obtained from the project and from the Global Biodiversity Information Facility (GBIF) field data for seven focal species of moth. The proximity to locations of easy access and the Ripley K method were used to test the level of spatial bias and randomness of the crowd-sourced data against GBIF data. Our results show that extracting information to identify the names of species and their locations from crowd-sourced data performed well. The results of the spatial bias and randomness tests revealed that the crowd-sourced data and GBIF data did not differ significantly in respect to both spatial bias and clustering. The prediction models developed using the crowd-sourced dataset were the most effective, followed by those that were developed using the combined dataset. Those that performed least well were based on the small sample size GBIF dataset. Our method demonstrates the potential for using data collected by citizen scientists and the extraction of information from vast social networks. Our analysis also shows the value of citizen science data to improve biodiversity information in combination with data collected by professionals.
Persistent UFZ Identifier https://www.ufz.de/index.php?en=20939&ufzPublicationIdentifier=15533
Lin, Y.-P., Deng, D., Lin, W.-C., Lemmens, R., Crossman, N.D., Henle, K., Schmeller, D.S. (2015):
Uncertainty analysis of crowd-sourced and professionally collected field data used in species distribution models of Taiwanese moths
Biol. Conserv. 181 , 102 - 110 10.1016/j.biocon.2014.11.012