2 The shiny app
2.1 Launching the app
library(bdDwC) # Laod package library
runDwC() # Launch the app
2.2 App overview
In the first screen, you’ll need to load your biodiversity data; choose dictionary and run the Darwinizer. There are two options, form a file on your computer, of fetch from a web based data provider.
2.3 Data upload
2.3.1 From a local file
A CSV file or a Darwin Core Archive (DwC-A) zip file can be uploaded.
2.3.2 From an online database
Also, data can be retrieved directly from various online biodiversity databases. You need only to:
- Select the database
- Specify the desired scientific name.
- Specify the number of records (upper limit of 50,000).
- Check the box if records must have coordinates.
- Wait for data to be downloaded.
2.4 Dictionaries
A dictionary is a key component when Darwinizing a dataset. It’s basically a lookup table that lists a possible variation of field name and it corresponding DwC name.
2.4.1 The Darwin Cloud dictionary
The Darwin Cloud dictionary (Wieczorek et al. 2017), is a lookup table that accumulates different variations in DwC field names from different publishers. This valuable and critical dictionary was created and is maintained by the Kurator project (http://kurator.acis.ufl.edu/kurator-web/), which provides workflow tools for data quality improvement of biodiversity data, via a user-friendly web interface. The development of bdDwC was inspired by Kurator’s own Darwinizer.
Updating the Darwin Cloud dictionary
It’s recommended to update the Darwin Cloud dictionary file. This can be done easily by clicking the Update DC button.
2.4.2 Custom dictionary
It’s also possible to add your own dictionary by creating a CSV file with two columns, one for the Field Names and one for the Standard Names. After uploading the custom disctionary, we need to specify which field denotes the ‘User fierld names’ and which is the ‘Standard (DwC) field names’.
2.5 Darwinizing your dataset
Once a dataset is uploaded, the ‘Submit to Darwinizer’ button is activated, Clicking it will begin the interactive ‘Darwinize the dataset’ process.
2.6 Darwinizer results
2.6.1 Results page overwiew
Manually renaming field names can be done very easily, just choose the two corresponding fields and click the Rename button.
Hovering over a DwC standard name will display its description.
2.7 Download your Darwinized data
2.8 Closing the app
Just close the app browser tab, and the R session will be terminated. To reopen it run in the R Console runDwC()
.
2.9 References
References
Wieczorek, John, Paul J. Morris, James Hanken, David B. Lowery, Bertram Ludäscher, James Macklin, Timothy McPhillips, Robert A. Morris, and Qian Zhang. 2017. “Darwin Cloud: Mapping Real-World Data to Darwin Core.” Biodiversity Information Science and Standards 1. Pensoft Publishers: e20486. doi:10.3897/tdwgproceedings.1.20486.