Go to the Bulk Uploader (https://data.pointblue.org/science/projectmanager/bulk_uploader), select your project (you will only see projects that you have access to).
The Add Observations module will load both sampling events and observations into your project. Bulk uploading observations requires 2 files: a CSV file with observation data, and a YAML configuration file describing how the CSV file columns map into your project database. CSV and YAML templates are automatically generated, based on your protocol selections, and three different formats are available to common data structures.
The CSV file you will load should have one observation type per line (see the three format types accepted below for specifics). Observations are organized into events as part of this process -- all observations seen at a single Sampling Unit at a given date and time under a Protocol is a sampling event. All event information, such as date and Sampling Unit name, should be repeated for each observation record in the csv file (aka "flattened" or denormalized).
To use the tool:
Choose the Add Observations tab.
If this is your first time using this tool to upload data of a particular type and protocol
Select “Get a blank Template to fill out” and “Point Count”, then select a Protocol and Site Condition Protocol. Note, the Protocols must already exist in the Project before they will show up in the list. If you need a new protocol added, please submit a request to support@pointbluehelp.zendesk.com.
Select a layout style for your observation data: Standard, By Distance Bins, or By Time Bins. See XXX for graphics that demonstrate the different format choices using the same dataset. All three format options result in the same data presentation in Biologists.
Standard is formatted as one record per unique combination of species, detection cue, distance bin, time bin, breeding status, and singing indicator.
Distance Bins is formatted as one record per unique combination of species, detection cue, time bin, breeding status, and singing indicator. Counts for each distance bin in the protocol are entered as separate fields in the same record.
Time Bins is formatted as one record per unique combination of species, detection cue, distance bin, breeding status, and singing indicator. Counts for each time bin in the protocol are entered as separate fields in the same record.
NOTE: Site condition protocol data are formatted and entered the same regardless of what layout style is selected for the observations.
Download CSV and YAML files. These files are custom-generated based on the Protocol and Site Condition Protocol you selected above.
Populate a CSV file with your observation data in one of the three accepted formats. The top row of headers in the template defines what should be in each column. The second row of headers is required for the YAML to process the file. If you wish to change the name of a header, you will need to change the corresponding line in the YAML file. However, in most cases, the YAML file should not need to be modified.
TODO: Create documentation for YAML modification.
Note: All entries must match the field definitions defined by the protocol. See Protocol Search for valid codes to use for the protocol-specific fields. For example, for the following protocol definition (VCP25_150), the distance bin values you must enter in your csv file are L25, L50, L75, G00, B00, B20, B50, or FLO.Upload your data by
Selecting “Bulk upload datafile.”
Selecting your populated CSV file to upload.
Selecting the YAML file that corresponds to your CSV file.
Selecting “Process” to upload the data.
Results will be displayed. Review these carefully to make sure both site conditions and observations loaded. You can also go into Biologists to confirm the data uploaded and is correct.
Note: If your observation count shows 0 in Biologists or in the Results screen of Bulk Uploader, but the site conditions loaded, it may be easiest to delete the sampling event with the partial dataset from Biologists, and then upload files for just the observation protocol. This will force an informative error message identifying which field value is not being accepted by Bulk Uploader. If a partial upload succeeds, any error messages will be suppressed.
Data format examples:
Standard format example
2. Distance Bin format example
3. Time Bin format example
4. Results in Biologists (note, date was changed in these examples to allow loading all three formats)