Contributors | Affiliation | Role |
---|---|---|
Chan, Yvonne Ling-Hsiang | University of Hawaiʻi at Mānoa (HIMB) | Principal Investigator, Contact |
Selkoe, Kimberly | University of California-Santa Barbara (UCSB-NCEAS) | Co-Principal Investigator |
Copley, Nancy | Woods Hole Oceanographic Institution (WHOI BCO-DMO) | BCO-DMO Data Manager |
This dataset includes raw data for model simulations and scripts for analyzing the data and products of a Hierarchical Approximate Bayesian Computation analysis.
* Download the associated files (12.4 GB): http://dmoserv3.bco-dmo.org/data/toonen/Multispecies_Connectivity/Toonen_et_al_OCE-1260169_Data_Submission_2017-05-04.zip
hBayeSSC download available here: https://github.com/UH-Bioinformatics/hBayeSSC
hBayeSSC is a Python script that wraps around Serial SimCoal in order to simulate a multi-taxa community undergoing a coordinated demographic expansion.
Requirements:
The applications required to produce a set of simulations with multitaxa summary statistics for the hABC analysis described in Chan et al. 2014:
- BayeSSC - Serial Simcoal
- Python 2.x >= 2.4
- hBayeSSC.py
- msReject
Input files
The input files needed to produce a set of simulations with multitaxa summary statistics for the hABC analysis described in Chan et al. 2014:
- A table of observed summary statistics for each taxon in the community. (details)
- An input par file for serial simcoal. (details)
Observation summary statistics:
Sample observation file:
The table of observed summary statistics consists of columns with the following header names. hBayeSSC replaces the appropriate line in the par file with these values:
Column name = Description
species = Name of taxa
nsam = number of samples to be simulated
nsites = Number of base pairs
tstv = % transitions
gamma = Gamma shape parameter
gen = Numbers of years per generation
locuslow = Low estimate of the locus mutation rate per generation
locushigh = High estimate of the locus mutation rate per generation
Nelow = Low estimate for effective population size
Nehigh = High estimate for effective population size
SegSites = Segregating sites
nucdiv = Nucleotide diversity
Haptypes = Number of haplotypes
HapDiver = Haplotypic diversity
TajimasD = Tajima's D
F* = Fu's F
A more complete description of these values can be found on the BayeSSC website.
par file
The par file contains one prior which is not individually replaced, such as expansion magnitude (under historical events) and will apply to all populations.
Then we use the following R-script and abc.R to do the final 1,000 acceptance and parameter estimation using local linear regression.
BayeSSC (Bayesian Serial SimCoal) http://web.stanford.edu/group/hadlylab/ssc/
Data was processed using R-3.2.1.
BCO-DMO Processing Notes:
Compressed submitted files into a .zip file.
File |
---|
NWHI_HABC_Simulations.csv (Comma Separated Values (.csv), 218 bytes) MD5:bf755a4b61a9fffaa9bafa15248104c4 Primary data file for dataset ID 699774 |
Parameter | Description | Units |
description | name of dataset download | unitless |
link | download link for zipped scripts and simulations to perform HABC analysis | unitless |
Website | |
Platform | UHawaii_HIMB |
Start Date | 2013-03-01 |
Description | Modeling of Hawaiian coral reef taxa |
Description from NSF award abstract:
The exchange of individuals among populations, termed connectivity, is a central element of population persistence and maintenance of genetic diversity, and influences most ecological and evolutionary processes. To date, field studies of marine connectivity have necessarily focused on one or a few species at a time, providing little understanding of both the extent of variability in connectivity across a whole community and what factors drive that variability. This project will address these questions with population genetic datasets of a diverse marine fauna sampled across the Hawaiian Archipelago. By combining these genetic data with extensive oceanographic, ecological and historical data, this project can potentially transform our understanding of the basis of the genetic structure of populations and the processes influencing genetic patterns. This project will provide unique, and new, knowledge to basic marine ecology and the science of Ecosystem Based Management while incorporating the latest analytical and simulation approaches.
The results will be novel on several fronts: 1) advancing our understanding of community genetics and associated statistical techniques; 2) achieving true integration of genetic, ecological and oceanographic data over large spatial scales for many species simultaneously using a World Heritage Site; the Hawaiian Papahânaumokuâkea Marine National Monument; 3) factoring historical effects into connectivity studies; and 4) providing information on the location of barriers to connectivity, the sources and sinks of individuals and the physical processes influencing ecological patterns at a community level. This project will result in a quantum leap for both the conceptual and empirical understanding of marine connectivity and the utility of population genetic data in basic and applied marine science.
Funding Source | Award |
---|---|
NSF Division of Ocean Sciences (NSF OCE) |