Dataset: Metadata for longread sequencing of Carcinus maenas collected from Buzzards Bay, Massachusetts from May 2022 to Aug 2022

ValidatedFinal no updates expectedDOI: 10.26008/1912/bco-dmo.949666.1Version 1 (2025-01-29)Dataset Type:Other Field Results

Principal Investigator: Carolyn Tepolt (Woods Hole Oceanographic Institution)

BCO-DMO Data Manager: Audrey Mickle (Woods Hole Oceanographic Institution)


Project: Collaborative Research: Tracking fine-scale selection to temperature at the invasion front of a highly dispersive marine predator (West Coast Carcinus)


Abstract

This project explores genomic changes in the invasive European green crab (Carcinus maenas), including at a putative inversion polymorphism. To begin to explore structural variation without a reference genome, we conducted semi-targeted longread sequencing of the C. maenas genome using MinION sequencing. This dataset includes individual metadata for 6 raw MinION reads, archived at GenBank's SRA under BioProject PRJNA1171011. This sequencing was conducted using crabs from Massachusetts waters.

Samples of Carcinus maenas (urn:lsid:marinespecies.org:taxname:107381) were collected between May 2022 and Aug 2022 from Massachusetts waters. 

Extractions were performed with an NEB Monarch HMW DNA Extraction Kit for Cells & Blood (May runs; low success) or a Circulomics Nanobind Tissue Big DNA kit (June and August runs; good success). Library prep was performed with a Oxford Nanopore Cas9 sequencing kit and custom Cas9 probes targeting putative inversion regions. May and June runs used the same first-round set of probes, while August runs used an updated second-round set of probes. Probe sequences are included in the supplemental file and can be cross-referenced using the probe_set value, though targeting is imperfect so much of the data simply reflect non-targeted genome sequencing. Libraries were sequenced on a single flowcell of an Oxford Nanopore MinION mk1c. For each round of sequencing (May, June, and August), the same library was run twice for 24 hours each time, with a flowcell flush at 24 hours. The run_day value captures this, listing this pre- and post-flush runs as run_days 1 and 2, respectively, for each single sample.


Related Datasets

References

Dataset: http://www.ncbi.nlm.nih.gov/bioproject/PRJNA1171011
Woods Hole Oceanographic Institution. Carcinus maenas longread sequencing. 2024/10. In: BioProject [Internet]. Bethesda, MD: National Library of Medicine (US), National Center for Biotechnology Information; 2011-. Available from: http://www.ncbi.nlm.nih.gov/bioproject/PRJNA1171011. NCBI:BioProject: PRJNA1171011.

Related Publications

Methods

MinKNOW Technical Document. Document version: MITD_5000_v1_revAJ_16May2016 (2016). Oxford Nanopore Technologies. https://nanoporetech.com/document/minknow-tech-doc