Contributors | Affiliation | Role |
---|---|---|
Saito, Mak A. | Woods Hole Oceanographic Institution (WHOI) | Principal Investigator |
Ake, Hannah | Woods Hole Oceanographic Institution (WHOI BCO-DMO) | BCO-DMO Data Manager |
York, Amber D. | Woods Hole Oceanographic Institution (WHOI BCO-DMO) | BCO-DMO Data Manager |
These data are part of the Ocean Protein Portal "MetZyme 0.2" dataset version 5 (https://proteinportal.whoi.edu/; Saito et al., 2020).
Ocean metaproteomes were sampled and extracted as described in Saito et al., 2014. The protocols are reproduced here:
Samples were collected during the KM1128 METZYME research expedition (Metals and Enzymes in the Pacific) on the R/V Kilo Moana October 1-25, 2011 from Oahu, Hawaii to Apia, Samoa, with Carl Lamborg and Mak Saito as Chief Scientists. Microbial biomass for protein analyses was collected on vertical profiles using in situ high volume particle filtration pumps with a focus on the North Pacific and Equatorial regions. Protein samples were collected by a suite of 4 L/min and 8 L/min McLane Pumps (WTS-LV; McLane Research Laboratories Inc., Falmouth MA, USA) outfitted with custom Mini-MULVS multiple filter head systems. The 0.2-3.0 micron size fraction was collected on 142 mm filters (Supor, Pall Corp.) for analyses used in this study. The volume of water was pumped until a minimal flow rate was achieved or the allotted cast time period expired, typically ~300-500 L. Filters were sectioned immediately after pump retrieval, and protein samples (¼ filter) were stored in RNAlater reagent (Ambion, Life Technologies), which has been shown to be an effective preservative for cyanobacterial biomass (Saito et al., 2011), frozen at -80C, transported back to the laboratory on dry ice, and stored at -80C until analysis.
Total microbial protein (0.2-3.0um fraction) was extracted using detergent-based methods, described below. Total protein showed enhanced concentrations in the photic zone, particularly in the Equatorial and South Pacific portions of the transect (Fig. S1F). For protein extraction, samples were thawed and the filter and RNAlater preservative (Ambion Life Technologies) were separated. The removed preservative was spin-concentrated by a 5K MWCO membrane (Sartorius Stedim Biotech 6 mL, 5 K MWCO Vivaspin units; Goettingen, Germany), and rinsed with 0.1M Tris buffer to recover, desalt, and concentrate any suspended material. The sample filter was unfolded and placed in a larger tube to which 1% SDS extraction buffer (1% SDS, 0.1M Tris/HCL pH 7.5, 10mM EDTA) and the rinsed/desalted RNAlater fraction was added back. Each sample was incubated at room temperature for 15 minutes, heated at 95C for 10 minutes, and shaken at room temperature for 350 rpm for 1 h. The protein extract was decanted and placed in a new tube and centrifuged for 30 min x 3220 g at room temperature. The supernatants were removed and filtered through a 5 um low protein binding syringe filter (Fisher Scientific), and the filter rinsed with extraction buffer. The extracts were concentrated by 5 kD membrane centrifugation to a small volume, washed with extraction buffer, and concentrated again. Each sample was precipitated with cold 50% methanol (MeOH) 50% acetone 0.5 mM HCL for 3 days at –20C, centrifuged at 14100 x g (14500 rpm) for 30 m at 4C, decanted and dried by vacuum concentration (Thermo Savant Speedvac) for 10 min or until dry. Pellets were resuspended in 1% SDS extraction buffer and left at room temperature (RT) for 1 h to redissolve. Total protein was quantified (Bio-Rad DC protein assay, Hercules, CA) with BSA as a standard.
Extracted proteins were purified from SDS detergent, reduced, alkylated, and trypsin digested, while embedded within a polyacrylamide tube gel, using a modified protocol from a previously published method (38). The tube gel approach allowed all proteins including membrane proteins to be solubilized by detergent and purified while immobilized in the gel matrix. A gel premix was made by combining 1 M Tris HCL (pH 7.5) and 40% Bis-acrylimide L 29:1 (Acros Organics) at a ratio of 1:3. The premix (103 ul) was combined with an extracted protein sample (usually 25 ug-200 ug), TE, 7 ul 1% APS and 3 ul of TEMED (Acros Organics) to a final volume of 200 ul. After 1 h of polymerization at RT, 200 ul of gel fix solution (50% ETOH, 10% acetic acid in LC/MS grade water) was added to the top of the gel and incubated at RT for 20 minutes. Liquid was then removed and the tube gel was transferred into a new 1.5 mL microtube.
Data Version 4 (2021-03-30) replaces data version 2 (2018-05-21). The intermediate version 3 was never made public.
Uses updated station information (dataset 730891 version 3) joined with the submitted file METZYME_V1_peptides_050917.xlsx. Also several transformation and find and replace operations performed as described below.
Sheet "Final" imported into the BCO-DMO data system from file Metzyme_Proteins_V1_targetdecoyPSMvalidator_030918.xlsx
N/A interpreted as no data value
skipped row that started "END"
Columns renamed to fit the OPP template (see https://github.com/oceanproteinportal/data-file-templates).
Added minimum_filter_size_microns column with value '0.2'
Added maximum_filter_size_microns with 'value 3.0'
Added station information from dataset 730891 version 3 using a join operation station and depth as keys. Added dates times (local and UTC), and lat, lon.
file saved as csv format.
{MS/MS sample name}
*Needed sample name with the number 0 padded to match other sample ids in station info and protein data. Created new column for sample_id.
v2 notes:
- Replaced blank cells with nd
- Replaced #N/A with NA
- Replaced commas in cells with ;
- Added cruise number column
Parameter | Description | Units |
cruise | Cruise deployment number | unitless |
station | Station number | unitless |
depth | Depth of sampling event | meters |
MS_MS_sample_name | RAW file name - crossreferences to proteomexchange or OPP repository | unitless |
protein_ID_num | Metagenome ID - needed to map to protein IDs. | unitless |
protein_accession_num | Duplicate of Protein ID but can contain multiple entries. | unitless |
protein_molecular_weight | Molecular weight | kDA |
peptide_sequence | Unique peptide sequence - most unique identifier in dataset | unitless |
best_peptide_identification_probability | Metric of peptide quality | unitless |
best_SEQUEST_Xcorr_score | Metric of peptide quality | unitless |
best_SEQUEST_DCn_score | Metric of peptide quality | unitless |
number_of_identified_pos2H_spectra | Number of identified +2H spectra | unitless |
number_of_identified_pos3H_spectra | Number of identified +3H spectra | unitless |
number_of_identified_pos4H_spectra | Number of identified +4H spectra | unitless |
sum_spectral_counts | Sum of +2+3 +4 data = total unnormalized spectral counts - Quantitative Value | unitless |
median_retention_time | Median retention time | minutes |
total_precursor_intensity | Quantitative Value (2 of 3) | unitless |
total_TIC | Quantitative Value (3 of 3) | unitless |
peptide_start_index | Data useful in creating peptide coverage map for WHO page | unitless |
peptide_stop_index | Data useful in creating peptide coverage map for WHO page | unitless |
lat | Latitude | decimal degrees |
lon | Longitude | decimal degrees |
time_local | Local time of sampling at station and depth (HST, UTC-10) | unitless |
min_filter_size | Minimum filter size | microns |
max_filter_size | Maximum filter size | microns |
Dataset-specific Instrument Name | 5 kD membrane centrifuge |
Generic Instrument Name | Centrifuge |
Dataset-specific Description | Used to concentrate the extracts |
Generic Instrument Description | A machine with a rapidly rotating container that applies centrifugal force to its contents, typically to separate fluids of different densities (e.g., cream from milk) or liquids from solids. |
Dataset-specific Instrument Name | McLane Pumps (WTS-LV; McLane Research Laboratories Inc., Falmouth MA, USA) |
Generic Instrument Name | McLane Pump |
Dataset-specific Description | Used to collect microbial biomass for protein analyses |
Generic Instrument Description | McLane pumps sample large volumes of seawater at depth. They are attached to a wire and lowered to different depths in the ocean. As the water is pumped through the filter, particles suspended in the ocean are collected on the filters. The pumps are then retrieved and the contents of the filters are analyzed in a lab. |
Website | |
Platform | R/V Kilo Moana |
Start Date | 2011-10-01 |
End Date | 2011-10-25 |
Description | This is a MetZyme project cruise. The original cruise data are available from the NSF R2R data catalog. |
MetZyme project researchers will determine the role of enzymatic activity in the cycling of trace metals. Specifically the research will address the following questions: (1) degradation of sinking particulate organic material in the Tropical North Pacific can be influenced by the ability of microbes to synthesize zinc proteases, which in turn is controlled by the abundance or availability of zinc, and (2) methylation of mercury is controlled, in part, by the activity of cobalt-containing enzymes, and therefore the supply of labile cobalt to the corrinoid-containing enzymes or co-factors responsible for methylation. To attain their goal, they will collect dissolved and particulate samples for trace metals and metalloenzymes from three stations along a biogeochemical gradient in the Tropical North Pacific (along 150 degrees West from 18 degrees North to the equator). Sinking particles from metal clean sediment traps will also be obtained. The samples will also be used to carry out shipboard incubation experiments using amendments of metals, metal-chelators, B12, and proteases to examine the sensitivity and metal limitation of heterotrophic, enzymatic degradation of organic matter within the oceanic "Twilight Zone" (100-500 m). This study will result in a novel metaproteomic/metalloenzyme datasets that should provide insights into the biogeochemical cycling of metals, as well as co-limitation of primary productivity and controls on the export of carbon from the photic zone. In addition to the final data being contributed to BCO-DMO, an online metaproteomic data server will be created so the community has access to the raw data files generated by this research.
Description from NSF award abstract:
This study will employ a novel combination of experimental evolution techniques and state-of-the-art molecular methods to yield unique insights into adaptive changes in the keystone marine cyanobacteria Trichodesmium and Crocosphaera in response to selection by high CO2. Several studies have suggested that N2-fixation rates of the biogeochemically-critical cyanobacteria Trichodesmium and Crocosphaera may increase dramatically in the future high CO2 ocean, but these have all used the same limited set of cultured isolates and considered cells only briefly acclimated to elevated CO2. The investigator's new results, however, demonstrate that a broad diversity of high- and low-CO2 adapted ecotypes exists within each diazotroph genus. Furthermore, in a preliminary four year experimental evolution study with Trichodesmium, the PIs observed large adaptive responses following 500-700 generations of selection by high CO2- but in a completely unexpected way. All of the six replicate high CO2-adapted cell lines exhibited strong constitutive up-regulation of N2 fixation rates. These very elevated N2 fixation rates continued, even though the cultures have were switched back to low-CO2 conditions for many months. Expression of the nif operon and N assimilatory genes was also up-regulated in these cell lines, as is expression of many intergenic regions of the genome.
The investigators hypothesize that constitutive up-regulation of cellular N2 fixation systems may be a common adaptive response of both Trichodesmium and Crocosphaera under extended selection by elevated CO2. This project will test this hypothesis in a four-year experimental evolution study to determine the adaptive responses of both high- and low-CO2 specialized ecotypes of these two diazotrophs to increased CO2.
The investigators will grow representative high- and low-CO2 adapted ecotypes from each genus in well-replicated cell lines at 380 ppm and 750 ppm CO2 for up to 1000 generations. Periodically, they will perform "switch" experiments to measure N2 and CO2 fixation rates and growth rates of high CO2-selected cell lines grown briefly (one week) at low CO2, and vice versa. These switch experiments will allow screening for cell lines which exhibit adaptive changes in phenotypically-expressed rate parameters, such as those observed in the preliminary Trichodesmium study. Evolutionary mechanisms in the CO2-selected cell lines will be examined by comparison of changes in their genomes, transcriptomes, and proteomes over time relative to reference genomes, using frozen samples archived monthly during the preceding selection period. Examination of these molecular and biochemical changes will be coordinated with an in-depth array of physiological and biogeochemical analyses. This combined approach will allow an evaluation of potential adaptive mechanisms in diazotrophic cyanobacteria ranging from indel, duplication, single nucleotide polymorphism, and transposition mutations to altered putative non-coding RNA expression, protein expression, and post-translational protein modifications, and then allow the investigators to link these mechanisms directly with their potential impacts on ecosystem-level biogeochemical processes like N2 and CO2 fixation. Finally, the research team will determine how long term selection by high CO2 affects the iron and phosphorus requirements of Trichodesmium and Crocosphaera, since constitutive up-regulation of N2 fixation would also have major implications for limitation of diazotrophs by these two critical nutrients in the future high CO2 ocean.
GEOTRACES is a SCOR sponsored program; and funding for program infrastructure development is provided by the U.S. National Science Foundation.
GEOTRACES gained momentum following a special symposium, S02: Biogeochemical cycling of trace elements and isotopes in the ocean and applications to constrain contemporary marine processes (GEOSECS II), at a 2003 Goldschmidt meeting convened in Japan. The GEOSECS II acronym referred to the Geochemical Ocean Section Studies To determine full water column distributions of selected trace elements and isotopes, including their concentration, chemical speciation, and physical form, along a sufficient number of sections in each ocean basin to establish the principal relationships between these distributions and with more traditional hydrographic parameters;
* To evaluate the sources, sinks, and internal cycling of these species and thereby characterize more completely the physical, chemical and biological processes regulating their distributions, and the sensitivity of these processes to global change; and
* To understand the processes that control the concentrations of geochemical species used for proxies of the past environment, both in the water column and in the substrates that reflect the water column.
GEOTRACES will be global in scope, consisting of ocean sections complemented by regional process studies. Sections and process studies will combine fieldwork, laboratory experiments and modelling. Beyond realizing the scientific objectives identified above, a natural outcome of this work will be to build a community of marine scientists who understand the processes regulating trace element cycles sufficiently well to exploit this knowledge reliably in future interdisciplinary studies.
Expand "Projects" below for information about and data resulting from individual US GEOTRACES research projects.
Funding Source | Award |
---|---|
NSF Division of Ocean Sciences (NSF OCE) | |
Gordon and Betty Moore Foundation: Marine Microbiology Initiative (MMI) |