ACS Publications. Most Trusted. Most Cited. Most Read
My Activity
CONTENT TYPES

Shotgun Protein Sequencing by Tandem Mass Spectra Assembly

View Author Information
Computer Science and Engineering Department, University of California, San Diego, Department 0114, 9500 Gilman Drive, La Jolla, California 92093-0114
Cite this: Anal. Chem. 2004, 76, 24, 7221–7233
Publication Date (Web):November 12, 2004
https://doi.org/10.1021/ac0489162
Copyright © 2004 American Chemical Society

    Article Views

    705

    Altmetric

    -

    Citations

    41
    LEARN ABOUT THESE METRICS
    Other access options
    Supporting Info (1)»

    Abstract

    The analysis of mass spectrometry data is still largely based on identification of single MS/MS spectra and does not attempt to make use of the extra information available in multiple MS/MS spectra from partially or completely overlapping peptides. Analysis of MS/MS spectra from multiple overlapping peptides opens up the possibility of assembling MS/MS spectra into entire proteins, similarly to the assembly of overlapping DNA reads into entire genomes. In this paper, we present for the first time a way to detect, score, and interpret overlaps between uninterpreted MS/MS spectra in an attempt to sequence entire proteins rather than individual peptides. We show that this approach not only extends the length of reconstructed amino acid sequences but also dramatically improves the quality of de novo peptide sequencing, even for low mass accuracy MS/MS data.

    Read this article

    To access this article, please review the available access options below.

    Get instant access

    Purchase Access

    Read this article for 48 hours. Check out below using your ACS ID or as a guest.

    Recommended

    Access through Your Institution

    You may have access to this article through your institution.

    Your institution does not have access to this content. You can change your affiliated institution below.

    *

     To whom correspondance should be addressed. E-mail:  [email protected].

    Supporting Information Available

    ARTICLE SECTIONS
    Jump To

    Additional information as noted in text. This material is available free of charge via the Internet at http://pubs.acs.org.

    Terms & Conditions

    Most electronic Supporting Information files are available without a subscription to ACS Web Editions. Such files may be downloaded by article for research use (if there is a public use license linked to the relevant article, that license may permit other uses). Permission may be obtained from ACS for other uses through requests via the RightsLink permission system: http://pubs.acs.org/page/copyright/permissions.html.

    Cited By

    This article is cited by 41 publications.

    1. Kira Vyatkina, Si Wu, Lennard J. M. Dekker, Martijn M. VanDuijn, Xiaowen Liu, Nikola Tolić, Mikhail Dvorkin, Sonya Alexandrova, Theo M. Luider, Ljiljana Paša-Tolić, and Pavel A. Pevzner . De Novo Sequencing of Peptides from Top-Down Tandem Mass Spectra. Journal of Proteome Research 2015, 14 (11) , 4450-4462. https://doi.org/10.1021/pr501244v
    2. Xiaowen Liu, Lennard J. M. Dekker, Si Wu, Martijn M. Vanduijn, Theo M. Luider, Nikola Tolić, Qiang Kou, Mikhail Dvorkin, Sonya Alexandrova, Kira Vyatkina, Ljiljana Paša-Tolić, and Pavel A. Pevzner . De Novo Protein Sequencing by Combining Top-Down and Bottom-Up Tandem Mass Spectra. Journal of Proteome Research 2014, 13 (7) , 3241-3248. https://doi.org/10.1021/pr401300m
    3. Adrian Guthals, Karl R. Clauser, Ari M. Frank, and Nuno Bandeira . Sequencing-Grade De novo Analysis of MS/MS Triplets (CID/HCD/ETD) From Overlapping Peptides. Journal of Proteome Research 2013, 12 (6) , 2846-2857. https://doi.org/10.1021/pr400173d
    4. Yaoyang Zhang, Bryan R. Fonslow, Bing Shan, Moon-Chang Baek, and John R. Yates, III . Protein Analysis by Shotgun/Bottom-up Proteomics. Chemical Reviews 2013, 113 (4) , 2343-2394. https://doi.org/10.1021/cr3003533
    5. Maykel Cruz-Monteagudo, Humberto González-Díaz, Fernanda Borges, Elena Rosa Dominguez and M. Natália D.S. Cordeiro. 3D-MEDNEs: An Alternative “in Silico” Technique for Chemical Research in Toxicology. 2. Quantitative Proteome−Toxicity Relationships (QPTR) based on Mass Spectrum Spiral Entropy. Chemical Research in Toxicology 2008, 21 (3) , 619-632. https://doi.org/10.1021/tx700296t
    6. Yi Lu, Cheng Ge, Biao Cai, Qing Xu, Ren Kong, Shan Chang. Antibody sequences assembly method based on weighted de Bruijn graph. Mathematical Biosciences and Engineering 2023, 20 (4) , 6174-6190. https://doi.org/10.3934/mbe.2023266
    7. K.V. Vyatkina. De novo sequencing of proteins and peptides: algorithms, applications, perspectives. Biomedical Chemistry: Research and Methods 2018, 1 (1) , e00005. https://doi.org/10.18097/BMCRM00005
    8. Kira Vyatkina. De Novo Sequencing of Top-Down Tandem Mass Spectra: A Next Step towards Retrieving a Complete Protein Sequence. Proteomes 2017, 5 (4) , 6. https://doi.org/10.3390/proteomes5010006
    9. Natalie Castellana, Adrian Guthals. Antibody de novo Sequencing. 2017, 139-153. https://doi.org/10.1002/9781119384434.ch6
    10. Letu Qingge, Xiaowen Liu, Farong Zhong, Binhai Zhu. Filling a Protein Scaffold With a Reference. IEEE Transactions on NanoBioscience 2017, 16 (2) , 123-130. https://doi.org/10.1109/TNB.2017.2666780
    11. Xiaoyan Guan, Naomi C. Brownstein, Nicolas L. Young, Alan G. Marshall. Ultrahigh-resolution Fourier transform ion cyclotron resonance mass spectrometry and tandem mass spectrometry for peptide de novo amino acid sequencing for a seven-protein mixture by paired single-residue transposed Lys-N and Lys-C digestion. Rapid Communications in Mass Spectrometry 2017, 31 (2) , 207-217. https://doi.org/10.1002/rcm.7783
    12. Ngoc Hieu Tran, M. Ziaur Rahman, Lin He, Lei Xin, Baozhen Shan, Ming Li. Complete De Novo Assembly of Monoclonal Antibody Sequences. Scientific Reports 2016, 6 (1) https://doi.org/10.1038/srep31730
    13. Kira Vyatkina, Si Wu, Lennard J. M. Dekker, Martijn M. VanDuijn, Xiaowen Liu, Nikola Tolić, Theo M. Luider, Ljiljana Paša-Tolić, Pavel A. Pevzner. Top-down analysis of protein samples by de novo sequencing techniques. Bioinformatics 2016, 32 (18) , 2753-2759. https://doi.org/10.1093/bioinformatics/btw307
    14. Letu Qingge, Xiaowen Liu, Farong Zhong, Binhai Zhu. Filling a Protein Scaffold with a Reference. 2016, 175-186. https://doi.org/10.1007/978-3-319-38782-6_15
    15. Victor Corasolla Carregari, Jie Dai, Thiago Verano-Braga, Thalita Rocha, Luis Alberto Ponce-Soto, Sergio Marangoni, Peter Roepstorff. Revealing the functional structure of a new PLA2 K49 from Bothriopsis taeniata snake venom employing automatic “de novo” sequencing using CID/HCD/ETD MS/MS analyses. Journal of Proteomics 2016, 131 , 131-139. https://doi.org/10.1016/j.jprot.2015.10.020
    16. Adrian Guthals, Christina Boucher, Nuno Bandeira. The Generating Function Approach for Peptide Identification in Spectral Networks. Journal of Computational Biology 2015, 22 (5) , 353-366. https://doi.org/10.1089/cmb.2014.0165
    17. Adrian Guthals, Christina Boucher, Nuno Bandeira. The Generating Function Approach for Peptide Identification in Spectral Networks. 2014, 85-99. https://doi.org/10.1007/978-3-319-05269-4_7
    18. Adrian Guthals, Karl R. Clauser, Nuno Bandeira. Shotgun Protein Sequencing with Meta-contig Assembly. Molecular & Cellular Proteomics 2012, 11 (10) , 1084-1096. https://doi.org/10.1074/mcp.M111.015768
    19. Bin Ma, Richard Johnson. De Novo Sequencing and Homology Searching. Molecular & Cellular Proteomics 2012, 11 (2) , O111.014902. https://doi.org/10.1074/mcp.O111.014902
    20. Adrian Guthals, Jeramie D. Watrous, Pieter C. Dorrestein, Nuno Bandeira. The spectral networks paradigm in high throughput mass spectrometry. Molecular BioSystems 2012, 8 (10) , 2535. https://doi.org/10.1039/c2mb25085c
    21. Nuno Bandeira. Protein Identification by Spectral Networks Analysis. 2011, 151-168. https://doi.org/10.1007/978-1-60761-977-2_11
    22. Jian Wang, Josué Pérez-Santiago, Jonathan E. Katz, Parag Mallick, Nuno Bandeira. Peptide Identification from Mixture Tandem Mass Spectra. Molecular & Cellular Proteomics 2010, 9 (7) , 1476-1485. https://doi.org/10.1074/mcp.M000136-MCP201
    23. Bin Ma. Challenges in Computational Analysis of Mass Spectrometry Data for Proteomics. Journal of Computer Science and Technology 2010, 25 (1) , 107-123. https://doi.org/10.1007/s11390-010-9309-1
    24. Xiaowen Liu, Yonghua Han, Denis Yuen, Bin Ma. Automated protein (re)sequencing with MS/MS and a homologous database yields almost full coverage and accuracy. Bioinformatics 2009, 25 (17) , 2174-2180. https://doi.org/10.1093/bioinformatics/btp366
    25. Deborah Penque. Two‐dimensional gel electrophoresis and mass spectrometry for biomarker discovery. PROTEOMICS – Clinical Applications 2009, 3 (2) , 155-172. https://doi.org/10.1002/prca.200800025
    26. Nuno Bandeira, Jesper V. Olsen, Matthias Mann, Pavel A. Pevzner. Multi-spectra peptide sequencing and its applications to multistage mass spectrometry. Bioinformatics 2008, 24 (13) , i416-i423. https://doi.org/10.1093/bioinformatics/btn184
    27. Jainab Khatun, Eric Hamlett, Morgan C. Giddings. Incorporating sequence information into the scoring function: a hidden Markov model for improved peptide identification. Bioinformatics 2008, 24 (5) , 674-681. https://doi.org/10.1093/bioinformatics/btn011
    28. Nuno Bandeira, Julio Ng, Dario Meluzzi, Roger G. Linington, Pieter Dorrestein, Pavel A. Pevzner. De Novo Sequencing of Nonribosomal Peptides. 2008, 181-195. https://doi.org/10.1007/978-3-540-78839-3_16
    29. Jonas Grossmann, Bernd Fischer, Katja Baerenfaller, Judith Owiti, Joachim M. Buhmann, Wilhelm Gruissem, Sacha Baginsky. A workflow to increase the detection rate of proteins from unsequenced organisms in high‐throughput proteomics experiments. PROTEOMICS 2007, 7 (23) , 4245-4254. https://doi.org/10.1002/pmic.200700474
    30. Kristian Flikka, Jeroen Meukens, Kenny Helsens, Joël Vandekerckhove, Ingvar Eidhammer, Kris Gevaert, Lennart Martens. Implementation and application of a versatile clustering tool for tandem mass spectrometry data. PROTEOMICS 2007, 7 (18) , 3245-3258. https://doi.org/10.1002/pmic.200700160
    31. Nuno Bandeira, Karl R. Clauser, Pavel A. Pevzner. Shotgun Protein Sequencing. Molecular & Cellular Proteomics 2007, 6 (7) , 1123-1134. https://doi.org/10.1074/mcp.M700001-MCP200
    32. Lijuan Mo, Debojyoti Dutta, Yunhu Wan, Ting Chen. MSNovo:  A Dynamic Programming Algorithm for de Novo Peptide Sequencing via Tandem Mass Spectrometry. Analytical Chemistry 2007, 79 (13) , 4870-4878. https://doi.org/10.1021/ac070039n
    33. Nuno Bandeira. Spectral networks: a new approach to de novo discovery of protein sequences and posttranslational modifications. BioTechniques 2007, 42 (6) , 687-695. https://doi.org/10.2144/000112487
    34. Nuno Bandeira, Dekel Tsur, Ari Frank, Pavel A. Pevzner. Protein identification by spectral networks analysis. Proceedings of the National Academy of Sciences 2007, 104 (15) , 6140-6145. https://doi.org/10.1073/pnas.0701130104
    35. Jainab Khatun, Kevin Ramkissoon, Morgan C. Giddings. Fragmentation Characteristics of Collision-Induced Dissociation in MALDI TOF/TOF Mass Spectrometry. Analytical Chemistry 2007, 79 (8) , 3032-3040. https://doi.org/10.1021/ac061455v
    36. Nuno Bandeira, Dekel Tsur, Ari Frank, Pavel Pevzner. A New Approach to Protein Identification. 2006, 363-378. https://doi.org/10.1007/11732990_31
    37. Yunhu Wan, Austin Yang, Ting Chen. PepHMM:  A Hidden Markov Model Based Scoring Function for Mass Spectrometry Database Search. Analytical Chemistry 2006, 78 (2) , 432-437. https://doi.org/10.1021/ac051319a
    38. . Current literature in mass spectrometry. Journal of Mass Spectrometry 2005, 1110-1121. https://doi.org/10.1002/jms.809
    39. Jean-Philippe Lambert, Martin Ethier, Jeffrey C. Smith, Daniel Figeys. Proteomics:  from Gel Based to Gel Free. Analytical Chemistry 2005, 77 (12) , 3771-3788. https://doi.org/10.1021/ac050586d
    40. D. Tsur, S. Tanner, E. Zandi, V. Bafna, P.A. Pevzner. Identification of post-translational modifications via blind search of mass-spectra. 2005, 157-166. https://doi.org/10.1109/CSB.2005.34
    41. Eckhard Nordhoff, Hans Lehrach. Identification and Characterization of DNA-Binding Proteins by Mass Spectrometry. , 111-195. https://doi.org/10.1007/10_2006_037

    Pair your accounts.

    Export articles to Mendeley

    Get article recommendations from ACS based on references in your Mendeley library.

    Pair your accounts.

    Export articles to Mendeley

    Get article recommendations from ACS based on references in your Mendeley library.

    You’ve supercharged your research process with ACS and Mendeley!

    STEP 1:
    Click to create an ACS ID

    Please note: If you switch to a different device, you may be asked to login again with only your ACS ID.

    Please note: If you switch to a different device, you may be asked to login again with only your ACS ID.

    Please note: If you switch to a different device, you may be asked to login again with only your ACS ID.

    MENDELEY PAIRING EXPIRED
    Your Mendeley pairing has expired. Please reconnect