Shotgun Protein Sequencing by Tandem Mass Spectra Assembly
- Nuno Bandeira
- ,
- Haixu Tang
- ,
- Vineet Bafna
- , and
- Pavel Pevzner
Abstract
The analysis of mass spectrometry data is still largely based on identification of single MS/MS spectra and does not attempt to make use of the extra information available in multiple MS/MS spectra from partially or completely overlapping peptides. Analysis of MS/MS spectra from multiple overlapping peptides opens up the possibility of assembling MS/MS spectra into entire proteins, similarly to the assembly of overlapping DNA reads into entire genomes. In this paper, we present for the first time a way to detect, score, and interpret overlaps between uninterpreted MS/MS spectra in an attempt to sequence entire proteins rather than individual peptides. We show that this approach not only extends the length of reconstructed amino acid sequences but also dramatically improves the quality of de novo peptide sequencing, even for low mass accuracy MS/MS data.
*
To whom correspondance should be addressed. E-mail: [email protected].
Cited By
This article is cited by 41 publications.
- Kira Vyatkina, Si Wu, Lennard J. M. Dekker, Martijn M. VanDuijn, Xiaowen Liu, Nikola Tolić, Mikhail Dvorkin, Sonya Alexandrova, Theo M. Luider, Ljiljana Paša-Tolić, and Pavel A. Pevzner . De Novo Sequencing of Peptides from Top-Down Tandem Mass Spectra. Journal of Proteome Research 2015, 14 (11) , 4450-4462. https://doi.org/10.1021/pr501244v
- Xiaowen Liu, Lennard J. M. Dekker, Si Wu, Martijn M. Vanduijn, Theo M. Luider, Nikola Tolić, Qiang Kou, Mikhail Dvorkin, Sonya Alexandrova, Kira Vyatkina, Ljiljana Paša-Tolić, and Pavel A. Pevzner . De Novo Protein Sequencing by Combining Top-Down and Bottom-Up Tandem Mass Spectra. Journal of Proteome Research 2014, 13 (7) , 3241-3248. https://doi.org/10.1021/pr401300m
- Adrian Guthals, Karl R. Clauser, Ari M. Frank, and Nuno Bandeira . Sequencing-Grade De novo Analysis of MS/MS Triplets (CID/HCD/ETD) From Overlapping Peptides. Journal of Proteome Research 2013, 12 (6) , 2846-2857. https://doi.org/10.1021/pr400173d
- Yaoyang Zhang, Bryan R. Fonslow, Bing Shan, Moon-Chang Baek, and John R. Yates, III . Protein Analysis by Shotgun/Bottom-up Proteomics. Chemical Reviews 2013, 113 (4) , 2343-2394. https://doi.org/10.1021/cr3003533
- Maykel Cruz-Monteagudo, Humberto González-Díaz, Fernanda Borges, Elena Rosa Dominguez and M. Natália D.S. Cordeiro. 3D-MEDNEs: An Alternative “in Silico” Technique for Chemical Research in Toxicology. 2. Quantitative Proteome−Toxicity Relationships (QPTR) based on Mass Spectrum Spiral Entropy. Chemical Research in Toxicology 2008, 21 (3) , 619-632. https://doi.org/10.1021/tx700296t
- Yi Lu, Cheng Ge, Biao Cai, Qing Xu, Ren Kong, Shan Chang. Antibody sequences assembly method based on weighted de Bruijn graph. Mathematical Biosciences and Engineering 2023, 20 (4) , 6174-6190. https://doi.org/10.3934/mbe.2023266
- K.V. Vyatkina. De novo sequencing of proteins and peptides: algorithms, applications, perspectives. Biomedical Chemistry: Research and Methods 2018, 1 (1) , e00005. https://doi.org/10.18097/BMCRM00005
- Kira Vyatkina. De Novo Sequencing of Top-Down Tandem Mass Spectra: A Next Step towards Retrieving a Complete Protein Sequence. Proteomes 2017, 5 (4) , 6. https://doi.org/10.3390/proteomes5010006
- Natalie Castellana, Adrian Guthals. Antibody de novo Sequencing. 2017, 139-153. https://doi.org/10.1002/9781119384434.ch6
- Letu Qingge, Xiaowen Liu, Farong Zhong, Binhai Zhu. Filling a Protein Scaffold With a Reference. IEEE Transactions on NanoBioscience 2017, 16 (2) , 123-130. https://doi.org/10.1109/TNB.2017.2666780
- Xiaoyan Guan, Naomi C. Brownstein, Nicolas L. Young, Alan G. Marshall. Ultrahigh-resolution Fourier transform ion cyclotron resonance mass spectrometry and tandem mass spectrometry for peptide de novo amino acid sequencing for a seven-protein mixture by paired single-residue transposed Lys-N and Lys-C digestion. Rapid Communications in Mass Spectrometry 2017, 31 (2) , 207-217. https://doi.org/10.1002/rcm.7783
- Ngoc Hieu Tran, M. Ziaur Rahman, Lin He, Lei Xin, Baozhen Shan, Ming Li. Complete De Novo Assembly of Monoclonal Antibody Sequences. Scientific Reports 2016, 6 (1) https://doi.org/10.1038/srep31730
- Kira Vyatkina, Si Wu, Lennard J. M. Dekker, Martijn M. VanDuijn, Xiaowen Liu, Nikola Tolić, Theo M. Luider, Ljiljana Paša-Tolić, Pavel A. Pevzner. Top-down analysis of protein samples by de novo sequencing techniques. Bioinformatics 2016, 32 (18) , 2753-2759. https://doi.org/10.1093/bioinformatics/btw307
- Letu Qingge, Xiaowen Liu, Farong Zhong, Binhai Zhu. Filling a Protein Scaffold with a Reference. 2016, 175-186. https://doi.org/10.1007/978-3-319-38782-6_15
- Victor Corasolla Carregari, Jie Dai, Thiago Verano-Braga, Thalita Rocha, Luis Alberto Ponce-Soto, Sergio Marangoni, Peter Roepstorff. Revealing the functional structure of a new PLA2 K49 from Bothriopsis taeniata snake venom employing automatic “de novo” sequencing using CID/HCD/ETD MS/MS analyses. Journal of Proteomics 2016, 131 , 131-139. https://doi.org/10.1016/j.jprot.2015.10.020
- Adrian Guthals, Christina Boucher, Nuno Bandeira. The Generating Function Approach for Peptide Identification in Spectral Networks. Journal of Computational Biology 2015, 22 (5) , 353-366. https://doi.org/10.1089/cmb.2014.0165
- Adrian Guthals, Christina Boucher, Nuno Bandeira. The Generating Function Approach for Peptide Identification in Spectral Networks. 2014, 85-99. https://doi.org/10.1007/978-3-319-05269-4_7
- Adrian Guthals, Karl R. Clauser, Nuno Bandeira. Shotgun Protein Sequencing with Meta-contig Assembly. Molecular & Cellular Proteomics 2012, 11 (10) , 1084-1096. https://doi.org/10.1074/mcp.M111.015768
- Bin Ma, Richard Johnson. De Novo Sequencing and Homology Searching. Molecular & Cellular Proteomics 2012, 11 (2) , O111.014902. https://doi.org/10.1074/mcp.O111.014902
- Adrian Guthals, Jeramie D. Watrous, Pieter C. Dorrestein, Nuno Bandeira. The spectral networks paradigm in high throughput mass spectrometry. Molecular BioSystems 2012, 8 (10) , 2535. https://doi.org/10.1039/c2mb25085c
- Nuno Bandeira. Protein Identification by Spectral Networks Analysis. 2011, 151-168. https://doi.org/10.1007/978-1-60761-977-2_11
- Jian Wang, Josué Pérez-Santiago, Jonathan E. Katz, Parag Mallick, Nuno Bandeira. Peptide Identification from Mixture Tandem Mass Spectra. Molecular & Cellular Proteomics 2010, 9 (7) , 1476-1485. https://doi.org/10.1074/mcp.M000136-MCP201
- Bin Ma. Challenges in Computational Analysis of Mass Spectrometry Data for Proteomics. Journal of Computer Science and Technology 2010, 25 (1) , 107-123. https://doi.org/10.1007/s11390-010-9309-1
- Xiaowen Liu, Yonghua Han, Denis Yuen, Bin Ma. Automated protein (re)sequencing with MS/MS and a homologous database yields almost full coverage and accuracy. Bioinformatics 2009, 25 (17) , 2174-2180. https://doi.org/10.1093/bioinformatics/btp366
- Deborah Penque. Two‐dimensional gel electrophoresis and mass spectrometry for biomarker discovery. PROTEOMICS – Clinical Applications 2009, 3 (2) , 155-172. https://doi.org/10.1002/prca.200800025
- Nuno Bandeira, Jesper V. Olsen, Matthias Mann, Pavel A. Pevzner. Multi-spectra peptide sequencing and its applications to multistage mass spectrometry. Bioinformatics 2008, 24 (13) , i416-i423. https://doi.org/10.1093/bioinformatics/btn184
- Jainab Khatun, Eric Hamlett, Morgan C. Giddings. Incorporating sequence information into the scoring function: a hidden Markov model for improved peptide identification. Bioinformatics 2008, 24 (5) , 674-681. https://doi.org/10.1093/bioinformatics/btn011
- Nuno Bandeira, Julio Ng, Dario Meluzzi, Roger G. Linington, Pieter Dorrestein, Pavel A. Pevzner. De Novo Sequencing of Nonribosomal Peptides. 2008, 181-195. https://doi.org/10.1007/978-3-540-78839-3_16
- Jonas Grossmann, Bernd Fischer, Katja Baerenfaller, Judith Owiti, Joachim M. Buhmann, Wilhelm Gruissem, Sacha Baginsky. A workflow to increase the detection rate of proteins from unsequenced organisms in high‐throughput proteomics experiments. PROTEOMICS 2007, 7 (23) , 4245-4254. https://doi.org/10.1002/pmic.200700474
- Kristian Flikka, Jeroen Meukens, Kenny Helsens, Joël Vandekerckhove, Ingvar Eidhammer, Kris Gevaert, Lennart Martens. Implementation and application of a versatile clustering tool for tandem mass spectrometry data. PROTEOMICS 2007, 7 (18) , 3245-3258. https://doi.org/10.1002/pmic.200700160
- Nuno Bandeira, Karl R. Clauser, Pavel A. Pevzner. Shotgun Protein Sequencing. Molecular & Cellular Proteomics 2007, 6 (7) , 1123-1134. https://doi.org/10.1074/mcp.M700001-MCP200
- Lijuan Mo, Debojyoti Dutta, Yunhu Wan, Ting Chen. MSNovo: A Dynamic Programming Algorithm for de Novo Peptide Sequencing via Tandem Mass Spectrometry. Analytical Chemistry 2007, 79 (13) , 4870-4878. https://doi.org/10.1021/ac070039n
- Nuno Bandeira. Spectral networks: a new approach to de novo discovery of protein sequences and posttranslational modifications. BioTechniques 2007, 42 (6) , 687-695. https://doi.org/10.2144/000112487
- Nuno Bandeira, Dekel Tsur, Ari Frank, Pavel A. Pevzner. Protein identification by spectral networks analysis. Proceedings of the National Academy of Sciences 2007, 104 (15) , 6140-6145. https://doi.org/10.1073/pnas.0701130104
- Jainab Khatun, Kevin Ramkissoon, Morgan C. Giddings. Fragmentation Characteristics of Collision-Induced Dissociation in MALDI TOF/TOF Mass Spectrometry. Analytical Chemistry 2007, 79 (8) , 3032-3040. https://doi.org/10.1021/ac061455v
- Nuno Bandeira, Dekel Tsur, Ari Frank, Pavel Pevzner. A New Approach to Protein Identification. 2006, 363-378. https://doi.org/10.1007/11732990_31
- Yunhu Wan, Austin Yang, Ting Chen. PepHMM: A Hidden Markov Model Based Scoring Function for Mass Spectrometry Database Search. Analytical Chemistry 2006, 78 (2) , 432-437. https://doi.org/10.1021/ac051319a
- . Current literature in mass spectrometry. Journal of Mass Spectrometry 2005, 1110-1121. https://doi.org/10.1002/jms.809
- Jean-Philippe Lambert, Martin Ethier, Jeffrey C. Smith, Daniel Figeys. Proteomics: from Gel Based to Gel Free. Analytical Chemistry 2005, 77 (12) , 3771-3788. https://doi.org/10.1021/ac050586d
- D. Tsur, S. Tanner, E. Zandi, V. Bafna, P.A. Pevzner. Identification of post-translational modifications via blind search of mass-spectra. 2005, 157-166. https://doi.org/10.1109/CSB.2005.34
- Eckhard Nordhoff, Hans Lehrach. Identification and Characterization of DNA-Binding Proteins by Mass Spectrometry. , 111-195. https://doi.org/10.1007/10_2006_037