Volume 17, Issue 20 p. 2337-2342
Research Article

PEAKS: powerful software for peptide de novo sequencing by tandem mass spectrometry

Bin Ma

Corresponding Author

Bin Ma

Department of Computer Science, University of Western Ontario, London, ON N6A 5B7, Canada

Department of Computer Science, University of Western Ontario, London, ON N6A 5B7, Canada.Search for more papers by this author
Kaizhong Zhang

Kaizhong Zhang

Department of Computer Science, University of Western Ontario, London, ON N6A 5B7, Canada

Search for more papers by this author
Christopher Hendrie

Christopher Hendrie

Bioinformatics Solutions Inc., Waterloo, ON N2L 3L2, Canada

Search for more papers by this author
Chengzhi Liang

Chengzhi Liang

Bioinformatics Solutions Inc., Waterloo, ON N2L 3L2, Canada

Search for more papers by this author
Ming Li

Ming Li

Department of Computer Science, University of Waterloo, Waterloo, ON N2L 3G1, Canada

Search for more papers by this author
Amanda Doherty-Kirby

Amanda Doherty-Kirby

Department of Biochemistry, University of Western Ontario, London, ON N6A 5C1, Canada

Search for more papers by this author
Gilles Lajoie

Gilles Lajoie

Department of Biochemistry, University of Western Ontario, London, ON N6A 5C1, Canada

Search for more papers by this author
First published: 19 September 2003
Citations: 1,012

Abstract

A number of different approaches have been described to identify proteins from tandem mass spectrometry (MS/MS) data. The most common approaches rely on the available databases to match experimental MS/MS data. These methods suffer from several drawbacks and cannot be used for the identification of proteins from unknown genomes. In this communication, we describe a new de novo sequencing software package, PEAKS, to extract amino acid sequence information without the use of databases. PEAKS uses a new model and a new algorithm to efficiently compute the best peptide sequences whose fragment ions can best interpret the peaks in the MS/MS spectrum. The output of the software gives amino acid sequences with confidence scores for the entire sequences, as well as an additional novel positional scoring scheme for portions of the sequences. The performance of PEAKS is compared with Lutefisk, a well-known de novo sequencing software, using quadrupole-time-of-flight (Q-TOF) data obtained for several tryptic peptides from standard proteins. Copyright © 2003 John Wiley & Sons, Ltd.