The WWWH of remote homolog detection: The state of the art

OpenURL Placeholder Text

Baker

D

.

Prediction and design of macromolecular structures and interactions

,

Phil Trans R Soc B

,

2006

, vol.

361

(pg.

495

-

63

)

Crossref

PubMed

Ringo

J

. ,

Fundamental Genetics.

,

2005

Cambridge

Cambridge University Press

OpenURL Placeholder Text

Holm

L

,

Sander

C

.

Mapping the protein universe

,

Science

,

1996

, vol.

273

(pg.

595

-

603

)

Crossref

PubMed

Pearsnon

RW

,

Sierk

ML

.

The limits of protein sequence comparison?

,

Curr Opin Struct Biol

,

2005

, vol.

15

(pg.

254

-

60

)

Crossref

PubMed

Copley

RR

,

Russell

RB

,

Ponting

CP

.

Sialidase-like Asp-boxes: sequence-similar structures within different protein folds

,

Protein Sci

,

2001

, vol.

10

(pg.

285

-

292

)

Crossref

PubMed

Lupas

AN

,

Ponting

CP

,

Russell

RB

.

On the evolution of protein folds: are similar motifs in different protein folds the result of convergence, insertion, or relics of an ancient peptide world?

,

J Struct Biol

,

2001

, vol.

13

(pg.

191

-

203

)

Crossref

Andreeva

A

,

Howorth

D

,

Brenner

SE

, et al.

SCOP database in 2004: refinements integrate structure and sequence family data

,

Nucleic Acids Res

,

2004

, vol.

32

(pg.

D226

-

9

)

Crossref

PubMed

Lichtarge

O

.

Getting past appearances: the many-fold consequences of remote homology

,

Nat Struct Biol

,

2001

, vol.

8

(pg.

918

-

20

)

Crossref

PubMed

Dietmann

S

,

Fernandez-Fuentes

N

,

Holm

L

.

Automated detection of remote homology

,

Curr Opin Struct Biol

,

2002

, vol.

12

(pg.

362

-

7

)

Crossref

PubMed

Wan

XF

,

Xu

D

.

Computational methods for remote homolog identification

,

Curr Protien Pet Sci

,

2005

, vol.

6

(pg.

527

-

46

)

Crossref

Smith

TS

,

Waterman

MS

.

Identification of common molecular subsequences

,

J Mol Biol

,

1981

, vol.

147

(pg.

195

-

7

)

Crossref

PubMed

Needleman

SB

,

Wunsch

CD

.

A general method applicable to the search for similarities in the amino acid sequence of two proteins

,

J Mol Biol

,

1970

, vol.

48

(pg.

443

-

53

)

Crossref

PubMed

Altschul

SF

,

Gish

W

,

Miller

W

, et al.

Basic local alignment search tool

,

J Mol Biol

,

1999

, vol.

215

(pg.

403

-

10

)

Crossref

Pearson

WR

,

Lipman

DJ

.

Improved tools for biological sequence comparison

,

Proc Natl Acad Sci USA

,

1988

, vol.

85

(pg.

2444

-

8

)

Crossref

PubMed

Gribskov

M

,

McLachlan

AD

,

Eisenberg

D

.

Profile analysis: detection of distantly related proteins

,

Proc Natl Acad Sci USA

,

1987

, vol.

84

(pg.

4355

-

8

)

Crossref

PubMed

Schaffer

AA

,

Aravind

L

,

Madden

TL

, et al.

Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements

,

Nucleic Acids Res

,

2001

, vol.

29

(pg.

2994

-

3005

)

Crossref

PubMed

Aravind

L

,

Koonin

EV

.

Gleaning non-trivial structural, functional and evolutionary information about proteins by iterative database searches

,

J Mol Biol

,

1999

, vol.

287

(pg.

1023

-

40

)

Crossref

PubMed

Schaffer

AA

,

Wolf

YI

,

Ponting

CP

, et al.

IMPALA: matching a protein sequence against a collection of PSI-BLAST-constructed position-specific score matrices

,

Bioinformatics

,

1999

, vol.

15

(pg.

1000

-

11

)

Crossref

PubMed

Sadreyev

R

,

Grishin

NV

.

COMPASS: a tool for comparison of multiple protein alignments with assessment of statistical significance

,

J Mol Biol

,

2003

, vol.

326

(pg.

317

-

36

)

Crossref

PubMed

Pietrokovski

S

.

Searching databases of conserved sequence regions by aligning protein multiple-alignments

,

Nucleic Acids Res

,

1996

, vol.

24

(pg.

3836

-

45

)

Crossref

PubMed

Rychlewski

L

,

Jaroszewski

L

,

Li

W

,

Godzik

A

.

Comparison of sequence profiles. Strategies for structural predictions using sequence information’

,

Protein Sci

,

2000

, vol.

9

(pg.

232

-

41

)

Crossref

PubMed

Edgar

RC

,

Sjolander

K

.

COACH: profile-profile alignment of protein families using hidden Markov models

,

Bioinformatics

,

2004

, vol.

20

(pg.

1309

-

18

)

Crossref

PubMed

Yona

G

,

Levitt

M

.

Within the twilight zone: a sensitive profile-profile comparison tool based on information theory

,

J Mol Biol

,

2002

, vol.

315

(pg.

1257

-

75

)

Crossref

PubMed

Capriotti

E

,

Fariselli

P

,

Rossi

I

, et al.

A Shannon entropy-based filter detects high-quality profile-profile alignments in searches for remote homologues

,

Proteins

,

2004

, vol.

54

(pg.

351

-

60

)

Crossref

PubMed

Sadreyev

RI

,

Baker

D

,

Grishin

NV

.

Profile-profile comparisons by COMPASS predict intricate homologies between protein families

,

Protein Sci

,

2003

, vol.

12

(pg.

2262

-

72

)

Crossref

PubMed

Bowie

JU

,

Luthy

R

,

Eisenberg

D

.

A method to identify protein sequences that fold into a known three-dimensional structure

,

Science

,

1991

, vol.

253

(pg.

164

-

70

)

Crossref

PubMed

Rost

B

,

Schneider

R

,

Sander

C

.

Protein fold recognition by prediction-based threading

,

J Mol Biol

,

1997

, vol.

270

(pg.

471

-

80

)

Crossref

PubMed

Wallqvist

A

,

Fukunishi

Y

,

Murphy

LR

, et al.

Iterative sequence/secondary structure search for protein homologs

,

Bioinformatics

,

2000

, vol.

16

(pg.

988

-

1002

)

Crossref

PubMed

Geourjon

C

,

Combet

C

,

Blanchet

C

, et al.

Identification of related proteins with weak sequence identity using secondary structure information

,

Protein Sci

,

2001

, vol.

10

(pg.

788

-

97

)

Crossref

PubMed

Yet Another Alignment Program (Pairwise Sequence Alignment Using Secondary Structures) http://gpcr.biocomp.unibo.it/oldpredictors/prototypes.html

Sippl

MJ

,

Weitckus

S

.

Detection of native-like models for amino acid sequences of unknown three-dimensional structure in a data base of known protein conformations

,

Proteins

,

1992

, vol.

13

(pg.

258

-

71

)

Crossref

PubMed

Jones

DT

,

Taylor

WR

,

Thornton

JM

.

A new approach to protein fold recognition

,

Nature

,

1992

, vol.

358

(pg.

86

-

9

)

Crossref

PubMed

Xu

Y

,

Xu

D

,

Uberbacher

EC

.

An efficient computational method for globally optimal threading

,

J Comput Biol

,

1998

, vol.

5

(pg.

597

-

614

)

Crossref

PubMed

Kelley

LA

,

MacCallum

RM

,

Sternberg

MJ

.

Enhanced genome annotation using structural profiles in the program 3D-PSSM

,

J Mol Biol

,

2000

, vol.

299

(pg.

499

-

520

)

Crossref

PubMed

Alexandrov

NN

,

Nussinov

R

,

Zimmer

RM

.

Hunter

Lawrence

,

Klein

Teri E

.

Fast protein fold recognition via sequence to structure alignment and contact capacity potentials

,

Pacific Symposium on Biocomputing ‘96.

,

1996

Singapore

World Scientific Publishing Co

(pg.

53

-

72

)

OpenURL Placeholder Text

Ginalski

K

,

Pas

J

,

Wyrwicz

LS

, et al.

ORFeus: Detection of distant homology using sequence profiles and predicted secondary structure

,

Nucleic Acids Res

,

2003

, vol.

31

(pg.

3804

-

7

)

Crossref

PubMed

Smith

TF

,

Lo Conte

L

,

Bienkowska

J

, et al.

Current limitations to protein threading approaches

,

J Comput Biol

,

1997

, vol.

4

(pg.

217

-

25

)

Crossref

PubMed

Xu

J

,

Li

M

,

Kim

D

, et al.

RAPTOR: Optimal Protein Threading by Linear Programming

,

J Bioinform Comput Biol

,

2003

, vol.

1

(pg.

95

-

117

)

Crossref

PubMed

Vapnik

V

. ,

Statistical Learning Theory.

,

1998

New York

Wiley

OpenURL Placeholder Text

Eddy

SR

.

Profile hidden Markov models

,

Bioinformatics

,

1998

, vol.

14

(pg.

755

-

63

)

Crossref

PubMed

Krogh

A

,

Brown

M

,

Mian

IS

, et al.

Hidden Markov models in computational biology. Applications to protein modelling

,

J Mol Biol

,

1994

, vol.

235

(pg.

1501

-

31

)

Crossref

PubMed

Karplus

K

,

Barrett

C

,

Hughey

R

.

Hidden Markov models for detecting remote protein homologies

,

Bioinformatics

,

1998

, vol.

14

(pg.

846

-

56

)

Crossref

PubMed

Loytynoja

A

,

Milinkovitch

MC

.

A hidden Markov model for progressive multiple alignment

,

Bioinformatics

,

2003

, vol.

19

(pg.

1505

-

513

)

Crossref

PubMed

Wistrand

M

,

Sonnhammer

EL

.

Improving profile HMM discrimination by adapting transition probabilities

,

J Mol Biol

,

2004

, vol.

338

(pg.

847

-

54

)

Crossref

PubMed

Finn

RD

,

Mistry

J

,

Schuster-Bockler

B

, et al.

Pfam: clans, web tools and services

,

Nucleic Acids Res

,

2006

, vol.

34

(pg.

D247

-

51

)

Crossref

PubMed

McGuffin

LJ

,

Jones

DT

.

Improvement of the GenTHREADER method for genomic fold recognition

,

Bioinformatics

,

2003

, vol.

19

(pg.

874

-

81

)

Crossref

PubMed

Huang

YM

,

Bystroff

C

.

Improved pairwise alignments of proteins in the Twilight Zone using local structure predictions

,

Bioinformatics

,

2006

, vol.

22

(pg.

413

-

22

)

Crossref

PubMed

Karplus

K

,

Katzman

S

,

Shackleford

G

, et al.

SAM-T04: what is new in protein-structure prediction for CASP6

,

Proteins

,

2005

, vol.

61

(pg.

135

-

42

)

Crossref

PubMed

Jaakkola

T

,

Diekhans

M

,

Haussler

D

.

A discriminative framework for detecting remote protein homologies

,

J Comput Biol

,

2000

, vol.

7

(pg.

95

-

114

)

Crossref

PubMed

Leslie

C

,

Eskin

E

,

Noble

WS

.

The spectrum kernel: a string kernel for SVM protein classification

,

Pac Symp Biocomput

,

2002

, vol.

7

(pg.

564

-

575

)

OpenURL Placeholder Text

Ben-Hur

A

,

Brutlag

D

.

Remote homology detection: a motif based approach

,

Bioinformatics

,

2003

, vol.

19

(pg.

i26

-

33

)

Crossref

PubMed

Liao

L

,

Noble

WS

.

Combining pairwise sequence similarity and support vector machines for detecting remote protein evolutionary and structural relationships

,

J Comput Biol

,

2003

, vol.

10

(pg.

857

-

68

)

Crossref

PubMed

Hou

Y

,

Hsu

W

,

Lee

ML

, et al.

Remote homolog detection using local sequence-structure correlations

,

Proteins

,

2004

, vol.

57

(pg.

518

-

30

)

Crossref

PubMed

Saigo

H

,

Vert

JP

,

Ueda

N

, et al.

Protein homology detection using string alignment kernels

,

Bioinformatics

,

2004

, vol.

20

(pg.

1682

-

9

)

Crossref

PubMed

Rangwala

H

,

Karypis

G

.

Profile-based direct kernels for remote homology detection and fold recognition

,

Bioinformatics

,

2005

, vol.

21

(pg.

4239

-

47

)

Crossref

PubMed

Kinch

LN

,

Wrabl

JO

,

Krishna

SS

, et al.

CASP5 assessment of fold recognition target predictions

,

Proteins

,

2003

, vol.

53

(pg.

395

-

409

)

Crossref

PubMed

Cheng

J

,

Baldi

P

.

A machine learning information retrieval approach to protein fold recognition

,

Bioinformatics

,

2006

, vol.

22

(pg.

1456

-

63

)

Crossref

PubMed

Wang

G

,

Jin

Y

,

Dunbrack

RL

Jr

.

Assessment of fold recognition predictions in CASP6

,

Proteins

,

2005

, vol.

61

(pg.

46

-

66

)

Crossref

PubMed

Wallner

B

,

Elofsson

A

.

Pcons5: combining consensus, structural evaluation and fold recognition scores

,

Bioinformatics

,

2005

, vol.

21

(pg.

4248

-

54

)

Crossref

PubMed

Ginalski

K

,

Elofsson

A

,

Fischer

D

, et al.

3D-Jury: a simple approach to improve protein structure predictions

,

Bioinformatics

,

2003

, vol.

19

(pg.

1015

-

8

)

Crossref

PubMed

Kurowski

MA

,

Bujnicki

JM

.

GeneSilico protein structure prediction meta-server

,

Nucleic Acids Res

,

2003

, vol.

31

(pg.

3305

-

7

)

Crossref

PubMed

Fischer

D

.

3DS3 and 3DS5 3D-SHOTGUN Meta-Predictors in CAFASP3

,

Proteins

,

2003

, vol.

53

(pg.

517

-

23

)

Crossref

PubMed

Bujnicki

JM

,

Elofsson

A

,

Fischer

D

, et al.

Structure prediction meta server

,

Bioinformatics

,

2001

, vol.

17

(pg.

750

-

1

)

Crossref

PubMed

Alexandrov

NN

.

SARFing the PDB

,

Protein Eng

,

1996

, vol.

9

(pg.

727

-

32

)

Crossref

PubMed

Shindyalov

IN

,

Bourne

PE

.

Protein structure alignment by incremental combinatorial extension (CE) of the optimal path

,

Protein Eng

,

1998

, vol.

11

(pg.

739

-

47

)

Crossref

PubMed

Holm

L

,

Sander

C

.

Protein folds and families: sequence and structure alignments

,

Nucleic Acids Res

,

1999

, vol.

27

(pg.

244

-

7

)

Crossref

PubMed

Levitt

M

,

Gerstein

M

.

A unified statistical framework for sequence comparison and structure comparison

,

Proc Natl Acad Sci USA

,

1998

, vol.

95

(pg.

5913

-

20

)

Crossref

PubMed

Gibrat

JF

,

Madej

T

,

Bryant

SH

.

Surprising similarities in structure comparison

,

Curr Opin Struct Biol

,

1996

, vol.

6

(pg.

377

-

85

)

Crossref

PubMed

Zhu

J

,

Weng

Z

.

FAST: a novel protein structure alignment algorithm

,

Proteins

,

2005

, vol.

58

(pg.

618

-

27

)

Crossref

PubMed

Lupyan

D

,

Leo-Macias

A

,

Ortiz

AR

.

A new progressive-iterative algorithm for multiple structure alignment

,

Bioinformatics

,

2005

, vol.

21

(pg.

3255

-

63

)

Crossref

PubMed

Siew

N

,

Elofsson

A

,

Rychlewski

L

, et al.

MaxSub: an automated measure for the assessment of protein structure prediction quality

,

Bioinformatics

,

2000

, vol.

16

(pg.

776

-

85

)

Crossref

PubMed

Karlin

S

,

Altschul

SF

.

Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes

,

Proc Natl Acad Sci USA

,

1990

, vol.

87

(pg.

2264

-

8

)

Crossref

PubMed

Theobald

DL

,

Wuttke

DS

.

Divergent evolution within protein superfolds inferred from profile-based phylogenetics

,

J Mol Biol

,

2005

, vol.

354

(pg.

722

-

37

)

Crossref

PubMed

Torres

J

,

Stevens

TJ

,

Samso

M

.

Membrane proteins: the ‘Wild West’ of structural biology

,

Trends Biochem Sci

,

2003

, vol.

28

(pg.

137

-

44

)

Crossref

PubMed

Casadio

R

,

Fariselli

P

,

Martelli

PL

.

In silico prediction of the structure of membrane proteins: Is it feasible?

,

Brief Bioinform

,

2003

, vol.

4

(pg.

341

-

8

)

Crossref

PubMed

Fariselli

P

,

Finelli

M

,

Rossi

I

, et al.

TRAMPLE: the transmembrane protein labelling environment

,

Nucl Acids Res

,

2005

, vol.

33

(pg.

W198

-

201

)

Crossref

PubMed

Casadio

R

,

Fariselli

P

,

Finocchiaro

G

, et al.

Fishing new proteins in the twilight zone of genomes: The test case of outer membrane proteins in Escherichia coli K12, Escherichia coli O157:H7, and other Gram-negative bacteria

,

Protein Sci

,

2003

, vol.

11

(pg.

1158

-

68

)

Crossref

Zhang

Y

,

Devries

ME

,

Skolnick

J

.

Structure Modeling of All Identified G Protein-Coupled Receptors in the Human Genome

,

PLoS Comput Biol

,

2006

, vol.

2

(pg.

89

-

99

)

Crossref