| References |
|
|
Abagyan RA and
Batalov S
(1997)
Do aligned sequences share the same fold?
Journal of Molecular Biology
273:
355368.
|
|
|
book
Altschul SF and
Gish W
(1996)
"Local alignment statistics".
In: Doolittle RF (ed.)
Methods in Enzymology,
vol. 266,
pp. 460480.
San Diego, CA: Academic Press.
|
|
|
Altschul SF,
Gish W,
Miller W,
Meyers EW and
Lipman DJ
(1990)
Basic local alignment search tool.
Journal of Molecular Biology
215:
403410.
|
|
|
Altschul SF,
Madden TL,
Schäffer AA et al.
(1997)
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.
Nucleic Acids Research
25:
33893402.
|
|
|
Arratia R and
Waterman MS
(1994)
A phase transition for the sore in matching random sequences allowing depletions.
Annals of Applied Probability
4:
200225.
|
|
|
Cartwright RA
(2007)
Ngila: global pairwise alignments with logarithmic and affine gap costs.
Bioinformatics
23:
14271428.
|
|
|
Dembo A and
Karlin S
(1991)
Strong limit theorems of empirical functionals for large exceedances of partial sums of i.i.d. variables.
Annals of Probability
19:
1737.
|
|
|
Dembo A,
Karlin S and
Zeitouni O
(1994)
Limit distributions of maximal non-aligned two-sequence segmental score.
Annals of Probability
22:
2022.
|
|
|
Doolittle RF
(1981)
Similar amino acid sequences: chance or common ancestry.
Science
214:
149159.
|
|
|
Eddy SR
(1998)
Profile hidden Markov models.
Bioinformatics
14:
755763.
|
|
|
Edgar RC and
Sjölander K
(2004)
A comparison of scoring functions for protein sequence profile alignment.
Bioinformatics
20:
13011308.
|
|
|
George RA and
Heringa J
(2002)
Protein domain identification and improved sequence searching using PSI-BLAST.
Proteins Structure Function and Genetics
48:
672681.
|
|
|
book
Gumbel EJ
(1958)
Statistics of Extremes.
New York, NY: Columbia University Press.
|
|
|
Heringa J
(1999)
Two strategies for sequence comparison: profile-preprocessed and secondary structure-induced multiple alignment.
Computers and Chemistry
23:
341364.
|
|
|
Heringa J
(2002)
Local weighting schemes for protein multiple sequence alignment.
Computers and Chemistry
26:
459477.
|
|
|
Jaroszewski L,
Rychlewski L,
Li Z,
Li W and
Godzik A
(2005)
FFAS03: a server for profileprofile sequence alignments.
Nucleic Acids Research
33:
W284W288.
|
|
|
Karlin S and
Altschul SF
(1990)
Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes.
Proceedings of the National Academy of Sciences of the USA
87:
22642268.
|
|
|
Karplus K,
Barrett C and
Hughey R
(1998)
Hidden Markov models for detecting remote protein homologies.
Bioinformatics
14:
846856.
|
|
|
Kent WJ
(2002)
BLAT The BLAST-like alignment tool.
Genome Research
12:
656664.
|
|
|
Kevin K,
Karchin R,
Barrett C et al.
(2001)
What is the value added by human intervention in protein structure prediction?
Proteins: Structure, Function, and Genetics
45(S5):
8691.
|
|
|
book
Lawless JF
(1982)
Statistical Models and Methods for Lifetime Data
pp. 141202.
New York, NY: Wiley.
|
|
|
May AC
(2001)
Related problems.
Nature
413:
453.
|
|
|
Mott R
(1992)
Maximum-likelihood estimation of the statistical distribution of SmithWaterman local sequence similarity scores.
Bulletin of Mathematical Biology
54:
5975.
|
|
|
Needleman SB and
Wunsch CD
(1970)
A general method applicable to the search for similarities in the amino acid sequence of two proteins.
Journal of Molecular Biology
48:
443453.
|
|
|
von Ohsen N,
Sommer I,
Zimmer R and
Lengauer T
(2004)
Arby: automatic protein structure prediction using profileprofile alignment and confidence measures.
Bioinformatics
20:
22282235.
|
|
|
Pascarella S and
Argos P
(1992)
A data bank merging related protein structures and sequences.
Protein Engineering
5:
121137.
|
|
|
book
Pearson WR
(1996)
"Effective protein sequence comparison".
In: Doolittle RF (ed.)
Methods in Enzymology,
vol. 266,
pp. 227258.
San Diego, CA: Academic Press.
|
|
|
Pearson WR
(1998)
Empirical statistical estimates for sequence similarity searches.
Journal of Molecular Biology
276:
7184.
|
|
|
Pearson WR and
Lipman DJ
(1988)
Improved tools for biological sequence comparison.
Proceedings of the National Academy of Sciences of the USA
85:
24442448.
|
|
|
Przybylski D and
Rost B
(2007)
Consensus sequences improve PSI-BLAST through mimicking profileprofile alignments searches.
Nucleic Acids Research
35(7):
22382246.
|
|
|
Rost B
(2002)
Enzyme function is less conserved than anticipated.
Journal of Molecular Biology
318:
595608.
|
|
|
Sander C and
Schneider R
(1991)
Database of homology derived protein structures and the structural meaning of sequence alignment.
Proteins Structure Function and Evolution
9:
5668.
|
|
|
Schäffer AA,
Aravind L,
Madden TL et al.
(2001)
Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements.
Nucleic Acids Research
29:
29943005.
|
|
|
Sharon I,
Birkland A,
Chang K,
El-Yaniv R and
Yona G
(2005)
Correcting BLAST e-values for low-complexity segments.
Journal of Computational Biology
12:
9781001.
|
|
|
Simossis VA,
Kleinjung J and
Heringa J
(2005)
Homology-extended sequence alignment.
Nucleic Acids Research
33:
816824.
|
|
|
Smith TF and
Waterman MS
(1981)
Identification of common molecular subsequences.
Journal of Molecular Biology
147:
195197.
|
|
|
Smith TF,
Waterman MS and
Burks C
(1985)
The statistical distribution of nucleic acid similarities.
Nucleic Acids Research
13:
645.
|
|
|
Tomii K and
Akiyama Y
(2004)
FORTE: a profileprofile comparison tool for protein fold recognition.
Bioinformatics
20:
594595.
|
|
|
Waterman MS and
Eggert M
(1987)
A new algorithm for best subsequences alignment with applications to the tRNArRNA comparisons.
Journal of Molecular Biology
197:
723728.
|
|
|
Waterman MS and
Vingron M
(1994)
Rapid and accurate estimates of statistical significance for sequence data base searches.
Proceedings of the National Academy of Sciences of the USA
91:
4625.
|
|
|
book
Wooton JC and
Federhen S
(1996)
"Analysis of compositionally biased regions in sequence databases".
In: Doolittle RF (ed.)
Methods in Enzymology,
vol. 266,
pp. 554571.
San Diego, CA: Academic Press.
|
|
|
Yona G and
Levitt M
(2002)
Within the twilight zone: a sensitive profileprofile comparison tool based on information theory.
Journal of Molecular Biology
315:
12571275.
|
|
|
Yu YK,
Wootton JC,
Altschul SF et al.
(2003)
The compositional adjustment of amino acid substitution matrices.
Proceedings of the National Academy of Sciences of the USA
100:
1568815693.
|
| Further Reading |
|
|
book
Doolittle RF (ed.)
(1996)
Methods in Enzymology,
vol. 266,
p. 711.
San Diego, CA: Academic Press.
|
|
|
book
Higgins D and
Taylor WR (eds)
(2000)
Bioinformatics: Sequence, Structure and Databanks,
p. 249.
Oxford: Oxford University Press.
|