English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Large Scale Hierarchical Clustering of Protein Sequences

Krause, A., Stoye, J., & Vingron, M. (2005). Large Scale Hierarchical Clustering of Protein Sequences. BMC Bioinformatics, 6, 15-15. doi:10.1186/1471-2105-6-15.

Item is

Files

show Files
hide Files
:
SYSTERS Large-scale Protein Clustering and Protein Family Database.htm (Any fulltext), 11KB
Name:
SYSTERS Large-scale Protein Clustering and Protein Family Database.htm
Description:
-
OA-Status:
Visibility:
Public
MIME-Type / Checksum:
text/html / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
eDoc_access: PUBLIC
License:
-

Locators

show

Creators

show
hide
 Creators:
Krause, Antje1, Author
Stoye, Jens, Author
Vingron, Martin2, Author           
Affiliations:
1Max Planck Society, ou_persistent13              
2Gene regulation (Martin Vingron), Dept. of Computational Molecular Biology (Head: Martin Vingron), Max Planck Institute for Molecular Genetics, Max Planck Society, ou_1479639              

Content

show
hide
Free keywords: -
 Abstract: Background Searching a biological sequence database with a query sequence looking for homologues has become a routine operation in computational biology. In spite of the high degree of sophistication of currently available search routines it is still virtually impossible to identify quickly and clearly a group of sequences that a given query sequence belongs to. Results We report on our developments in grouping all known protein sequences hierarchically into superfamily and family clusters. Our graph-based algorithms take into account the topology of the sequence space induced by the data itself to construct a biologically meaningful partitioning. We have applied our clustering procedures to a non-redundant set of about 1,000,000 sequences resulting in a hierarchical clustering which is being made available for querying and browsing at http://systers.molgen.mpg.de/. Conclusions Comparisons with other widely used clustering methods on various data sets show the abilities and strengths of our clustering methods in producing a biologically meaningful grouping of protein sequences.

Details

show
hide
Language(s): eng - English
 Dates: 2005-01-22
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: eDoc: 265192
DOI: 10.1186/1471-2105-6-15
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: BMC Bioinformatics
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: 6 Sequence Number: - Start / End Page: 15 - 15 Identifier: ISSN: 1471-2105