Deutsch
 
Hilfe Datenschutzhinweis Impressum
  DetailsucheBrowse

Datensatz

DATENSATZ AKTIONENEXPORT
  The SYSTERS protein family database: taxon-related protein family size distributions and singleton frequencies

Meinel, T., Vingron, M., & Krause, A. (2003). The SYSTERS protein family database: taxon-related protein family size distributions and singleton frequencies. In H.-W. Mewes, D. Frishman, V. Heun, & S. Kramer (Eds.), Proceedings of the German Conference on Bioinformatics (GCB '03) (pp. 103-108).

Item is

Dateien

einblenden: Dateien
ausblenden: Dateien
:
gcb2003_meinel.pdf (beliebiger Volltext), 162KB
Name:
gcb2003_meinel.pdf
Beschreibung:
-
OA-Status:
Sichtbarkeit:
Öffentlich
MIME-Typ / Prüfsumme:
application/pdf / [MD5]
Technische Metadaten:
Copyright Datum:
-
Copyright Info:
eDoc_access: PUBLIC
Lizenz:
-

Externe Referenzen

einblenden:

Urheber

einblenden:
ausblenden:
 Urheber:
Meinel, Thomas1, Autor
Vingron, Martin2, Autor           
Krause, Antje1, Autor
Affiliations:
1Max Planck Society, ou_persistent13              
2Gene regulation (Martin Vingron), Dept. of Computational Molecular Biology (Head: Martin Vingron), Max Planck Institute for Molecular Genetics, Max Planck Society, ou_1479639              

Inhalt

einblenden:
ausblenden:
Schlagwörter: protein family; large scale clustering; taxonomy; taxon-related; cluster size distribution
 Zusammenfassung: Based on the SYSTERS protein family database, we present taxon-related protein family frequencies and distributions. A set of taxon-related protein families is a subset of the whole family set with respect to one taxon, where taxon is not restricted to the species level but may be any rank in the taxonomy. We examine eight ranks in the lineages of seven organisms. A strong linear correlation is observed between the total number of different families and the number of sequences in the data set under consideration. We fitted the generalised power-law function to protein family distributions in a least-squares sense excluding singleton frequencies. Taxon-related family distributions tend to have the same shape and a negative slope being not larger than -2.1 for large data sets. For smaller data sets, the slope is decreasing down to -3.7. Slopes of family distributions are found to be slowly increasing towards higher taxonomic ranks. Our observations lead to a new estimation of single sequence cluster frequencies. Data sets of various species are studied with respect to being complete or incomplete.

Details

einblenden:
ausblenden:
Sprache(n): eng - English
 Datum: 2003
 Publikationsstatus: Erschienen
 Seiten: -
 Ort, Verlag, Ausgabe: -
 Inhaltsverzeichnis: -
 Art der Begutachtung: -
 Identifikatoren: eDoc: 175889
 Art des Abschluß: -

Veranstaltung

einblenden:
ausblenden:
Titel: German Conference on Bioinformatics
Veranstaltungsort: Neuherberg/Garching near Munich
Start-/Enddatum: 2003-10-12 - 2003-10-14

Entscheidung

einblenden:

Projektinformation

einblenden:

Quelle 1

einblenden:
ausblenden:
Titel: Proceedings of the German Conference on Bioinformatics (GCB '03)
Genre der Quelle: Konferenzband
 Urheber:
Mewes, H.-W., Herausgeber
Frishman, D., Herausgeber
Heun, V., Herausgeber
Kramer, S., Herausgeber
Affiliations:
-
Ort, Verlag, Ausgabe: -
Seiten: - Band / Heft: - Artikelnummer: - Start- / Endseite: 103 - 108 Identifikator: -