Deutsch
 
Hilfe Datenschutzhinweis Impressum
  DetailsucheBrowse

Datensatz

 
 
DownloadE-Mail
  Alignment of 1000 Genomes Project Reads to Reference Assembly GRCh38

Zheng-Bradley, X., Streeter, I., Fairley, S., Richardson, D., Clarke, L., Flicek, P., et al. (2017). Alignment of 1000 Genomes Project Reads to Reference Assembly GRCh38. GigaScience, 6(7), 1-8. doi:10.1093/gigascience/gix038.

Item is

Basisdaten

einblenden: ausblenden:
Genre: Zeitschriftenartikel

Dateien

einblenden: Dateien
ausblenden: Dateien
:
Zheng-Bradley.pdf (Verlagsversion), 673KB
Name:
Zheng-Bradley.pdf
Beschreibung:
-
OA-Status:
Sichtbarkeit:
Öffentlich
MIME-Typ / Prüfsumme:
application/pdf / [MD5]
Technische Metadaten:
Copyright Datum:
-
Copyright Info:
© The Authors 2017. Published by Oxford University Press.

Externe Referenzen

einblenden:

Urheber

einblenden:
ausblenden:
 Urheber:
Zheng-Bradley, Xiangqun , Autor
Streeter, Ian, Autor
Fairley, Susan, Autor
Richardson, David , Autor
Clarke, Laura, Autor
Flicek, Paul, Autor
1000 Genomes Project , Consortium, Autor
Timmermann, Bernd1, Autor           
Affiliations:
1Sequencing (Head: Bernd Timmermann), Scientific Service (Head: Christoph Krukenkamp), Max Planck Institute for Molecular Genetics, Max Planck Society, ou_1479670              

Inhalt

einblenden:
ausblenden:
Schlagwörter: alignment; reference genome; GRCh38; sequence reads; read mapping
 Zusammenfassung: The 1000 Genomes Project produced more than 100 trillion basepairs of short read sequence from more than 2600 samples in 26 populations over a period of five years. In its final phase, the project released over 85 million genotyped and phased variants on human reference genome assembly GRCh37. An updated reference assembly, GRCh38, was released in late 2013, but there was insufficient time for the final phase of the project analysis to change to the new assembly. Although it is possible to lift the coordinates of the 1000 Genomes Project variants to the new assembly, this is a potentially error-prone process as coordinate remapping is most appropriate only for non-repetitive regions of the genome and those that did not see significant change between the two assemblies. It will also miss variants in any region that was newly added to GRCh38. Thus, to produce the highest quality variants and genotypes on GRCh38, the best strategy is to realign the reads and recall the variants based on the new alignment. As the first step of variant calling for the 1000 Genomes Project data, we have finished remapping all of the 1000 Genomes sequence reads to GRCh38 with alternative scaffold–aware BWA-MEM. The resulting alignments are available as CRAM, a reference-based sequence compression format. The data have been released on our FTP site and are also available from European Nucleotide Archive to facilitate researchers discovering variants on the primary sequences and alternative contigs of GRCh38.

Details

einblenden:
ausblenden:
Sprache(n): eng - English
 Datum: 2017-05-202017-07-01
 Publikationsstatus: Erschienen
 Seiten: 8
 Ort, Verlag, Ausgabe: -
 Inhaltsverzeichnis: -
 Art der Begutachtung: -
 Identifikatoren: DOI: 10.1093/gigascience/gix038
 Art des Abschluß: -

Veranstaltung

einblenden:

Entscheidung

einblenden:

Projektinformation

einblenden:

Quelle 1

einblenden:
ausblenden:
Titel: GigaScience
Genre der Quelle: Zeitschrift
 Urheber:
Affiliations:
Ort, Verlag, Ausgabe: London : BioMed Central
Seiten: - Band / Heft: 6 (7) Artikelnummer: - Start- / Endseite: 1 - 8 Identifikator: ISSN: 2047-217X
CoNE: https://pure.mpg.de/cone/journals/resource/2047-217X