Partially Blind Domain Adaptation for Age Prediction from DNA Methylation Data

Handl, Lisa; Jalali, Adrin; Scherer, Michael; Pfeifer, Nico

Local TagsRelease HistoryDetailsSummary

Partially Blind Domain Adaptation for Age Prediction from DNA Methylation Data

Handl, L., Jalali, A., Scherer, M., & Pfeifer, N. (2016). Partially Blind Domain Adaptation for Age Prediction from DNA Methylation Data. Retrieved from http://arxiv.org/abs/1612.06650.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/11858/00-001M-0000-002C-4CDD-3 Version Permalink: https://hdl.handle.net/11858/00-001M-0000-002C-4CDE-1

Genre: Paper

Latex : Partially Blind Domain Adaptation for Age Prediction from {DNA} Methylation Data

Files

show Files

hide Files

:

arXiv:1612.06650.pdf (Preprint), 73KB

View Save

File Permalink:
https://hdl.handle.net/11858/00-001M-0000-002C-4CDF-0

Name:
arXiv:1612.06650.pdf

Description:
File downloaded from arXiv at 2017-01-27 09:28 NIPS 2016 Workshop on Machine Learning for Health, Barcelona, Spain

OA-Status:

Visibility:
Public

MIME-Type / Checksum:
application/pdf / [MD5]

Technical Metadata:

View

Copyright Date:
-

Copyright Info:
-

License:
http://arxiv.org/help/license

Locators

show

Creators

show

hide

Creators:
Handl, Lisa¹, Author
Jalali, Adrin¹, Author
Scherer, Michael¹, Author
Pfeifer, Nico¹, Author

Affiliations:
1Computational Biology and Applied Algorithmics, MPI for Informatics, Max Planck Society, ou_40046

Content

show

hide

Free keywords: Quantitative Biology, Quantitative Methods, q-bio.QM,Statistics, Machine Learning, stat.ML

Abstract: Over the last years, huge resources of biological and medical data have become available for research. This data offers great chances for machine learning applications in health care, e.g. for precision medicine, but is also challenging to analyze. Typical challenges include a large number of possibly correlated features and heterogeneity in the data. One flourishing field of biological research in which this is relevant is epigenetics. Here, especially large amounts of DNA methylation data have emerged. This epigenetic mark has been used to predict a donor's 'epigenetic age' and increased epigenetic aging has been linked to lifestyle and disease history. In this paper we propose an adaptive model which performs feature selection for each test sample individually based on the distribution of the input data. The method can be seen as partially blind domain adaptation. We apply the model to the problem of age prediction based on DNA methylation data from a variety of tissues, and compare it to a standard model, which does not take heterogeneity into account. The standard approach has particularly bad performance on one tissue type on which we show substantial improvement with our new adaptive approach even though no samples of that tissue were part of the training data.

Details

show

hide

Language(s): eng - English

Dates: Created: 2016-12-20Published Online: 2016

Publication Status: Published online

Pages: 6 p.

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: arXiv: 1612.06650
URI: http://arxiv.org/abs/1612.06650
BibTex Citekey: HandlarXiv2016

Degree: -

Event

show

Legal Case

show

Project information

show

Source

show