English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
 
 
DownloadE-Mail
  On the Complexity of Gene Expression Classification Data Sets

Lorena, A. C., Costa, I. G., & de Souto, M. C. P. (2008). On the Complexity of Gene Expression Classification Data Sets. In Hybrid Intelligent Systems, 2008. HIS '08. Eighth International Conference on (pp. 825-830). IEEE.

Item is

Files

show Files
hide Files
:
04626733.pdf (Any fulltext), 210KB
 
File Permalink:
-
Name:
04626733.pdf
Description:
-
OA-Status:
Visibility:
Restricted (Max Planck Institute for Molecular Genetics, MBMG; )
MIME-Type / Checksum:
application/pdf
Technical Metadata:
Copyright Date:
-
Copyright Info:
eDoc_access: INSTITUT
License:
-

Locators

show

Creators

show
hide
 Creators:
Lorena, Ana C., Author
Costa, Ivan G.1, Author           
de Souto, Marcilio C. P., Author
Affiliations:
1Dept. of Computational Molecular Biology (Head: Martin Vingron), Max Planck Institute for Molecular Genetics, Max Planck Society, ou_1433547              

Content

show
hide
Free keywords: cancer classifier complexity gene expression
 Abstract: One of the main kinds of computational tasks regarding gene expression data is the construction of classifiers (models), often via some machine learning (ML) technique and given data sets, to automatically discriminate expression patterns from cancer (tumor) and normal tissues or from subtypes of cancers. A very distinctive characteristic of these data sets is its high dimensionality and the fewer number of data items. Such a characteristic makes the induction of accurate ML models difficult (e.g., it could lead to model overfitting). In this context, we present an empirical study on the complexity of the classification task of gene expression data sets, related to cancer, used for classification purposes. In order to do so, we measure the complexity of the ML models used to perform the tumors' classification. The results indicate that most of these data sets can be effectively discriminated by a simple linear function.

Details

show
hide
Language(s): eng - English
 Dates: 2008-09-19
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Degree: -

Event

show
hide
Title: Eighth International Conference on Hybrid Intelligent Systems
Place of Event: Barcelona, Spain
Start-/End Date: 2008-09-10 - 2008-09-12

Legal Case

show

Project information

show

Source 1

show
hide
Title: Hybrid Intelligent Systems, 2008. HIS '08. Eighth International Conference on
Source Genre: Proceedings
 Creator(s):
Affiliations:
Publ. Info: IEEE
Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 825 - 830 Identifier: ISBN: 978-0-7695-3326-1