English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  From Black and White to Full Colour: Extending Redescription Mining Outside the Boolean World

Galbrun, E., & Miettinen, P. (2012). From Black and White to Full Colour: Extending Redescription Mining Outside the Boolean World. Statistical Analysis and Data Mining, 5(4), 284-303. doi:10.1002/sam.11145.

Item is

Files

show Files
hide Files
:
Stat. Anal. Data Min. 2012 Galbrun.pdf (Any fulltext), 401KB
 
File Permalink:
-
Name:
Stat. Anal. Data Min. 2012 Galbrun.pdf
Description:
-
OA-Status:
Visibility:
Private
MIME-Type / Checksum:
application/pdf
Technical Metadata:
Copyright Date:
-
Copyright Info:
-
License:
-

Locators

show

Creators

show
hide
 Creators:
Galbrun, Esther1, Author           
Miettinen, Pauli1, Author           
Affiliations:
1Databases and Information Systems, MPI for Informatics, Max Planck Society, ou_24018              

Content

show
hide
Free keywords: -
 Abstract: Redescription mining is a powerful data analysis tool that is used to find multiple descriptions of the same entities. Consider geographical regions as an example. They can be characterized by the fauna that inhabits them on one hand and by their meteorological conditions on the other hand. Finding such redescriptors, a task known as niche-finding, is of much importance in biology. Current redescription mining methods cannot handle other than Boolean data. This restricts the range of possible applications or makes discretization a prerequisite, entailing a possibly harmful loss of information. In niche-finding, while the fauna can be naturally represented using a Boolean presence/absence data, the weather cannot. In this paper, we extend redescription mining to categorical and real-valued data with possibly missing values using a surprisingly simple and efficient approach. We provide extensive experimental evaluation to study the behaviour of the proposed algorithm. Furthermore, we show the statistical significance of our results using recent innovations on randomization methods.

Details

show
hide
Language(s): eng - English
 Dates: 20122012
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: Peer
 Identifiers: eDoc: 647493
DOI: 10.1002/sam.11145
Other: Local-ID: C1256DBF005F876D-BE029674FDF3303BC1257AF300432F52-galbrun12black
BibTex Citekey: galbrun12black
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: Statistical Analysis and Data Mining
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: Chichester : Wiley
Pages: - Volume / Issue: 5 (4) Sequence Number: - Start / End Page: 284 - 303 Identifier: ISSN: 1932-1872