日本語
 
Help Privacy Policy ポリシー/免責事項
  詳細検索ブラウズ

アイテム詳細

  Model Order Selection for Boolean Matrix Factorization

Miettinen, P., & Vreeken, J. (2011). Model Order Selection for Boolean Matrix Factorization. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'11) (pp. 51-59). New York, NY: ACM. doi:10.1145/2020408.2020424.

Item is

基本情報

表示: 非表示:
資料種別: 会議論文
LaTeX : Model Order Selection for {Boolean} Matrix Factorization

ファイル

表示: ファイル
非表示: ファイル
:
op0629-miettinen.pdf (全文テキスト(全般)), 2MB
 
ファイルのパーマリンク:
-
ファイル名:
op0629-miettinen.pdf
説明:
-
OA-Status:
閲覧制限:
非公開
MIMEタイプ / チェックサム:
application/pdf
技術的なメタデータ:
著作権日付:
-
著作権情報:
(c) ACM, 2011. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in Proceedings of the 17th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD-2011), http://doi.acm.org/10.1145/2020408.2020424.
CCライセンス:
-

関連URL

表示:

作成者

表示:
非表示:
 作成者:
Miettinen, Pauli1, 著者           
Vreeken, Jilles2, 著者
所属:
1Databases and Information Systems, MPI for Informatics, Max Planck Society, ou_24018              
2External Organizations, ou_persistent22              

内容説明

表示:
非表示:
キーワード: -
 要旨: Matrix factorizations---where a given data matrix is approximated by a product of two or more factor matrices---are powerful data mining tools. Among other tasks, matrix factorizations are often used to separate global structure from noise. This, however, requires solving the `model order selection problem' of determining where fine-grained structure stops, and noise starts, i.e., what is the proper size of the factor matrices. Boolean matrix factorization (BMF)---where data, factors, and matrix product are Boolean---has received increased attention from the data mining community in recent years. The technique has desirable properties, such as high interpretability and natural sparsity. But so far no method for selecting the correct model order for BMF has been available. In this paper we propose to use the Minimum Description Length (MDL) principle for this task. Besides solving the problem, this well-founded approach has numerous benefits, e.g., it is automatic, does not require a likelihood function, is fast, and, as experiments show, is highly accurate. We formulate the description length function for BMF in general---making it applicable for any BMF algorithm. We extend an existing algorithm for BMF to use MDL to identify the best Boolean matrix factorization, analyze the complexity of the problem, and perform an extensive experimental evaluation to study its behavior.

資料詳細

表示:
非表示:
言語: eng - English
 日付: 20112011
 出版の状態: 出版
 ページ: -
 出版情報: -
 目次: -
 査読: -
 識別子(DOI, ISBNなど): eDoc: 618993
DOI: 10.1145/2020408.2020424
URI: http://doi.acm.org/10.1145/2020408.2020424
その他: Local-ID: C1256DBF005F876D-9C688EF8F47E45D4C125796C0046EFA8-miettinen11model
 学位: -

関連イベント

表示:
非表示:
イベント名: 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
開催地: San Diego, CA, USA
開始日・終了日: 2011-08-21 - 2011-08-24

訴訟

表示:

Project information

表示:

出版物 1

表示:
非表示:
出版物名: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'11)
  副タイトル : 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
  省略形 : KDD 2011
種別: 会議論文集
 著者・編者:
所属:
出版社, 出版地: New York, NY : ACM
ページ: - 巻号: - 通巻号: - 開始・終了ページ: 51 - 59 識別子(ISBN, ISSN, DOIなど): ISBN: 978-1-4503-0813-7