Graph Mining with Variational Dirichlet Process Mixture Models

Tsuda, K; Zaki, M. J.

Item

ITEM ACTIONSEXPORT

DownloadE-Mail

Please note that a newer version of this item is available:
https://pure.mpg.de/pubman/item/item_1789990_2

DetailsSummary

Graph Mining with Variational Dirichlet Process Mixture Models

Tsuda, K. (2008). Graph Mining with Variational Dirichlet Process Mixture Models. Proceedings of the 8th SIAM International Conference on Data Mining, 432-442.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/11858/00-001M-0000-0013-C9DD-B Version Permalink: https://hdl.handle.net/11858/00-001M-0000-0013-C9DE-9

Genre: Conference Paper

Files

show Files

Locators

show

Creators

show

hide

Creators:
Tsuda, K¹, Author
Zaki, M. J., Editor

Affiliations:
1Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_1497795

Content

show

hide

Free keywords: -

Abstract: Graph data such as chemical compounds and XML documents are getting more common in many application domains. A main difficulty of graph data processing lies in the intrinsic high dimensionality of graphs, namely, when a graph is represented as a binary feature vector of indicators of all possible subgraph patterns, the dimensionality gets too large for usual statistical methods. We propose a nonparametric Bayesian method for clustering graphs and selecting salient patterns at the same time. Variational inference is adopted here, because sampling is not applicable due to extremely high dimensionality. The feature set minimizing the free energy is efficiently collected with the DFS code tree, where the generation of useless subgraphs is suppressed by a tree pruning condition. In experiments, our method is compared with a simpler approach based on frequent subgraph mining, and graph kernels.

Details

show

hide

Language(s):

Dates: Date issued: 2008-04

Publication Status: Issued

Pages: -

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: URI: http://www.siam.org/proceedings/datamining/2008/dm08.php
BibTex Citekey: 4950

Degree: -

Event

show

hide

Title: 8th 2008 SIAM International Conference on Data Mining

Place of Event: Atlanta, GA, USA

Start-/End Date: -

Legal Case

show

Project information

show

Source 1

show

hide

Title: Proceedings of the 8th SIAM International Conference on Data Mining

Source Genre: Journal

Creator(s):

Affiliations:

Publ. Info: Philadelphia, PA, USA : Society for Industrial and Applied Mathematics

Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 432 - 442 Identifier: -