日本語
 
Help Privacy Policy ポリシー/免責事項
  詳細検索ブラウズ

アイテム詳細

  Efficient Index-Based Audio Matching

Kurth, F., & Müller, M. (2008). Efficient Index-Based Audio Matching. IEEE Transactions on Audio, Speech, and Language Processing, 16(2), 382-395. doi:10.1109/TASL.2007.911552.

Item is

基本情報

表示: 非表示:
資料種別: 学術論文

ファイル

表示: ファイル

関連URL

表示:

作成者

表示:
非表示:
 作成者:
Kurth, Frank, 著者
Müller, Meinard1, 著者           
所属:
1Computer Graphics, MPI for Informatics, Max Planck Society, ou_40047              

内容説明

表示:
非表示:
キーワード: -
 要旨: Given a large audio database of music recordings, the goal of classical audio identification is to identify a particular audio recording by means of a short audio fragment. Even though recent identification algorithms show a significant degree of robustness towards noise, MP3 compression artifacts, and uniform temporal distortions, the notion of similarity is rather close to the identity. In this paper, we address a higher level retrieval problem, which we refer to as audio matching: given a short query audio clip, the goal is to automatically retrieve all excerpts from all recordings within the database that musically correspond to the query. In our matching scenario, opposed to classical audio identification, we allow semantically motivated variations as they typically occur in different interpretations of a piece of music. To this end, this paper presents an efficient and robust audio matching procedure that works even in the presence of significant variations, such as nonlinear temporal, dynamical, and spectral deviations, where existing algorithms for audio identification would fail. Furthermore, the combination of various deformation- and fault-tolerance mechanisms allows us to employ standard indexing techniques to obtain an efficient, index-based matching procedure, thus providing an important step towards semantically searching large-scale real-world music collections.

資料詳細

表示:
非表示:
言語: eng - English
 日付: 2009-03-162008
 出版の状態: 出版
 ページ: -
 出版情報: -
 目次: -
 査読: 査読あり
 識別子(DOI, ISBNなど): eDoc: 428141
DOI: 10.1109/TASL.2007.911552
URI: http://dx.doi.org/10.1109/TASL.2007.911552
その他: Local-ID: C125756E0038A185-3AD2D78184F392C5C125753E0058C93D-Müller2008
 学位: -

関連イベント

表示:

訴訟

表示:

Project information

表示:

出版物 1

表示:
非表示:
出版物名: IEEE Transactions on Audio, Speech, and Language Processing
種別: 学術雑誌
 著者・編者:
所属:
出版社, 出版地: -
ページ: - 巻号: 16 (2) 通巻号: - 開始・終了ページ: 382 - 395 識別子(ISBN, ISSN, DOIなど): ISSN: 1558-7916