English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Multiple Active Speaker Localization Based on Audio-visual Fusion in Two Stages

Li, Z., Herfet, T., Grochulla, M. P., & Thormählen, T. (2012). Multiple Active Speaker Localization Based on Audio-visual Fusion in Two Stages. In 2012 IEEE Conference on Multisensor Fusion and Integration for Intelligent Systems (pp. 262-268). Piscataway, NJ: IEEE. doi:10.1109/MFI.2012.6343015.

Item is

Files

show Files

Locators

show

Creators

show
hide
 Creators:
Li, Zhao1, Author
Herfet, Thorsten1, Author
Grochulla, Martin Peter2, Author           
Thormählen, Thorsten2, Author           
Affiliations:
1External Organizations, ou_persistent22              
2Computer Graphics, MPI for Informatics, Max Planck Society, ou_40047              

Content

show
hide
Free keywords: -
 Abstract: Localization of multiple active speakers in natural environments with only two microphones is a challenging problem. Reverberation degrades performance of speaker localization based exclusively on directional cues. The audio modality alone has problems with localization accuracy while the video modality alone has problems with false speaker activity detections. This paper presents an approach based on audiovisual fusion in two stages. In the first stage, speaker activity is detected based on the audio-visual fusion which can handle false lip movements. In the second stage, a Gaussian fusion method is proposed to integrate the estimates of both modalities. As a consequence, the localization accuracy and robustness compared to the audio/video modality alone is significantly increased. Experimental results in various scenarios confirmed the improved performance of the proposed system.

Details

show
hide
Language(s): eng - English
 Dates: 20122012
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: DOI: 10.1109/MFI.2012.6343015
BibTex Citekey: Grochulla2012a
Other: Local-ID: BC1B873FD9C3D529C1257B0C00586BF0-Grochulla2012a
 Degree: -

Event

show
hide
Title: IEEE Conference on Multisensor Fusion and Integration for Intelligent Systems
Place of Event: Hamburg, Germany
Start-/End Date: 2012-09-13 - 2012-09-15

Legal Case

show

Project information

show

Source 1

show
hide
Title: 2012 IEEE Conference on Multisensor Fusion and Integration for Intelligent Systems
  Abbreviation : MFI 2012
Source Genre: Proceedings
 Creator(s):
Affiliations:
Publ. Info: Piscataway, NJ : IEEE
Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 262 - 268 Identifier: ISBN: 978-1-4673-2510-3
ISBN: 978-1-4673-2511-0