Deutsch
 
Hilfe Datenschutzhinweis Impressum
  DetailsucheBrowse

Datensatz

 
 
DownloadE-Mail
  Multiple Active Speaker Localization Based on Audio-visual Fusion in Two Stages

Li, Z., Herfet, T., Grochulla, M. P., & Thormählen, T. (2012). Multiple Active Speaker Localization Based on Audio-visual Fusion in Two Stages. In 2012 IEEE Conference on Multisensor Fusion and Integration for Intelligent Systems (pp. 262-268). Piscataway, NJ: IEEE. doi:10.1109/MFI.2012.6343015.

Item is

Externe Referenzen

einblenden:

Urheber

einblenden:
ausblenden:
 Urheber:
Li, Zhao1, Autor
Herfet, Thorsten1, Autor
Grochulla, Martin Peter2, Autor           
Thormählen, Thorsten2, Autor           
Affiliations:
1External Organizations, ou_persistent22              
2Computer Graphics, MPI for Informatics, Max Planck Society, ou_40047              

Inhalt

einblenden:
ausblenden:
Schlagwörter: -
 Zusammenfassung: Localization of multiple active speakers in natural environments with only two microphones is a challenging problem. Reverberation degrades performance of speaker localization based exclusively on directional cues. The audio modality alone has problems with localization accuracy while the video modality alone has problems with false speaker activity detections. This paper presents an approach based on audiovisual fusion in two stages. In the first stage, speaker activity is detected based on the audio-visual fusion which can handle false lip movements. In the second stage, a Gaussian fusion method is proposed to integrate the estimates of both modalities. As a consequence, the localization accuracy and robustness compared to the audio/video modality alone is significantly increased. Experimental results in various scenarios confirmed the improved performance of the proposed system.

Details

einblenden:
ausblenden:
Sprache(n): eng - English
 Datum: 20122012
 Publikationsstatus: Erschienen
 Seiten: -
 Ort, Verlag, Ausgabe: -
 Inhaltsverzeichnis: -
 Art der Begutachtung: -
 Identifikatoren: DOI: 10.1109/MFI.2012.6343015
BibTex Citekey: Grochulla2012a
Anderer: Local-ID: BC1B873FD9C3D529C1257B0C00586BF0-Grochulla2012a
 Art des Abschluß: -

Veranstaltung

einblenden:
ausblenden:
Titel: IEEE Conference on Multisensor Fusion and Integration for Intelligent Systems
Veranstaltungsort: Hamburg, Germany
Start-/Enddatum: 2012-09-13 - 2012-09-15

Entscheidung

einblenden:

Projektinformation

einblenden:

Quelle 1

einblenden:
ausblenden:
Titel: 2012 IEEE Conference on Multisensor Fusion and Integration for Intelligent Systems
  Kurztitel : MFI 2012
Genre der Quelle: Konferenzband
 Urheber:
Affiliations:
Ort, Verlag, Ausgabe: Piscataway, NJ : IEEE
Seiten: - Band / Heft: - Artikelnummer: - Start- / Endseite: 262 - 268 Identifikator: ISBN: 978-1-4673-2510-3
ISBN: 978-1-4673-2511-0