A two-pass approach for handling out-of-vocabulary words in a large vocabulary 
recognition task

Scharenborg, Odette; Seneff, S.; Boves, L.

doi:10.1016/j.csl.2006.03.003

DetailsSummary

A two-pass approach for handling out-of-vocabulary words in a large vocabulary recognition task

Scharenborg, O., Seneff, S., & Boves, L. (2007). A two-pass approach for handling out-of-vocabulary words in a large vocabulary recognition task. Computer, Speech & Language, 21, 206-218. doi:10.1016/j.csl.2006.03.003.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/11858/00-001M-0000-0012-D1DC-3 Version Permalink: https://hdl.handle.net/11858/00-001M-0000-0012-D1DD-1

Genre: Journal Article

Files

show Files

hide Files

:

59725A2Fd01.pdf (Publisher version), 197KB

View Save

File Permalink:
https://hdl.handle.net/11858/00-001M-0000-0012-D1DB-5

Name:
59725A2Fd01.pdf

Description:
-

OA-Status:

Visibility:
Public

MIME-Type / Checksum:
application/pdf / [MD5]

Technical Metadata:

View

Copyright Date:
-

Copyright Info:
-

License:
-

Locators

show

Creators

show

hide

Creators:
Scharenborg, Odette¹, Author
Seneff, S.², Author
Boves, L.¹, Author

Affiliations:
1Centre for Language and Speech Technology, Radboud University Nijmegen, ou_55203
2Spoken Language System Group, Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA, ou_persistent22

Content

show

hide

Free keywords: -

Abstract: This paper addresses the problem of recognizing a vocabulary of over 50,000 city names in a telephone access spoken dialogue system. We adopt a two-stage framework in which only major cities are represented in the first stage lexicon. We rely on an unknown word model encoded as a phone loop to detect OOV city names (referred to as ‘rare city’ names). We use SpeM, a tool that can extract words and word-initial cohorts from phone graphs from a large fallback lexicon, to provide an N-best list of promising city name hypotheses on the basis of the phone graph corresponding to the OOV. This N-best list is then inserted into the second stage lexicon for a subsequent recognition pass. Experiments were conducted on a set of spontaneous telephone-quality utterances; each containing one rare city name. It appeared that SpeM was able to include nearly 75% of the correct city names in an N-best hypothesis list of 3000 city names. With the names found by SpeM to extend the lexicon of the second stage recognizer, a word accuracy of 77.3% could be obtained. The best one-stage system yielded a word accuracy of 72.6%. The absolute number of correctly recognized rare city names almost doubled, from 62 for the best one-stage system to 102 for the best two-stage system. However, even the best two-stage system recognized only about one-third of the rare city names retrieved by SpeM. The paper discusses ways for improving the overall performance in the context of an application.

Details

show

hide

Language(s): eng - English

Dates: Date issued: 2007

Publication Status: Issued

Pages: -

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: DOI: 10.1016/j.csl.2006.03.003

Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show

hide

Title: Computer, Speech & Language

Source Genre: Journal

Creator(s):

Affiliations:

Publ. Info: Elsevier

Pages: - Volume / Issue: 21 Sequence Number: - Start / End Page: 206 - 218 Identifier: -