Detecting and extracting named entities from spontaneous speech in a mixed-initiative spoken dialogue context: How May I Help You? sm;tm Fr ederic Bechet a, * , Allen L. Gorin b , Jeremy H. Wright b , Dilek Hakkani Tur b a LIA, University of Avignon, BP1228, 84911 Avignon Cedex 09, France b AT&T Labs, 180 Park Avenue, Florham Park, NJ 07932, USA Received 17 October 2002; received in revised form 14 June 2003; accepted 1 July 2003 Abstract The understanding module of a spoken dialogue system must extract, from the speech recognizer output, the kind of request expressed by the caller (the call type) and its parameters (numerical expressions, time expressions or proper- names). Such expressions are called Named Entities and their definitions can be either generic or linked to the dialogue application domain. Detecting and extracting such Named Entities within a mixed-initiative dialogue context like How May I Help You? sm;tm (HMIHY) is the subject of this study. After reviewing standard methods based on hand-written grammars and statistical tagging, we propose a new approach, combining the advantages of both in a 2-step process. We also propose a novel architecture which exploits understanding to improve recognition accuracy: the output of the Automatic Speech Recognition module is now a word lattice and the understanding module is responsible for tran- scribing the word strings which are useful to the Dialogue Manager. All the methods proposed are trained and eval- uated on a corpus comprising utterances from live customer traffic. Ó 2003 Elsevier B.V. All rights reserved. Resume Les systemes automatiques de dialogue tel ephonique contiennent generalement un module de comprehension charge de traiter les sorties du module de reconnaissance automatique de parole. Ce traitement consiste a extraire non-seu- lement le type de requ^ ete exprimee par lÕutilisateur mais aussi les parametres de cette requ^ ete tels que les expressions numeriques, temporelles ou bien encore les noms propres. Ces expressions sont generalement appelees des Entites Nommees et leurs definitions peuvent ^ etre generiques ou bien liees a un domaine dÕapplication particulier. Detecter et extraire de telles entitesdanslecadredÕun systeme automatique de dialogue tel ephonique a initiative mixte tel que How May I Help You? sm;tm (HMIHY) est le sujet de cette etude. Apres avoir passe en revue les methodes habituelles basees sur des grammaires ecrites manuellement ou bien sur des etiqueteurs statistiques, nous proposons une nouvelle ap- proche permettant de combiner leurs avantages respectifs. Nous proposons egalement une nouvelle architecture, pour les systemes automatiques de dialogue tel ephonique, qui utilize les resultats du module de comprehension afin * Corresponding author. Tel.: +33-4-90-84-35-12; fax: +33-4-90-84-35-01. E-mail addresses: frederic.bechet@lia.univ-avignon.fr (F.Bechet), algor@research.att.com (A.L. Gorin), jwright@research.att.com (J.H. Wright), dtur@research.att.com (D. Hakkani Tur). 0167-6393/$ - see front matter Ó 2003 Elsevier B.V. All rights reserved. doi:10.1016/j.specom.2003.07.003 Speech Communication 42 (2004) 207–225 www.elsevier.com/locate/specom