Aural-perceptual speaker identification: problems with noncontemporary samples Harry Hollien* and Reva Schwartz *Institute for Advanced Study of the Communication Processes, University of Florida ABSTRACT The degree to which speech and/or speech samples are noncontemporary is considered important to the speaker identification process. There are two dimensions to the problem; the first relates to the listener and, especially, to earwitness lineups. Here, the subject or witness is asked to make identifications at various times after having heard (but not having seen, of course) the speaker. It has been found that a person’s memory for a voice decays over time. In the second case, it is the samples of the speaker’s utterances which are temporally displaced. The prevailing opinion here has been that the use of non- contemporary speech samples poses just as difficult a challenge to the speaker identifi- cation process as does the decaying memory of a witness. Accordingly, research was carried out to test this possibility (Hollien and Schwartz, in press); it was found that the overall drop in correct identification over latencies from four weeks to six years was only about 15–25 per cent. It was not until the greatest of the time separations was studied (i.e., twenty years) that a substantial drop occurred (to 31 per cent). At this juncture, a number of questions arose; and three of them have been investigated. First, is listener gender important to the process; second, are the identification levels affected by the type of lis- teners employed and, finally, can external factors serve to differentially degrade listener performance? It was found that the first question could be answered in the negative and the second two in the affirmative. These findings should aid in clarifying some of the rela- tionship between sample latency and identification accuracy. KEYWORDS speaker identification, auditory identification, noncomtempory speech, perceptual processing INTRODUCTION Most forensic approaches to speaker identification take one of two major forms (there are exceptions, of course). In one instance, a victim or witness is asked to identify the voice of another person; one who was heard but not seen. One example: a telephone call. A second example: a woman was raped but did not actually see the perpetrator – only heard his speech and voice. She is asked to make an identification (earwitness lineup) at some time after (often long after) having heard him speak. The second of the two procedures is one where a voice has been recorded (examples: death threats, bomb threats, sexual harassment) and attempts are made later to determine the identity of the speaker, often from a pool of suspects. Here, © University of Birmingham Press 2000 1350-1771 Forensic Linguistics 7(2) 2000