Aural-perceptual speaker identification:
problems with noncontemporary samples
Harry Hollien* and Reva Schwartz
*Institute for Advanced Study of the Communication Processes,
University of Florida
ABSTRACT The degree to which speech and/or speech samples are noncontemporary is
considered important to the speaker identification process. There are two dimensions to
the problem; the first relates to the listener and, especially, to earwitness lineups. Here, the
subject or witness is asked to make identifications at various times after having heard (but
not having seen, of course) the speaker. It has been found that a person’s memory for a
voice decays over time. In the second case, it is the samples of the speaker’s utterances
which are temporally displaced. The prevailing opinion here has been that the use of non-
contemporary speech samples poses just as difficult a challenge to the speaker identifi-
cation process as does the decaying memory of a witness. Accordingly, research was
carried out to test this possibility (Hollien and Schwartz, in press); it was found that the
overall drop in correct identification over latencies from four weeks to six years was only
about 15–25 per cent. It was not until the greatest of the time separations was studied (i.e.,
twenty years) that a substantial drop occurred (to 31 per cent). At this juncture, a number
of questions arose; and three of them have been investigated. First, is listener gender
important to the process; second, are the identification levels affected by the type of lis-
teners employed and, finally, can external factors serve to differentially degrade listener
performance? It was found that the first question could be answered in the negative and
the second two in the affirmative. These findings should aid in clarifying some of the rela-
tionship between sample latency and identification accuracy.
KEYWORDS speaker identification, auditory identification, noncomtempory speech,
perceptual processing
INTRODUCTION
Most forensic approaches to speaker identification take one of two major
forms (there are exceptions, of course). In one instance, a victim or witness
is asked to identify the voice of another person; one who was heard but
not seen. One example: a telephone call. A second example: a woman was
raped but did not actually see the perpetrator – only heard his speech and
voice. She is asked to make an identification (earwitness lineup) at some
time after (often long after) having heard him speak. The second of the
two procedures is one where a voice has been recorded (examples: death
threats, bomb threats, sexual harassment) and attempts are made later to
determine the identity of the speaker, often from a pool of suspects. Here,
© University of Birmingham Press 2000 1350-1771
Forensic Linguistics 7(2) 2000