D. Forsyth, P. Torr, and A. Zisserman (Eds.): ECCV 2008, Part II, LNCS 5303, pp. 568 – 581, 2008.
© Springer-Verlag Berlin Heidelberg 2008
Shape-Based Retrieval of Heart Sounds for Disease
Similarity Detection
Tanveer Syeda-Mahmood and Fei Wang
IBM Almaden Research Center, 650 Harry Road, San Jose, CA 95120
{stf,wangfe}@almaden.ibm.com
Abstract. Retrieval of similar heart sounds from a sound database has applica-
tions in physician training, diagnostic screening, and decision support. In this
paper, we exploit a visual rendering of heart sounds and model the morphologi-
cal variations of audio envelopes through a constrained non-rigid translation
transform. Similar heart sounds are then retrieved by recovering the correspond-
ing alignment transform using a variant of shape-based dynamic time warping.
Results of similar heart sound retrieval are demonstrated for various diseases
on a large database of heart sounds.
Keywords: Sound pattern analysis, audio retrieval, curve analysis, healthcare
application.
1 Introduction
The field of pattern recognition is becoming increasingly applicable to new modali-
ties. In this paper we explore the application of computer vision techniques to an
important class of audio signals, namely, heart sounds. Heart auscultation, i.e., listen-
ing to the sounds produced by the heart, is a common practice in the screening of
heart disease. Although different diseases produce characteristic sounds, forming a
diagnosis based on sounds heard through a stethoscope is a skill that takes years to
perfect. Numerous studies have shown that as much as 87% of patients referred to
cardiologists for evaluation are as a result of false alarms [2]. Thus software diagnos-
tic tools that aid physicians in their diagnosis of heart sounds are needed.
Auditory discrimination of heart sounds is inherently difficult as these sounds are
faint and lie at the lower end of the audible frequency range [1]. Although there are
tools to visually render the sounds [3], we recently observed that the visual represen-
tation of heart sounds actually brings out the differentiating characteristics of various
diseases more readily than the audio signal. Figure 1 illustrates this by recording the
visual appearance of the audio signal within a single heart beat duration, from patients
with various diseases. As can be seen, different diseases show characteristically dif-
ferent shape patterns. Further, it is also easier to spot the similarity in the sounds
across patients with similar diseases through their visual representations. Figure 2
illustrates this for different patients diagnosed with the same disease. From these
examples, it appears plausible that the disease similarity can be inferred by develop-
ing a measure for capturing the visual similarity of audio signals.