Observer Agreement With Laparoscopic Diagnosis of
Pelvic Inflammatory Disease Using Photographs
Pontus Molander, MD, Patrik Finne, MD, PhD, Jari Sjöberg, MD, PhD, John Sellors, MD, PhD, and
Jorma Paavonen, MD, PhD
OBJECTIVE: To test the intraobserver (are observers likely to
agree between themselves?) and interobserver (are observ-
ers likely to agree with other observers?) reproducibility
and the overall diagnostic accuracy of laparoscopic diagno-
sis of pelvic inflammatory disease (PID).
METHODS: Three senior consultants and three residents in
training in obstetrics and gynecology scored the four lapa-
roscopic images (adnexa, cul-de-sac, and pelvic panoramic
view) from each of 40 patients and repeated the process 2
days later after the order of presentation had been random-
ized. A standardized predesigned scoring form was used.
Histopathologically proven PID was used as the gold stan-
dard.
RESULTS: The overall accuracy of the laparoscopic diagno-
sis of PID was 78%, the sensitivity was 27%, and the speci-
ficity was 92%. The overall intraobserver reproducibility of
the diagnosis of PID was only fair ( 0.58), and it was
clearly better among the consultants than among the resi-
dents ( 0.76 and 0.39, respectively). The overall inter-
observer reproducibility was poor to fair ( 0.43), and it
was again better among the consultants than among the
residents ( 0.48 and 0.38, respectively). When specific
diagnostic features (including tubal erythema, edema, ad-
hesions, cul-de-sac fluid) were separately analyzed, the
results were no different suggesting only poor-to-fair repro-
ducibility.
CONCLUSION: Based on photographic images, the observer
reproducibility and the overall diagnostic accuracy of the
laparoscopic diagnosis of PID are unsatisfactory when his-
topathologically proven PID is used as the gold standard.
(Obstet Gynecol 2003;101:875– 80. © 2003 by The Amer-
ican College of Obstetricians and Gynecologists.)
Laparoscopy has been recognized as the gold standard
for the diagnosis of pelvic inflammatory disease (PID).
1
Many studies have been performed on the correlation
between the clinical and the laparoscopic diagnosis of
PID. Most studies have demonstrated a relatively low
sensitivity for the clinical diagnosis usually in the range
of 60 –70%.
2–4
Similarly, laboratory criteria seem to
have a low sensitivity in the diagnosis of PID.
2
Several
other diagnostic modalities including transvaginal ultra-
sound,
5
magnetic resonance imaging,
6
endometrial biop-
sy,
7
and evidence of lower genital tract infection
8
have
also been compared with laparoscopy in the diagnosis of
PID.
Laparoscopic criteria for the diagnosis of PID include
tubal erythema, tubal edema, and the presence of puru-
lent exudate.
1,9
In some studies, the acute salpingitis
score has been derived to improve laparoscopic diagno-
sis and also to determine the severity of PID.
10
However,
uniform classification systems for the laparoscopic diag-
nosis of PID have been difficult to develop and imple-
ment. In fact, laparoscopy has never been properly vali-
dated as the gold standard in the diagnosis of PID. An
expectation bias by the laparoscopist can be a major
problem.
11
Sellors et al
11
demonstrated that laparoscopy
was no better than chance in the diagnosis of salpingitis
when fimbrial minibiopsy showing histopathologic evi-
dence of salpingitis was used as the gold standard. This
study clearly demonstrated the inaccuracy of the laparo-
scopic diagnosis of PID. Surprisingly, only few studies
have tested observer reproducibility of specific laparo-
scopic findings.
12–15
Thus, there is a growing concern
about the use of laparoscopy as the gold standard.
Therefore, we decided to perform a systematic study of
the observer reproducibility and diagnostic accuracy of
the laparoscopic diagnosis of PID among unselected
patients referred for acute pelvic pain. We used the
histopathologic diagnosis of PID as the gold standard.
MATERIALS AND METHODS
Three senior consultants (experience in laparoscopic
surgery 6 –15 years, mean 12 years) and three residents
in training (third-year residents undergoing rotation in
gynecologic surgery) from the Department of Obstetrics
From the Departments of Obstetrics and Gynecology and Clinical Chemistry,
University of Helsinki, Helsinki, Finland; and Program for Appropriate Technol-
ogy in Health, Seattle, Washington.
This study was supported by grants from the Helsinki University Research Funds,
K. Albin Johansson Funds, and Medical Society of Finland.
875 VOL. 101, NO. 5, PART 1, MAY 2003 0029-7844/03/$30.00
© 2003 by The American College of Obstetricians and Gynecologists. Published by Elsevier. doi:10.1016/S0029-7844(03)00013-9