IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, VOL. ?, NO. ?, MONTH 2015 1 Detection of US Trafﬁc Signs Andreas Møgelmose, Dongran Liu, and Mohan M. Trivedi, Life Fellow, IEEE Abstract—This paper presents a comprehensive research study of the detection of US trafﬁc signs. Until now, the research in Trafﬁc Sign Recognition systems has been centered on European trafﬁc signs, but signs can look very different across different parts of the world, and a system which works well in Europe may indeed not work in the US. We go over the recent advances in trafﬁc sign detection and discuss the differences in signs across the world. Then we present a comprehensive extension to the publicly available LISA-TS trafﬁc sign dataset, almost doubling its size, now with HD-quality footage. The extension is made with testing of tracking sign detection systems in mind, providing videos of trafﬁc sign passes. We apply the Integral Channel Features and Aggregate Channel Features detection methods to US trafﬁc signs and show performance numbers outperforming all previous research on US signs (while also performing similarly to the state of the art on European signs). Integral Channel Features have previously been used successfully for European signs, while Aggregate Channel Features have never been applied to the ﬁeld of trafﬁc signs. We take a look at the performance differences between the two methods and analyze how they perform on very distinctive signs, as well as white, rectangular signs, which tend to blend into their environment. Index Terms—Advanced Driver Assistance, active safety, ma- chine vision, trafﬁc signs. I. I NTRODUCTION T RAFFIC sign detection has become an important topic of attention, not only for researchers in intelligent vehicles and driver assistance areas but also those active in the machine vision area. Trafﬁc Sign Recognition (TSR) generally consists of two layers, detection and classiﬁcation. With the German Trafﬁc Sign Recognition Benchmark (GTSRB) in 2011, the classiﬁcation problem was largely solved. To achieve a fully functional TSR system, the detection step needs to work as well. With the introduction of the German Trafﬁc Sign Detection Benchmark (GTSDB) competition, a good amount of work has been done to that effect, even with suggestions of the detection problem being solved [1]. We contend that while good progress has deﬁnitely been made, the research community is not quite there yet. Not all trafﬁc signs look the same, especially the US signs are signiﬁcantly different in appearance from those in Europe. Systems which do not consider them cannot be expected to perform in the same manner as for what they are designed for - namely almost exclusively European signs. We have taken a fresh look at the speciﬁc issues, challenges, features, and evaluation of US trafﬁc signs in a comprehensive manner. To do this in a systematic way, the very ﬁrst order of business is to draw out differences in how these signs appear. Given A. Møgelmose is with the Visual Analysis of People Lab, Aalborg Univer- sity, Denmark, and Laboratory for Intelligent and Safe Automobiles (LISA), UC San Diego, USA. E-mail: am@create.aau.dk. D. Liu and M. M. Trivedi are with Laboratory for Intelligent and Safe Automobiles (LISA), UC San Diego, USA. Figure 1. Countries which have ratiﬁed the Vienna Convention on Road Signs and Signals. Note that apart from these, Japan, Australia, and to a lesser extent China also follows it, even though they did not ratify it. Data source: [2] these rather stark appearance differences, we undertook a major database collection, annotation, organization, and public distribution effort. Secondly, we explored the overall landscape of appearance based object detection research - including European trafﬁc signs - and carefully selected the two most promising approaches, one (Integral Channel Features) which has offered very good results on European signs and another (Aggregate Channel Features) which was very recently intro- duced in the literature, but has never been applied to the trafﬁc sign case. TSR is becoming more and more relevant, as cars obtain better and better Advanced Driver Assistance Systems (ADAS) [3]–[8], and driving becomes more and more automated. While mapping-based indexing of trafﬁc signs can replace in-situ recognition to some extent, it will never be able to work in changing road conditions, such as road work, and furthermore the initial sign locations and types must be determined some- how in the ﬁrst place. Until the infrastructure is updated to include wireless transponders in all trafﬁc signs, TSR will have its place in cars. No matter the application, detecting and recognizing signs on individual images is not sufﬁcient. If every new detection in a video feed is treated as a new sign, the driver (human or not) will quickly be overwhelmed by notiﬁcations. Instead detections must be grouped so all detections pertaining to the same physical sign are treated like the single sign it is. The temporal grouping of detections may also have a positive impact on the classiﬁcation, since more than one image can be used to determine the sign type. For temporal grouping, tracking comes into play. There has been some research into tracking of trafﬁc signs [9], [10], but it is still in its infancy. One of the issues in trafﬁc sign tracking is that no suitable dataset exists for that purpose, something the dataset extension put forth in this paper addresses, even though we do not tackle that issue in the experiments here. The primary goal of this paper is to present the most com-