3D Scene Reconstruction from Reﬂection Images in a Spherical Mirror Masayuki Kanbara, Norimichi Ukita, Masatsugu Kidode and Naokazu Yokoya Graduate School of Information Science, Nara Institute of Science and Technology 8916-5 Takayama-cho, Ikoma-shi, Nara 630-0192, JAPAN {kanbara, ukita, kidode, yokoya } @is.naist.jp Abstract This paper proposes a method for reconstructing a 3D scene structure by using the images reﬂected in a spherical mirror. In our method, the mirror is moved freely within the ﬁeld of view of a camera in order to observe a surrounding scene virtually from multiple viewpoints. The observation scheme, therefore, allows us to obtain the wide-angle multi- viewpoint images of a wide area. In addition, the following characteristics of this observation enable multi-view stereo with simple calibration of the geometric conﬁguration be- tween the mirror and the camera; (1) the distance and direc- tion from the camera to the mirror can be estimated directly from the position and size of the mirror in the captured im- age and (2) the directions of detected points from each po- sition of the moving mirror can be also estimated based on reﬂection on a spherical surface. Some experimental results show the effectiveness of our 3D reconstruction method. 1. Introduction This paper proposes a method for 3D reconstruction of a wide area from an image sequence capturing a spherical mirror by a camera whose projection center is ﬁxed. In the ﬁeld of computer vision, 3D scene reconstruction using im- ages taken from different view points have attracted much attention [1, 2]. Especially, as computers and cameras have made remarkable progress in recent years, a large number of methods for reconstructing a 3D scene from multiple im- ages have been proposed[3]. One of the major approaches to 3D reconstruction from multiple images is to use a static stereo vision [4, 5]. To re- construct the whole 3D structure of a scene, a large number of cameras must be employed for wide observation. How- ever, conventional methods cannot employ a large number of images because it is difﬁcult to calibrate a large number of cameras accurately. These methods, therefore, are not appropriate for reconstructing the whole 3D structure of a scene. One of other approaches is to use an image sequence taken by a moving camera, which is called shape-from- motion [6, 7]. The method can recover extrinsic camera parameters and 3-D positions of natural features simultane- ously by tracking the 2D positions of the natural features in multiple images of the sequence. Therefore, the method allows us to move a camera freely and widely in order to observe multidirectional images of a scene. A factorization algorithm [8] is one of the well known shape from motion methods that can estimate a rough 3D scene model stably and efﬁciently by assuming an afﬁne camera model. How- ever, when the 3-D scene is not suitable for the afﬁne cam- era model, the estimated results (i.e., both of extrinsic cam- era parameters and 3-D positions of feature points) are not reliable. Therefore, this method is not suitable for recon- structing a dense 3D structure in general. Although, some other methods of shape-from-motion are based on a pro- jective reconstruction method [9], most of these methods reconstruct only a limited scene from a small number of images and are not designed to obtain a dense structure. We can summarize the above discussion as follows: The meth- ods of 3D reconstruction from many images need an accu- rate calibration of a large number of cameras or undesired assumptions regarding camera/scene models, and thus these methods become usually complex and/or unstable. On the other hand, 3D reconstruction methods using re- ﬂection images acquired by capturing a mirror surface by a camera have been also investigated [10, 11, 12]. Some of these methods reconstruct 3D information by using the re- lationship between real features, that are directly captured by a camera, and virtual features, that are points observed through a mirror surface [13, 14]. In these methods, both of the real and virtual features of a 3D point must be captured simultaneously by the camera in order to reconstruct its 3D position, and thus the reconstruction environment is limited to a local scene. This paper proposes a 3D reconstruction method using only a combination of a spherical mirror and a high res- The 18th International Conference on Pattern Recognition (ICPR'06) 0-7695-2521-0/06 $20.00 © 2006