© 2013, IJARCSSE All Rights Reserved Page | 196 Volume 3, Issue 3, March 2013 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com SIFT Based Approach: Object Recognition and Localization for Pick-and-Place System Girish Patil * Department of Electronics and Telecommunication, Govt. College of Engineering, Amravati, India Devendra Chaudhari Department of Electronics and Telecommunication, Govt. College of Engineering, Amravati, India Abstract — Vision based pick and place robotic systems have been the focus of significant research in both academia and industry. The system typically employs machine vision to analyse the scene, identify and locate the specified object and provide feedback to the robot arm for subsequent operations. For successful picking, the vision system needs to recognize the position and the orientation of the objects, the Scale Invariant Feature Transform (SIFT) is used for this purpose. The basis of the proposed work is built around two major areas; object recognition for developing artificial vision system and the robotics for carrying out the specified task with the specified object. In this paper, such object recognition techniques are reviewed. Keywords — Object Recognition, Scale Invariant Feature Transform, Image Processing, Feature Extraction, Robotics. I. INTRODUCTION The pick-and-place processes are the primary requisite for many of the industrial and household application. For such applications, there is a need to automate the pick-and-place process basically comprising of picking the intended objects, possibly performing certain tasks and placing them to desired location. The automated pick-and-place systems mainly consist of robotic arms and sensors. The machine vision is used as sensor and the primary function of them is to drive the robotic arms to the right location of desired object for picking and placing according to the robot’s degrees of freedom. The placing location is prefixed in most applications hence the sensors are rarely used here and the placing phase found to be comparatively easier. In contrast, the picking phase becomes very complex process in the applications where the scene is occluded and constrained. In this phase the sensors plays most important role as it is responsible for correct movements of the robotic system [1]. Most of the picking system assumes the situations where the objects are well structured, ordered, aligned and synchronized grasping of the objects. For such cases, the use of simple photocells will be sufficient to accomplish the picking phase. However, this approach will not be adequate for several applications as the arrangement to keep the objects well-structured and well aligned results in wastage of time and space of the process. In addition to this, there are some applications where the objects need to be kept in bins for saving time and/or for hygienic and safety reasons as shown in Fig.1. In this case, cameras used must be high resolution along with appropriate machine vision algorithms. In the scene, the objects are positioned at random inside a bin, a container or even at random on a belt/shelf; this problem is addressed as bin picking. [2] Fig. 1: Examples of complex situations for multiple object segmentation. The pick-and-place systems with robotic vision present several challenges, like the system should be capable of overcoming the difficulties in the disposal of the objects such as order, structure and placing of grasping points, working with every type of object of different dimension and complexity, with reﬂective surfaces or semi -transparent parts, such as in the case of pharmaceutical and cosmetic objects, often reﬂective or included in transparent ﬂow packs, tackling the conditions of occlusion and clutter which make the object only partially visible, not only counting but also classifying the first instance of the object and also to identify all the duplicate objects with their orientation and dimensions, meeting the required working speed though having fast detection technique so that it works with several objects per minute.