http://www.iaeme.com/IJMET/index.asp 465 editor@iaeme.com International Journal of Mechanical Engineering and Technology (IJMET) Volume 10, Issue 05, May 2019, pp. 465-475, Article ID: IJMET_10_05_047 Available online at http://www.iaeme.com/ijmet/issues.asp?JType=IJMET&VType=10&IType=5 ISSN Print: 0976-6340 and ISSN Online: 0976-6359 © IAEME Publication MULTIPLE HUMAN TRACKING USING RETINANET FEATURES, SIAMESE NEURAL NETWORK, AND HUNGARIAN ALGORITHM Dina Chahyati, Aniati Murni Arymurthy Machine Learning and Computer Vision Laboratory Faculty of Computer Science, Universitas Indonesia, Depok, Indonesia ABSTRACT Multiple human tracking based on object detection has been a challenge due to its complexity. Errors in object detection would be propagated to tracking errors. In this paper, we propose a tracking method that minimizes the error produced by object detector. We use RetinaNet as object detector and Hungarian algorithm for tracking. The cost matrix for Hungarian algorithm is calculated using the RetinaNet features, bounding box center distances, and intersection of unions of bounding boxes. We interpolate the missing detections in the last step. The proposed method yield 43.2 MOTA for MOT16 benchmark. Key words: RetinaNet, tracking by detection, Hungarian algorithm, Siamese neural network, interpolation Cite this Article: Dina Chahyati, Aniati Murni Arymurthy, Multiple Human Tracking Using Retinanet Features, Siamese Neural Network, and Hungarian Algorithm, International Journal of Mechanical Engineering and Technology 10(5), 2019, pp. 465-475. http://www.iaeme.com/IJMET/issues.asp?JType=IJMET&VType=10&IType=5 1. INTRODUCTION Multiple object tracking has been a challenge for researchers over a decade. Vehicle and human tracking have been dominating this field since they are very important for surveillance system. Human tracking has its own challenges because of a wide variety of human appearance and severe occlusions in most of the scenes. In the beginning, researchers focused on tracking by trying to predict the path. Methods such as Kalman filter and optical flow [1][2][3] were commonly used in this approach. Nowadays many researchers change the approach to tracking by detection [4][6]. Detections by using HOG or its variation such as DPM are quite popular and has been used in multiple object tracking (MOT) benchmark [7]. However, DPM has limitation for recognizing more complex objects and its properties such as variation of clothes, bags, activities, gender, etc. Our future research topic is to track movements of people with a specific gender, therefore we cannot rely on DPM for the detection step. A more reasonable option would be to use deep learning approach for the detection step because they can be trained to detect more specific categories by using transfer learning.