Performance Evaluation of a Live, Crowdsensing Based Transit Feed Service Architecture K´ aroly Farkas * , R´ obert Szab´ o † and Bern´ at Wiandt * * Dept. of Networked Systems and Services, † HSNLab, Dept. of Telecommunications and Media Informatics, Budapest University of Technology and Economics, Budapest, Hungary Email: farkask@hit.bme.hu, robert.szabo@tmit.bme.hu, bwiandt@hit.bme.hu Abstract—Taking public transportation is an efﬁcient and environmentally sound way of traveling in most of the cities. Unfortunately, the static schedule information of the public transport vehicles, available at the stops, on the web or in some special format like GTFS (General Transit Feed Speciﬁcation), usually does not reﬂect the actual trafﬁc situation. However, real- time trafﬁc updates require the gathering of an immense amount of tracking data. Mobile crowdsensing, via the mobile devices of the crowd, can offer a cheap and efﬁcient way for collecting such data. Nonetheless, for motivating active participation some day zero service must be provided to the users. In this paper, we discuss how to realize a live public transport information service extending a static timetable based on GTFS data, as the day zero service, using the power of the crowd for data collection. We detail the design of such a service implemented by an XMPP (Extensible Messaging and Presence Protocol) based mobile crowdsensing architecture and evaluate its performance. We show how we can build a scalable service architecture even with commodity hardware to handle thousands of users. Keywords—Crowdsensing, Public transport, GTFS, Pub- lish/subscribe, XMPP I. I NTRODUCTION Commuting has become part of our daily routine which results in spending a signiﬁcant part of our time with traveling. Public transportation offers a viable alternative for commuters, especially in crowded cities, reducing private vehicle usage, fuel consumption, environmental pollution and alleviating traf- ﬁc congestion. However, it is important for the passengers to have accurate information about the arrival time at the stops of the public transit vehicles. Otherwise, unexpectedly long waiting times can frustrate passengers and divert them from using public transport. It is even better if some other extra information about the arriving vehicle, for instance the congestion level or baby buggy friendliness, is also provided, making travel planning easier. Fortunately, most transit operators of big cities make their timetables or transit feeds freely available for the passengers or third party application developers. The General Transit Feed Speciﬁcation (GTFS) [1] – originally developed by Google – evolved into the de-facto standard to exchange transit feeds publicly. However, the widely used GTFS format enables only the exchange of static transit information (e.g., departure schedules, travel time, operating hours), which does not reﬂect live trafﬁc conditions. Remedy to live updates is the relatively new real-time extension to GTFS [2], which relies on operators to invest into a real-time ﬂeet tracking and communication infrastructure to be able to provide live service updates. Unfortunately, nowadays very few operators offer such services due to the necessary investments. Another approach to live updates is using mobile crowd- sensing 1 [3] and let the crowd collect the information re- quired to extend the basic timetable service. In this case, voluntary passengers sense and send live service updates (e.g., delays, congestion, hazard, etc.) to a service provider via their smart-phones. The service provider then aggregates, cleans, analyzes the collected data, and derives and disseminates the live updates to the users. For sensing, the built-in and ubiquitous sensors of the mobile phones can be used either in a participatory or an opportunistic way depending on whether data collection happens with or without user involvement. The contribution of every traveler can be useful. Hence, passengers waiting for a trip can report the line number with a timestamp of every arriving public transport vehicle at a stop during the waiting period. On the other hand, on-board passengers can send actual location information of the moving vehicle periodically and report events of arrival at/departure from the stops. Although the participatory sensing based approach is a viable alternative, it faces several challenges. The basic chal- lenge, similarly to other crowdsensing based services, is how passengers can be motivated to participate in data collec- tion. We believe, that a day zero service, which is provided from the beginning and improved, based on the crowd-sensed data, following incremental service development, can be an appropriate incentive. Along this line, as a sequel work to [4] and [5], we propose and investigate a scenario, where a static transit feed as the day zero service is improved and updated with live service updates from participatory users. We show how such a scenario can be implemented by the Extensible Messaging and Presence Protocol (XMPP) [6]. We evaluate different XMPP server setups under load. We present measurement results showing that even commodity PCs are able to carry on with the load and handle several thousand users, and that XMPP server clustering can remedy service scaling. We show, that with our architecture and a GTFS emulator static and live service updates can easily be combined introducing incremental service improvements based on mobile participatory users. The rest of the paper is structured as follows. In Sec. II we 1 We use the terms crowdsensing, crowdsourcing and participatory sensing interchangeably in this paper. – 251 – INES 2014 • IEEE 18th International Conference on Intelligent Engineering Systems • July 3-5, 2014, Tihany, Hungary 978-1-4799-4615-0/14/$31.00 ©2014 IEEE