Expertise Discovery in Decentralised Online Social Networks Sana Showkat Ara Insight Centre For Data Analytics, NUI Galway Galway, Ireland sana.ara@insight-centre.org Subhasis Thakur Insight Centre For Data Analytics, NUI Galway Galway, Ireland Subhasis.thakur@insight-centre.org John G. Breslin Insight Centre For Data Analytics, NUI Galway Galway, Ireland John.breslin@insight-centre.org ABSTRACT Distributed Social Networks (DSNs) are the solution to the privacy and security problems of online social networks. In DSN, a user controls their own data as it chooses personal storage for its so- cial network data. In absence of a centralized entity with access to all social network data, information retrieval becomes dicult in DSNs. In this paper we propose to use crowd sourcing for informa- tion retrieval in a DSN. We analyze a popular information retrieval problem called expert search in a social network. In this paper, we present an algorithm for such a crowd sourcing based search process which includes solution for (a) the worker selection prob- lem (b) the task selection problem and (c) the reward distribution problem. Using experimental evaluation, we show that, the search algorithms proposed in this paper can be as ecient as a greedy search algorithm with access to entire social network information. KEYWORDS Expertise Discovery, Decentralized, Online Social Network ACM Reference format: Sana Showkat Ara, Subhasis Thakur, and John G. Breslin. 2017. Exper- tise Discovery in Decentralised Online Social Networks. In Proceedings of ASONAM ’17, Sydney, Australia, July 31-August 03, 2017, 6 pages. https://doi.org/10.1145/3110025.3110048 1 INTRODUCTION Social networking has become the most popular communication medium of human life. However, centralised social networks have security and privacy problems. In a social network, users store large amounts of private data and they have very limited authority to control their own data. Social network provider have immense opportunity to extract, analyze user’s data anonymously and they may handover user data to third parties for personalised advertising. Therefore, the absence of condentiality and data ownership is the most discussed drawback of a centralized social network. Distributed Social Network (DSN) can provide a potential mit- igation of the security and privacy problems of OSNs. In DSN, a user chooses the storage location of its data and it controls the access to such storage. This helps to eliminate certain privacy and security issues. The benets of distributed social networks are open source code, decentralized, not owned by any central authority ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or aliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only. ASONAM ’17, July 31-August 03, 2017, Sydney, Australia © 2017 Association for Computing Machinery. ACM ISBN 978-1-4503-4993-2/17/07. . . $15.00 https://doi.org/10.1145/3110025.3110048 and as a result it gives back control of the data to users. In some decentralized social networks, user data remain in user machine [1, 2]. Information retrieval and search is dicult in DSN compared with OSN. In OSN, the social network provider has access to the entire social network data. In OSN, a query from a user can be eciently answered with sophisticated indexing techniques on the entire social network data. But there is no centralized entity in a DSN with access to all the social network data. Asknext [10] developed a protocol to search for an agent in OSN and its requires complete data of the OSN. The algorithms developed in this paper are ecient for incomplete OSN, specially decentralized OSN. In this paper, we assume that a search process in DSN will use the following procedure: A user depends on its contacts in the DSN for an answer to its query, i.e., a user sends a query to its neighbours in a DSN who may have the answer and may know any other user who has the answer. So, it sends the query to its neighbours and who may forward it to their respective neighbours. For example, a user Sam is looking for a knowledgeable person in machine learning. Alice is another user of the social network who is unknown to Sam but they have a common friend Bob. Bob regards Alice as a machine learning expert though Alice’s activities in the social network. In this scenario, Bob will recommend Alice to Sam. Note that, in a OSN, this search could be answered by the OSN provider who has access to the entire social network information. But in a DSN, a user depends on its neighbors for such a search. Such a search initiated by a user may resemble a cascade on the network as the query spreads on the DSN. Such a search process can be very inecient. This is because, users may not participate in the search, users may not have the knowledge or expertise to answer the query or even route the query to the users who may have the answer. To overcome the above problems, in this paper we employ crowdsourcing. The proposed search process solves three problems. Worker selection problem: We need to select workers who are most likely to answer the query or route the query to the users who has the expertise to answer it. Task selection problem: We need tond a suitable subset of the recommended neighbours of the workers to whom the the workers will propagate the query. Reward distribution problem: We need tond a payment scheme for the workers who participate in the search. The payment scheme should encourage truthful behaviour of the users. 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 244