PHYSICAL REVIEW E 89, 042812 (2014) Intergroup networks as random threshold graphs Sudipta Saha, Niloy Ganguly, and Animesh Mukherjee Department of Computer Science and Engineering, Indian Institute of Technology, Kharagpur 721302, India Tyll Krueger Department of Computer Science and Engineering, Technical University of Wroclaw, Poland (Received 9 February 2014; published 24 April 2014) Similar-minded people tend to form social groups. Due to pluralistic homophily as well as a sort of heterophily, people also participate in a wide variety of groups. Thus, these groups generally overlap with each other; an overlap between two groups can be characterized by the number of common members. These common members can play a crucial role in the transmission of information between the groups. As a step towards understanding the information dissemination, we perceive the system as a pruned intergroup network and show that it maps to a very basic graph theoretic concept known as a threshold graph. We analyze several structural properties of this network such as degree distribution, largest component size, edge density, and local clustering coefficient. We compare the theoretical predictions with the results obtained from several online social networks (LiveJournal, Flickr, YouTube) and find a good match. DOI: 10.1103/PhysRevE.89.042812 PACS number(s): 89.75.Fb, 89.90.+n I. INTRODUCTION Group formation [13] is a very common and popular feature among humans where a user (human) can participate in multiple groups [4]. Consequently, many social network- ing sites like LiveJournal, 1 Flickr, 2 YouTube, 3 etc., provide explicit facilities to form, maintain, and communicate within social groups [5]. These groups can be deemed as a medium of mass communication among its participating users [69]; there is, however, an interesting side effect to this communication. Common users belonging to multiple groups pass information of one group to another. Hence, analyzing the extent of connectivity among the groups can shed light into the amount of information propagated from one group to another. This connectivity structure can be best estimated by analyzing the properties of an evolving intergroup network, which is the primary focus of this paper. Intergroup networks can be modeled as the one-mode projection of evolving user-group bipartite networks. In the bipartite process, the user partition grows with time, while, if we consider only the popular groups, the group partition remains fixed. Such bipartite networks where one partition remains fixed is termed a alphabetic-bipartite network (α-BiN) [10]. A projection on the groups allows us to obtain the group-group network (i.e., the intergroup network) where two groups are neighbors if they have at least one common user. However, it can be safely assumed that two groups will have high mutual interaction, thus allowing more information to propagate if there are at least a critical number of common users (determined by a threshold value) [5,11]. Therefore, an interesting structure to study is the “pruned intergroup network” where two nodes (groups) are connected if they have more than a threshold number of common users. The first attempt to understand the “pruned intergroup network” was made in Ref. [5], where by mapping the 1 LiveJournal: www.livejournal.com 2 Flickr: www.flickr.com 3 YouTube: www.youtube.com underlying evolution dynamics to a Polya urn model, the degree distribution of the pruned intergroup network is derived. However, in order to gain a better insight into the connectivity structure, the more relevant structural properties such as largest component size, edge density, and local clustering coefficient (assuming a large number of groups) under various possible threshold values need to be analyzed as a first step. In this paper, we identify the mathematical relationship between the weight of an edge and the weights of the associated nodes (later these node weights are referred to as “attractiveness parameters”), derived in Ref. [5], as special importance. We heavily leverage on this relationship to derive the formula for the above mentioned structural properties of the “pruned intergroup network.” We also show that this class of networks can be appropriately modeled by a special variant of “random threshold graph”[12], termed a “multiplicative random threshold graph.” We compare the theoretical predictions with the same obtained from the available real datasets, and in most of the cases, we obtain a significantly accurate match. Hence, the main contributions of this paper are twofold: (a) we show how the intergroup relationships in social systems and in a more general sense the pruned one-mode projection of a preferentially grown α-BiN can be studied as a special kind of random threshold graph model, and (b) we show how the mathematical analysis of various structural aspects (degree distribution, largest component, edge density, as well as local clustering coefficient) of this specific kind of multiplicative random threshold graphs can be done in a more transparent way. The rest of the paper is organized as follows. In the next section, we precisely describe the basic model of α-BiN and the special property of the intergroup networks that allows us to map it to multiplicative random threshold graphs. In Sec. III we present a detailed description of the mathematical analysis of degree distribution, edge density, largest component, and the local clustering coefficient. Next, in Sec. IV we compare the mathematical findings with the observations made from the real dataset. Finally, in Sec. V we present a brief review of the state-of-the-art before drawing the conclusion. 1539-3755/2014/89(4)/042812(11) 042812-1 ©2014 American Physical Society