- Research
- Open access
- Published:
Layer entanglement in multiplex, temporal multiplex, and coupled multilayer networks
Applied Network Science volume 5, Article number: 89 (2020)
Abstract
Complex networks, such as transportation networks, social networks, or biological networks, capture the complex system they model by often representing only one type of interactions. In real world systems, there may be many different aspects that connect entities together. These can be captured using multilayer networks, which combine different modalities of interactions in a single model. Coupling in multilayer networks may exhibit different properties which can be related to the very nature of the data they model (or to events in time-dependent data). We hypothesise that such properties may be reflected in the way layers are intertwined. In this paper, we investigated these through the prism of layer entanglement in coupled multilayer networks. We test over 30 real-life networks in 6 different disciplines (social, genetic, transport, co-authorship, trade, and neuronal networks). We further propose a random generator, displaying comparable patterns of elementary layer entanglement and transition coupling entanglement across 1,329,696 synthetic coupled multilayer networks. Our experiments demonstrate difference of layer entanglement across disciplines, and even suggest a link between entanglement intensity and homophily. We additionally study entanglement in 3 real world temporal datasets displaying a potential rise in entanglement activity prior to other network activity.
Introduction
A real world complex system often counts multiple interactions between multiple different entities. When these interactions are regrouped under multiple families of entities, multilayer network modelling becomes a tool of choice to capture the key components of the system. The use of this model emerges in all fields of science from social sciences to finances, logistics, biology, and many more (Kivelä et al. 2014).
With multilayer networks, the study of multiple viewpoints [or aspects (Kivelä et al. 2019)] on the same network data becomes possible. This is critical for example in social network analysis, to study the role of users in different networks, and compare them (for example the same individual may behave differently on LinkedIn, Twitter, or Facebook). These different networks form different types of links that may be overlaid.
Motivated by their practical interest, multilayer networks also show interesting structures (Battiston et al. 2014) that could be exploited to mine community structures or study the roles of nodes and edges through centrality, for example. These are also possible in a traditional network analysis standpoint but often requires some kind of simplification (such as one-mode projection) but recent advances show that interesting structures can be obtained directly from the multilayer networks (Gomez et al. 2013; Chen et al. 2018; Škrlj et al. 2019).
The key concept in multilayer networks are the layers themselves. Since the structure of such networks is driven by the layers and their aspect (Kivelä et al. 2014), understanding how the layers organise can reveal properties unique to a given multilayer network model (Renoust et al. 2015; Škrlj and Renoust 2019). Particularly, the intertwining of edges, or layer entanglement (Renoust et al. 2014, 2013), shows how layers overlap to form coherent structures and substructures.
Although recent works have focused on multilayer network analysis and description (Wang et al. 2018; Omodei et al. 2015), not many have focused on a large scale analysis of multilayer networks of different nature—and produced in different disciplines, while comparing them to synthetic models. One comparative study of flow analysis (De Domenico et al. 2015a) has particularly influenced this paper where emerging structures are described, albeit not compared to synthetic models.
In their seminal work, McPherson et al. (2001) discuss how ties emerge in social systems. They investigate how people similarity, i.e. homophily, is a strong driver to the formation of ties, with the addition to make them more durable in a dynamic system. They investigate social ties in a multilayer manner, and argue for further research: “in the impact of multiplex ties on the patterns of homophily; [and] the dynamic of network change over time [...]”. Our original work (Škrlj and Renoust 2019)—that we extend in this paper—particularly resonates with the first point of McPherson et al., in that we displayed a link between homophily (McPherson et al. 2001; Borgatti et al. 2009) in social networks and high entanglement intensity networks.
This paper extends (Škrlj and Renoust 2019), which originally contributed with an open source implementation of entanglement homogeneity and intensity for multiplex networks, while evaluating them over 30 real world networks. We proposed also a synthetic multiplex network generator. A generation of over 10k synthetic networks, and their comparison with the real world networks, displayed common patterns of entanglement homogeneity and intensity that could be specific to the families of applications that generated the networks. In this extended work, we contribute with:
-
The theoretical extension of the entanglement computation to a fully multiplex model that takes into account coupling edges;
-
The extension of our synthetic generator accordingly;
-
The computations on a wider range of real and synthetic networks (1,329,696 synthetic networks were considered);
-
The study of entanglement in large, temporal multiplex networks;
-
An open-source implementation of all conducted experiments.
Coupled multilayer and multiplex networks
A multilayer network can be defined as a sequence \(M = \{G_l\}_{l \in L} = \{(V_{l},E_{l})\}_{l \in L}\) where \(E_{l} \subseteq V_l \times V_l\) is a set of edges in one network \(l \in L\) of the sequence (Kivelä et al. 2014). Multilayer networks are commonly understood as layers comprised of interactions, where each layer corresponds to a specific aspect of the system. Coupling accounts for transitions between layers. Kivelä et al. (2014) consider a multiplex network as “diagonally coupled multilayer networks in which each layer shares at least one node with some other layer in the network”. They consider also node-aligned multiplex networks, which do not specifically address coupling of nodes, but assume that nodes are shared (and coupled) across all layers. In our context, we refer to coupled multilayer networks when we specifically consider networks with coupling between nodes across layers, and simply to multiplex networks when considering node-aligned multiplex networks. The difference between these two types of multiplex networks is only whether we consider or not the coupling between layers. In multiplex networks, nodes represent the same entity across all layers.
We represent a multiplex network as a structure \(M'=(V_M, E_M)\), where \(V_M\) is the set of nodes and \(E_M\) the set of all edges (in all layers). \(\mathbb {V}\) denotes the super set of all nodes, and \(\mathbb {E} = \mathbb {V} \times \mathbb {V}\) the super set of all edges, regardless of the layers. There may exist coupling edges connecting nodes through layers, forming transition coupling. This may concern, for example, coupled multilayer networks which are modelling transportation systems (Cozzo et al. 2015). In that case, we can differentiate the elementary layers (holding inner-layer edges) from the transition coupling (holding coupling edges). Each transition coupling \(t = (l, l')\) between layer l and \(l'\) can be modelled similarly to a layer, with a set of nodes and edges. If \(S \subset L\) represents the subset of all elementary layers, and \(T \subset L\) the subset of all transition coupling, we may define our coupled multilayer network M as the union. It combines a multilayer network with elementary layers only, and another multilayer network with transition coupling only \(M = \{G_l\}_{l \in L} = M_S \cup M_T = \{G_s\}_{s \in S} \cup \{G_t\}_{t \in T}\). The coupling can heavily influence the structural behaviour of multilayer networks (Cozzo and Moreno 2016). It can also influence the resilience of the network against failures (De Domenico et al. 2014) and naturally the diffusion phenomena (Tejedor et al. 2018) too.
Among other examples of coupled multilayer networks, a biological system can be studied at the protein, RNA, or gene level (Valdeolivas et al. 2018). Similarly, social networks can be studied by taking into account a person’s presence on multiple platforms (Mittal and Bhatia 2019). For computational purposes, such networks are commonly represented in the form of supra-adjacency matrices, where block-diagonal structures connect the same node across individual layers emerges (Cozzo et al. 2015). Algorithms can operate on such matrices directly, and thus exploit additional information representing multiple aspects.
Algorithms for analysis of multilayer networks can also operate on sparse adjacency data structure of the multilayer network directly. Yet, they need to take into account that a given node is present in multiple layers. Such representation is suitable for this work, as we are focused primarily on how edges co-occur across layers. Hence, this work focuses primarily on the relations between the layers of a given multilayer network. We next discuss the two measures we consider throughout this work.
Entanglement in multiplex networks
We briefly recall the definitions of entanglement measures from previous work (Renoust et al. 2014).
Layer interaction network
Recall the notion of a multiplex network \(M=(V_M, E_M)=\{G_l \}_{l \in L}\). As mentioned earlier, such a network really distinguishes itself from classical graphs through the use of different layers to connect nodes. These layers may have different patterns and may overlap together. There may even exist latent dependencies among these layers. To investigate this matter, each layer could be abstracted to one single node and form a new graph, the Layer Interaction Network (hereafter LIN) (Renoust et al. 2014). Visualizing the LIN is a key component for multiplex network visualization such as in Renoust et al. (2015). In the LIN, \(LIN=(L,F)\), each node \(u_{l}, u_{l'}, u_{l''} \ldots\) corresponds to a layer \(l, l', l'', \ldots \in L\) of the multiplex network M, and each edge \(f \in F\) captures when two layers overlap through edges. More formally, there exists an edge \(f=(u_l, u_{l'})\) whenever there exists at least two nodes \(v, v' \in V_M\) with the condition that there exists at least one edge connecting these two nodes on each layer \(e_M = (v,v') \in l\) and \(e'_M = (v,v') \in l'\). The LIN can be interpreted as an edge-layer co-occurrence graph, and the weight of an edge \(f = (u_l, u_{l'})\), denoted as \(n_{l, l'}\) equals the number of times layers l and \(l'\) co-occur. By extension, \(n_{l, l}\) is the number of edges on layer l. This process is illustrated in Fig. 1b.
Layer entanglement
The analysis of layer entanglement is inspired by the analysis of relation content in social networks (Burt and Schøtt 1985). The idea is to study the redundancy between relation content, each forming in our formalism a different layer. The layer entanglement measures the “influence” of a layer in its neighbourhood.
This measure is recursively defined: the entanglement \(\gamma _l\) of a layer l is defined upon the entanglement of the layers it is entangled with. Similarly to the eigen centrality (Wasserman and Faust 1994), this translates into the recursive equation:
The entanglement of a layer \(\gamma _l\) can be retrieved from a vector \(\mathbf {\gamma }\) which corresponds to the right eigenvector (associated to the maximum eigenvalue \(\lambda\)) of the layer overlap frequency matrix with corresponding overlap, defined as:
this metric was initially introduced in Burt and Schøtt (1985), then later constructed using the weights in the LIN (Renoust et al. 2014) (see Figs. 1 and 2).
Entanglement intensity and homogeneity
The layer entanglement \(\gamma _l\) measures the share of layer l overlapping with other layers. The more a group of layers interacts together, the more the nodes they connect will be cohesive in view of these layers, hence the more \(\gamma _l\; \forall l \in L\) values will be similar (their share of entanglement will be similar). This is captured by the entanglement homogeneity (Renoust et al. 2014) which is then defined as the following cosine similarity:
With \(\mathbf {\mathbf {e}_L}=[1,1, \ldots , 1]_L\) the vector of size L all filled with 1’s. Optimal homogenity is not necessarily reached only when all nodes are connected through all layers, but also when all nodes are connected in a very balanced manner between all layers (see Fig. 2). Homogeneity thus permits various symmetries in a given LIN.
When a maximum overlap is reached through all layers in the network, the frequencies in the matrix C (of size \(|L| \times |L|)\) are saturated with \(C_{i,j}=1\). This gives us a theoretical limit to measure the amount of layer overlap through the entanglement intensity (Renoust et al. 2014), defined as:
In practice, both entanglement intensity and homogeneity have been used to measure the coherence of clusters of documents (Renoust et al. 2013).
Transition coupling entanglement
We have defined the layer entanglement which measures overlap between layers of a multiplex network, but many multiplex networks include another critical parameter which is coupling edges (Battiston et al. 2014). The coupling often measures the transition of nodes between layers, hence the transitions of nodes are captured by edges connecting nodes across layers.
Recall our multiplex graph \(M = (V_M, E_M)\). Suppose S is the set of elementary layers, we can then have transitions between any pair of elementary layers \(l \in S\) and \(l' \in S\). Let \(u_l = (u,l), u \in V_M, l \in S\), the connection of a node u within a layer l. A transition coupling edge e can be defined as follows: \(e=(u_l,v_{l'}) \in E_M\) such that e connects nodes \(\{u, v\} \subseteq V_M\) across layers \(l \ne l', \{l, l'\} \subseteq S\). Coupling edges often connect a same node across two layers and may be used to model a physical transition, such as a change from subway to train in a station of a transportation network. As a consequence, a pair of layers \((l, l')=t\) forms a transition coupling \(t \in T\) when there exists at least one such edge \(e=(u_l,v_{l'}) \in E_M\). Note that taken together, these elementary and transition coupling subsets form the set of all layers \(S \cup T = L\), and that the size of T is bounded by the size of S such that \(|T| \le \frac{1}{2}|S|(|S|-1)\).
Now, given this definition, nothing limits the computation of entanglement (introduced in “Layer interaction network”, “Layer entanglement” and “Entanglement intensity and homogeneity” section) only to the elementary layers part of \(M_S\), as illustrated in Fig. 3. Entanglement can also be used to characterise the coupling between these elementary layers if applied only to the edges of the transition coupling \(M_T\).
The nature of coupling often captures a very distinct characteristic of the network in comparison to its elementary layers. A transition coupling edge mostly connects the same node across layers, while elementary layers do not always display loops. These cases may happen on rare occasions, one example being an underground path connecting subway stations being modelled as a transition coupling, but the literature is very poor regarding such examples. It is however technically possible to consider both elementary layers and transition coupling in one multiplex network M to compute entanglement (as shown in Fig. 4), but we keep this discussion for the “Appendix”. In practice, the intensity and homogeneity greatly differ between them, and often result in clearly separated components of the LIN.
A coupled multilayer network generator
In this section, we describe an algorithm which generates synthetic coupled multilayer networks, i.e. multilayer networks which share some nodes across some layers, but do not guarantee that all nodes are being shared between all layers. These kinds of networks make the link between general multilayer networks and node-aligned multiplex networks [for which the assumption is that all nodes are shared through all layers (Kivelä et al. 2014)].
The algorithm is based on the following observations. Let \(M = (V_M, E_M)\) represent a coupled multilayer network with layer set L. Each node is associated to a random number of layers \(\{l_1, l_2, \ldots , l_i\} \subseteq L\). Now for each layer \(l_i \in L\) there is a set of nodes \(V_{l_i} \subseteq V_M\) which forms a potential set of edges of size \(|E_{l_i}| = \frac{1}{2}|V_{l_i}|(|V_{l_i}|-1)\). We introduce o, a parameter determining the probability of a node occurring at a given layer. We then introduce the probability p of an edge to be created between any pair of nodes belonging to a layer so we may avoid cliques to form on each layer. We referred in our previous work to the edge dropout (Škrlj and Renoust 2019), which is \(d = 1-p\) as the share of links we drop from the clique model. Intuitively, the more similar a given random multiplex is to a clique over each layer, the higher its elementary layer intensity should be. Hence, high intensity implies larger probability that two given nodes will have an edge between them on more than one layer. The generator also accounts for coupling by adding transition coupling edges. These coupling edges are connecting nodes across two layers. We introduce q, the probability for a same node to be connected across two layers. The higher q, the more nodes will be connected through layers. Note that in our initial work (Škrlj and Renoust 2019), neither o nor q were considered (o was in fact picked uniformly).
The purpose of this generator is to offer a simple testbed for further exploration, as well as additional evidence of the relation between homogeneity and intensity on many random, synthetic networks. The Algorithm 1 represents the proposed procedure.
The generator first randomly assigns the same node index to the many layers (lines 2–5). Once assigned, the layers are processed by applying sampling on \(|V_{l_i}| \atopwithdelims ()2\) possible edges in layer \(l_i\). Note that in line 7, this whole clique is virtually generated. The global multiplex is updated during this process (lines 6–10). These steps are then repeated for each transition coupling i.e. pairs of elementary layers (lines 11–14). The implementation thus uses a generator, for which lazy evaluation avoids potential combinatorial explosion when considering a large number of nodes and low edge probability.
Some theoretical properties of the generator
In this section we show two properties of the proposed generator. We denote \(n = |V_M|\) the parameter setting the number of nodes of the network, \(m = |L|\) the parameter setting the number of edge layers in the network, and p the inner-layer edge probability. Let \(\phi \in \mathbb {N}^{+}\) represent the number of possible edges. Then \(\phi \le m \cdot \left( {\begin{array}{c}n\\ 2\end{array}}\right) .\) Let \(o=1\). Each layer can have at most n nodes. Assuming they form a clique, each layer is thus comprised of \(n \atopwithdelims ()2\) edges. As there are m layers, there can be at most \(m \cdot \left( {\begin{array}{c}n\\ 2\end{array}}\right)\) edges — a clique of n nodes in each layer (assuming \(p = 1\)). We refer to this bound as \(\phi \le m \cdot \left( {\begin{array}{c}n\\ 2\end{array}}\right)\).
In the limit, as \(p \rightarrow 1\), a full clique needs to be constructed, assuming each node is projected across all layers. The complexity w.r.t. the number of layers and edges is: \(\mathcal {O}(m \cdot \left( {\begin{array}{c}n\\ 2\end{array}}\right) ) = \mathcal {O}(|E_M|).\) Note that, even though theoretically, the proposed generator creates a clique and then samples from it, current, lazy implementation only generates the edges needed to satisfy a given p percentage. In practice, only when \(p \approx 1\), the generator needs larger portions of space (and time). As such, fully connected networks do not represent real systems, we were able to generate a multitude of very diverse networks. This generator-based implementation does not imply that large spatial overheads are not possible: such situations occur when very dense networks are considered.
We next discuss the impacts of q parameter. The number of coupling edges has a worst case complexity of \(\mathcal {O}(\left( {\begin{array}{c}m\\ 2\end{array}}\right) \cdot n)\) since q directly depends on the number of layers available. Let \(l_a\) and \(l_b\) represent a given pair of layers, where each layer consists of all n possible nodes. As each node couples only to itself, there are at most n edges between \(l_a\) and \(l_b\). As there are \(\left( {\begin{array}{c}m\\ 2\end{array}}\right)\) possible layer pairs, if nodes are in each pair fully coupled, the network can have at most \(\left( {\begin{array}{c}m\\ 2\end{array}}\right) \cdot n\) coupling edges.
However, is that also the case when considering only transition coupling? Consider the following example of a multiplex network without the coupling edges. No matter what p is employed, if \(q \approx 0\), coupling intensity will be low—very few coupling edges are introduced, the observed LIN will be very sparse. Hence, we posit that the distribution of intensity shall be constant with respect to a given p. The proof of this claim is by contradiction. We assume that p would indeed influence coupling entanglement intensity. Since transition coupling intensity is defined solely based on the coupling edges, this claim would imply a dependency between p and q, which is by the definition (and design) not the case. Even if the nodes are isolated in each layer, transition coupling intensity can be high. Note also that the node positioning, governed by o, directly impacts both elementary and transition coupling entanglement, since there is higher possibilities for edges to overlap when nodes belong to many layers. These points are illustrated in our “Empirical evaluation” section and further in the “Appendix”.
Layer entanglement in temporal multiplex networks
Analysis of temporal multiplex networks has shown promising results in multiple fields of science, such as for example healthcare and transportation (Sannino et al. 2017).
Since patterns of layer interaction networks result in typical entanglement values, considering temporal entanglement is related to sizing particular topologies of a temporal multiplex network. For example, a high intensity among members in a multiplex social network communicating through different social media corresponds to a synchronization of communications between them. When such a synchronization corresponds to the preparation of a particular event, understanding such synchronization could help forecast the event.
In this section, we first discuss how we define temporal multiplex networks and entanglement time series. We limit the following discussion to the consideration of entanglement between elementary layers only, i.e. only inner-layer edges.
Temporal multiplex networks and entanglement
Real-life networks often evolve over time, making them behave differently at different points. In our current setting, we define the temporal aspect of our network such as each edge \(e_t\) is defined at a specific time point t. A multiplex network \(M_{d}\) can then be defined for a given time window d. A time window \({d} = [t_0, t_f]\) covers a time frame (beginning at \(t_0\) and ending \(t_f\)), and the multiplex network \(M_d\) is defined such as each edge exists within the time window:
The second scenario we considered is that of moving time windows. Here, edges from the f past windows are considered when constructing a given network M, i.e.,
Our intuition is to compare the shape of a network at different moving time windows. For example, we could compare political social networks under different rulers of a country (Renoust et al. 2016a, b). To do so, we can simply compute entanglement homogeneity and intensity for each time window and compare them. Since our computation only focuses on edge, we consider the network as multiplex, the nodes are shared across all time frames.
Slicing the time windows is a very different topic and many options are open (Gomez et al. 2013; Beck et al. 2014). For example, it could be achieved manually, with equal time slices, moving window, or with volume of changes. In our context, we consider the identification of time window through slices of equal duration in time, but the principle can be extended. We refer to the duration r in time of the slices as time resolution.
We may now investigate entanglement homogeneity and intensity properties with respect to time resolution (r), and verify if patterns of intensity/homogeneity variation can be predicted. Note that one challenge of slice-based modelling of temporal multiplex networks is the problem of selecting the correct resolution r, i.e. how coarse (or fine)-grained the intervals must be in order to capture desired dynamics.
In a system covering a global period of D, once a slicing resolution is chosen, we can observe values of homogeneity and intensity at the time series level, i.e. for each slice \(d \in D\), and define the intensity time series \({\mathbb S}_{I} = \{I_{M_d}\}, \forall d \in D\) and the homogeneity time series as \({\mathbb S}_{H}= \{H_{M_d}\}, \forall d \in D\). These intensity and homogeneity time series can now feed further processing. Note that \(\mathbb {S}_{I_f}\) and \(\mathbb {S}_{H_f}\) are defined analogously (entanglement for the past f slices, moving in the increments of one slice). The whole processing from temporal edges to time series is illustrated in Fig. 5.
In our following evaluation (“Entanglement in temporal multiplex networks” section), we explore \(\mathbb {S}_I\) and \(\mathbb {S}_H\) when also considering a moving window of previous f time slices. The rationale for considering past f slices up to the considered time point is that such information only includes past data, and could indicate whether entanglement can be also used for forecasting purposes. The second option considered, where only the current time slice was plotted, can shed insight on whether online monitoring based on I or H is a sensible option.
Empirical evaluation
We now study entanglement intensity and homogeneity across different series of networks. We first investigate entanglement measures across different parameters of synthetic settings. We follow with investigations on a large panel of real world networks. We finish our study with the study of entanglement in temporal multiplex networks.
Entanglement in synthetic networks
In this first study, we compare entanglement measures over a series of synthetic multiplex networks, using our proposed generator.
We consider for all our generations, the following key parameters:
-
Number of nodes (n) from 10 to 200 in increments of 10.
-
Number of layers (m) in 1, 2, 3, 4, 6, 7, 9, 10.
-
Layer assignment probability (o), from 0 to 1 in increments of 0.05
-
Edge probability (p) from 0 to 1 in increments of 0.05.
-
Transition coupling edge probability (q) from 0 to 1 in increments of 0.05.
Multiplex networks without transition coupling
A first generation concerns multiplex networks settings in which transition coupling is not specified (for example, friendship over different social platforms), so we do not consider parameter q here.
We have generated in total 1,329,696 synthetic networks (a couple are illustrated in Fig. 6).
We measure entanglement intensity I and homogeneity H on each generated network (averaged over all connected components of the layer overlap frequency matrix). We investigate the role of the different parameters over the entanglement measures, as illustrated in Figs. 7, 8 and 9.
There is an obvious dependency between entanglement intensity and homogeneity since we cannot obtain low homogeneity with high intensity values (Fig. 7). This is due to the nature of both measures. With a high intensity, most of the layers are overlapping over most of the network. As a consequence, there is little space for permutations in the way layers overlap, this means the entanglement of all individual layers \(\gamma _l\) tends to align, hence resulting in high values of homogeneity. This leads to a denser production of high homogeneity networks as illustrated by the density lines in Fig. 7.
The number of nodes n and edges m do not show a strong dependency with homogeneity, but a slight one on intensity. Higher values of n and m make it easier to obtain sparser networks, with the consequence of resulting lower values of intensity. We further illustrate these in Fig. 8. This effect mitigates quickly with higher numbers of nodes and layers.
We further explore the layer assignment probability of a node o, and the inner-layer edge probability p in Fig. 9. There is a first dependency appearing on the layer assignment probability o, for which higher values tend to produce higher homogeneity (Fig. 9b). Higher homogeneity is reached when all layers contribute equally, meaning that a higher o shows more chances for each layer to contain most of the nodes. We may also observe apparent linear trend between the edge probability p (sparseness) and entanglement intensity (Fig. 9d). This trend confirms that sparser networks (i.e. lower p) are less “intensely” overlapping over edges. As intensity directly measures this property, this result outlines one of the desired properties of the proposed network generator.
Multiplex networks with transition coupling
A second experiment is focusing on multiplex graphs with transition coupling, i.e. considering only the coupling edges in our 1,329,696 generated networks (illustrated in Fig. 10). This experiment reproduces the previous one, but focusing on the transition coupling entanglement. Results are shown in Figs. 11 and 12, dependency on the number of nodes and layers is illustrated in “Appendix”. From Fig. 11, the shape is globally the same, with the difference in a skewed density of high-homogeneity without a dense production of very low intensity generated networks (from the density lines).
The profile is sensibly the same than that of the previous experiment, except that the layer assignment probability o appears to have a more diffuse impact, and the direct dependency is this time observed on the coupling edge probability q. Comparison with parameter p obviously does not influence entanglement, but can be found in “Appendix” for additional inspection.
Overall, the networks with transition coupling are more saturated when compared to the ones without transition. The reason may be that we only consider here transition coupling edges that only connect the same node across layers.
For the interested reader, we also illustrate in the “Appendix” the independence of parameters q over the elementary layer entanglement and p over the transition coupling entanglement. We also report there the computation of entanglement over the combined elementary layers and transition coupling, which displays a dependency on both p and q parameters. Finally, we have computed the layer correlation coefficient, as suggested in Nicosia and Latora (2015), confirming the role of the different parameters of our generator.
Multiplex network comparison across disciplines
We now consider real world static networks. All considered networks are summarised with their main characteristics in Table 1Footnote 1. Unfortunately, we have not found a real case with a large number of transition coupling edges, so we limit this evaluation to elementary layer entanglement. For each network, we computed elementary layer homogeneity and intensity for all connected components.
We first investigate individual results through the distributions of each metric across network types, Fig. 13. We then compare individual networks across entanglement intensity and homogeneity Fig. 14.
Two main observations are apparent when studying the results on real networks. First, the difference between social and genetic (biological) multiplex networks becomes obvious when both entanglement intensity and homogeneity are considered (Fig. 14). To confirm these differences, we further compare their distributions, i.e., the intensity and homogeneity of social versus genetic networks, in Fig. 15.
In addition, from Fig. 14, we may observe that many genetic networks sit in relatively low intensity/homogeneity places, whereas social networks sit in the top right corner: the high entanglement homogeneity of social networks is quite noticeable. This suggests a few interpretations:
-
genetic networks show in general very little layer overlap;
-
some genetic networks could be matched to synthetic networks of low inner-layer edge probability, especially when homogeneity is low, being very sparse, potentially pointing at low layer assignment probability too;
-
layers in social networks tend to overlap a lot;
-
social networks tend to be quite dense and may be simulated by synthetic networks with a high inner-layer edge probability;
The results on social networks indicate a high level of layer overlap and it may be due to the overall behaviour of people, which is rather similar across different networks, whatever their means of interaction. Simmelian ties, triadic closure, and homophily (which are well studied in social sciences) are probably strong drivers of this layer overlap.
Entanglement in temporal multiplex networks
In our last experiment, we investigate entanglement across time slices of three real-life temporal multiplex networks: MLKing2013, MoscowAthletics2013, and Cannes2013 [as found in Omodei et al. (2015)]. Each network consists in a collection of Twitter activity related to some event. The networks are comprised of three layers of connection, namely retweets, replies and comments. They can be summarised as follows. The MLKing2013 data set consists of 421,083 events covering a week of celebration of M.L. King’s speech “I have a dream” in 2013, forming 396,671 edges between 327,708 nodes. The MoscowAthletics2013 data set consists of 303,330 events covering two weeks of the World Championships of Athletics held in Moscow in 2013, forming 210,250 edges between 88,805 nodes. The Cannes2013 network consists of 1,297,545 events (temporal edges) covering a month of the 2013 Cannes Film Festival, together forming a network of 930,419 edges and 438,538 nodes. Note that the networks are not trivially small, offering additional evidence of the stability of the entanglement computation.
The networks were analysed following the methodology introduced in “Layer entanglement in temporal multiplex networks” section. We propose two experiments with regard to time segmentation.
The first experiment considers fixed time windows of sizes 1h, 3h, 6h, and 12h. We compare with the activity volume in form of a total number of tweets—as found in Omodei et al. (2015), Figure 1 for a 1h window size, here reported in Figs. 16a, 17a, and 18a. We normalise here this volume so values are in [0, 1].
We selected the coarse windows at their best readability for each dataset (3h for MLKing2013 in Fig. 16b, 6h for MoscowAthletics2013 in Fig. 17b, and 12h for Cannes2013 in Fig. 18b)—each coarsening is further illustrated in “Appendix”. A second experiment considers a moving window of the size corresponding to these best windows, sliding by the hours (Figs. 16c, 17c, and 18c).
In the MLKing2013 data set (Fig. 16), we can observe that spikes of intensity surround the main spike of volume activity. A smaller spike of intensity consistently coincides with a smaller spike of volume at the end of the main spike.
In the MoscowAthletics2013 data set (Fig. 17), the 1h-time window does not show a consistent behaviour. However, we can see that spikes in coarser time windows coincide with the spikes in volume. A larger spike in intensity appears before the final spike in volume.
In the Cannes2013 data set (Fig. 18), the 1h-time window shows some spikes in intensity, especially a major by the end of the period of activity in terms of volume. In coarser time windows, we can notice four main spikes: one before the beginning of volume of activity; the next two ones appear just before a slight increase in the daily volume; the last one appears the day before the last day of the volume activity. This last peak appears even more prominent from the sliding window example.
The volume captures Twitter activity, governed by the human activity following the day/night rhythm. Although entanglement intensity is also submitted to it, we see emerging patterns that seem proper to each type of event. The activity of entanglement shows definitely some relationship with volume while telling a different story. The sports event that is MoscowAthletics2013 may be much more subject to the day-by-day routine in which different disciplines are at play. On the other hand, the speech celebration in MLKing2013 has some very specific activity before (could it be anticipation?) and after (could it be ripples?) the event. The movie festival in Cannes2013 may be governed by sub-events of different importance in terms of networking activity.
In accordance with the position of social networks in our evaluation of real-world networks in “Multiplex network comparison across disciplines” section, we see a decrease in homogeneity whenever we see spiking of intensity. This may indicate that a lot of the network activity suddenly focuses on one specific modality of exchange (such as replies). Entanglement study may help in targeting when this is driven by a particular modality.
Further studies on the nature of the events, and the specific topologies of the LIN networks that gave rise to these entanglement values is necessary for a more in-depth analysis of each case. Since we see some spiking activity of entanglement before actual events took place, we may suspect that, beyond monitoring, there is a predictive power of modelling time series from entanglement in past data (sliding windows).
Discussion and conclusions
In this work, we have revisited the notion of layer entanglement and extended it to coupled multilayer networks and temporal networks. To investigate entanglement, we have proposed a random generator for coupled multilayer networks, and generated a large set of synthetic ones. We have evaluated entanglement intensity and homogeneity in all cases, and compared to static and temporal real world networks.
Our analysis of the synthetic networks outlined that entanglement intensity is directly correlated with edge probability parameter—the sparser the network, the lower the intensity. This result indicates the proposed generator indeed emits networks which adhere to this property. We have also observed that large parts of the generated networks are subject to high homogeneity with various degrees of entanglement intensity.
Entanglement in the synthetic networks appears very sensitive to the different probabilities characterising the model (o, p, and eventually q for the coupled multilayer networks). The influence of each parameter should also be investigated theoretically in future work.
The high homogeneity observed may be a byproduct of our computations. First, our random generation induces a lot of small connected components of the coupled multilayer networks, and small components tend to show higher homogeneity since there are not so many degrees of freedom for edges to overlap. Because we are averaging the entanglement intensity and homogeneity over all components, this may go in favour of high homogeneity. Understanding this effect deserves more investigation. Second, entanglement homogeneity is a cosine measure, and the observed values may suffer from the skewness of cosine values when distributed in a linear space, amplifying the effect of having large values. Furthermore, it might also suffer from the curse of dimensionality in the case of a high number of layers. It would be worth considering normalizing this homogeneity with respect to the number of layers involved and the number of edges they cover. Instead of cosine, a Shannon’s entropy measure may overcome some of these limitations.
One of the aspects that was not extensively evaluated as a part of this work is the processing of the repeated links in a given time slice. The current implementation considers, for each time slice, the collection of unique links, which are not weighted by their possible multiple occurrences. This way, the diversity of connections is emphasized, instead of the link frequency. A more detailed study of how the links can be re-weighted will be considered in future work.
We further demonstrated that the two measures offer interesting insights when computed across a wide array of real-world networks. The observed relationship between the intensity and homogeneity of layer entanglement with the family of dataset was previously reported for clusters of documents [in Renoust et al. (2013, Figure 5)]. In this previous experiments, clusters of documents were mostly located at the left frontier of high intensity for a varying homogeneity. Our current experiments showed that real networks cluster based on their type (e.g. biological vs. social), also close to this frontier. We have observed (from Fig. 14) that the set of genetic networks tend to sit in areas with low entanglement intensity, which could correspond to lower edge probability p, but they also tend to show a wider span of entanglement homogeneity including our lowest values measured (from Fig. 13), which could correspond to lower layer assignment probability o. Further work should be invested on finding the reason why genetic networks tend to show lower homogeneity. This is opposed to social networks which tend to find their way in the higher probability area. This should be further investigated, but this may be related to homophily (McPherson et al. 2001; Borgatti et al. 2009). Homophily is the implied similarity of two entities in a social network, and the property of entities to agglomerate when being similar. If the reason of ‘being similar’ could be modelled as a layer of interaction, the result of a group of entities in ‘being similar’ would lead to the formation of a clique in this layer, hence locating social networks in high probability areas.
The proposed work offers at least two prospects of multiplex network study which are in our belief worth exploring further. The difference between the genetic and social networks is possibly subject to very distinct topologies which emerge in individual layers. This claim may further be investigated via other measurements, such as graphlets, communities or other structures. Next, genetic networks are less homogeneous. Future work includes exploration of this fact, as it can be merely a property of the networks considered, empirical methodology used to obtain the networks or some other effect.
We believe that theoretical properties of the proposed network generator can also be further studied, offering potential insights into how multiplex networks behave and whether the human-made aspects are indeed representative of a given system’s state. The model that we are currently exploring only takes into account a probability of linkage through (or within) layers without guarantee of connectivity. We made this choice to be able to compare between different fields, without prior assumption which could, for example, rule in favour of similarity to social network. Our future work will investigate other generation models including Erdős-Rónyi-based (Caimo and Gollini 2020) or other with preferential attachment (Nicosia et al. 2014).
The analysis of real-life temporal networks offers cues on evolution in layer entanglement which can happen prior to some other events. We have tested multiple time scales. Too small time windows mostly result in noisy time series carrying low amounts of useful information, while higher coarsening shows activity related to volume, but with a different light on the events that are captured. Future work will dive deeper into these events, and consider testing entanglement as a predictor using approaches such as of Prophet (Taylor and Letham 2018).
When considering entanglement as a either a monitoring or a predictive variable, its utility largely depends on the time scale at which a given edge stream needs to be considered. We leave extensive, possibly automatic determination of a setting where entanglement would be of practical relevance for future work. To study the parameters driving the dynamics of entanglement in temporal networks, we will consider comparing entanglement measures with synthetic temporal networks in our future investigations.
Availability of data and materials
The code for reproduction of experiments is freely available at https://gitlab.com/skblaz/entanglement-multiplex.
Notes
The networks are hosted at https://comunelab.fbk.eu/data.php
Abbreviations
- LIN:
-
Layer interaction Network
References
Battiston F, Nicosia V, Latora V (2014) Structural measures for multiplex networks. Phys Rev E 89(3):032804
Beck F, Burch M, Diehl S, Weiskopf D (2014) The state of the art in visualizing dynamic graphs. In: EuroVis (STARs). Citeseer
Borgatti SP, Mehra A, Brass DJ, Labianca G (2009) Network analysis in the social sciences. Science 323(5916):892–895
Burt RS, Schøtt T (1985) Relation contents in multiple networks. Soc Sci Res 14(4):287–308
Caimo A, Gollini I (2020) A multilayer exponential random graph modelling approach for weighted networks. Comput Stat Data Anal 142:106825
Cardillo A, Gómez-Gardenes J, Zanin M, Romance M, Papo D, Del Pozo F, Boccaletti S (2013) Emergence of network features from multiplexity. Sci Rep 3:1344
Chen BL, Hall DH, Chklovskii DB (2006) Wiring optimization can relate neuronal structure and function. Proc Natl Acad Sci 103(12):4723–4728
Chen X, Wang R, Tang M, Cai S, Stanley HE, Braunstein LA (2018) Suppressing epidemic spreading in multiplex networks with social-support. New J Phys 20(1):013007
Coleman J, Katz E, Menzel H (1957) The diffusion of an innovation among physicians. Sociometry 20(4):253–270
Costanzo M, Baryshnikova A, Bellay J, Kim Y, Spear ED, Sevier CS, Ding H, Koh JL, Toufighi K, Mostafavi S et al (2010) The genetic landscape of a cell. Science 327(5964):425–431
Cozzo E, Moreno Y (2016) Characterization of multiple topological scales in multiplex networks through supra-Laplacian eigengaps. Phys Rev E 94(5):052318
Cozzo E, Kivelä M, De Domenico M, Solé-Ribalta A, Arenas A, Gómez S, Porter MA, Moreno Y (2015) Structure of triadic relations in multiplex networks. New J Phys 17(7):073029
De Domenico M, Solé-Ribalta A, Gómez S, Arenas A (2014) Navigability of interconnected networks under random failures. Proc Natl Acad Sci 111(23):8351–8356
De Domenico M, Lancichinetti A, Arenas A, Rosvall M (2015a) Identifying modular flows on multilayer networks reveals highly overlapping organization in interconnected systems. Phys Rev X 5(1):011027
De Domenico M, Nicosia V, Arenas A, Latora V (2015b) Structural reducibility of multilayer networks. Nat Commun 6:6864
Gomez S, Diaz-Guilera A, Gomez-Gardenes J, Perez-Vicente CJ, Moreno Y, Arenas A (2013) Diffusion dynamics on multiplex networks. Phys Rev Lett 110(2):028701
Kapferer B (1972) Strategy and transaction in an African factory: African workers and Indian management in a Zambian Town. University Press, Manchester
Kivelä M, Arenas A, Barthelemy M, Gleeson JP, Moreno Y, Porter MA (2014) Multilayer networks. J Complex Netw 2(3):203–271
Kivelä M, McGee F, Melançon G, Henry Riche N, von Landesberger T (2019) Visual analytics of multilayer networks across disciplines (seminar 19061). Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik
Krackhardt D (1987) Cognitive social structures. Soc Netw 9(2):109–134
Lazega E et al (2001) The collegial phenomenon: the social mechanisms of cooperation among peers in a corporate law partnership. Oxford University Press, Oxford
Magnani M, Micenkova B, Rossi L (2013) Combinatorial analysis of multiple networks. arXiv preprint arXiv:1303.4986
McPherson M, Smith-Lovin L, Cook JM (2001) Birds of a feather: homophily in social networks. Annu Rev Sociol 27(1):415–444
Mittal R, Bhatia M (2019) Analysis of multiplex social networks using nature-inspired algorithms. In: Nature-inspired algorithms for big data frameworks. IGI Global, pp 290–318
Nicosia V, Latora V (2015) Measuring and modeling correlations in multiplex networks. Phys Rev E 92(3):032805
Nicosia V, Bianconi G, Latora V, Barthelemy M (2014) Nonlinear growth and condensation in multiplex networks. Phys Rev E 90(4):042807
Omodei E, De Domenico MD, Arenas A (2015) Characterizing interactions in online social networks during exceptional events. Front Phys 3:59
Padgett JF, Ansell CK (1993) Robust action and the rise of the medici, 1400–1434. Am J Sociol 98(6):1259–1319
Renoust B, Melançon G, Viaud M.-L (2013) Measuring group cohesion in document collections. In: 2013 IEEE/WIC/ACM international joint conferences on web intelligence (WI) and intelligent agent technologies (IAT), vol 1. IEEE, pp 373–380
Renoust B, Melançon G, Viaud M-L (2014) Entanglement in multiplex networks: understanding group cohesion in homophily networks. In: Missaoui R, Sarr I (eds) Social network analysis. Springer, Berlin, pp 89–117
Renoust B, Melancon G, Munzner T (2015) Detangler: visual analytics for multiplex networks. Computer Graphics Forum 34(3):321–330
Renoust B, Kobayashi T, Ngo TD, Le D-D, Satoh S (2016a) When face-tracking meets social networks: a story of politics in news videos. Appl Netw Sci 1(1):4
Renoust B, Le D-D, Satoh S (2016b) Visual analytics of political networks from face-tracking of news video. IEEE Trans Multimedia 18(11):2184–2195
Sannino S, Stramaglia S, Lacasa L, Marinazzo D (2017) Visibility graphs for fMRI data: multiplex temporal graphs and their modulations across resting-state networks. Netw Neurosci 1(3):208–221
Škrlj B, Kralj J, Lavrač N (2019) Cbssd: Community-based semantic subgroup discovery. J Intell Inf Syst 53(2):265–304
Škrlj B, Renoust B (2019) Patterns of multiplex layer entanglement across real and synthetic networks. In: International conference on complex networks and their applications. Springer, pp 671–683
Stark C, Breitkreutz BJ, Reguly T, Boucher L, Breitkreutz A, Tyers M (2006) Biogrid: a general repository for interaction datasets. Nucleic Acids Res 34(suppl|–1):535–539
Taylor SJ, Letham B (2018) Forecasting at scale. Am Stat 72(1):37–45
Tejedor A, Longjas A, Foufoula-Georgiou E, Georgiou TT, Moreno Y (2018) Diffusion dynamics and optimal coupling in multiplex networks with directed layers. Phys Rev X 8(3):031071
Valdeolivas A, Tichit L, Navarro C, Perrin S, Odelin G, Levy N, Cau P, Remy E, Baudot A (2018) Random walk with restart on multiplex and heterogeneous biological networks. Bioinformatics 35(3):497–505
Vickers M, Chan S (1981) Representing classroom social structure. Victoria Institute of Secondary Education, Melbourne
Wang W, Cai M, Zheng M (2018) Social contagions on correlated multiplex networks. Physica A 499:121–128
Wasserman S, Faust K (1994) Social Network analysis, methods and applications. Structural analysis in the social sciences. Cambridge University Press, Cambridge. https://doi.org/10.1017/CBO9780511815478
Acknowledgements
We acknowledge Dagstuhl seminar 19061 where many ideas implemented in this paper emerged.
Funding
The work of the first author was funded by the Slovenian Research Agency through a young researcher grant. The work of other authors was supported by the Slovenian Research Agency (ARRS) core research programme Knowledge Technologies (P2-0103) and ARRS funded research project Semantic Data Mining for Linked Open Data (financed under the ERC Complementary Scheme, N2-0078).
Author information
Authors and Affiliations
Contributions
Both authors have contributed equally to the theoretical background, design of the experiments, and the writing of the manuscript. BR contributed to experiments, but the most of the experiments was handled by BS. Both authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
Dependency in synthetic networks over nodes and layers
One can predict of course a level of dependency over the number of nodes n and layers m for the transition coupling case too. The dependency tends towards lower entanglement values since when increasing the number of nodes and layers, we increase the degree of freedom for layers to overlap. This trend, first illustrated in Fig. 8, is confirmed in Fig. 19.
Independence of parameters
The distribution of parameters of transition coupling intensity and homogeneity over parameter p, and elementary layer intensity and homogeneity over parameter q, show no dependency as illustrated in Fig. 20.
Combining both elementary and transition coupling
As we mentioned in “Transition coupling entanglement” section, one can compute entanglement over all the network, combining elementary layers and transition coupling (as illustrated in Fig. 21). Although we have not identified practical use cases for this entanglement (often both categories of layers tell a different story), we report here the results over our synthetic networks in Figs. 22, 23, and 24. As expected we may observe a strong dependency over both p and q parameters combined (Fig. 24). Note that the current generator does not forbid the creation of loops enabling overlap between elementary layer and transition coupling. A generation of transition coupling edges that would connect different nodes between layers would create even more overlap between elementary layers and transition coupling. Such a parameter is actually available in the proposed code, but beyond the scope of this paper.
Choosing the right size of time window
Choosing the right size of time-window fundamentally depends on the dataset we observe. We report all variations of fixed time window coarsening we have explored, among 1h, 3h, 6h, and 12h-long windows for each of the MLKing2013 (Fig. 25), MoscowAthletics2013 (Fig. 26), and Cannes2013 (Fig. 27) events. Too fine selection displays a lot of noise, too coarse eludes most of the content.
Correlation analysis
For completeness, we also computed the correlation between the occurrence of a given pair of nodes, as discussed in Nicosia and Latora (2015). As a measure of correlations of layer activity, we have computed the pairwise multiplexity. Following the original notation, we computed, for each synthetic network:
where \(\alpha \in L\) and \(\beta \in L\) are two distinct layers. Here, \(b_i \in [0,1]\) represents the presence of a given node i, hence, the product equals zero if a given node is not co-present on both considered layers. Given the large space of synthetic multiplex networks, it is not sensible to analyse individual ones, as performed in Nicosia et al. (2014), hence we further computed:
i.e., the distribution of all possible pairwise correlations. In Fig. 28, we present the statistical properties of this distribution further segmented according to the number of layers.
We further compare this distribution with each parameter of our generator in Fig. 29. We first could expect some level of relationship with the number of layers and the node layer probability. This is confirmed, and we can observe a direct dependency with the number of layers m, and the node-layer assignment probability o. The relationship with the number of layers slowly decreases with a minimum of 5 layers, since with more layers, there are more degrees of freedom for nodes to be assigned on layers. The relationship with o is almost linear. There is almost no dependency with the number of nodes (once enough nodes are assigned). The inner-layer edge probability p does not influence this measure, although we may observe a sudden increase first, it is an artefact of our algorithm because nodes with degree zero in a layer are discarded from the layer to speed up computations. The transition coupling edge probability q shows no correlation because this algorithmic artefact does not apply to it.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Škrlj, B., Renoust, B. Layer entanglement in multiplex, temporal multiplex, and coupled multilayer networks. Appl Netw Sci 5, 89 (2020). https://doi.org/10.1007/s41109-020-00331-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s41109-020-00331-w