Skip to main content

Convergence properties of optimal transport-based temporal hypergraphs


We present a method to extract temporal hypergraphs from sequences of 2-dimensional functions obtained as solutions to Optimal Transport problems. We investigate optimality principles exhibited by these solutions from the point of view of hypergraph structures. Discrete properties follow patterns that differ from those characterizing their continuous counterparts. Analyzing these patterns can bring new insights into the studied transportation principles. We also compare these higher-order structures to their network counterparts in terms of standard graph properties. We give evidence that some transportation schemes might benefit from hypernetwork representations. We demonstrate our method on real data by analyzing the properties of hypernetworks extracted from images of real systems.


Optimal Transport (OT) is a principled theory to compare probability distributions (Kantorovich 1942; Villani 2009; Santambrogio 2015; Peyré et al. 2019). Although this task is usually framed as an optimization problem, recent studies have mapped it within the framework of dynamic partial differential equations (Evans and Gangbo 1999; Facca et al. 2018, 2020, 2021; Tero et al. 2007, 2010). In this context, solutions to a transportation problem are often found as the convergent state of evolving families of functions.

In some scenarios, the steady states of these evolving families are supported in network-shaped structures (Xia 2003, 2014, 2015). Recently, this fact has called the attention of network scientists and graph theorists leading to the development of methods that convert the solutions of OT problems into actual graph structures (Baptista et al. 2020; Leite and De Bacco 2022). This has broadened the available set of tools to understand and solve these transportation problems. Recent studies have shown that common patterns can be unveiled in both the original mathematical setting and in the converted graph structures (Baptista and De Bacco 2021b).

Representations of these functions as sets of dyadic relations have been proven meaningful in various applications (Baptista and De Bacco 2021a; Facca et al. 2021). Nonetheless, traditional dyadic representations may be limited in representing flows of quantities like mass or information as observed in real systems. Various examples of systems where interactions happen between 3 individuals or more are observed in applications as social contagion (de Arruda et al. 2020; Chowdhary et al. 2021), random walks (Carletti et al. 2020; Schaub et al. 2020) or non-linear consensus (Neuhäuser et al. 2020). Understanding the relation between the structure and dynamics taking place on higher-order structures is an active field of research (Taylor et al. 2015; Patania et al. 2017). For instance, key elements controlling dynamics are linked to the heterogeneity of hyperedges’ sizes present in their higher-order representations (Patania et al. 2017). These systems are hence best described by hypergraphs, generalizations of networks that encode structured relations among any number of individuals. With this in mind, a natural question to ask is how do OT-based structures perform in terms of higher-order representations?

To help bridge this knowledge gap about higher-order properties of structures derived from OT solutions, we elaborate on the results observed in Baptista and De Bacco (2021b). Specifically, we propose a method to convert the families of 2-dimensional functions into temporal hypernetworks. We enrich the existing network structures associated with these functions by encoding the observed interactions into hyperedges. We study classic hypergraph properties and compare them to the predefined cost functional linked to the transportation problems. Finally, we extend this method and the analysis to study systems coming from real data. We build hypergraph representations of P. polycephalum (Westendorf et al. 2016) and analyze their topological features.


The Dynamical Monge-Kantorovich method

The Dynamical Monge-Kantorovich set of equations

We start by reviewing the basic elements of the mechanism chosen to solve the OT problems. As opposed to other standard optimization methods used to solve this (Cuturi 2013), we use an approach that turns the problem into a dynamical set of partial differential equations. In this way, initial conditions are updated until a convergent state is reached. The dynamical system of equations as proposed by Facca et al. (2018, 2020, 2021), is presented as follows. We assume that the OT problem is set on a continuous 2-dimensional space \(\Omega \in {\mathbb {R}}^{2}\), and at the beginning, no underlying network structure is observed. This gives us the freedom of exploring the whole space to design an optimal network topology, solution of the transportation problem. The main quantities that need to be specified in input are source and target distributions. We refer to them as sources and sinks, where a certain mass (e.g. passengers in a transportation network, water in a water distribution network) is injected and then extracted. We denote these with a “forcing” function \(f(x)=f^+(x)-f^-(x)\in {\mathbb {R}}\), describing the flow-generating sources \(f^+(x)\) and sinks \(f^-(x)\). To ensure mass balance it is imposed \(\int _\Omega f(x)dx = 0\). We assume that the flow is governed by a transient Fick–Poiseuille flux \(q=- \mu {\nabla }u\), where \(\mu ,u\) and q are called conductivity (or transport density), transport potential and flux, respectively. Intuitively, mass is injected through the source, moved based on the conductivity across space, and then extracted through the sink. The way mass moves determines a flux that depends on the pressure exerted on the different points in space; this pressure is described by a potential function.

The set of Dynamical Monge–Kantorovich (DMK) equations is given by:

$$\begin{aligned} -\nabla \cdot (\mu (t,x)\nabla u(t,x))&= f^+(x)-f^-(x), \end{aligned}$$
$$\begin{aligned} \frac{\partial \mu (t,x)}{\partial t}&= \left[ \mu (t,x)\nabla u(t,x)\right] ^{\beta } - \mu (t,x), \end{aligned}$$
$$\begin{aligned} \mu (0,x)&= \mu _0(x) > 0 , \end{aligned}$$

where \(\nabla =\nabla _{x}\). Equation (1) states the spatial balance of the Fick–Poiseuille flux and is complemented by no-flow Neumann boundary conditions. Equation (2) enforces the dynamics of this system, and it is controlled by the so-called traffic rate \(\beta\). It determines the transportation scheme, and it shapes the topology of the solution: for \(\beta <1\) we have congested transportation where traffic is minimized, whereas \(\beta >1\) induces branched transportation where traffic is consolidated into a smaller amount of space. The case \(\beta =1\) recovers shortest path-like structures. Finally, Eq. (3) constitutes the initialization of the system and can be thought of as an initial guess of the solution.

Solutions \((\mu ^*, u^*)\) of Eqs. (1)–(3) minimize the transportation cost function \({\mathcal {L}}(\mu ,u)\) (Facca et al. 2018, 2020, 2021), defined as:

$$\begin{aligned}&{\mathcal {L}}(\mu ,u) := {\mathcal {E}}(\mu ,u)+ {\mathcal {M}}(\mu ,u) \end{aligned}$$
$$\begin{aligned}&{\mathcal {E}}(\mu ,u) := \dfrac{1}{2}\int _{\Omega } \mu |{\nabla }u|^2 dx, \ \ {\mathcal {M}}(\mu ,u) := \dfrac{1}{2}\int _{\Omega } \dfrac{\mu ^{\frac{(2-\beta )}{\beta }}}{2-\beta } dx. \end{aligned}$$

\({\mathcal {L}}\) can be thought of as a combination of \({\mathcal {M}}\), the total energy dissipated during transport (or network operating cost) and \({\mathcal {E}}\), the cost to build the network infrastructure (or infrastructural cost). It is known that this functional’s convexity changes as a function of \(\beta\). Non-convex cases arise in the branched schemes, inducing fractal-like structures (Facca et al. 2021; Santambrogio 2007). This is the case that we considered in this work, and it is the only one where meaningful network structures, and thus, hypergraphs, can be extracted (Baptista et al. 2020).

Hypergraph sequences

Hypergraph construction

We define a hypergraph (also, hypernetwork) as follows (Battiston et al. 2020): a hypergraph is a tuple \(H = (V, E),\) where \(V = \{v_1, ... ,v_n\}\) is the set of vertices and \(E = \{ e_1, e_2, ... , e_m\}\) is the set of hyperedges in which \(e_i\subset V, \forall i = 1,...,m,\) and \(|e_i|>1\). If \(|e_i|=2,\forall i\) then H is simply a graph. We call edges those hyperedges \(e_i\) with \(|e_i|=2\) and triangles, those with \(|e_i|=3\). We refer to the 1-skeleton of H as the clique expansion of H. This is the graph \(G=(V,E_{G})\) made of the vertices V of H, and of the pairwise edges built considering all the possible combinations of pairs that can be built from each set of nodes defining each hyperedge in E.

Let \(\mu\) be the conductivity found as a solution of Eqs. (1)–(3). As previously mentioned, \(\mu\) at convergence regulates where the mass should travel for optimal transportation. Similar to Baptista and De Bacco (2021b), we turn this 2-dimensional function into a different data structure, namely, a hypergraph. This is done as follows: consider \(G(\mu ) = (V_G,E_G)\) the network extracted using the method proposed in Baptista et al. (2020). We define \(H(\mu )\) as the tuple \((V_H,E_H)\) where \(V_H = V_G\) and \(E_H = E_G \cup T_G,\) s.t., \(T_G = \{(u,v,w): (u,v),(v,w),(w,u) \in E_G, \}.\) In words, \(H(\mu )\) is the graph \(G(\mu )\) together with all of its triangles. This choice is motivated by the fact that the graph-extraction method proposed in Baptista et al. (2020) uses triangles to discretize the continuous space \(\Omega\), which can have a relevant impact on the extracted graph or hypergraph structures. Hence, triangles are the natural sub-structure for hypergraph constructions. The method proposed in this work is valid for higher-order structures beyond triangles. Exploring how these additional structures impact the properties of the resulting hypergraphs is left for future work.

Fig. 1
figure 1

Hypernetwork construction. Higher order structures are built using edges and triangles as hyperedges. The leftmost panel shows one of the studied graphs together with the triangles (in red) used. The subsequent panels highlight different clusters of triangles that can be seen in the main hypergraph

Figure 1 shows an example of one of the studied hypergraphs. The red shapes represent the different triangles of \(H(\mu )\). Notice that, although we consider here the case where \(|e|\le 3\) for each hyperedge e—for the sake of simplicity—higher-order structures are also well represented by the union of these elements, as shown in the right panels of the figure.

Since this hypergraph construction method is valid for any 2-dimensional transport density, we can extract a hypergraph not only from the convergent \(\mu\) but also at any time step before convergence. This then allows us to represent optimal transport sequences as hypergraphs evolving in time, i.e. temporal hypernetworks.

Fig. 2
figure 2

Temporal hypergraphs. Top row: different timestamps of the sequence \(\{\mu _t\}\); triangles are a discretization of \([0,1]^2\). Bottom row: hypergraphs extracted for \(\mu _t\) at the time steps displayed on the top row; triangles are highlighted in red. In both rows, filled and empty circles correspond to the support of \(f^+\) and \(f^-\), i.e. sources and sinks, respectively. This sequence is obtained for \(\beta = 1.5\)

Hypergraph sequences

Formally, let \(\mu (x,t)\) be a transport density (or conductivity) function of both time and space obtained as a solution of the DMK model. We denote it as the sequence \(\{\mu _t\}_{t=0}^T\), for some index T (usually taken to be that of the convergent state). Each \(\mu _{t}\) is the t-th update of our initial guess \(\mu _0\), computed by following the rules described in Eqs. (1)–(3). This determines a sequence of hypernetworks \(\{ H(\mu _t)\}_{t=0}^T\) extracted from \(\{\mu _t\}_{t=0}^T\) with the extraction method proposed in Baptista et al. (2020). Figure 2 shows three hypergraphs built from one of the studied sequences \(\{\mu _t\}\) using this method at different time steps. The corresponding OT problem is that defined by the (filled and empty) circles: mass is injected in the bottom left circle and must be extracted at the highlighted destinations. On the top row, different updates (namely, \(t=12, 18, 26\)) of the solution are shown. They are defined on a discretization of \([0,1]^2.\) Darkest colors represent their support. Hypergraphs extracted from these functions are displayed at the bottom row. As can be seen, only edges (in gray) and triangles (in red) are considered as part of \(H(\mu _t)\). Notice that the larger the t is, the less dense the hypergraphs are, which is expected for a uniform initial distribution \(\mu _0\) and branched OT (\(\beta >1\)) (Facca et al. 2021).

Graph and hypergraph properties

We compare hypergraph sequences to their correspoding network counterparts (defined as described in the previous paragraph). We analyze the following main network and hypergraph properties for the different elements in the sequences and for different sequences. Denote with \(G = (V_G,E_G)\) and \(H = (V_H, E_H)\) one of the studied graphs and hypergraphs belonging to some sequence \(\{ G(\mu _t)\}_{t=0}^T\) and \(\{ H(\mu _t)\}_{t=0}^T\), respectively. We consider the following network properties:

  1. 1.

    \(|E_G|\), total number of edges;

  2. 2.

    Average degree d(G), the mean number of neighbors per node;

  3. 3.

    Average closeness centrality c(G): let \(v\in V_G\), the closeness centrality of \(v\) is defined as \(\sum _{u\in V_G} 1/d(u,v),\) where \(d(u,v)\) is the shortest path distance between \(u\) and \(v\).

    Fig. 3
    figure 3

    Adjacency matrices and line graphs. Top: generalized node s-adjacency matrices for different values of s from a given toy graph G. Bottom, from left to right: reference network G, and s-line graphs for \(s=2,3,\) and 4

Hypernetwork properties can be easily adapted from the previous definitions with the help of generalized adjacency matrices and line graphs (Aksoy et al. 2020). Let H be a hypergraph with vertex set \(V = \{1,..,n\}\) and edge set \(E = \{e_1, ... ,e_m\}\). We define the generalized node s-adjacency matrix \(A_s\) of H as the binary matrix of size \(n\times n\), s.t., \(A_s[i][j]=1\) if i and j are part of at least s shared hyperedges; \(A_s[i][j]=0,\) otherwise. We define the s-line graph \(L_s\) as the graph generated by the adjacency matrix \(A_s\). Notice that \(A_1\) corresponds to the adjacency matrix of H’s skeleton (which is \(L_1\)). Figure 3 shows a family of adjacency matrices together with the line graphs generated using them. We can then define hypergraphs properties in the following way:

  1. 1.

    \(|E_H|\), total number of hyperedges;

  2. 2.

    \(|T| = |\{e \in E_H: |e|= 3\}|,\) total number of triangles;

  3. 3.

    \(S = \sum _{t\in T} a(t),\) covered area, where a(t) is the area of the triangle t

  4. 4.

    Average degree \(d_s(H)\), the mean number of incident hyperedges of size greater or equal than s per node;

  5. 5.

    Average closeness centrality \(c_s(H)\): let \(v\in V_H\), the closeness centrality of \(v\) is defined as its closeness centrality in \(L_s\).

Fig. 4
figure 4

Graph and Hypergraph properties. Top row: optimal hypernetworks obtained with different traffic rates. Center and bottom rows: degree distributions and closeness distributions for the hypernetworks shown on the top row, and their 1-skeletons. The node labels in the x-axis of the center and bottom rows are sorted by their degree of centrality values

S can be defined in terms of any other property of a hyperedge, e.g. a function of its size |e|. Here we consider the area covered by a hyperedge to keep a geometrical perspective. On the other hand, this area S can be easily generalized to hyperedges with \(|e_{i}|>3\) by suitably changing the set T in the summation, e.g. by considering structures containing four nodes. As for the centrality measures, we focus our attention to compare the case \(s>1\) against \(s=1\), as the latter traces back to standard graph properties and we are interested instead to investigate what properties are inherent to hypergraps. Figure 4 shows values of the \(d_s(H)\) and \(c_s(H)\) for convergent hypergraphs H (obtained from different values of \(\beta\)) together with the degree and closeness centrality of their correspondent graph versions. The considered hypergraphs are displayed in the top row of the figure. As can be seen in the figure, patterns differ considerably for different values of \(\beta\). As s controls the minimum number of shared connections for different nodes in the networks, the higher this number, the more restrictive this condition becomes, thus leading to more disconnected line graphs. In the case of the s-degree centrality, we observe decreasing values for increasing s, with nodes with the highest centrality having much higher values than nodes less central. For both \(s=2,3\) we observe higher values than nodes in G. This follows from the fact that once hyperedges are added to G, the number of incidences per node can only increase. Centrality distributions strongly depend on \(\beta\). For small values—more distributed traffic (\(\beta =1.1\))—the number of hyperedges per node remains larger than the number of regular edges connected to it. But if traffic is consolidated on less space (\(\beta =1.9\)), then very few hyperedges are found. This suggests that the information learned from hypergraphs that is distinct to that contained in the graph skeleton is influenced by the chosen traffic regime.

As for the closeness centrality distribution, this resembles that of G for small values of \(\beta\), regardless s. For higher \(\beta\) it switches towards an almost binary signal. Thus, nodes tend to become more central as \(\beta\) increases, suggesting that adding hyperedges to networks G leads to shorter distances between nodes. The loss of information seen for the highest values of s is due to the fact that the line graphs \(L_s\) become disconnected with many small connected components. In these cases, the closeness centrality of a node is either 0 if it is isolated, or proportional to the diameter of the small connected component where it lives in.

Convergence criteria

Numerical convergence of the DMK Eqs. (1)–(3) is usually defined by fixing a threshold \(\tau\). The updates are considered enough once the cost associated to them does not change more (\(\le \tau\)) than that of the previous time step. As it is usually the case when this threshold is too small (\(\tau =10^{-12}\) in our experiments), the cost or the network structure may consolidate to a constant value earlier than algorithmic convergence. Similar to Baptista and De Bacco (2021b), to meaningfully establish when is hypergraph optimality reached, we consider as convergence time the first time step when the transport cost, or a given network property, reaches a value that is smaller or equal to a certain fraction p of the value reached by the same quantity at algorithmic convergence (in the experiments here we use \(p=1.05\)). We refer to \(t_{\mathcal {L}}\) and \(t_P\) for the convergence in times in terms of cost function or a network property, respectively.


To test the properties presented in the previous section and understand their connection to transportation optimality, we synthetically generate a set of optimal transport problems, determined by the configuration of sources and sinks. As done in Baptista and De Bacco (2021b), we fix a source’s location and sample several points in the set \([0,1]^2\) to be used as sinks’ locations. Let \(S = \{s_0,s_1,...,s_M\}\) be the set of locations in the space \([0,1]^2,\) and fix a positive number \(0<r\). We define the distributions \(f^+\) and \(f^-\) as \(f^+(x) \propto \mathbbm {1}_{R_0}(x),\) and \(f^-(x) \propto \sum _{i>0} \mathbbm {1}_{R_i}(x),\) where \(\mathbbm {1}_{R_i}(x) := 1,\) if \(x\in R_i\), and \(\mathbbm {1}_{R_i}(x) := 0\), otherwise; \(R_i = C(s_i,r)\) is the circle of center \(s_i\) and radius r. The value of r is chosen based on the used discretization, and as mentioned before, the centers are sampled uniformly at random. The symbol \(\propto\) stands for proportionality and is used to ensure that \(f^+\) and \(f^-\) are both probability distributions. The transportation cost is that of Eq. (4).

Synthetic OT problems

The set of transportation problems considered in our experiments consists of 100 source-sink configurations. We place the location of the source \(s_0=(0,0)\) (i.e. the support of \(f^+\) at (0, 0)), and sample 15 points \(s_1,s_2,...,s_M\) uniformly at random from a regular grid. By sampling them from the nodes of the grid, we ensure that two different locations are at a safe distance so they are considered different once the space is discretized. We initialize \(\mu _0(x)=1, \forall x\) to be a uniform distribution on \([0,1]^2\). This can be interpreted as a non-informative initial guess for the solution. Starting from \(\mu _0,\) we compute a maximum of 300 updates. Depending on the chosen traffic rate \(\beta\) more or fewer iterations can be needed. We claim that the sequence \(\{\mu _t\}_{t=0}^T\) converges to a certain function \(\mu ^*\) at iteration T if either \(|\mu _T-\mu _{T-1} |<\tau ,\) for a tolerance \(\tau \in (0,1],\) or T reaches the mentioned maximum. For the experiments reported in this manuscript, the tolerance \(\tau\) is set to be \(10^{-12}\). Given the dependence of the solution of traffic constraints, a wide range of values of \(\beta\) is considered. Namely, we study solutions obtained from low traffic cases (\(\beta =1.1\), and thus, less traffic penalization) to large ones (\(\beta =1.9\)), all of them generating branched transportation schemes. Our 100 problems are linked to a total of 900 hypergraph sequences, each of them containing between 50 and 80 higher-order structures.

Convergence: transport cost vs hypernetwork properties

Fig. 5
figure 5

Covered area and Lyapunov cost. Mean (markers) and standard deviations (shades around the markers) of the covered area S (top plots) and of the Lyapunov cost, energy dissipation \({\mathcal {E}}\) and structural cost \({\mathcal {M}}\) (bottom plots), as functions of time t. Means and standard deviations are computed on the set described in Paragraph Synthetic OT problems. From left to right: \(\beta =1.2, 1.5\) and 1.8. Red and blue lines denote \(t_P\) and \(t_{\mathcal {L}}\)

As presented in Baptista and De Bacco (2021b), we show a comparison between hypernetwork properties and the cost function minimized by the dynamics, where convergence times are highlighted (Fig. 5). We focus on the property S, the area of the surface covered by the triangles in H. This quantity is influenced by both the amount of triangles (hence of hyperedges) and their distribution in space. Hence, it is a good proxy for how hypergraph properties change both in terms of iteration time and as we tune \(\beta\). We observe that \(t_P>t_{\mathcal {L}}\) in all the cases, i.e. convergence in terms of transportation cost is reached earlier than the convergence of the topological property. Similar behaviors are seen for other values of \(\beta \in [1.1,1.9]\) and other network properties (see “Appendix”). Similar to DMK-based network properties, the covered area’s decay is faster for the smallest values of \(\beta\). This is expected, given the convexity properties of \({\mathcal {L}}\) (Facca et al. 2018, 2020, 2021). However, the transport cost decays even faster, in a way that the value of S is still far away from convergence in the congested transportation case (small \(\beta\)).

Notice that S remains stable after the first few iterations, and then it starts decreasing at different rates (depending on \(\beta\)) until reaching the converged value. This suggests that the dynamics tend to develop thick branches—covering a large area— at the beginning of the evolution, and then it slowly compresses them until reaching the optimal topologies.

These different convergence rates for S and \({\mathcal {L}}\) may prevent construction of converged hypernetwork topologies: if the solver is stopped at \(t_{\mathcal {L}}< t_{P}\), the resulting hypergraphs \(H(\mu _t), \ t=t_{\mathcal {L}}\) would mistakenly cover a surface larger than that covered by the convergent counterpart (\(H(\mu _t),\) for \(t\ge t_P\)). This scenario is less impactful for larger values of \(\beta\), although in these scenarios H is much more similar to a regular graph, because of the small number of higher-order structures. Topological differences between converged hypernetworks can be seen in Fig. 4.

Finally, we observe that both \(t_{\mathcal {L}}(\beta )\) and \(t_P(\beta )\) are increasing functions on \(\beta\). This is expected since the larger the traffic rate is, the longer it takes for the sequences to converge. This particular behavior matches what is shown in Baptista and De Bacco (2021b) in the case of \(t_{\mathcal {L}}\), but this is not the case for \(t_P(\beta )\): it was observed a non-monotonic behavior in the network case.

Convergence behavior of hypernetwork properties

Fig. 6
figure 6

Evolution of hypernetwork properties. Mean (markers) and standard deviations (shades around the markers) of number of hyperedges \(|E_H|\) (upper left), number of triangles |T| (upper center), covered area S(H) (upper right), average 2-degree \(d_2(H)\) (lower left), average 1-closeness centrality \(c_1(H)\)(lower center) and 2-closeness centrality \(c_2(H)\)(lower right), computed for different values of \(\beta\) as a function of time

Figure 6 shows how the various network properties change depending on the traffic rate. Mean values and standard deviations are computed across times, for a fixed value of \(\beta\). As shown, the number of hyperedges, number of triangles, covered area, and average 1-degree exhibit decreasing patterns as functions of t. As a consequence, transport optimality can be thought of as reaching minimum states on the mentioned hypernetwork properties. Another clear feature of these functions is related to the actual converged values: the larger the \(\beta\) is, the smaller these metrics become. This is explained by a cost function increasingly encouraging consolidations of paths on fewer edges. Notice also that the gap between these converged values signals a non-linear dependence on the outputs of the dynamics; e.g., a converged hypernetwork obtained for \(\beta =1.1.\) loses many more hyperedges if the traffic rate is then set to 1.2, whereas this loss would not be that large if \(\beta =1.2\) is increased to 1.3. The nature of these gaps is substantially different depending on the property itself. This also shows that certain properties better reveal the distinction between different optimal traffic regimes.

The behavior of the closeness centralities is distinctly different than that of the other properties. While its initial values are the same for all values of \(\beta\) (similar to the previous properties), no clear trend can be found as time increases. For \(s=1\), on average \(\beta =1.1\) generates sequences that tend to recover initial values after increasing and then decreasing behavior. For the other traffic rates, we observe different patterns. Notice that \(s-\)closeness centrality on the hypergraph for \(s=1\) is the same as the classic closeness centrality on the skeleton of it. Thus, these rather noisy patterns are not due to the addition of hyperedges. On the other hand, for \(s=2\) the average centrality shows increasing curves. This may be due to \(L_s\) getting increasingly disconnected with small connected components. Therefore, the larger s, the closer the nodes are seen (see Fig. 3). Moreover, in this case small values of \(\beta\) lead to more stable closeness centrality values, showing the impact of \(\beta\) in building higher-order structures. While different values of \(\beta\) lead to different behaviors of the hypergraph properties (e.g. decreasing degrees and amount of hyperedges for increasing \(\beta\)) we remark that choosing the value of \(\beta\) should depend on the application at hand. The analysis performed here showcases how this choice may impact the resulting topologies. This can help practitioners to visualize possible consequences in terms of downstream analysis on the transportation properties of the underlying infrastructure.

P. polycephalum hypernetworks

We now analyze hypernetworks extracted from images of real data. We are interested in the evolution of the area covered by triangles in the sequences \(\{ H(\mu _t)\}_{t=0}^T\) extracted from real images of the slime mold P. polycephalum. The behavior of this organism is the inspiration of the modeling ideas of the DMK equations described in Methods. It has been shown that these slime molds follow a similar optimization strategy as that captured by the DMK dynamics while foraging for food in 2D surfaces (Nakagaki et al. 2000; Tero et al. 2007, 2010). We extract hypernetworks from images using the idea described in Methods but instead of applying (Baptista et al. 2020) to obtain the networks, we use the method proposed by Baptista and De Bacco (2021a) which takes images as input. This pipeline uses the color intensities of the different image pixels to build a graph, by connecting adjacent meaningful nodes. We dedicate our attention to 4 image sequences from the Slime Mold Graph Repository (Dirnberger et al. 2017). The sequences are then describing the evolution of a P. polycephalum placed in a rectangular Petri dish. Each image, and thus each hypernetwork, is a snapshot of the movement of this organism over periods of 120 seconds.

We study the covered area for every one of the 4 sequences, and plot the results for one of them (namely, image set motion12; see “Appendix”) in Fig. 7. We highlight 4 times along the property sequence to display the used images together with the corresponding hypernetworks. The lower leftmost plot shows a subsection of one of the studied snapshots. As can be seen there, this subhypernetwork topology exhibits a significant number of hyperedges of dimension 3, mainly around the thickest parts of the slime mold. On the other side, in the lower rightmost plot, the evolution of S is overall decreasing in time (similar results are obtained for other sequences, as shown in the “Appendix”). This suggests that the thicker body parts tend to get thinner as the P. polycephalum evolves into a consolidated state. This pattern resembles what is shown above for the synthetic data, i.e. the covered area tends to decrease as time evolves similar to the behavior of the DMK-based hypernetwork sequence. This suggests that the DMK model realistically mirrors a consolidation phase towards optimality of real slime molds (Dirnberger et al. 2017).

Fig. 7
figure 7

P. polycephalum hypergraphs. On top: P. polycephalum images and hypernetworks extracted from them. Bottom left: a zoomed-in part of the hypergraph shown inside the red rectangle on top. Bottom right: covered area as a function of time. The red shade highlights a tentative consolidation phase towards optimality


We proposed a method to build higher-order structures from OT sequences. This method maps every member of the sequence into a hypergraph, outputting a temporal hypernetwork. We analyzed standard hypergraph properties on these temporal families and compared them to their continuous counterparts. We showed that convergence in terms of transportation cost tends to happen faster than that given by the covered area of the hypernetworks. This suggests that the dynamics used to solve the OT problems concentrates the displaced mass into main branches, and once this task is carried out, it slightly reduces the area covered by them. We studied this and other hypergraph properties, and compared them to those of their network versions. In some cases, hypernetworks reveal more information about the topology at convergence. This suggests that hypernetworks could be a better alternative representation to solutions of OT problems for some transportation schemes. The conclusions found in this work may further enhance our comprehension of OT solutions and the links between this field and that of hypergraphs.

Availability of data and materials

The code used for the analysis was taken from Data used in the analysis was taken from Dirnberger et al. (2017).


  • Aksoy SG, Joslyn C, Marrero CO, Praggastis B, Purvine E (2020) Hypernetwork science via high-order hypergraph walks. EPJ Data Sci 9(1):16

    Article  Google Scholar 

  • Baptista D, De Bacco C (2021a) Principled network extraction from images. R Soc Open Sci 8:210025

    Article  Google Scholar 

  • Baptista D, De Bacco C (2021b) Convergence properties of optimal transport-based temporal networks. In: International conference on complex networks and their applications. Springer, pp 578–593

  • Baptista D, Leite D, Facca E, Putti M, De Bacco C (2020) Network extraction by routing optimization. Sci Rep 10(1):1–13

    Article  Google Scholar 

  • Battiston F, Cencetti G, Iacopini I, Latora V, Lucas M, Patania A, Young J-G, Petri G (2020) Networks beyond pairwise interactions: structure and dynamics. Phys Rep 874:1–92

    Article  MathSciNet  MATH  Google Scholar 

  • Carletti T, Battiston F, Cencetti G, Fanelli D (2020) Random walks on hypergraphs. Phys Rev E 101:022308.

    Article  MathSciNet  Google Scholar 

  • Chowdhary S, Kumar A, Cencetti G, Iacopini I, Battiston F (2021) Simplicial contagion in temporal higher-order networks. J Phys Complex 2(3):035019

    Article  Google Scholar 

  • Cuturi M (2013) Sinkhorn distances: lightspeed computation of optimal transport. In: Advances in neural information processing systems, vol 26

  • de Arruda GF, Petri G, Moreno Y (2020) Social contagion models on hypergraphs. Phys Rev Res 2:023032.

    Article  Google Scholar 

  • Dirnberger M, Mehlhorn K, Mehlhorn T (2017) Introducing the slime mold graph repository. J Phys D Appl Phys 50(26):264001

    Article  Google Scholar 

  • Evans LC, Gangbo W (1999) Differential equations methods for the Monge–Kantorovich mass transfer problem, vol 653. American Mathematical Society, Providence

    MATH  Google Scholar 

  • Facca E, Cardin F, Putti M (2018) Towards a stationary Monge–Kantorovich dynamics: the Physarum polycephalum experience. SIAM J Appl Math 78(2):651–676

    Article  MathSciNet  MATH  Google Scholar 

  • Facca E, Daneri S, Cardin F, Putti M (2020) Numerical solution of Monge–Kantorovich equations via a dynamic formulation. J Sci Comput 82(68):1–26

    MathSciNet  MATH  Google Scholar 

  • Facca E, Cardin F, Putti M (2021) Branching structures emerging from a continuous optimal transport model. J Comput Phys 447:110700

    Article  MathSciNet  MATH  Google Scholar 

  • Kantorovich L (1942) On the transfer of masses. Dokl Akad Nauk 37:227–229 (in Russian)

    Google Scholar 

  • Leite D, De Bacco C (2022) Revealing the similarity between urban transportation networks and optimal transport-based infrastructures. arXiv:2209.06751

  • Nakagaki T, Yamada H, Tóth Á (2000) Maze-solving by an amoeboid organism. Nature 407(6803):470–470

    Article  Google Scholar 

  • Neuhäuser L, Mellor A, Lambiotte R (2020) Multibody interactions and nonlinear consensus dynamics on networked systems. Phys Rev E 101:032310.

    Article  MathSciNet  Google Scholar 

  • Patania A, Vaccarino F, Petri G (2017) Topological analysis of data. EPJ Data Sci 6(1):1–6

    Article  MATH  Google Scholar 

  • Peyré G, Cuturi M et al (2019) Computational optimal transport: With applications to data science. Found Trends Mach Learn 11(5–6):355–607

    Article  MATH  Google Scholar 

  • Santambrogio F (2007) Optimal channel networks, landscape function and branched transport. Interfaces and Free Boundaries 9(1):149–169

    Article  MathSciNet  MATH  Google Scholar 

  • Santambrogio F (2015) Optimal transport for applied mathematicians, vol 55. Birkäuser, New York, pp 58–63

    Book  MATH  Google Scholar 

  • Schaub MT, Benson AR, Horn P, Lippner G, Jadbabaie A (2020) Random walks on simplicial complexes and the normalized Hodge 1-Laplacian. SIAM Rev 62(2):353–391

    Article  MathSciNet  MATH  Google Scholar 

  • Taylor D, Klimm F, Harrington HA, Kramár M, Mischaikow K, Porter MA, Mucha PJ (2015) Topological data analysis of contagion maps for examining spreading processes on networks. Nat Commun 6(1):1–11

    Google Scholar 

  • Tero A, Kobayashi R, Nakagaki T (2007) A mathematical model for adaptive transport network in path finding by true slime mold. J Theor Biol 244(4):553–564

    Article  MathSciNet  MATH  Google Scholar 

  • Tero A, Takagi S, Saigusa T, Ito K, Bebber DP, Fricker MD, Yumiki K, Kobayashi R, Nakagaki T (2010) Rules for biologically inspired adaptive network design. Science 327(5964):439–442

    Article  MathSciNet  MATH  Google Scholar 

  • Villani C (2009) Optimal transport: old and new, vol 338. Springer, Berlin

    MATH  Google Scholar 

  • Westendorf C, Gruber C, Grube M (2016) Quantitative comparison of plasmodial networks of different slime molds. In: Proceedings of the 9th EAI international conference on bio-inspired information and communications technologies (formerly BIONETICS), pp 611–612

  • Xia Q (2003) Optimal paths related to transport problems. Commun Contemp Math 5(02):251–279

    Article  MathSciNet  MATH  Google Scholar 

  • Xia Q (2014) On landscape functions associated with transport paths. Discrete Contin Dyn Syst 34(4):1683–1700

    Article  MathSciNet  MATH  Google Scholar 

  • Xia Q (2015) Motivations, ideas and applications of ramified optimal transportation. ESAIM Math Model Numer Anal 49(6):1791–1832

    Article  MathSciNet  MATH  Google Scholar 

Download references


The authors thank the International Max Planck Research School for Intelligent Systems (IMPRS-IS) for supporting Diego Baptista.


Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations



DB and CDB derived the model, analyzed results, and wrote the manuscript. DB conducted the experiments. Both authors read and approved the final manuscript.

Corresponding author

Correspondence to Diego Baptista.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.



Covered area for other values of \(\beta\)

We present in this section a similar plot to that of Fig. 5—comparing the covered area and the cost function— for other values of \(\beta .\) As mentioned there, S shows decreasing behaviors for which \(t_P>t_{\mathcal {L}}\) holds true (see Fig. 8).

Fig. 8
figure 8

S and Lyapunov cost. First and second top-down rows: from left to right we see \(\beta =1.1, 1.3\) and 1.4. Third and fourth top-down rows: from left to right we see \(\beta =1.6, 1.7\) and 1.9. First and third top-down rows: mean and standard deviation of S as a function of time t; Second and fourth top-down rows: Mean and standard deviation of the Lyapunov cost \({\mathcal {L}}\), energy dissipation \({\mathcal {E}}\) and structural cost \({\mathcal {M}}\) of transport densities. Red and blue lines denote \(t_P\) and \(t_{\mathcal {L}}\) for \(p = 1.05\)

Additional hypernetwork properties

In this section we extend the comparison between the cost function—minimized by the dynamics—and hypernetwork properties (see Fig. 9). As mentioned in the main manuscript, similar monotonic behaviors can be observed in these cases.

Fig. 9
figure 9

Other hypernetwork properties and Lyapunov cost. From left to right: \(\beta =1.2, 1.5\) and 1.8. From top to bottom: Mean and standard deviation of the average degree \(d_1(H)\), number of hyperedges \(|E_H|,\) number of triangles |T|, and the Lyapunov cost \({\mathcal {L}}\), energy dissipation \({\mathcal {E}}\) and structural cost \({\mathcal {M}}\). Red and blue lines denote \(t_P\) and \(t_{\mathcal {L}}\) for \(p = 1.05\)

Fig. 10
figure 10

P. polycephalum S evolution. From top to bottom: motion24, motion40 and motion79. Plots are separated in couples. For every couple, the plots on top show both P. polycephalum images and hypernetworks extracted from them. The hypernetwork at the lower leftmost plot is a subsection of the hypergraph shown inside the red rectangle on top. The plot at the bottom shows the covered area as a function of time. The red shade in this plot highlights a tentative consolidation phase towards optimality

P. polycephalum hypernetworks

Data information

We explain in this section further details about the analyzed real data.

The images are taken from the Slime Mold Graph Repository (Dirnberger et al. 2017) as mentioned in the main manuscript. We study 4 \(\{H_i\}_i^T\) sequences of different lengths. The length (T) varies depending on the number of images included in the sequence. This is because different experiments need more o fewer shots. These experiments, as mentioned in the repository’s documentation, consist of placing a slime mold inside a Petri dish with a thin sheet of agar where no food is provided. Slime mold’s exploration of the dish, as explained by the creators, is unbiased, due to the lack of food. Given that this organism is initially placed along one of the short edges of the rectangular dish, the experiment is considered to be finished once the plasmodium reaches the other short side. No more pictures are taken after this happens (Fig. 10).

Hypergraph extraction

We used the image sets motion12, motion24, motion40 and motion79, located in the repository, to build the studied hypernetworks. These sets contain a number of images ranging from 60 to 150. Hypernetworks are then extracted using the Img2net algorithm described in Baptista and De Bacco (2021a) as mentioned in the main manuscript, using the same configuration described in Baptista and De Bacco (2021b).

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Baptista, D., De Bacco, C. Convergence properties of optimal transport-based temporal hypergraphs. Appl Netw Sci 8, 3 (2023).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Optimal transport theory
  • Hypergraph theory
  • Graph theory
  • Network structure
  • Network design