Revealing the component structure of the world air transportation network

Diop, Issa Moussa; Cherifi, Chantal; Diallo, Cherif; Cherifi, Hocine

doi:10.1007/s41109-021-00430-2

Research
Open access
Published: 24 November 2021

Revealing the component structure of the world air transportation network

Issa Moussa Diop¹,
Chantal Cherifi²,
Cherif Diallo¹ &
…
Hocine Cherifi³

Applied Network Science volume 6, Article number: 92 (2021) Cite this article

5436 Accesses
15 Citations
4 Altmetric
Metrics details

Abstract

Air transportation plays an essential role in the global economy. Therefore, there is a great deal of work to understand better the complex network formed by the links between the origins and destinations of flights. Some investigations show that the world air transportation network exhibits a community and a core-periphery structure. Although precious, these representations do not distinguish the inter-regional (global) web of connections from the regional (local) one. Therefore, we propose a new mesoscopic model called the component structure that decomposes the network into local and global components. Local components are the dense areas of the network, and global components are the nodes and links bridging the local components. As a case study, we consider the unweighted and undirected world air transportation network. Experiments show that it contains seven large local components and multiple small ones spatially well-defined. Moreover, it has a main global component covering the world. We perform an extensive comparative analysis of the structure of the components. Results demonstrate the non-homogeneous nature of the world air transportation network. The local components structure highlights regional differences, and the global component organization captures the efficiency of inter-regional travel. Centrality analysis of the components allows distinguishing airports centered on regional destinations from those focused on inter-regional exchanges. Core analysis is more accurate in the components than in the whole network where Europe dominates, blurring the rest of the world. Besides the world air transportation network, this paper demonstrates the potential of the component decomposition for modeling and analyzing the mesoscale structure of networks.

Introduction

Air transport plays an essential role in the current context of globalization by reducing the distance between countries. Whether it is for the movement of millions of people or goods, thousands of flights are made per day, impacting the global economy and even public health (Colizza et al. 2006). Subsequently, the spread of the COVID-19 pandemic is mainly due to the world air transportation network. This pandemic leads to the bankruptcy of several airlines and affects the tourism and trade industry. That is why researchers have been interested in the air transportation network for a long time to study its structure, dynamics, and robustness (Zanin and Lillo 2013).

One can consider three levels of analysis of a network (macroscopic, microscopic, mesoscopic). Macroscopic analysis characterizes the entire network topology through a set of global measures. Microscopic studies investigate the network properties at the node or link level. Mesoscopic analysis concerns groups of nodes or links sharing similar features. There are two popular mesoscopic structure models: the community structure and the core-periphery structure. Although there is no consensus on their definition, both are related to the non-homogeneous density observed in real-world networks. The common understanding is that communities are dense areas of the network sparsely connected (Fortunato and Hric 2016). The core-periphery structure considers that a network comprises a dense, cohesive core and a sparse, unconnected periphery (Borgatti and Everett 2000). These mesoscopic structures are observed in most real-world networks. While they find numerous applications and explain a broad range of phenomena in networked systems, none of them is well-suited to disentangle the local interactions of nodes within their dense area from their global interactions with the other dense areas of the network. Inspired by the works reported in Ghalmane et al. (2019), Guimera et al. (2005), we propose a new mesoscopic representation of a network, called the component structure. Indeed, the authors show that considering the interplay between intra-community links and inter-community links in a modular network allows defining effective centrality measures. While these works focus on computing centrality measures in modular networks, our proposition is more general. Our goal is to distinguish between the local and global influence of various groups of nodes or links. To this aim, the proposed model decomposes a network into local components and global components. The local components are isolated, dense parts of the network that can be uncovered using a community detection or a multicore detection technique. The global components are the subnetworks joining the local components. One extracts them easily, based on the links between local components. Although it also relies on dense areas of the network as the community or core-periphery structures, the component structure offers a complementary view of the network mesoscopic organization. Indeed, components are isolated networks that can be analyzed separately. Furthermore, as the overlapping community structure, it does not operate a partition of the network. Indeed, in overlapping communities, a node can belong to multiple communities. In the component structure, a node can belong to a local and a global component. It is the case of the nodes linked to the other groups.

Our work departs from recent studies focusing on robustness (Lordan and Sallan 2019), and multilayer modeling (Lordan and Sallan 2017; Dai et al. 2018) of the air transportation network. Our main concern is to use the world air transportation network as a case study to evaluate the ability of the component structure representation to get a better understanding of the network mesoscopic organization. Therefore, we concentrate on the monolayer network representation of the world air transportation network. However, one may note that the component structure can be interpreted as a multilayer representation where the layers are the local components. Nodes and links connecting the various local components form the global components. The main contributions of this paper are as follows:

1
We introduce an alternative mesoscopic network structure called the component structure where local components are dense subnetworks, and the global components account for their interactions.
2
We use the component structure to study the structure of the world air transportation network. The local components characterize its regional organization, and the subnetworks forming the global components represent their interactions.
3
The regional characterization of the world air transportation network is not based on geographical considerations, but it relies on the density of the interactions.
4
This representation allows us to distinguish the regional impact of an airport from its influence worldwide.
5
We perform an extensive topological analysis of the component highlighting the regional and inter-regional differences of the world air transportation network.

The rest of the paper is organized as follows. Section "Litterature review" reports a review of related studies of the air transportation network. Section "Component structure of a network" introduces the definition of the component structure, and it gives an algorithm to uncover it. Section "Uncovering the dense parts of the world air transportation network" describes the data used in the experiments and examines the network community structure. Section "Local component structure of the world air transportation network" reports the analysis of the local component structure and section "Global component structure of the world air transportation network" analyzes the global component structure. Comparisons with the whole world air transportation network are reported in section "Comparison of the world air transportation network with the large components". Section "Degree centrality analysis" presents the results of a comparative analysis of the degree centrality of the components and the world air transportation network. Section "Core analysis" discusses the results of the core structure analysis. Finally, we conclude in section "Conclusion".

Litterature review

One can distinguish three levels of study of the air transportation network: worldwide, regional and national. Based on this classification we present some influential contributions. For more information the reader can refer to the following surveys (Rocha 2017; Lordan et al. 2014).

Several studies have been devoted to the properties of the worldwide air transportation network. In Guimera and Amaral (2004) Guimera et al. conduct the first exhaustive analysis of the world air transportation network. Considering airports as nodes and direct connections as links, the authors show that the degree and betweenness centrality exhibit a power-law distribution. Their work also reveals that highly connected cities are not the ones with high betweenness centrality values. This feature contradicts the classical preferential attachment formation model with a geographical distance constraint (Yook et al. 2002). They propose a new model which generates nodes with large betweenness and small degree based on geopolitical considerations. Indeed, only a few airports in each country are permitted to connect to airports of other countries, regardless of the geographical distance.

In Guimera et al. (2005), the community structure is investigated. Results show that communities correspond to geographical areas. Seven city roles are defined based on the proportion of intra-community links and extra-community links. Three roles distinguish the highly connected nodes. “Provincial hubs connect cities in their community. The “Connector hubs” are in the majority linked to cities outside their community. Finally, “Kinless hubs” share their links uniformly with all the communities. In the four non-hub categories, the “Ultraperipherical” nodes have no inter-community links. “Peripherical” nodes share most of their links with nodes in their community. “Nonhub connector” nodes have many links with nodes outside their community, and “Nonhub kinless” nodes share their links uniformly with all the communities. The main contribution of these works is to highlight the importance of geopolitical considerations in the formation mechanism of the air transportation networks and the various role of cities.

Inspired by fractality analysis, in Sun et al. (2017) the authors consider six types of nodes from fine-grained to coarse-grained granularity: airport, city, spatial areas of 100 km diameter around hubs, spatial areas of 200 km diameter around hubs, sub-national territory, country. Links are direct connections between nodes. The network analysis shows that all networks are small-world and disassortative. Furthermore, the clustering coefficient increases and the average path length decreases as the aggregation level varies from fine to coarse. The community structures uncovered by Louvain are quite consistent. It contains about ten communities corresponding to different geographical boundaries.

Cheung et al. (2020) explores the evolution of the world air transportation network during the period 2006–2016. In this weighted network, nodes are airports, and the total number of passengers per year weights the direct fly links. The authors propose a new metric called Global Airport Connectivity Index, measuring the importance of airports in global passenger movements. It combines degree, closeness, eigenvector centrality measures, flow betweenness, and an indicator of regional importance. Building on the work of Guimera et al. (2005), they classify the airports into regional hubs or global hubs, depending on their embeddedness in their community and their Global Airport Connectivity Index. Results show that the average degree and the density increase over time. Furthermore, North America, Russia, and China focus on developing regional hubs, while West Europe and the Middle East concentrating on emerging global hubs.

Some studies focus on the regional air transportation network. In Lordan and Sallan (2017), the authors investigate the European airport network where nodes are European cities, and links account for direct flights. It appears that the degree follows a two-regime power-law distribution and that the network possesses the small-world property. Using the k-core, they decompose the network into three layers: the core (max k-core), the periphery (k-core of degree one), and the bridges between the core and the periphery. The analysis shows that the core contains the global European cities. The leisure air travel origins and destinations compose the bridges. Finally, the local destinations constitute the periphery. A robustness study shows that the network is more vulnerable to the isolation of a combination of core and bridge nodes.

In Dai et al. (2018), the authors investigate the evolution of Southeast Asian air transportation over the period 1979–2019. Results indicate that the number of hubs increases in this scale-free network. Disassortative behavior increases with time due to a more pronounced hub-and-spoke configuration of small airports for better accessibility. Decomposition of the network into a core-bridge-periphery structure shows that the core comprises the regional capital cities, the most economically vibrant secondary cities, and tourist destinations. The periphery cities are in remote areas with declining connectivity. High volatility over time characterizes the bridge nodes. The number of connections and passengers increases mainly in the core layer and the bridge layer at the end of the 20th century.

Lordan and Sallan (2019) use the Official Aviation Guide (OAG) subdivision to partition the world air transportation network into seven global regions (Africa, Asia, Europe, Latin America, Middle East, North America, Southwest Pacific). In these networks nodes are cities and links represent direct connections. Analysis shows that these small-world networks exhibit a two-regime power-law degree distribution. Targeted attack experiments based on the network’s decomposition into core, bridge, and peripheral nodes show that regional networks with a large core are more resilient than networks with a smaller core.

Much more works concern the national air transportation network of major countries (US, China, India, Brazil) Wandelt et al. (2019). We summarize the main findings of the following related studies where nodes are airports and links represent direct flights. The common characteristics of all these networks is that hey all share the small-world property and are disassortative. In Guida and Maria (2007) the authors investigate the Italian network during three non-overlapping periods (June 1, 2005, to May 31, 2006–July 16 to August 14, 2005–November 2005). Results are consistent across networks. Indeed, the degree distribution and the betweenness centrality follow a double Pareto law. Their findings also suggest that the networks exhibit a fractal structure. Furthermore, the clustering coefficients are comparable and lower than those observed in a corresponding random network. It also appears that some highly connected airports have a small betweenness centrality.

In Bagler (2008) the authors investigate the network of India. They consider an unweighted directed network and a network weighted by the number of flights by week. The unweighted network has a truncated power law degree distribution. It is disassortative with a clustering coefficient one order of magnitude higher than the corresponding random network. The weighted network presents a hierarchical structure. Its analysis shows that highly connected airports share almost all the traffic, forming high traffic corridors.

The Chinese network has been extensively studied (Du et al. 2017, 2017; Yang et al. 2021). In Wang et al. (2011) the authors show that its structure diverges from other national networks. Indeed, the exponential is a better fit than the power-law for the degree distribution. The explanation lies in the influence of the three main metropolises (Beijing, Shanghai, and Guangzhou). The network is disassortative with highly connected cities surrounded by poorly connected cities with direct links. This phenomenon gets more pronounced as the degree increases. Indeed, small airports in China tend to supply direct links to the top hubs bypassing the less developed regional ones.

Extensive research on the topology and the dynamics of the U.S. air transportation network have been performed (Jia et al. 2014; Xu and Harriss 2008). In Cheung and Gunes (2012) the authors analyze its evolution over the period 1991–2011, and the study reported in Siozos-Rousoulis et al. (2021) concerns the period 2001–2016. Overall, one does not observe considerable changes in the topological properties of the networks. The number of airports and flight routes has increased according to user demand. The network exhibits a truncated power law degree distribution. It is highly disassortative, and this trend grows with time, suggesting a pronounced evolution towards a hub and spoke structure over time. Similarly, with the world air transportation network, some high betweenness centrality nodes such as Anchorage cannot be considered hubs. The clustering coefficient decreases over time, and the average shortest path increases. It is in line with hub and spoke organization where peripherical airports connect to hubs providing long-distance flights.

Several papers are devoted to the analysis of the Brazilian air transportation network (da Rocha 2009; Costa et al. 2018; Oliveira et al. 2020). In Couto et al. (2015), the authors consider three networks: the network of national flight, the network of international flight and the network with both type of flights. The network is scale-free. Six communities corresponding to geographical areas (“North”, “Center/North”, “Northeast”, “Minas Gerals”, “Southeast”, “South/West”) are discovered by Louvain. The network is not resilient to targeted attack. Viracopos and Guarulhos are the key airports in the national network and for international connections, and they have the largest values of degree centrality and betweenness centrality. In addition, the number of routes decreases while the number of passengers increases, causing a higher level of occupation of aircrafts.

The analysis of the Australian network (Hossain and Alam 2017) shows that it is scale-free. Its clustering coefficient is higher than its random network version indicating a cohesive network where passengers can be easily rerouted. The average path length suggests that, on average, a passenger can reach every destination in 3 flights. Most of the traffic goes through an interconnected group of high-degree nodes surrounded by low-degree neighbors. Centrality analysis shows that the more connected nodes do not necessarily exhibit the largest betweenness and closeness centrality values.

This literature review’s main findings are that the topological properties are pretty consistent across the various levels of studies. Indeed, the degree and betweenness centrality exhibit a heavy tail distribution. Networks are small world and disassortative, with most nodes poorly connected to a few highly connected nodes. Nevertheless, the preferential attachment mechanism is not sufficient to explain the formation of the networks. One needs to consider geographical constraints (borders) and political and economic issues to get a better understanding of the network’s topology. The clustering coefficient is higher than in random networks, and the average path length allows to join any destination in few hops. Another critical finding concerns the mesoscopic properties of the world air transportation network. Some studies demonstrate that it exhibits a community and a core-periphery structure.

Component structure of a network

This section introduces the definition of the component structure and an algorithm to uncover it based on the community structure.

Definition

Community structure and core-periphery structure models assume that the density is not homogeneous in a network. Dense areas form either the communities or the core elements of the networks. These two mesoscopic representations share a different view of the remaining nodes or links. In the community structure approach, communities are supposed to be sparsely connected by inter-community links. In the core-periphery structure, peripherical nodes are poorly connected to each other and with core nodes. To define the component structure, we retain both approaches’ common points of view, i.e., the network contains dense areas. Those dense areas are localized in the network. Indeed, the vast majority of nodes interact with nodes contained in their community or core. That is the reason why we call them local components. Indeed, they share information with the rest of the network through a set of proxy links and nodes that have a more global view of their environment. These subnetworks tie together the local components. Consequently, the definition of the component structure is quite simple. A network contains two sets of subnetworks: 1) The dense parts of the network form the local components 2) Nodes and links shared by any two local components form the global components. Note that once the dense areas are extracted, global components identification is straightforward. Furthermore, one can exploit the various definition of dense areas proposed either in the community detection literature or the multi-core-periphery studies to extract the local components. In the following, we propose an algorithm that uses the community structure approach.

Component structure detection algorithm

Building on the work of Ghalmane et al. (2019), we propose a component structure extraction algorithm exploiting the community structure. Remember that a network is decomposed into local components and global components. The local components are isolated dense parts of the network. The global components are subnetworks joining the local components.

The algorithm to uncover the component structure of a network proceeds as follows:

1
Uncover the dense part of the network: Use a community detection algorithm to uncover the community structure.
2
Extract the local components: Remove the inter-community links from the community structure to form the local components.
3
Extract the global components: Remove the intra-community links from the community structure, and the subsequent isolated nodes.

Note that this representation is redundant. Indeed, a node can belong simultaneously to a local component and to a global component. Such nodes at the frontier of the communities are important locally and globally.

Figure 1 illustrates the decomposition process of a network into its components on a toy example. First, one uses a community detection algorithm to partition the network into a set of non-overlapping communities. Inter-community links joining nodes in different communities are black. Nodes and intra-community links that bind nodes in the same community share the same color. We observe three communities respectively colored in red, yellow, and green. Removing the inter-community links allows us to isolate the three local components. Each community forms a local component, and it carries only local information. One obtains the global components removing the colored links from the community structure (intra-community links) and the subsequently isolated nodes. Global components act as bridges between the dense areas of the network. They are important actors in the information diffusion between the various dense parts of the network. As components are isolated sub-networks of the initial network, one can proceed to any type of topological analysis.

Uncovering the dense parts of the world air transportation network

In this section, we present the data set used in the experiments. We perform a comparative analysis of the community structure uncovered by two popular community detection algorithms used to extract the dense parts of the network.

Data

Information on world flights has been collected from FlightAware. The data covers six days (between May 17, 2018, and May 22, 2018), ensuring the inclusion of less frequent connections (Alves et al. 2020). Nodes represent airports, and links represent direct flights between two airports. For the sake of simplicity, the network is undirected and unweighted. However, one can consider weighted and directed networks using the appropriate analysis tools. The network contains 2734 nodes and 16,665 links. Table 1 reports its basic topological properties.

Table 1 Basic topological properties of the world air transportation network

Revealing the component structure of the world air transportation network

Abstract

Introduction

Litterature review

Component structure of a network

Definition

Component structure detection algorithm

Uncovering the dense parts of the world air transportation network

Data

Community detection

Community structure analysis

Local component structure of the world air transportation network

Analysis of the large local components

Basic macroscopic topological properties

Degree distribution

Degree-degree distance distribution of the local components

Distribution of airports by country

Analysis of the small local components

Basic macroscopic topological properties

Global component structure of the world air transportation network

Analysis of the large global component

Basic macroscopic topological properties

Degree and degree-degree distance distribution

Distribution of the airports between regions

Comparison of the world air transportation network with the large components

Basic macroscopic topological properties

Degree and degree-degree distance distribution

Degree centrality analysis

Exploring the regional hubs in the large local components

Exploring the inter-regional hubs in the large global component

Comparison of the regional and inter-regional hubs with the hubs of the world air transportation network

Core analysis

Exploring the max k-core in the large local components

Exploring the max k-core in the large global components

Comparison of max k-core of the components with the max k-core of the world air transportation network

Conclusion

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's Note

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords