Skip to main content

Social network analysis of manga: similarities to real-world social networks and trends over decades

Abstract

Manga, Japanese comics, has been popular on a global scale. Social networks among characters, which are often called character networks, may be a significant contributor to their popularity. We collected data from 162 popular manga that span over 70 years and analyzed their character networks. First, we found that many of static and temporal properties of the character networks are similar to those of real human social networks. Second, the character networks of most manga are protagonist-centered such that a single protagonist interacts with the majority of other characters. Third, the character networks for manga mainly targeting boys have shifted to denser and less protagonist-centered networks and with fewer characters over decades. Manga mainly targeting girls showed the opposite trend except for the downward trend in the number of characters. The present study, which relies on manga data sampled on an unprecedented scale, paves the way for further population studies of character networks and other aspects of comics.

Introduction

The global comic market has been rapidly growing. The market size was valued at USD 14.7 billion in 2021 and is expected to expand at a compound annual growth rate of 4.8% from 2022 to 2029 (Fortune Business Insights 2022). A driving force underlying the rapid expansion of the market is the widespread availability of e-books, which enables us to easily access comics around the world. The COVID-19 pandemic also helped the growth of the comic market because the lockdown in many countries increased the demand for comics. Manga, Japanese comics, has been especially popular on a global scale. Many manga have been translated into different languages. For example, over 510 million copies of One Piece have circulated worldwide; One Piece is recorded in the Guinness Book of Records as the best-selling comic in the history (The Mainichi 2022). The history of the development of manga culture in Japan, its social background, and its global prevalence have been studied (Ito 2005; Brenner 2007; Schodt 2013).

Our social networks are complex but characterized by common properties such as heterogeneity and community structure (Barabási 2016; Newman 2018). Because literally all storylines of manga rely on interactions among characters, social networks among characters, which are often called character networks, may provide a backbone of the story of manga and influence its popularity. More generally, character networks have been analyzed for quantitatively characterizing fiction works (Labatut and Bost 2019; Perc 2020). A number of problems can be addressed through analyses of character networks, such as summarization (Tran et al. 2015; Bost et al. 2019), classification (Ardanuy and Sporleder 2014; Holanda et al. 2019), and role detection (Weng et al. 2007; Jung et al. 2013). Character networks have been studied for a variety of fiction such as novels (Elson et al. 2010; Ardanuy and Sporleder 2014; Bonato et al. 2016; Chaturvedi et al. 2016; Min and Park 2019; Gessey-Jones et al. 2020), plays (Stiller et al. 2003; Mutton 2004; Moretti 2011; Rieck and Leitte 2016), movies (Weng et al. 2007; Park et al. 2012; Agarwal et al. 2014; Yeh and Wu 2014), and TV series (Park et al. 2012; Tan et al. 2014; Nan et al. 2015; Chen and Choi 2016; Bost et al. 2019). However, character networks for comics have been rarely explored except for Marvel comics (Alberich et al. 2002; Gleiser 2007), a graphic novel (Labatut 2022), and a few manga titles (Murakami et al. 2020). One possible reason for this is the difficulty of automating data collection for comics (Labatut and Bost 2019).

In the present study, we analyze character networks of 162 popular Japanese manga that span decades and test the following two hypotheses. First, we expect that popular manga tend to produce realistic social networks. Therefore, we hypothesize that structural and temporal properties of the character networks of manga are similar to those of human social networks. If this hypothesis is supported, it could shed light on how to compose popular manga. Second, our data enables us to investigate how trends of manga have changed over approximately 70 years. Therefore, we hypothesize that character networks of modern manga and old ones are systematically different. We also examine differences in the network structure between manga mainly targeting boys versus girls. Through these analyses, we provide a new understanding of manga culture and its historical development.

Methods

Data collection

We focus on manga of which more than 10 million copies have been published in Japan as of January 2021 (mangazenkan 2021). Since the physical size of the manga may affect the number of panels per page and we adopted the page as the unit of analysis, we excluded the manga that had not been published in the paperback pocket edition (112 mm × 174 mm). Note that the paperback pocket edition is the most common for the boys’ and girls’ manga in Japan. As a result, we included 162 manga (see Additional file 1 for the list of the 162 manga).

We prepared a data table of time-stamped copresence of characters for each manga as follows. For each manga, we manually examined volumes one through three. Examining only three volumes is due to a logistic limitation. All the selected 162 manga had at least three volumes. Then, we first extracted all the characters whose name, blood relation to a named character, or job title that uniquely identifies the character, is known. To ensure the reproducibility of this work, we avoided to use any other information sources (e.g., other volumes of the same manga, the Internet, or anime) to attempt to identify more characters. In practice, such additional information sources would not contribute to identifying many more characters. Second, we recorded the copresence of characters on each page as interaction between the characters. Note that the copresence is the most common definition of interaction in constructing character networks in fictional works (Labatut and Bost 2019). We used copresence on a page rather than in a single panel within a page because it is common that characters appearing in different panels on the same page have some interactions (e.g., two characters talk to each other by alternately occupying successive panels).

Character networks

The original data from which we construct the temporal and static character networks are equivalent to a temporal bipartite graph in which the two types of nodes are characters and pages, and an edge connects a character and a page in which the character appears. We construct a character network by projecting the bipartite graph onto the space of character nodes. In other words, in the character network, we connect two characters by an edge if and only if they appear on the same page at least once (see Fig. 1). The weight of an edge is the number of pages on which the two characters are copresent. Because we are interested in interaction between the characters, we exclude the isolated nodes in the character networks from the analysis.

Fig. 1
figure 1

Schematic illustration of one-mode projection, with which we construct a character network from a temporal bipartite graph of a manga

Bipartite configuration model

To generate a randomized bipartite network, we choose a pair of edges in the original bipartite graph uniformly at random, denoted by (cp) and \((c', p')\), where c and \(c'\) are characters and p and \(p'\) are pages. If c and \(c'\) are the same or p and \(p'\) are the same, we discard the two edges and redraw them. Then, we rewire the two original edges to \((c, p')\) and \((c', p)\). We repeat this procedure 10, 000 times excluding the discarded edge pairs to generate a randomized temporal bipartite network. We construct a static randomized character network from the generated randomized bipartite graph by one-mode projection.

Coefficient of variation

To quantify the heterogeneity of a variable, we measure the coefficient of variation (CV), which is defined as the standard deviation divided by the average. If a variable takes only one value, the CV is 0. If a continuous-valued variable obeys an exponential distribution, the CV is 1. A power-law distribution has a CV value substantially larger than 1.

Degree assortativity coefficient

The degree assortativity coefficient measures the extent to which nodes with similar degrees tend to be adjacent to each other in a network (Newman 2002, 2003). It is defined by

$$\begin{aligned} r=\frac{\sum _{i=1}^{N}\sum _{j=1}^{N}(A_{ij} -k_i k_j/2M)k_i k_j}{\sum _{i=1}^{N}\sum _{j=1}^{N}(k_i \delta _{ij}-k_i k_j/2M)k_i k_j}, \end{aligned}$$
(1)

where N is the number of nodes, M is the number of edges, \(k_i\) is the degree of node i, and \(\delta _{ij}\) is the Kronecker delta. Tha value of r ranges between \(-1\) and 1. A positive value indicates an assortative network, in which nodes with similar degrees tend to be adjacent to each other. A negative value suggests a disassortative network, in which nodes with different degrees tend to be adjacent to each other.

Clustering coefficient

The clustering coefficient quantifies the amount of triangles in a network (Watts and Strogatz 1998). The local clustering coefficient for node i is defined by

$$\begin{aligned} C_i=\frac{\text {(number of triangles including node }i)}{k_i (k_i-1)/2}. \end{aligned}$$
(2)

The denominator gives the normalization such that \(0 \le C_i \le 1\). The clustering coefficient, denoted by C, is defined by the average of \(C_i\) over all the nodes in the network, i.e.,

$$\begin{aligned} C=\frac{1}{N}\sum _{i=1}^{N}C_i. \end{aligned}$$
(3)

Interevent time

An interevent time (IET) refers to the time between two consecutive events. It should be noted that we regard a page as the time unit to simplify the analysis although manga story does not always follow the chronological order. Copresence of two characters on a page defines a time-stamped event on the edge of the character network, where we identify the page number as discrete time for simplicity. Therefore, an IET on edge (ij) is the time between two consecutive copresence events of characters i and j. A time-stamped event for a given node is the presence of the character on a page. Therefore, an IET for node i is the time between two consecutive appearances of character i.

SI model

We use the susceptible-infectious (SI) model (Kermack and McKendrick 1927) to numerically investigate contagion. Because infection does not spread from one connected component to another, we run the SI model on the largest connected component of the character network (see Additional file 1 for the number of nodes in the largest connected component, denoted by \({\overline{N}}_{(L)}\), for the 162 manga). We assume that just one character is initially infectious and that the other \({\overline{N}}_{(L)}-1\) characters are initially susceptible. If a susceptible character co-appears with an infectious character on a page, the susceptible character becomes infectious with probability \(\beta\). Different infectious characters appearing on the same page independently attempt to infect each susceptible character on the page. Once a character contracts infection, it stays infectious forever. We run the SI dynamics with each of the \({\overline{N}}_{(L)}\) characters as the sole character that is initially infectious.

Temporal correlation coefficient

We quantify the persistence of edges over time by the temporal correlation coefficient (Nicosia et al. 2013; Thompson et al. 2017). First, we calculate the topological overlap for node i at time t by

$$\begin{aligned} O_{it} = \frac{\sum _{j=1}^{N} A_{ij}^{t} A_{ij}^{t+1}}{\sqrt{\sum _{j=1}^{N} A_{ij}^{t} \sum _{j=1}^{N} A_{ij}^{t+1}}}, \end{aligned}$$
(4)

where \(A_{ij}^t\) is the adjacency matrix of an unweighted network at time step t. In other words, \(A_{ij}^t = 1\) if characters i and j are copresent on page t, and \(A_{ij}^t = 0\) otherwise. We then define the average topological overlap for node i by

$$\begin{aligned} O_{i} = \frac{1}{T-1} \sum _{t=1}^{T-1} O_{it}, \end{aligned}$$
(5)

where T is the number of pages. The temporal correlation coefficient for the entire temporal network is given by

$$\begin{aligned} O = \frac{1}{N} \sum _{i=1}^{N} O_{i}. \end{aligned}$$
(6)

Partial correlation coefficient

The partial correlation coefficient measures the extent of association between two variables while controlling for the influence of one or more additional variables. The three-way partial correlation coefficient between \(x_i\) and \(x_j\) conditioned on \(x_k\) is given by

$$\begin{aligned} r_{ij|k}=\frac{r_{ij}-r_{ik}r_{jk}}{\sqrt{\smash [b]{1-r_{ik}^2}}\sqrt{\smash [b]{1-r_{jk}^2}}}, \end{aligned}$$
(7)

where \(r_{ij}\) is the Pearson correlation coefficient between \(x_i\) and \(x_j\).

Results

Similarities to empirical social networks

Fig. 2
figure 2

Degree distribution for the A character and B page nodes in the bipartite graph for One Piece. We also show the Poisson distributions with the same mean

Our original data are equivalent to a temporal bipartite graph in which the two types of nodes are characters and pages, and the edges connect characters to the pages in which they appear. The pages are ordered in time. We show the descriptive statistics of the bipartite networks for 162 manga in Additional file 1. We show the degree distribution for the character nodes for One Piece and the Poisson distribution with the same mean in Fig. 2A. The CV of the original degree distribution of the characters in One Piece is 1.66. This value is approximately 11.1 times larger than that of the Poisson distribution. Therefore, similar to human social networks (Barabási 2016; Newman 2018), the characters in manga have heterogeneous numbers of connections, and there are a small number of characters that appear disproportionately frequently on various pages. We obtained similar results for the other manga (see Additional file 1).

In contrast, the CV of the degree distribution for the page nodes is 0.478 for One Piece. The CV for the Poisson distribution with the same mean is 0.592. Therefore, we argue that the characters do not appear uniformly randomly over the pages. As we show in Fig. 2B, there are typically two or three characters on a page in One Piece, and pages containing none or just one character are relatively rare. We obtained similar results for the other manga (see Additional file 1). These results suggest that the high heterogeneity of the characters in terms of the frequency of appearance on pages and the tendency of a page typically containing two or three characters are two common properties of manga. Therefore, in the following analyses, we consider the bipartite configuration model, in which the degrees of all character and page nodes are preserved and the edges are otherwise randomly placed, as a null model. Then, we examine properties of character networks that we can explain by the null model versus those we cannot.

Fig. 3
figure 3

Character network for One Piece. The thickness of an edge is proportional to the weight of the edge. The node with an arrow represents the protagonist, Monkey D. Luffy

We show the weighted character network, which is the projection of the bipartite graph onto the space of character nodes, for One Piece in Fig. 3 (see Additional file 2: Fig. S1 for the networks for all 162 manga). By definition, the edge weight is equal to the number of pages in which the two characters simultaneously appear. We show a summary of descriptive statistics of the character networks over 162 manga in Table 1 (see Additional file 1 for the statistics for each manga). The character networks vary widely in size from manga to manga. Although multiple protagonists may exist in a manga, for simplicity, we define the protagonist as the node with the largest node strength (i.e., weighted degree) in the character network. The protagonist is the character that appears in the largest number of pages for most manga (\(97.5\%\)). In fact, the protagonist for One Piece, indicated by an arrow in Fig. 3, is Monkey D. Luffy, who is generally known as the most central character in One Piece.

Table 1 Structural properties of character networks for 162 manga

We compared various properties of character networks between the empirical character networks and their randomization obtained by the one-mode projection of the bipartite network generated by the bipartite configuration model. We first compare three strength-related indices, i.e., the average strength, \(\langle s \rangle\), the CV of the strength, \(s_{\mathrm{CV}}\), and the strength of the protagonist, \(s_{\text {P}}\), of the character networks for all 162 manga between the original and randomized networks in Fig. 4A–C. These figures suggest that randomized character networks well explain the strength-related indices of the original character networks.

Fig. 4
figure 4

Comparison between the original and randomized character networks for 162 manga. For the randomized networks, we show the mean and standard deviation on the basis of 1,000 realizations by the circle and error bar, respectively. A \(\langle s \rangle\): average node strength. B \(s_{\mathrm{CV}}\): CV of the node strength. C \(s_{\mathrm{p}}\): strength of the protagonist. D \(\langle k \rangle\): average degree. E \(k_{\mathrm{CV}}\): CV of the degree. F \(k_{\mathrm{p}}/(N-1)\): normalized degree of the protagonist. G \(\alpha\): coefficient in the relationships between the degree and strength, i.e., \(s_i \propto k_i^{\alpha }\). H r: degree assortativity coefficient. I C: clustering coefficient. J \(\tau _{\mathrm{CV}}^{\mathrm{node}}\): CV of IETs on nodes. K \(\tau _\mathrm{CV}^{\mathrm{edge}}\): CV of IETs on edges. L \(N_\mathrm{I}/{\overline{N}}_{(L)}\): final epidemic size (i.e., fraction of infected nodes at the last time step)

Figure 5A shows the strength distribution for One Piece. We find that the strength obeys a heavy-tailed distribution over some scale. This result is consistent with Fig. 4B, which shows that the CV of the strength for most manga is substantially larger than 1; the CV for One Piece is 1.48. The heavy-tailed strength distributions shown in Fig. 5A for One Piece and implied in Fig. 4B for a majority of manga are consistent with those for empirical social networks (Barrat et al. 2004; Wang et al. 2006).

In contrast to the strength-related indices, the randomized character networks do not explain the degree-related indices of the original character networks. The randomized networks tend to overestimate the average degree, \(\langle k \rangle\), of the original character networks (see Fig. 4D). This result suggests that characters tend to repeat interacting with relatively few characters. The CV of the degree, \(k_{\mathrm{CV}}\), for the original networks tends to be larger than that for the randomized networks (see Fig. 4E). Let \(k_{\mathrm{p}}\) denote the degree of the protagonist. The fraction of the other characters that the protagonist is adjacent to, \(k_\mathrm{P}/(N-1)\), is \(0.899 \pm 0.103 \, (\mathrm{mean} \pm \mathrm{standard \, deviation}\) based on the 162 manga) and \(0.971 \pm 0.0413\) for the original and randomized character networks, respectively (see Table 1 and Fig. 4F). Therefore, the protagonist is adjacent to most of the other characters in both original and randomized networks. Based on these results, we conclude that the character networks are strongly protagonist-centered, in which the protagonist interacts with most of the other characters, while other characters tend to interact only with fewer characters than expected for the randomized networks, but including the protagonist. This interpretation is consistent with the aforementioned observation that \(\langle k\rangle\) and \(k_{\mathrm{CV}}\) are smaller and larger for the original than randomized networks, respectively.

Fig. 5
figure 5

Strength and degree distributions for the characters in One Piece. A Strength distribution. B Degree distribution. C The relationship between the degree and strength. Each circle represents a character. The coefficient \(\alpha\) is determined by the linear regression \(\log s_i = \alpha \log k_i +b\), where b is an intercept; the solid line shows the linear regression. Variable \(R^2\) represents the coefficient of determination

We show the degree distribution for One Piece in Fig. 5B. The CV of the degree, \(k_{\mathrm{CV}}\), for One Piece is 0.77. Figure 4E shows that the CV of the degree for most of the manga is smaller than 1. In fact, this result is consistent with that for empirical social networks with similar number of nodes (Read 1954; Zachary 1977), while large empirical social networks tend to have heavy-tailed degree distributions implying a large CV (Barabási et al. 2002; Mislove et al. 2007).

We found that the strength is super-linearly scaled with the degree, i.e., \(s_i \propto k_i^{\alpha }\) with \(\alpha >1\), in the character networks, where \(\propto\) indicates “proportional to”. Note that the absence of correlation between the strength and degree would yield \(\alpha =1\) (Barrat et al. 2004). Specifically, in Fig. 5C, we show the relationship between \(k_i\) and \(s_i\) for One Piece. We obtained \(\alpha =1.94\) with the coefficient of determination \(R^2=0.847\). We obtained similar results for the other manga (see Fig. 4G). These results are consistent with power-law relationships between the strength and degree with \(\alpha > 1\), which has been observed in empirical social networks (Barrat et al. 2004; Wang et al. 2005). Note that the randomized character networks also yield \(\alpha > 1\) and that \(\alpha\) for the original and randomized networks are highly correlated with a small number of exceptions.

The character networks are disassortative. In fact, the degree assortativity coefficient, denoted by r, of the character network is \(-0.345\) on average (see Table 1). The values of r for all but two manga are negative (see Fig. 4H). This result is in stark contrast with various observations that social networks are more often than not assortative with positive r (Newman 2002; Newman and Park 2003). We argue that the character networks are disassortative because they are strongly protagonist-centered. In fact, star graphs are disassortative with large negative values of r (Piraveenan et al. 2008; Estrada 2011). The degree disassortativity is also observed for empirical egocentric social networks (Batagelj and Mrvar 2000; Gupta et al. 2015). Figure 4H indicates that r is also negative for the randomized character networks although the correlation between r for the original and randomized networks is low. Therefore, we conclude that the degree disassortativity in our character networks is a consequence of the protagonist-centered nature of the original character-page bipartite network.

The character networks are highly clustered. Specifically, the clustering coefficient, denoted by C, over the different manga is 0.790 with the minimum value of 0.598 (see Table 1). This result is consistent with the observations that empirical social networks have high clustering coefficients (Watts and Strogatz 1998; Saramäki et al. 2007). In fact, randomized character networks also have similarly large C although the spread is large between the empirical and randomized networks. Therefore, we conclude that a high clustering coefficient is a consequence of one-mode projection of the bipartite graph, which is known (Newman 2001; Ramasco et al. 2004).

Fig. 6
figure 6

Survival function, \(P(\tau )\), of IETs on A nodes and B edges for One Piece. Each line in A and B corresponds to a node and an edge, respectively. We only considered nodes that had at least 100 events and edges that had at least 50 events for A and B, respectively

We found that heavy-tailed distributions of IETs are simultaneously present for nodes and edges in the character networks. We show the survival functions of the IETs (i.e., probability that the IET, denoted by \(\tau\), is larger than the specified value) for individual nodes and edges in One Piece in Fig. 6A and B, respectively. The relatively slow decay in Fig. 6 suggests heavy-tailed distributions for both nodes and edges across some scales of \(\tau\). The CV values for IETs on nodes and edges for One Piece are \(1.72 \pm 0.697\) and \(1.92 \pm 0.535\), respectively. We obtained similar results for the other manga, as we show in Fig. 4J and K. These figures also indicate that the randomization of the bipartite network does not preserve this feature, yielding CV values close to 1 regardless of the CV value for the original temporal character networks. It should be noted that a Poisson process produces an exponential IET distribution, which yields CV = 1. These results support that IETs for nodes and edges in the original character networks are non-Poissonian and heterogeneously distributed, which is consistent with properties of empirical social networks (Fonseca dos Reis et al. 2020).

We simulated stochastic contagion processes on the character network, in which one character can infect another if and only if they appear on the same page. The purpose of this analysis is to examine whether contagion on the character network occurs in a manner similar to that on empirical temporal contact networks, which show different contagion patterns from the case of static contact networks. Related to heavy-tailed IET distributions, we found that epidemic spreading occurs more slowly in the temporal character networks than in randomized counterparts. We ran the SI model. By assumption, an infectious character independently infects each susceptible character coappearing on the same page with probability \(\beta =0.2\). For One Piece, we show in Fig. 7 the time course of the fraction of the infectious characters averaged over all the runs. We also show the corresponding averaged time courses for each of the 1,000 randomized temporal networks by the blue lines. Figure 7 indicates that the infection occurs more slowly in the original temporal network than in the randomized temporal networks. We obtained similar results for the other manga (see Fig. 4L). These results are qualitatively the same as those observed for empirical social temporal networks (Karsai et al. 2011; Masuda and Holme 2013).

Fig. 7
figure 7

Time courses of the fraction of infectious characters for the original temporal network and 1,000 randomized temporal networks for One Piece. We used the SI model starting from one initially infectious character. Each curve represents the average time course of the fraction of infectious characters over \({\overline{N}}_{(L)}\) runs. We set the infection probability per contact event to \(\beta =0.2\)

Trends of network structure over decades

The 162 manga span nearly 70 years, from Astro Boy with the first volume being published in 1952 to Jujutsu Kaisen in 2018 (see Additional file 1 for detailed information on the 162 manga). The trend of the structure of manga character networks may have changed over the \(\approx 70\) years, reflecting the transitions of Japanese society including in economics and fashion. In this section, we explore possible existence of such trends.

We first calculated the Pearson correlation coefficient between the year of publication of the first volume, denoted by y, and various indices of network structure on the basis of all 162 manga. We show the correlation coefficients and their 95% confidence intervals (CIs) in Fig. 8A. We find that the number of characters, N, tends to decrease over decades (i.e., negative correlation between N and y). The CV of the node degree, \(k_{\mathrm{CV}}\), and that of the node strength, \(s_{\mathrm{CV}}\), are negatively correlated with y with moderate effect sizes. In general, these and other network indices may be affected by the number of nodes, N (van Wijk et al. 2010). Therefore, we also examined the partial correlation coefficient between y and each index by partialing out the effect of N. The results of the partial correlation coefficient for \(k_{\mathrm{CV}}\) and \(s_{\mathrm{CV}}\), shown by the lines with squares in Fig. 8A, are consistent with those of the Pearson correlation coefficient, although the partial correlation is closer to 0 than the Pearson correlation. We also find that the average degree, \(\langle k \rangle\), is positively correlated with y in terms of the partial correlation. These results suggest that more recent manga tend to be denser and more homogeneous in the node’s connectivity, such that various characters, not just the protagonist, tend to have more connections in more recent manga.

Fig. 8
figure 8

Correlation between the year of publication of the first volume, y, and indices of network structure. For the partial correlation coefficient, we partial out the influence of the number of nodes. According to a standard, the effect size is said to be large, moderate, or small when the correlation coefficient is \(> 0.5\), \(> 0.3\), or \(> 0.1\), respectively (Cohen 1988). The horizontal lines indicate 95% confidence intervals. N: number of nodes, \(\langle k \rangle\): average degree, \(k_{\mathrm{CV}}\): CV of the degree, \(k_{\mathrm{p}}/(N-1)\): normalized degree of the protagonist, \(\langle s \rangle\): average strength, \(s_{\mathrm{CV}}\): CV of strength, \(s_{\mathrm{p}}\): strength of the protagonist, r: degree assortativity coefficient, C: clustering coefficient, and O: temporal correlation coefficient

How the trend of the character network changes over time may depend on genres of manga. A most major categorization of Japanese manga is the one based on the gender of their main readership (Toku 2007; Prough 2010). Therefore, we classify the 162 manga into 124 boys’ and 38 girls’ manga based on the genre section on their Wikipedia pages (see Additional file 1 for the genre of each manga), while their classification is necessarily subjective. We show the Pearson and partial correlation coefficients between each index and y, separately for the boys’ and girls’ manga in Fig. 8B and C, respectively.

We find that there tend to be less characters for more recent manga for both boys’ and girls’ manga. The boy’s and girls’ manga are different in the following aspects in terms of the partial correlation coefficients, i.e., when we control for the number of characters. The trends that we identified for all the 162 manga are even more pronounced in the boys’ manga. The effect sizes of the partial correlation for \(\langle k \rangle\), \(k_{\mathrm{CV}}\), and \(s_{\mathrm{CV}}\) for the boys’ manga are larger than those for the 162 manga. In addition, the degree assortativity coefficient, r, is positively correlated with y. Because r is negative for most manga, this result implies that r tends to be closer to 0 for more recent boys’ manga. On the other hand, the two protagonist-related indices, i.e., \(k_{\mathrm{P}}/(N-1)\) and \(s_{\mathrm{P}}\), are not correlated with y. These results suggest that the character networks in more recent boys’ manga tend to be less protagonist-centered, in which non-protagonist characters have more connections among them.

The trend for the girls’ manga is opposite to that for the boys’ manga except that the downward trend in the number of characters is common. The two indices for which the partial correlation was negative for the boys’ manga, i.e., \(k_{\mathrm{CV}}\) and \(s_{\mathrm{CV}}\), are positively correlated with y for the girls’ manga in terms of the partial correlation. On the other hand, the two indices for which the partial correlation was positive for the boys’ manga, i.e., \(\langle k \rangle\) and r, are negatively correlated with y for the girls’ manga. Furthermore, the normalized degree of the protagonist, \(k_{\mathrm{P}}/(N-1)\), for which we did not confirm a correlation for the boys’ manga, is positively correlated with y. Overall, these results indicate that the character networks in the girls’ manga have shifted toward more protagonist-centered networks.

Discussion

We have examined character networks of 162 Japanese manga that span over 70 years. Our main findings are as follows. First, the structural and temporal properties of the character networks of manga are similar to those of human social networks. Second, the trend of the manga character networks has shifted over decades. Third, this trend shift is substantially different between boys’ and girls’ manga.

Similarities between character networks and human social networks have been investigated for some fictions such as Marvel comics (Alberich et al. 2002; Gleiser 2007), a graphic novel (Labatut 2022), Shakespeare’s plays (Stiller et al. 2003), and myths (Mac Carron and Kenna 2012). Our results of the high clustering coefficient and the heavy-tailed distribution of the strength in the character networks are consistent with these previous studies. However, we ascribe the high clustering coefficient to the effect of one-mode projection rather than to similarity to empirical social networks because the randomized character networks also have high clustering. Our results of the disassortativity is also consistent with the results for Marvel comics (Gleiser 2007), a graphic novel (Labatut 2022), and myths (Mac Carron and Kenna 2012). Therefore, we infer that the heavy-tailed distribution of the strength and disassortativity in the character networks are common properties for various types of fictions. Although the aforementioned previous studies investigated only static properties of the character networks, we further revealed temporal properties of the character networks, such as the long-tailed IET distributions, which are in fact consistent with empirical social networks (Karsai et al. 2011; Masuda and Holme 2013; Fonseca dos Reis et al. 2020).

The bipartite configuration model, which is a standard random bipartite graph, has been shown to be reasonably accurate at explaining some features of the empirical character networks including statistics of the node’s strength, \(\alpha\) values, disassortativity, and the high clustering coefficients. In contrast, there are other features of the empirical character networks that deviate from the expectation from the bipartite configuration graph. Such features include statistics of the degree, the IET distributions, and speeds of epidemic spreading. We suggested that part of these differences originates from the protagonist-centered nature of the character networks. In other words, the protagonist interacts with most characters, while non-protagonist characters tend to interact only with the protagonist and a smaller number of other characters than expected by the configuration model. There are positive support of protagonist-centered social networks in the real world when the network is egocentric (Batagelj and Mrvar 2000; Gupta et al. 2015).

We also found that more modern manga tend to have fewer characters, be denser, and be less protagonist-centered for the boys’ manga, which may reflect a modern change in the society that places more emphasis on diversity and teamwork (Dell’Era and Verganti 2010; Curşeu and Pluut 2013). In contrast, the character networks in the girls’ manga have shifted toward more protagonist-centered, although the downward trend in the number of characters is common. In general, girls’ manga in Japan tend to revolve around issues of love and friendship with a focus on inner feelings of the protagonist (Prough 2010; Takahashi 2014). Our results suggest that more recent girls’ manga may describe the relationships between the protagonist and a few other characters in depth.

There are many future directions of investigation. First, we focused on 162 manga with high circulations. By analyzing manga with low circulation as well, we may be able to discover static and temporal properties of character networks that readers favor, contributing to understanding why some manga are more popular than others. Second, we analyzed only the first three volumes of each manga because of a logistic limitation. Examining all volumes would allow us to understand the narrative structure, which have been analyzed for novels (Gessey-Jones et al. 2020), movies (Weng et al. 2007), and TV series (Park et al. 2012). For this purpose, a wide variety of time series and temporal network analysis tools, such as change-point detection, temporal centrality, and temporal community structure, may be useful (Holme and Saramäki 2012, 2019; Masuda and Lambiotte 2020). Third, character networks are probably signed in most cases, connecting characters by positive or negative ties. It is worth to deploy sentiment analysis based on text (Min and Park 2019) and facial expressions (Soleymani et al. 2015) to construct and analyze signed networks of characters in manga. Temporal network analysis of signed character networks may also reveal complicated dynamics of relationships among characters (e.g., an enemy later becomes an ally). Fourth, manga in different genres such as action, adventure, sports, and comedy, may have different structures of character networks. In addition, we have only analyzed the manga that have been published in the paperback pocket edition, which is the most common for the boys’ and girls’ manga in Japan. Analyzing manga of other sizes may help us understand variation of character networks over a wider range of genres. Fifth, studying character networks in comics in countries other than Japan warrants future work. For example, character networks may reflect societal differences from country to country.

In conclusion, to the best of our knowledge, the present study of character networks for 162 manga titles is unprecedented in scale in quantitative studies of comics and has enabled us to discover their general features. We also introduced new tools and views to the analysis of fiction character network data, such as the bipartite configuration model, the concept of protagonist-centeredness, and temporal network analysis. We hope that this study triggers further quantitative studies of character networks in manga and other types of fictions.

Availability of data and materials

The datasets generated during and/or analysed during the current study are available in the GitHub repository, https://github.com/KS-92/Manga.

References

  • Agarwal A, Balasubramanian S, Zheng J, Dash S (2014) Parsing screenplays for extracting social networks from movies. In: Proceedings of the 3rd Workshop on Computational Linguistics for Literature (CLFL), pp 50–58

  • Alberich R, Miro-Julia J, Rosselló F (2002) Marvel Universe looks almost like a real social network. https://arxiv.org/abs/cond-mat/0202174

  • Ardanuy MC, Sporleder C (2014) Structure-based clustering of novels. In: Proceedings of the 3rd Workshop on Computational Linguistics for Literature (CLFL), pp 31–39

  • Barabási AL (2016) Network Science. Cambridge University Press, Cambridge

    MATH  Google Scholar 

  • Barabási AL, Jeong H, Néda Z, Ravasz E, Schubert A, Vicsek T (2002) Evolution of the social network of scientific collaborations. Physica A 311(3–4):590–614

    Article  MathSciNet  MATH  Google Scholar 

  • Barrat A, Barthélemy M, Pastor-Satorras R, Vespignani A (2004) The architecture of complex weighted networks. Proc Natl Acad Sci 101(11):3747–3752

    Article  MATH  Google Scholar 

  • Batagelj V, Mrvar A (2000) Some analyses of Erdös collaboration graph. Soc Netw 22(2):173–186

    Article  Google Scholar 

  • Bonato A, D’Angelo D.R, Elenberg E.R, Gleich D.F, Hou Y (2016) Mining and Modeling Character Networks. In: International Workshop on Algorithms and Models for the Web-Graph. Springer, pp 100–114

  • Bost X, Gueye S, Labatut V, Larson M, Linarès G, Malinas D, Roth R (2019) Remembering winter was coming. Multimedia Tools Appl 78(24):35373–35399

    Article  Google Scholar 

  • Brenner RE (2007) Understanding Manga and Anime. Greenwood Publishing Group

    Google Scholar 

  • Chaturvedi S, Srivastava S, Daume III H, Dyer C (2016) Modeling evolving relationships between characters in literary novels. In: Proceedings of the 30th AAAI Conference on Artificial Intelligence, pp 2704–2710

  • Chen YH, Choi JD (2016) Character identification on multiparty conversation. In: Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pp 90–100

  • Cohen J (1988) Statistical Power Analysis for the Behavioral Sciences. Lawrence Erlbaum Associations, Mahwah

    MATH  Google Scholar 

  • Curşeu PL, Pluut H (2013) Student groups as learning entities: The effect of group diversity and teamwork quality on groups’ cognitive complexity. Stud High Educ 38(1):87–103

    Article  Google Scholar 

  • Dell’Era C, Verganti R (2010) Collaborative strategies in design-intensive industries: Knowledge diversity and innovation. Long Range Plan 43(1):123–141

    Article  Google Scholar 

  • Elson DK, McKeown K, Dames NJ (2010) Extracting social networks from literary fiction. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp 138–147

  • Estrada E (2011) Combinatorial study of degree assortativity in networks. Phys Rev E 84(4):047101

    Article  Google Scholar 

  • Fortune Business Insights. Market research report. https://www.fortunebusinessinsights.com (Access Date: December 16, 2022)

  • Gessey-Jones T, Connaughton C, Dunbar R, Kenna R, MacCarron P, O’Conchobhair C, Yose J (2020) Narrative structure of A Song of Ice and Fire creates a fictional world with realistic measures of social complexity. Proc Natl Acad Sci 117(46):28582–28588

    Article  MathSciNet  Google Scholar 

  • Gleiser PM (2007) How to become a superhero. J Stat Mech: Theory Exp 2007(9):P09020

    Article  Google Scholar 

  • Gupta S, Yan X, Lerman K (2015) Structural Properties of Ego Networks. In: Proceedings of International Conference on Social Computing, Behavioral-Cultural Modeling, and Prediction. Springer, pp 55–64

  • Holanda AJ, Matias M, Ferreira SM, Benevides GM, Kinouchi O (2019) Character networks and book genre classification. Int J Mod Phys C 30(08):1950058

    Article  MathSciNet  Google Scholar 

  • Holme P, Saramäki J (2012) Temporal networks. Phys Rep 519:97–125

    Article  Google Scholar 

  • Holme P, Saramäki J (2019) Temporal Network Theory. Springer

    Book  MATH  Google Scholar 

  • Ito K (2005) A history of manga in the context of Japanese culture and society. J Pop Cult 38(3):456–475

    Article  Google Scholar 

  • Jung JJ, You E, Park SB (2013) Emotion-based character clustering for managing story-based contents: a cinemetric analysis. Multimedia Tools Appl 65(1):29–45

    Article  Google Scholar 

  • Karsai M, Kivelä M, Pan RK, Kaski K, Kertész J, Barabási AL, Saramäki J (2011) Small but slow world: How network topology and burstiness slow down spreading. Phys Rev E 83(2):025102

    Article  Google Scholar 

  • Kermack WO, McKendrick AG (1927) A contribution to the mathematical theory of epidemics. Proc R Soc Lond Ser A Math Phys Eng Sci 115:700–721

    MATH  Google Scholar 

  • Labatut V (2022) Complex network analysis of a graphic novel: The case of the bande dessinée Thorgal. Adv Complex Syst 25(05n06):2240003

    Article  Google Scholar 

  • Labatut V, Bost X (2019) Extraction and analysis of fictional character networks: A survey. ACM Comput Surv 52(5):1–40

    Article  Google Scholar 

  • Mac Carron P, Kenna R (2012) Universal properties of mythological networks. Europhys Lett 99(2):28002

    Article  Google Scholar 

  • mangazenkan.com. Manga historical circulation ranking. https://www.mangazenkan.com/r/rekidai/total/ (Access Date: January 25, 2021)

  • Masuda N, Holme P (2013) Predicting and controlling infectious disease epidemics using temporal networks. F1000Prime Rep 5:6

    Article  Google Scholar 

  • Masuda N, Lambiotte R (2020) A Guide to Temporal Networks, 2nd edn. World Scientific Publishing, Singapore

    Book  MATH  Google Scholar 

  • Min S, Park J (2019) Modeling narrative structure and dynamics with networks, sentiment analysis, and topic modeling. PLoS ONE 14(12):e0226025

    Article  Google Scholar 

  • Mislove A, Marcon M, Gummadi K.P, Druschel P, Bhattacharjee B (2007) Measurement and analysis of online social networks. In: Proceedings of the 7th ACM SIGCOMM conference on Internet measurement, pp 29–42

  • Moretti F (2011) Network theory, plot analysis. Stanford Lit Lab, 2

  • Murakami H, Nagaoka Y, Kyogoku R (2020) Creating character networks with kinship relations from comics. Int J Serv Knowl Manag 4(1):1–26

    Article  Google Scholar 

  • Mutton P (2004) Inferring and visualizing social networks on internet relay chat. In: Proceedings of the 8th International Conference on Information Visualisation, pp 35–43

  • Nan CJ, Kim KM, Zhang BT (2015) Social network analysis of TV drama characters via deep concept hierarchies. In: Proceedings of the IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, pp 831–836

  • Newman MEJ (2001) Scientific collaboration networks. I. Network construction and fundamental results. Phys Rev E 64(1):016131

    Article  Google Scholar 

  • Newman MEJ (2002) Assortative mixing in networks. Phys Rev Lett 89(20):208701

    Article  Google Scholar 

  • Newman MEJ (2003) Mixing patterns in networks. Phys Rev E 67(2):026126

    Article  MathSciNet  Google Scholar 

  • Newman MEJ (2018) Networks, 2nd edn. Oxford University Press, Oxford

    Book  MATH  Google Scholar 

  • Newman MEJ, Park J (2003) Why social networks are different from other types of networks. Phys Rev E 68(3):036122

    Article  Google Scholar 

  • Nicosia V, Tang J, Mascolo C, Musolesi M, Russo G, Latora V (2013) Graph metrics for temporal networks. In: Temporal Networks. Springer, pp 15–40

    Chapter  Google Scholar 

  • Park SB, Oh KJ, Jo GS (2012) Social network analysis in a movie using Character-Net. Multimed Tools Appl 59(2):601–627

    Article  Google Scholar 

  • Perc M (2020) Beauty in artistic expressions through the eyes of networks and physics. J R Soc Interface 17(164):20190686

    Article  Google Scholar 

  • Piraveenan M, Prokopenko M, Zomaya A (2008) Local assortativeness in scale-free networks. EPL 84(2):28002

    Article  Google Scholar 

  • Prough J (2010) Shōjo manga in Japan and abroad. In: Johnson-Woods T (ed) Manga: An Anthology of Global and Cultural Perspectives. Continuum, New York, pp 93–106

    Google Scholar 

  • Ramasco JJ, Dorogovtsev SN, Pastor-Satorras R (2004) Self-organization of collaboration networks. Phys Rev E 70(3):036106

    Article  Google Scholar 

  • Read KE (1954) Cultures of the central highlands, New Guinea. Southwest J Anthropol 10(1):1–43

    Article  Google Scholar 

  • Fonseca dos Reis E, Li A, Masuda N (2020) Generative models of simultaneously heavy-tailed distributions of interevent times on nodes and edges. Phys Rev E 102(5):052303

    Article  MathSciNet  Google Scholar 

  • Rieck B, Leitte H (2016) ‘Shall I compare thee to a network?’: Visualizing the topological structure of Shakespeare’s plays. In: Proceedings of the 1st Workshop on Visualization for the Digital Humanities

  • Saramäki J, Kivelä M, Onnela JP, Kaski K, Kertesz J (2007) Generalizations of the clustering coefficient to weighted complex networks. Phys Rev E 75(2):027105

    Article  Google Scholar 

  • Schodt FL (2013) Dreamland Japan: Writings on Modern Manga. Stone Bridge Press, Berkeley

    Google Scholar 

  • Soleymani M, Asghari-Esfeden S, Fu Y, Pantic M (2015) Analysis of EEG signals and facial expressions for continuous emotion detection. IEEE Trans Affect Comput 7(1):17–28

    Article  Google Scholar 

  • Stiller J, Nettle D, Dunbar RI (2003) The small world of Shakespeare’s plays. Hum Nat 14(4):397–408

    Article  Google Scholar 

  • Takahashi M (2014) Opening the closed world of shōjo manga. In: Japanese Visual Culture. Routledge, pp 114–136

    Google Scholar 

  • Tan MS, Ujum EA, Ratnavelu K (2014) A character network study of two Sci-Fi TV series. In: Proceedings of the Frontiers in Physics (AIP Conference Proceedings), 1588, pp 246–251

  • The Mainichi. Japan’s 'One Piece’ manga hits over 510 mil. copies in print, boosts own world record. https://mainichi.jp/english/articles/20220804/p2a/00m/0et/007000c. Accessed 4 Sept 2022

  • Thompson WH, Brantefors P, Fransson P (2017) From static to temporal network theory: Applications to functional brain connectivity. Netw Neurosci 1(2):69–99

    Article  Google Scholar 

  • Toku M (2007) Shojo Manga! Girls’ Comics! A mirror of girls’ dreams. Mechademia 2(1):19–32

    Article  Google Scholar 

  • Tran QD, Hwang D, Jung JJ (2015) Movie summarization using characters network analysis. In: Proceedings of the 7th International Conference on Computational Collective Intelligence. Lecture Notes in Computer Science, 9329, Springer, pp 390–399

  • van Wijk BC, Stam CJ, Daffertshofer A (2010) Comparing brain networks of different size and connectivity density using graph theory. PLoS ONE 5(10):e13701

    Article  Google Scholar 

  • Wang WX, Hu B, Zhou T, Wang BH, Xie YB (2005) Mutual selection model for weighted networks. Phys Rev E 72(4):046140

    Article  Google Scholar 

  • Wang F, Moreno Y, Sun Y (2006) Structure of peer-to-peer social networks. Phys Rev E 73(3):036123

    Article  Google Scholar 

  • Watts DJ, Strogatz SH (1998) Collective dynamics of ‘small-world’ networks. Nature 393:440–442

    Article  MATH  Google Scholar 

  • Weng CY, Chu WT, Wu JL (2007) Movie analysis based on roles’ social network. In: Proceedings of the International Conference on Multimedia and Expo, pp 1403–1406

  • Yeh MC, Wu WP (2014) Clustering faces in movies using an automatically constructed social network. IEEE Multimedia 21(2):22–31

    Article  Google Scholar 

  • Zachary WW (1977) An information flow model for conflict and fission in small groups. J Anthropol Res 33(4):452–473

    Article  Google Scholar 

Download references

Acknowledgements

We thank Juyong Park for valuable discussion.

Funding

K.S. thanks the financial support by the Japan Society for the Promotion of Science (under Grant No. 19K23531). N.M. thanks the financial support by AFOSR European Office (under Grant No. FA9550-19-1-7024), the Japan Science and Technology Agency (JST) Moonshot R &D (under Grant No. JPMJMS2021), the Japan Society for the Promotion of Science (under Grant No. 21H04595 and 23H03414), and the National Science Foundation (under Grant No. 2052720 and 2204936).

Author information

Authors and Affiliations

Authors

Contributions

NM conceived the research; KS and NM designed the research. KS collected and analyzed the data; KS and NM discussed the results and wrote the paper.

Corresponding author

Correspondence to Naoki Masuda.

Ethics declarations

Competing interests

The authors declare no competing financial or non-financial interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1

. Properties of the character networks and bipartite graphs for 162 manga.

Additional file 2

. Character networks for 162 manga.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sugishita, K., Masuda, N. Social network analysis of manga: similarities to real-world social networks and trends over decades. Appl Netw Sci 8, 79 (2023). https://doi.org/10.1007/s41109-023-00604-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s41109-023-00604-0

Keywords