Skip to main content

A Twitter network and discourse analysis of the Rana Plaza collapse


Ten years after the collapse of the Rana Plaza textile factory in Dhaka, Bangladesh that killed over 1000 factory workers, the event has become a symbol for the desolate working conditions in fast fashion producer countries in the global south. We analyze the global Twitter discourse on this event over a three week window around the collapse date over the years 2013–2022 by a mixture of network-theoretic quantitative and discourse-theoretic qualitative methods. In particular, key communicators and the community structure of the discourse participants are identified using a multilayer network modeling approach and the interpretative patterns of the key communicator’s tweets of all years are analyzed using the sociology of knowledge approach to discourse. This combination of quantitative and qualitative methods reveals that the discourse is separated into three phases: reporting, reprocessing, and commemoration. These phases can be identified by the temporal evolution, network-structural properties, and the contentual analysis of the discourse. After the negotiation of the interpretative framework in the reprocessing phase, subsequent years are characterized by its commemorative repetition as well as resulting demands by different international actor groups despite highly fluctuating participants.


The Rana Plaza factory collapse on April \(24\hbox {th}\) 2013 triggered major international repercussions including a large body of scholarly works on topics such as worker’s safety in the textile industry, international corporate law, or social movements for their improvement [1,2,3,4,5,6]. The online discourse on the Rana Plaza disaster remains largely unexplored although social media communication has become an important part of the general public discourse. Recent social media studies focus on the analysis of online social media platforms such as Twitter by employing quantitative and qualitative methods from various disciplines [7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22]. In this work, we combine quantitative and qualitative methodology to analyze the international Twitter discourse on the Rana Plaza factory collapse.

The quantitative analysis of this paper builds on tools from complex network science. Complex systems from various disciplines can effectively be modeled by networks recording pairwise interactions of their entities [23, 24]. In particular, they form the basis for the quantitative analysis of social networks [25,26,27]. In recent years, multilayer networks that separately record different types of interactions between the same entities by means of different layers have gained increased attention in many fields including the social sciences [10, 12, 28, 29].

Recent decades have witnessed an increased understanding of complex systems by structural and dynamical network measures. Among the most prominent structural network analysis techniques are centrality measures [30,31,32,33,34,35] and community detection methods [36,37,38,39]. In particular, current research efforts focusing on the analysis of social networks such as Twitter heavily rely on the detection of central users [9, 12, 13, 18] and community structure [8, 12, 16, 17].

Centrality measures identify and rank the most influential entities in complex networks and some of the earliest contributions in the field were made in the context of social networks [30, 31, 40]. These measures rely on the mathematical formulation of a network in terms of an adjacency matrix and well-studied linear algebraic objects such as eigenvectors are capable of revealing substantial insights into complex systems such as the internet [32,33,34,35]. More recently, the class of matrix function-based centrality measures has attracted significant attention [40,41,42,43,44]. One distinguishing feature of these measures is their flexibility to interpolate between the well-established local degree and global eigenvector centrality measures [45]. Recent contributions have generalized eigenvector and matrix function-based centrality measures to multiplex networks, a special class of multilayer networks in which edges across layers are only present between nodes representing the same entity in different layers, cf. e.g., [46,47,48].

Community detection methods, instead, aim at identifying groups of densely connected entities in complex networks that are sparsely connected to other communities [36,37,38,39]. One successful concept to obtain such partitions is the maximization of modularity, which compares a given connectivity structure with an idealized community structure described by some null model [36, 37, 49]. The first null model for multilayer networks has been proposed a number of years ago [50].

The qualitative analysis of this paper is based on Foucault’s discourse term of knowledge and power [51] as well as Berger and Luckmann’s concept of social constructivism [52]. The sociology of knowledge approach to discourse (SKAD) by Keller combines these two theoretical approaches into the methodological framework employed in this paper [53]. We analyze the content structure in terms of the phenomenal structure of the tweets as well as the subject relations in a chronological analysis [53]. Recent studies using discourse analysis approaches have examined political, corporate, and activist communication on other events such as COVID-19 [19], a Spanish journalist scandal [22], or the 2016 US election [14]. Further discourse analyses focus on different communication strategies of actor groups such as NGOs [7], activists [20], or brands [15].

With the theoretical background discussed above, we understand the quantitative analysis as the reconstruction of subject positions, their interactions, and community formation, serving as an example how qualitative and quantitative methodology, e.g., mixed methods complement each other.

In this work, we combine quantitative network-theoretic and qualitative discourse-theoretic methodology for the analysis of the global Twitter discourse on the Rana Plaza factory collapse. We separately analyze three week time windows around the collapse (anniversary) date over ten years, which allows for the detection of the temporal evolution of the discourse.

The remainder of this paper is organized as follows. Section “Data”  describes the data collection procedure. In Section “Methods” , we introduce the quantitative network-theoretic and qualitative discourse-theoretic methods employed in this work. Section “Results”  describes the results of the analysis before we discuss our findings in Section “Discussion”  and summarize in Section “Conclusion” .


The analysis of this study is based on tweet data of the Twitter discourse on the collapse of the Rana Plaza textile factory in Dhaka, Bangladesh on April \(24\hbox {th}\) 2013. The data was obtained from the Twitter API with a tweet text filter for the term “Rana Plaza”. The tweets come in json format containing tweet text, tweet time, tweet ID, user ID, and information on interactions such as retweets, replies, and mentions as mandatory platform-generated fields. Further fields such as user names, locations, descriptions, etc. are mostly optional and provided by the individual users.

We collected data for the ten years from 2013 in which the event took place until 2022. Each year covers the same time window from April \(17\hbox {th}\) to May \(10\hbox {th}\) around the (anniversary) date April \(24\hbox {th}\). We refrained from manual data processing, which leaves the possibility of encountering (in practice very rare) off-topic tweet content.

Our search procedure found a total number of \(101,103\) tweets within the time windows detailed above. The tweet numbers are inhomogeneously distributed across the ten years with \(15,522\) tweets in 2013 and \(22,322\) tweets in 2014 marking the two highest tweet numbers per year. The lowest number of \(4888\) tweets per year was encountered in year 2019. Figure 2 illustrates the total yearly tweet volumes as well as yearly tweet numbers for certain countries relevant to this study either due to their high tweet volumes or their geographical proximity to the Rana Plaza factory. In addition, Fig. 3 provides daily tweet volumes for all years over time intervals of two hours.

Fig. 1
figure 1

Sparsity structure of the intra-layer adjacency matrices of year 2014

The analysis of data from discourse participants from all over the world provides an overview on the international community structure, central users, and central, thus potentially contentually relevant tweets.


This section introduces existing methodology employed to obtain the results of this study. The first illustrations in Section “Tweet statistics”  for exploratory data analysis are obtained by straightforward aggregation of tweet numbers over varying time intervals. The word frequency analysis displayed in Table 2 and discussed in Section “Discussion”  is based on a word combination search with two to four words as well as a single word frequency search in MaxQDA, excluding words such as “a” or “the”. Section “Quantitative multiplex network analysis”  describes the quantitative network-theoretic and Section “Qualitative analysis by the sociology of knowledge approach to discourse”  the qualitative discourse-theoretic methods employed in this work before Section “Interplay of quantitative and qualitative methods”  describes how their interplay leads to the results presented in Section “Results” .

Quantitative multiplex network analysis

For the quantitative analysis of the network structure of Twitter user interactions we rely on a multiplex network model [28, 29]. We follow the modeling approach presented by [10] and identify layers with different types of user interactions: retweet, reply, and mention. We construct a separate multiplex network for each year, which allows a comparison of different structural network properties across the years.

Table 1 List of dimensions and codes identified in the coding process of the SKAD. Dimensions are marked in bold font. The three dimensions Demand for improved working conditions, Background information on the collapse, and Call to action of the responsible have corresponding codes
Table 2 Word (combination) frequencies in all tweets by the ten most central users across the ten years. Numbers in round bracket indicate the absolute number of word occurrences in all years. The bar charts denote yearly word frequencies, i.e., word occurrences in the commemoration phase are averaged over the years 2015 to 2022. Furthermore, each word (combination)’s bar charts are normalized by the respective maximum number of occurrences across the three phases

The basis of our network model is the identification of users with nodes and user interactions with edges between pairs of nodes. We use weighted directed edges as each interaction (retweet, reply, and mention) involves two users clearly distinguishable as initiator and recipient. Edge weights correspond to the numbers of each type of interaction between all pairs of users per year.

To construct the network, we define the node (or vertex) set \(\mathcal {V}\) that consists of all n users participating in the Twitter discourse of the respective year. Furthermore, we define three intra-layer edge sets \(\mathcal {E}^{(1)}, \mathcal {E}^{(2)},\) and \(\mathcal {E}^{(3)}\) denoting the retweet, reply, and mention interactions, respectively. Each of the \(L=3\) layers is represented by a generally nonsymmetric adjacency matrix \(\varvec{A}^{(l)}\in \mathbb {R}^{n \times n}\) with entries

$$\begin{aligned} \varvec{A}^{(l)}_{ij} = {\left\{ \begin{array}{ll} w^{(l)}_{ij} &{} \text {if interaction }l\text { took place between users }i\text { and }j,\\ 0 &{} \text {otherwise,} \end{array}\right. } \end{aligned}$$

for \(l=1, 2, 3,\) where \(w^{(l)}_{ij}\) denotes the number of distinct interactions l initiated by user i and received by user j, e.g., if user i retweets seven tweets by user j we have \(\varvec{A}^{(1)}_{ij} = 7\). The final ingredient that completes our multiplex network \(\mathcal {G} = (\mathcal {V}, \mathcal {E}^{(1)}, \mathcal {E}^{(2)}, \mathcal {E}^{(3)}, \tilde{\mathcal {E}})\) is the inter-layer edge set \(\tilde{\mathcal {E}}\). It represents interactions of nodes from different layers and in this work we use diagonal all-to-all coupling, i.e., each node is connected to itself in both other layers with weight 1.

For the quantitative analysis, we use the linear algebraic representation of the multiplex network \(\mathcal {G}\) in terms of the supra-adjacency matrix \(\varvec{A}\in \mathbb {R}^{nL \times nL}\) defined as

$$\begin{aligned} \varvec{A} = \varvec{A}_{\textrm{intra}} + \omega \varvec{A}_{\textrm{inter}} = \begin{bmatrix} \varvec{A}^{(1)} &{} \varvec{0} &{} \varvec{0}\\ \varvec{0} &{} \varvec{A}^{(2)} &{} \varvec{0}\\ \varvec{0} &{} \varvec{0} &{} \varvec{A}^{(3)} \end{bmatrix} + \omega \begin{bmatrix} \varvec{0} &{} \varvec{I} &{} \varvec{I}\\ \varvec{I} &{} \varvec{0} &{} \varvec{I}\\ \varvec{I} &{} \varvec{I} &{} \varvec{0} \end{bmatrix}, \end{aligned}$$

where \(\omega \in \mathbb {R}\) denotes a scalar layer-coupling parameter, \(\varvec{0}\in \mathbb {R}^{n \times n}\) the matrix of all zeros, and \(\varvec{I}\in \mathbb {R}^{n \times n}\) the identity matrix. Throughout our analysis, we choose the layer-coupling parameter \(\omega =1\). As an illustrative example, Fig. 1 shows the intra-layer sparsity structure of the three layers for the year 2014. The blank lower blocks in each layer indicate a set of users that receives user interaction without initiating it. The adjacency matrices for all ten years are publicly available in Matlab and python formats [54].

Matrix function-based centrality measures

The identification and ranking of influential individuals in networked complex systems is an important area of research in network science. Prominent examples of well-established centrality measures have been developed over past decades [30,31,32, 34] with PageRank at the heart of Google’s search engine being an example of a centrality measure impacting everyday life [33, 35].

A more recently studied class of centrality measures is the class of matrix function-based centrality measures [40,41,42,43,44]. These can be interpreted as walk-based measures taking all possible connections between all pairs of nodes of the network into account. Matrix function-based centrality measures possess the flexibility to interpolate between local degree and global eigenvector centrality [45] by means of a parameter called \(\alpha\) or \(\beta\) that controls how strongly longer walks are weighted.

In this work, we employ a recently introduced generalization of matrix function-based centrality measures to multiplex networks [47, 48] for the identification of key actors in the Twitter discourse under investigation. We restrict ourselves to the multiplex version of receiver total communicability [43] defined as

$$\begin{aligned} \textrm{TC}(i,\beta ) = \varvec{e}_i^T \exp (\beta \varvec{A}^T) \varvec{1} = \varvec{e}_i^T \left( \sum _{p=0}^\infty \frac{\beta ^p}{p!} (\varvec{A}^T)^p \right) \varvec{1}, \end{aligned}$$

and receiver Katz centrality [40] defined as

$$\begin{aligned} \textrm{KC}(i,\alpha ) = \varvec{e}_i^T (\varvec{I} - \alpha \varvec{A}^T)^{-1} \varvec{1} = \varvec{e}_i^T \left( \sum _{p=0}^\infty \alpha ^p (\varvec{A}^T)^p \right) \varvec{1}, \end{aligned}$$

where \(\beta > 0\) and \(0< \alpha < 1/\lambda _{\textrm{max}}\) with \(\lambda _{\textrm{max}}\) the largest eigenvalue of \(\varvec{A}\) denote the trade-off parameters mentioned above. Furthermore, \(\varvec{e}_i\in \mathbb {R}^{nL}\) denotes the ith canonical basis vector, \(\varvec{1}\in \mathbb {R}^{nL}\) the vector of all ones, and \(\exp (\beta \varvec{A}^T)\) and \((\varvec{I} - \alpha \varvec{A}^T)^{-1}\) defined by the above matrix power series denote the matrix exponential and resolvent function, respectively [55].

In terms of the Twitter interaction networks at hand, these measures can be interpreted as simulating the effectivity of the distribution of information along existing channels in the network. By the choice of total communicability and Katz centrality, (2) and (3) measure the impact that all information distributed by node i has on the entirety of the network. The choice of receiver centralities leads to high rankings of users being retweeted, being replied to, and being mentioned and causes the transpose of \(\varvec{A}\) in the above definitions.

For the computation of the vectors \(\exp (\beta \varvec{A}^T) \varvec{1}\) and \((\varvec{I} - \alpha \varvec{A}^T)^{-1} \varvec{1}\), which contain the measures (2) and (3) for all nodes \(i=1, \dots , nL\), we employ highly efficient methods from numerical linear algebra, which are described in detail in [48] and implemented in Matlab [56] and python [57]. The centrality results of this paper can be reproduced using publicly available codes [54].

The quantities ultimately used in our analysis and displayed in Fig. 4 are one TC and one KC centrality value for each Twitter user per year. These are easily obtained from (2) and (3) by marginal node centralities, i.e., the addition of each node’s centralities from the three layers [46,47,48]. Throughout our computations, we use the parameters \(\alpha =0.5/\lambda _{\textrm{max}}\) and \(\beta =2/\lambda _{\textrm{max}}\).

The Louvain community detection method

A second structural network feature of tremendous interest to network science is community detection [36,37,38,39]. Similar to the small-world effect, scale-freeness, or transitivity, the presence of a community structure is viewed as a universal property of networks originating from various disciplines including social networks [23, 24].

A community in a network is generally understood as a subset of nodes that is strongly connected internally but only sparsely connected to the remainder of the network. One widely acknowledged concept to quantify this property is network modularity [36, 37]. It compares a given distribution of edges in an observed network with an expected distribution according to some null model and defines a scalar measure Q of the quality of a given partition of the node set. Optimal partitions can hence be obtained by computational maximization of the modularity score Q.

One successful heuristic computational method for modularity maximization is the “Louvain method” [49]. It initially assigns every node its own community and proceeds by iteratively merging pairs of communities that lead to a maximal increase in the modularity Q. The method terminates if no further improvement of the partition is possible.

A generalization of the Louvain method to multilayer networks was proposed by [50]. It defines a multilayer null model that in our notation leads to the multilayer modularity function

$$\begin{aligned} Q = \frac{1}{2\mu } \sum _{ijlr} \left[ \left( \varvec{A}_{ij}^{(l)} - \gamma _l \frac{k_{il}k_{jl}}{2m_l} \right) \delta _{lr} + \omega \delta _{ij} \right] \delta (g_{il},g_{jr}) \end{aligned}$$

for a given node partition g, where \(g_{il}\) denotes the community affiliation of node i in layer l, \(\gamma _l\) denotes a resolution parameter, and \(\delta _{ij}, \delta _{lr},\) and \(\delta (g_{il},g_{jr})\) denote Kronecker deltas. Furthermore, the quantities \(2\mu = \sum _{ijl} \varvec{A}_{ij}^{(l)} + 6n\omega , k_{il}=\sum _j \varvec{A}_{ij}^{(l)}, k_{jl}=\sum _i \varvec{A}_{ij}^{(l)}\), and \(m_l = \sum _{ij} \varvec{A}_{ij}^{(l)}\) denote different strengths of the multiplex network.

We used the Matlab implementation [58] for obtaining the community structures reported in Section “Multiplex network analysis”  and illustrated in Fig. 4. The results can be reproduced with our code release [54] together with [58]. The output of the method consists of three community affiliations for each user corresponding to the three layers. Due to the inter-layer connectivity, the three affiliations mostly coincide. In case of deviations, a majority vote is performed over the layers, i.e., the most frequently encountered community determines the user’s community affiliation. Throughout our computations, we use the parameter \(\gamma _l=1\).

Qualitative analysis by the sociology of knowledge approach to discourse

The selection of tweets considered in the qualitative content analysis builds on the centrality measures introduced in Section “Matrix function-based centrality measures”.  as we consider all tweets by the union of the ten most central users identified by both centrality measures per year.

For the qualitative analysis of these tweets, we use the sociology of knowledge approach to discourse (SKAD), which is a methodological framework focusing on questions on public discourse arenas and knowledge power structures [53]. Building on the power and knowledge correlation described by Foucault [51] and the power structure in knowledge societies [52], the sociology of knowledge approach offers the opportunity to analyze knowledge politics. Foucault understands discourses as social practices that imply controversies and discoursive fights around “problemetizations” [51, 59]. In order to analyze discoursive formations such as the formation of concepts or speaker positions, concrete data, such as in our case tweets, have to be analyzed bottom up [53, 60].

In the qualitative analysis of the Twitter discourse, we focus on the phenomenal structure as a content-structuring framework. The anaysis of content structure allows us to determine the change of content in the tweets over the years and which dimensions of the event are communicated. Following the SKAD, we understand phenomenal structures as the construction of an issue and its dimensions such as the definition, causal relations, corresponding responsibilities, normative judgments, and possible courses of action [53]. As described by Berger and Luckmann [52], these dimensions are constructed from individual subject positions that do “not describe any essential qualities of a discourse topic” [53] but rather a communicative construction of reality [61].

The basis of the bottom-up analysis in the SKAD is a coding process, which builds on the methodology of the grounded theory devised by [62]. In the coding phase, the raw data is firstly broken down into codes to prepare it for the analysis. Each code is assigned to a specific passage in the data and multiple codes for a given passage are possible. In this work, each tweet is coded with at least one code, e.g., “8 years ago, the Rana Plaza collapse showed that voluntary agreements don’t keep workers safe. Then why haven’t brands comit[ted]...” is coded with both Memorial and Security standards/Worker’s rights. In the coding process, we used in-vivo codes, i.e., codes taken directly from the data as well as contrasting and association techniques to identify the initial codes. For the coding process and the schematic design of Fig. 6, we used MaxQDA 2020.

Secondly, the codes are revised for the dimensions introduced above as well as relationships between them, i.e., codes can be merged or diversified. To carve out the content structure, the interpretative analysis process is oriented towards the problem dimensions of the phenomenal structure, [53] describes.

The resulting dimensions of our study alongside corresponding codes (where applicable) are summarized in Table 1. Each code and dimension is accompanied by a short description as well as an example tweet. The links and relations of the dimensions identified in this work are depicted in Fig. 6. We particularly focus on the communicators’ subject positions in the discourse, which can be sub-divided into the user’s nationalities, which are summarized in Fig. 2, and their actor groups. We identified the five actor groups “Activist”, “Politician”, “News agency”, “Anonymous user”, and “Brand”. Due to the frequent occurrence of the actor group “Activist”, this group was sub-divided into “NGO” (Non-Governmental Organization), “Writer”, and “Celebrity”.

Interplay of quantitative and qualitative methods

The starting point of our analysis is the abstraction of Twitter interactions into multiplex networks as described in Section “Quantitative multiplex network analysis”  followed by the centrality analysis presented in Section “Matrix function-based centrality measures”. All tweets of the union of the ten most central users of both centrality measures per year present the basis for the qualitative analysis by the SKAD discussed in Section “Qualitative analysis by the sociology of knowledge approach to discourse” . During the coding process, each central user is manually reviewed and assigned to an actor group. These actor groups in turn serve as means to visualize the community detection results obtained by the multilayer Louvain method described in Section “The Louvain community detection method” As most communities are found to gather around one distinctly most central user we identify each community with the actor group affiliation of that most central user. This allows to highlight the differing composition and heterogeneous interconnectedness of communities illustrated in Fig. 4 through the years.


Tweet statistics

We start by reporting some basic numbers on the tweet volume obtained by our search queries described in Section “Data” . Figure 2 shows that the total number of tweets per year varies between 4900 in year 2019 and 22,300 in year 2014. Furthermore, we selected eight countries relevant either due to their high tweet volume or their geographical proximity to the Rana Plaza factory and report their tweet volumes in Fig. 2. We remark that the user location is a user-specified free text field that can not be expected to provide complete information. While the observed tweet volume in some countries behaves approximately proportionally to the total volume, we observe an increase in tweet volumes from Spain starting from 2017. Furthermore, Indonesia shows a remarkably high tweet volume in the year of the collapse, 2013, followed by a rapid decline and very low tweet volumes in subsequent years.

Fig. 2
figure 2

Yearly tweet volumes for selected countries and in total. User-specified locations were filtered by up to two search terms indicated in the legend. The total number corresponds to all tweets matching the search term “Rana Plaza” in the given time period, cf. Section “Data” 

Figure 3 additionally shows the distribution of the total tweet volume per year over the respective time interval from April \(17\hbox {th}\) to May \(10\hbox {th}\). Ordinate values indicate the number of tweets per two hour time interval. Naturally, the 2013 discourse starts with the collapse date April \(24\hbox {th}\). Interestingly, high peak tweet numbers are observed only three days later. In subsequent years, the discourse temporally closely centers around the anniversary date April 24th. The only distinct deviation is April 27th 2017, which marks a tweet by “EmmaWatson” causing increased user interaction and hence turns the overall downward trend of tweet volumes established between 2014 and 2016.

Fig. 3
figure 3

Tweet volume distribution over two hour time intervals of all considered time periods. Abscissa values indicate the time interval from April 17th to May \(10\hbox {th}\) for the respective year

In terms of the three types of interactions retweets, replies, and mentions, we observe across all years that retweets and mentions occur frequently while replies are rare. The retweet and mention layers in the multiplex framework described in Section “Quantitative multiplex network analysis” thus allow for an analysis of the encountered degree distributions: across all ten years and in line with results from different Twitter discourses on exceptional events [10], we observe a power-law degree distribution with an exponent of approximately \(-1.75\).

Multiplex network analysis

We separately apply multiplex matrix function-based centrality measures and the multilayer Louvain method described in Sects. “Matrix function-based centrality measures” and “The Louvain community detection method” to the user interaction networks of the ten different years, which allows a structural comparison of the recorded Twitter discourse across the years.

The tables in Fig. 4 show the ten most central users of each year according to receiver total communicability (TC) and receiver Katz centrality (KC). Albeit differences in the numerical values and slight deviations in rankings the results of both measures show a broad overlap. As is customary in centrality analysis, we observe a localization of both measures [63], i.e., high centrality values for a small portion of the nodes as well as small and similar values for the majority of nodes. The union of the ten most central nodes of both measures per year forms the basis of the qualitative analysis of this work.

Fig. 4
figure 4figure 4

Results of the quantitative multilayer network analysis. The left plots show all nodes (users) belonging to the ten largest communities of the respective year. Directed edges (interactions via retweets, replies, or mentions) within communities are marked black and edges across communities are marked gray. Nodes sizes are proportional to the receiver total communicability (TC) centrality measure. Color codes in the plots indicate the actor group of the most central node in each community. Each community is labeled by the actor group of its most central node as well as the most central node’s user name in round brackets. The tables show the top ten users of the respective year according to receiver total communicability (TC) and receiver Katz centrality (KC). Color codes in the tables identify the users with their community in the plot. Blank spots in the centrality tables indicate that the user does not belong to any of the ten largest communities

The multilayer Louvain method additionally provides a partition of each year’s set of users into densely connected communities of strongly varying sizes. For all years, we obtain a large number of communities (typically around \(1000\)). However, only around 50 of these communities consist of ten or more users. As we encounter many central nodes in the largest communities, we focus our attention on the ten largest communities per year. The attempt to identify recurring communities across the years proved difficult. One reason for this is the heterogeneous user base across the years: with 62,500, the sum of overall unique users over the ten years was only slightly below 74,000, the number of unique users summed up for each year separately. Exceptions of highly central recurring users actively forming the discourse across the years can be found in the centrality results. Furthermore, the assignment of communities to user nationalities proved difficult indicating a significant degree of internationality in the discourse.

We illustrate the ten largest communities per year in the graphics in Fig. 4. Edges within communities are marked black and edges across communities are marked gray. The connectivity structure within large communities often consists of one or few highly central users being reacted to by many non-central users. The amount of inter-community connectivity greatly varies across the years, with partial explanations being revealed by the qualitative content analysis in Section “Content analysis” . Furthermore, the color coding of the nodes stems from the user groups “Activist”, “Politician”, “News agency”, “Anonymous user”, and “Brand” identified in the qualitative analysis of the tweets of the ten most central users described in Section “Qualitative analysis by the sociology of knowledge approach to discourse” . We use different color shades per user group to allow the identification of each community with its most central user in the centrality results in the respective year. In an attempt to disclose the cohesion of structural network properties and our contentual analysis, we assign each community with a color code corresponding to the actor group of the most central user of that community. The communities’ actor groups as well as the most central nodes’ user names in round brackets are depicted in the plots. The same color codes in the tables in Fig. 4 encode the affiliation of the ten most central users to the ten largest communities. Blank spots in the centrality tables indicate that the user does not belong to any of the ten largest communities.

The prevalence of the five actor groups in the ten largest communities undergoes a significant temporal evolution. Unsurprisingly, the year 2013 is dominated by news agencies spreading information on the collapse. Subsequent years witness their diminution and the emergence of large communication sub-networks around NGOs with 2020 marking the record of eight out of ten communities classified as NGO. A noteworthy development in the years 2020 and 2022 is that anonymous, i.e., private Twitter users are able to generate enough interaction to gather the largest community of that year around them; a role that was previously filled by news agencies, NGOs, or celebrities.

Content analysis

The first result of the qualitative analysis by the sociology of knowledge approach to discourse (SKAD) described in Section “Qualitative analysis by the sociology of knowledge approach to discourse”  is a set of codes obtained in the coding process of all tweets by the ten most central users per year. Table 1 lists the identified dimensions alongside codes obtained in the coding process as well as a short description and one example tweet. Furthermore, Fig. 5 shows the relative prevalence of the six dimensions per year. Finally, Fig. 6 depicts the contentual relationships between the dimensions and codes identified in the coding step of the SKAD. It schematically summarizes the communicative pattern that the anniversary is taken as an opportunity to first raise awareness with memorials and then provide background information and describe working conditions, thus justify the demand for action.

Fig. 5
figure 5

Relative occurrences of the dimensions identified by the SKAD by year

Fig. 6
figure 6

Schematic illustration of contentual relations between the dimensions and codes as a result of the coding process. Codes are marked gray, the five dark blue dimensions induce the light blue dimension Ads for own causes

A detailed analysis of the individual years reveals a strong heterogeneity across the years. In 2013, we detect three main codes Security standards/worker’s rights, Death toll, and News about collapse belonging to the two dimensions Demand for improved working conditions and Background information on the collapse. The codes Security standards/worker’s rights and News about collapse are the most prevalent topics as both cover over \(50\%\) of their respective code occurrences among all years and are often published by Asian actors.

At the first anniversary in the year 2014, we observe a peak occurrence of eight codes and in comparison to other years a wide range of topics being discussed intensively. Especially compared to the news feed style of the discourse in 2013 we find more in-depth and complex content such as background analyses, individual stories, or reports about what has changed since the collapse. Figure 5 shows that while the dimensions Demand for improved working conditions and Background information on the collapse peak in 2013, all other dimensions start to emerge in 2014. Analysis of industrial practice as well as Survivor’s/victim’s relatives’ stories are mainly published in 2014 and 2015 indicating that in 2013 the news were too recent for such discussions even two weeks after the incidentFootnote 1. Last but not least, the dimension Memorial indicating the emergence of commemorative practices establishes itself in 2014 as the most frequent code in all and the most frequent dimension in almost all years following 2014.

The years 2015 to 2022 are characterized by the repetition of the interpretative dimensions established in 2014 with some variations of major topics emerging for individual years. In 2015, for example, we observe the peak of the code Monetary compensation/payment as the “Rana Plaza Fund” is publicly raising money in order to pay compensation to the victims. In total, the fund raised 30 mio. USD for Rana Plaza victims with 1.1 mio. USD donated by the brand “benetton” [64, 65]. This brand’s role is special in the sense that it took an active role in the Twitter discourse: 2015, “benetton” is the only brand appearing as a top ten central user across all years.

In 2016, the total number of tweets decreases and besides Memorial the main topics are the responsibility of Manufacturers and brands as well as the Analysis of industrial practice. In 2017, the total number of tweets slightly increases again and Consumer orientation reaches its peak – caused by the retweet of the Guardian article “Sustainable style: will Gen Z help the fashion industry clean up its act?” by “EmmaWatson” on April \(27\hbox {th}\) 2017. This retweet not only leads to the distinct peak in tweet volume that can be observed in Fig. 3; it also establishes “EmmaWatson” as the most central user in year 2017 and sets the agenda of topics discussed by other central users.

In the context of the COVID-19 pandemic and the withdrawal of the Bangladesh Accord agreement [66], the years 2020 and 2021 witness another rise of the code Analysis of industrial practice. In 2022, Survivor’s/victim’s relatives’ stories suddenly rise to their peak – a thematic emphasis that can be traced back to the activists behind the user “OhSoEthical”. Albeit frequent repetition of the code by that user, the community detection result in Fig. 4 reveals that only a relatively small community is reached by these tweets.

The codes Memorial and Death toll are identified as an incentive to spread further information on the background of the incident. Quantitatively, this trigger constitutes up to \(40\%\) of all codes, making it an important starting point in the Twitter discourse. The survivor stories and background information lead to a constant call for action to change the described circumstances in the fashion industry.


The results presented in the previous section suggest that the international Twitter discourse on the Rana Plaza collapse is composed of three phases: reporting, reprocessing, and commemoration. These phases follow the temporal unfolding of the discourse and can be identified from both a contentual qualitative as well as a structural quantitative viewpoint.

The first phase, reporting, starts with the collapse date in 2013 and is mainly formed by Asian and Western news agencies informing the public about the accident and its extent, e.g., the death toll. Furthermore, a comparatively small but active community of NGOs explicitly addresses manufacturers and brands demanding better working conditions of textile workers in Bangladesh. This is reflected by the demand for an “acuerdo de seguridad” (engl.: safety agreement) that a group of Spanish activists directs towards brands, cf. Table 2. Additionally, this is underpinned by the three main codes Security standards/worker’s rights, Death toll, and News about collapse obtained in the coding phase of the SKAD as well as by the prevalence of news agencies as most and NGOs as second most frequent actor group in the centrality analysis. The communities emerging around central users show remarkably little interaction indicating the formation of communication bubbles around different news agencies and NGOs. In terms of the involved user’s nationalities, a relatively large portion of users with a small physical proximity to the Rana Plaza factory such as Bangladesh or Indonesia shape the 2013 discourse.

The second phase, reprocessing, takes place around the first anniversary of the collapse in 2014. The overall tweet volume, the number of involved users, as well as the structural and contentual complexity of the discourse all reach their peaks. The active discussion of various social, political, and economical aspects of the collapse reflects the societal negotiation process establishing a collective interpretative framework of the event. This variety of topics is, for instance, reflected in the word frequency analysis in Table 2 showing that the most important word (combination) frequencies reach their peak in the reprocessing phase and prevail in the consecutive commemoration phase. In particular, the first anniversary witnesses the rise of the dimension Memorial as most frequent dimension, which establishes a pattern repeated in subsequent years. These simple commemorations serve as a ramp for other dimensions; Survivor’s/victim’s relatives’ stories and Background information on the collapse are taken as a peg to address responsibilities and to demand consequences. Most actor groups such as NGOs or writers accompany their contribution by Ads for own causes/petitions.

Regarding speaker positions, the centrality analysis reveals that in 2014 mainly NGOs seize control of the Twitter discourse by lobbying for consequences and changes in the fast fashion industry. Furthermore, the active dynamics of the 2014 discussion are reflected in the high degree of interconnectivity across different (relatively large) communities and actor groups. The prominence of the widely used hashtag #insideout is conjectured to additionally promote interactions between heterogeneous user groups. While the tweet volume of countries with close spatial proximity to the Rana Plaza factory decreases, 2014 witnesses a distinct rise of discourse participation by Western countries – suggesting that the interpretative momentum, i.e., the discursive power is over-proportionally seized by Western actors.

The third phase, commemoration, spans all subsequently investigated years from 2015 to 2022. It is mainly shaped by the repetition of the interpretative patterns negotiated in the reprocessing phase in 2014 and is complemented by uprising discussions on contemporary topics, which are discussed in detail in Section “Content analysis” . While, overall, we observe high fluctuations in the user base of the Rana Plaza remembrance community over the years, cf. Section “Multiplex network analysis” , the active NGO user group that formed in 2014 continues to dominate the discourse and inter-community connectivity tends to take place between NGO communities. More specifically, we detect few interconnected key actors such as “Fash_Rev” and “cleanclothes” re-appearing in the top ten most central users almost every year. While we focus on the Twitter discourse in this work, “Fash_Rev” (Fashion Revolution) also organizes offline activities such as the yearly Fashion Revolution weekFootnote 2 around the Rana Plaza anniversary, which in turn influences the online discourse by stimulating discussions on these offline activities.


In this work, we studied the global Twitter discourse on the Rana Plaza factory collapse by a mixture of established quantitative network-theoretic and qualitative discourse-theoretic methods. We analyzed structural network properties and the temporal evolution of interpretative patterns of the collapse over the years 2013 to 2022 and found a division of the discourse into three phases: reporting, reprocessing, and commemoration. Our analysis reveals, which actors and communities worldwide interact, build, and rebuild the Rana Plaza discourse on Twitter every year.

Further analysis could focus on how our findings relate to existing theories on key events [67] and digital memory culture [68, 69] or how the Twitter discourse is embedded into the general discourse across various communication channels. Additionally, further structural and dynamical multiplex network properties could be examined. For instance, it would be interesting to study centralities of the full ten year time horizon, which may be possible by a multilayer modeling approach taking a second aspect representing time into account. Studying the applicability of matrix function-based centralities to this type of network architecture would be an interesting road for future research. Furthermore, preliminary investigations of the years 2013 and 2014 indicate a strong multilayer core–periphery structure [70].

Availability of data and materials

The networks generated in this study are publicly available in Matlab and python format under The same repository contains publicly available codes to reproduce all results obtained by the computational methods described in “Matrix function-based centrality measures” Sects.  and “The Louvain community detection method” .


  1. We considered a time horizon of one week before and approximately two weeks after the anniversary date of each year, cf. Section “Data” 

  2. cf. e.g.,



Sociology of knowledge approach to discourse


Non-governmental organization


Application programming interface


Total communicability


Katz centrality


  1. Reinecke J, Donaghey J (2015) After Rana Plaza: building coalitional power for labour rights between unions and (consumption-based) social movement organisations. Organization 22(5):720–740.

    Article  Google Scholar 

  2. Siddiqui J, Uddin S (2016) Human rights disasters, corporate accountability and the state: lessons learned from Rana Plaza. Account Audit Account J.

    Article  Google Scholar 

  3. Barua U, Ansary MA (2017) Workplace safety in Bangladesh ready-made garment sector: 3 years after the Rana Plaza collapse. Int J Occup Saf Ergon 23(4):578–583.

    Article  Google Scholar 

  4. Chowdhury R (2017) The Rana Plaza disaster and the complicit behavior of elite NGOs. Organization 24(6):938–949.

    Article  Google Scholar 

  5. Bair J, Anner M, Blasi J (2020) The political economy of private and public regulation in post-Rana Plaza Bangladesh. ILR Rev 73(4):969–994.

    Article  Google Scholar 

  6. Rahman S, Rahman KM (2020) Multi-actor initiatives after Rana Plaza: factory managers’ views. Dev Change 51(5):1331–1359.

    Article  Google Scholar 

  7. Waters RD, Jamal JY (2011) Tweet, tweet, tweet: a content analysis of nonprofit organizations’ Twitter updates. Public Relat Rev 37(3):321–324.

    Article  Google Scholar 

  8. Papadopoulos S, Kompatsiaris Y, Vakali A, Spyridonos P (2012) Community detection in social media. Data Min Knowl Discov 24(3):515–554.

    Article  Google Scholar 

  9. Ch’ng E (2015) Local interactions and the emergence of a twitter small-world network. arXiv preprint arxiv:1508.03594

  10. Omodei E, De Domenico M, Arenas A (2015) Characterizing interactions in online social networks during exceptional events. Front Phys 3:59.

    Article  Google Scholar 

  11. Piña-García CA, Gershenson Carlos, Siqueiros-García J Mario (2016) Towards a standard sampling methodology on online social networks: collecting global trends on Twitter. Appl Netw Sci 1(1):1–19.

    Article  Google Scholar 

  12. Dickison ME, Magnani M, Rossi L (2016) Multilayer social networks. Cambridge University Press, Cambridge.

    Book  Google Scholar 

  13. Riquelme F, González-Cantergiani P (2016) Measuring user influence on Twitter: a survey. Inf Process Manag 52(5):949–975.

    Article  Google Scholar 

  14. Yaqub U, Chun SA, Atluri V, Vaidya J (2017) Analysis of political discourse on Twitter in the context of the 2016 US presidential elections. Gov Inf Q 34(4):613–626.

    Article  Google Scholar 

  15. Wu DD, Li C (2018) Emotional branding on social media: a cross-cultural discourse analysis of global brands on Twitter and Weibo. In: Intercultural communication in Asia: education, language and values, pp 225–240 .

  16. Hanteer O, Rossi L (2019) An innovative way to model Twitter topic-driven interactions using multiplex networks. Front Big Data.

    Article  Google Scholar 

  17. Rehman AU, Jiang A, Rehman A, Paul A, Sadiq MT et al (2020) Identification and role of opinion leaders in information diffusion for online discussion network. J Ambient Intell Humaniz Comput.

    Article  Google Scholar 

  18. Sadri AM, Hasan S, Ukkusuri SV, Cebrian M (2020) Exploring network properties of social media interactions and activities during Hurricane Sandy. Transp Res Interdiscip Perspect 6:100143.

    Article  Google Scholar 

  19. Wicke P, Bolognesi MM (2021) Covid-19 discourse on Twitter: how the topics, sentiments, subjectivity, and figurative frames changed over time. Front Commun 6:651997.

    Article  Google Scholar 

  20. Shea CS, Jiang Y, Leung WL (2022) David versus Goliath: transnational grassroots outreach and empirical evidence from the #HongKongProtests Twitter network. Rev Commun 22(3):193–212.

    Article  Google Scholar 

  21. Wiggins B (2022) ‘Nothing Can Stop What’s Coming’: an analysis of the conspiracy theory discourse on 4chan’s/Pol board. Discourse Soc.

    Article  Google Scholar 

  22. Reguero-Sanz I, Berdón-Prieto P, Herrero-Izquierdo J (2023) Journalism in democracy: a discourse analysis of Twitter posts on the Ferrerasgate scandal. Media Commun.

    Article  Google Scholar 

  23. Watts DJ, Strogatz SH (1998) Collective dynamics of ‘small-world’ networks. Nature 393(6684):440–442.

    Article  MATH  Google Scholar 

  24. Newman ME (2003) The structure and function of complex networks. SIAM Rev 45(2):167–256.

    Article  MathSciNet  MATH  Google Scholar 

  25. Scott J (2012) What is social network analysis? Bloomsbury Academic, UK

    Book  Google Scholar 

  26. Brandes U, Freeman LC, Wagner D (2013) Social networks. In: Handbook of graph drawing and visualization, Chapman & Hall, UK, pp 805–839

  27. Borgatti SP, Everett MG, Johnson JC (2018) Analyzing social networks. Sage, Thousand Oaks

    Google Scholar 

  28. Kivelä M, Arenas A, Barthélemy M, Gleeson JP, Moreno Y, Porter MA (2014) Multilayer networks. J Complex Netw 2(3):203–271.

    Article  Google Scholar 

  29. Boccaletti S, Bianconi G, Criado R, Del Genio CI, Gómez-Gardenes J, Romance M, Sendina-Nadal I, Wang Z, Zanin M (2014) The structure and dynamics of multilayer networks. Phys Rep 544(1):1–122.

    Article  MathSciNet  Google Scholar 

  30. Freeman LC (1977) A set of measures of centrality based on betweenness. Sociometry 40(1):35–41.

    Article  Google Scholar 

  31. Freeman LC (1978) Centrality in social networks conceptual clarification. Soc Netw 1(3):215–239.

    Article  Google Scholar 

  32. Bonacich P (1987) Power and centrality: a family of measures. Am J Sociol 92(5):1170–1182.

    Article  Google Scholar 

  33. Brin S, Page L (1998) The anatomy of a large-scale hypertextual web search engine. Comput Netw ISDN Syst 30(1–7):107–117.

    Article  Google Scholar 

  34. Kleinberg JM (1999) Authoritative sources in a hyperlinked environment. J ACM (JACM) 46(5):604–632.

    Article  MathSciNet  MATH  Google Scholar 

  35. Page L, Brin S, Motwani R, Winograd T (1999) The PageRank citation ranking: bringing order to the web. Technical report, Stanford InfoLab

  36. Girvan M, Newman ME (2002) Community structure in social and biological networks. Proc Natl Acad Sci 99(12):7821–7826.

    Article  MathSciNet  MATH  Google Scholar 

  37. Newman ME (2006) Modularity and community structure in networks. Proc Natl Acad Sci 103(23):8577–8582.

    Article  Google Scholar 

  38. Von Luxburg U (2007) A tutorial on spectral clustering. Stat Comput 17(4):395–416.

    Article  MathSciNet  Google Scholar 

  39. Fortunato S (2010) Community detection in graphs. Phys Rep 486(3–5):75–174.

    Article  MathSciNet  Google Scholar 

  40. Katz L (1953) A new status index derived from sociometric analysis. Psychometrika 18(1):39–43.

    Article  MATH  Google Scholar 

  41. Estrada E, Rodriguez-Velazquez JA (2005) Subgraph centrality in complex networks. Phys Rev E 71(5):056103.

    Article  MathSciNet  Google Scholar 

  42. Estrada E, Higham DJ (2010) Network properties revealed through matrix functions. SIAM Rev 52(4):696–714.

    Article  MathSciNet  MATH  Google Scholar 

  43. Benzi M, Klymko C (2013) Total communicability as a centrality measure. J Complex Netw 1(2):124–149.

    Article  Google Scholar 

  44. Benzi M, Boito P (2020) Matrix functions in network analysis. GAMM-Mitteilungen 43(3):202000012.

    Article  MathSciNet  Google Scholar 

  45. Benzi M, Klymko C (2015) On the limiting behavior of parameter-dependent network centrality measures. SIAM J Matrix Anal Appl 36(2):686–706.

    Article  MathSciNet  MATH  Google Scholar 

  46. Taylor D, Myers SA, Clauset A, Porter MA, Mucha PJ (2017) Eigenvector-based centrality measures for temporal networks. Multiscale Model Simul 15(1):537–574.

    Article  MathSciNet  MATH  Google Scholar 

  47. Bergermann K, Stoll M (2021) Orientations and matrix function-based centralities in multiplex network analysis of urban public transport. Appl Netw Sci 6(1):1–33.

    Article  Google Scholar 

  48. Bergermann K, Stoll M (2022) Fast computation of matrix function-based centrality measures for layer-coupled multiplex networks. Phys Rev E 105(3):034305.

    Article  MathSciNet  Google Scholar 

  49. Blondel VD, Guillaume J-L, Lambiotte R, Lefebvre E (2008) Fast unfolding of communities in large networks. J Stat Mech: Theory Exp 2008(10):10008.

    Article  MATH  Google Scholar 

  50. Mucha PJ, Richardson T, Macon K, Porter MA, Onnela J-P (2010) Community structure in time-dependent, multiscale, and multiplex networks. Science 328(5980):876–878.

    Article  MathSciNet  MATH  Google Scholar 

  51. Foucault M (1970) The archaeology of knowledge. Soc Sci Inf 9(1):175–185.

    Article  Google Scholar 

  52. Berger PL, Luckmann T (1967) The social construction of reality: a treatise in the sociology of knowledge. Anchor, Garden City

    Google Scholar 

  53. Keller R (2011) The sociology of knowledge approach to discourse (SKAD). Hum Stud 34:43–65.

    Article  Google Scholar 

  54. Bergermann K (2023) Code release: Twitter-Rana-Plaza. Available at

  55. Higham NJ (2008) Functions of matrices: theory and computation. SIAM, USA.

    Book  MATH  Google Scholar 

  56. Bergermann K (2021) Code release: Multiplex-matrix-function-centralities. Available at

  57. Bergermann K (2021) Code release: Urban-multiplex-networks. Available at

  58. Jeub LGS, Bazzi M, Jutla IS, Mucha PJ (2011-2019) Code release: a generalized Louvain method for community detection implemented in MATLAB. Available at

  59. Foucault M, Rabinow P, Hurley R (1997) Polemics, politics, and problematizations: an interview with Michel foucault. I: Ethics, subjectivity and truth. The New Press, New York

    Google Scholar 

  60. Foucault M (2005) The order of things. Routledge, Milton Park

    Book  Google Scholar 

  61. Knoblauch H (2019) The communicative construction of reality. Routledge, Milton Park.

    Book  Google Scholar 

  62. Glaser BG, Strauss AL (1967) The discovery of grounded theory: strategies for qualitative research. Routledge, Milton Park

    Google Scholar 

  63. Martin T, Zhang X, Newman ME (2014) Localization and centrality in networks. Phys Rev E 90(5):052808.

    Article  Google Scholar 

  64. The Rana Plaza Arrangement ((accessed on March 3rd, 2023))

  65. Hoskins T (2015) After two years, the Rana Plaza fund finally reaches its \$30m target. The Guardian

  66. Salminen J (2018) The accord on fire and building safety in Bangladesh: a new paradigm for limiting buyers’ liability in global supply chains? Am J Comp Law 66(2):411–451.

    Article  MathSciNet  Google Scholar 

  67. Brosius H-B, Eps P (1995) Prototyping through key events: news selection in the case of violence against aliens and asylum seekers in Germany. Eur J Commun 10(3):391–412.

    Article  Google Scholar 

  68. Garde-Hansen J, Hoskins A, Reading A (2009) Save As... Digital Memories. Springer, Germany.

    Book  Google Scholar 

  69. Pentzold C, Lohmeier C, Birkner T (2022) 2 Kommunikatives Erinnern. In: Handbuch Kommunikationswissenschaftliche Erinnerungsforschung, De Gruyter, Germany, pp 47–70.

  70. Bergermann K, Stoll M, Tudisco F (2023) A nonlinear spectral core-periphery detection method for multiplex networks. arXiv preprint arxiv:2310.19697

Download references


We thank the anonymous referees for their helpful comments.


Open Access funding enabled and organized by Projekt DEAL. The publication of this article was funded by Chemnitz University of Technology.

Author information

Authors and Affiliations



Both authors conceived and designed the study, collected and analyzed the data, conducted the analysis, interpreted the results, and wrote the manuscript.

Corresponding author

Correspondence to Kai Bergermann.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

The authors agree to the publication of the manuscript in Applied Network Science.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit

Reprints and Permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bergermann, K., Wolter, M. A Twitter network and discourse analysis of the Rana Plaza collapse. Appl Netw Sci 8, 74 (2023).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


Mathematics Subject Classification