General methodology for inferring failure-spreading dynamics in networks

Guan, Xiangyang; Chen, Cynthia

doi:10.1073/pnas.1722313115

Research Article

Engineering

General methodology for inferring failure-spreading dynamics in networks

Xiangyang Guan https://orcid.org/0000-0002-4821-6551 and Cynthia Chen [email protected]Authors Info & Affiliations

Edited by Yueyue Fan, University of California, Davis, CA and accepted by Editorial Board Member Susan Hanson July 18, 2018 (received for review December 26, 2017)

August 15, 2018

115 (35) E8125-E8134

https://doi.org/10.1073/pnas.1722313115

PDF/EPUB

Significance

Failure spreading widely exists in many systems, but methodologies devised to understand its dynamics so far are domain-constrained and demonstrate limited applicability across different systems. This paper tackles this issue from a reverse perspective of failure-spreading processes: It takes the spreading outcomes as inputs and seeks to infer the spreading process that gives rise to the outcomes, instead of the other way around as the prevalent approaches do. Because failure-spreading outcomes are commonly observed for different systems, we envision that this approach is generally applicable and provides a promising avenue to potentially unify research on spreading dynamics across disciplines. This research will facilitate understanding system dynamics and developing control techniques for them at different systems, scales, and dimensions.

Abstract

A generic modeling framework to infer the failure-spreading process based on failure times of individual nodes is proposed and tested in four simulation studies: one for cascading failures in interdependent power and transportation networks, one for influenza epidemics, one benchmark test case for congestion cascade in a transportation network, and one benchmark test case for cascading power outages. Four general failure-spreading mechanisms—external, temporal, spatial, and functional—are quantified to capture what drives the spreading of failures. With the failure time of each node given, the proposed methodology demonstrates remarkable capability of inferring the underlying general failure-spreading mechanisms and accurately reconstructing the failure-spreading process in all four simulation studies. The analysis of the two benchmark test cases also reveals the robustness of the proposed methodology: It is shown that a failure-spreading process embedded by specific failure-spreading mechanisms such as flow redistribution can be captured with low uncertainty by our model. The proposed methodology thereby presents a promising channel for providing a generally applicable framework for modeling, understanding, and controlling failure spreading in a variety of systems.

Many natural, social, or technical phenomena, such as epidemics (1, 2), viral marketing (3, 4), metabolic reaction knockouts (5, 6), traffic congestions (7, 8), and power blackouts (9–12), are known to be underpinned by a spreading process, in which failures (e.g., disease infections) initially hitting a small set of nodes propagate to a much larger set in a network. Given the ubiquity of failure-spreading processes, understanding and controlling their dynamics from network perspectives has been an active research field since the early 2000s (13–15).

In this paper, we propose and test a general methodology for inferring the dynamics of failure-spreading processes. The time of failure for each node and a minimum amount of knowledge in the network topology are taken as inputs. The methodology seeks to infer the mechanisms underlying a spreading process and how failures propagate from node to node. Four general spreading mechanisms—external, temporal, spatial, and functional (16–20)—are conceived as abstractions of a variety of domain-specific spreading mechanisms. A stochastic cascade model based on maximum likelihood estimation (MLE) is devised to estimate parameters associated with these four general spreading mechanisms and to reconstruct the node-to-node failure-propagation process. In this stochastic cascade model, failure propagation from one node to another is viewed as a probabilistic event driven by the four general spreading mechanisms, resulting in node failures also being probabilistic. MLE is used to maximize the likelihood of nodes failing at the given failure times, with the following three assumptions: (i) failed nodes may recover but will not fail again during the study period, (ii) failures of two nodes are conditionally independent given the four general spreading mechanisms, and (iii) among the four general spreading mechanisms there exists no interaction between external factors and failure-propagation mechanisms (i.e., temporal, spatial, and functional).

A number of existing methods for modeling spreading processes are known in the literature of various disciplines. Examples of these methods include percolation theory (21–23), game theory (24, 25), the sandpile model (26, 27), the branching process (4, 28), agent-based models (29), and other domain-specific models, such as OPA for power outages (the name OPA comes from the three institutions that collaborated to develop this model: Oak Ridge National Laboratory, Power Systems Engineering Research Center at the University of Wisconsin, and the University of Alaska) (30). These lines of research, which we call collectively the “forward approach” in the rest of this paper, typically presume a network topology and a specific spreading mechanism and seek to understand how they (nonlinearly) affect outcome quantities such as the final failure size (31). The proposed methodology, which we call the “backward approach,” differs from the forward approach by reversing the traditional modeling procedure. Instead of taking spreading mechanisms and network topologies for inputs and outputting the outcome of the spreading process, the proposed backward approach takes the outcome of the spreading process (i.e., time of failure for each node) for input and seeks to infer detailed network topologies and spreading mechanisms. It thus advances the state of the art by addressing three limitations of the forward approach. First, the forward approach relies on knowledge of real-world network topologies (32, 33), which is not always available. For example, the detailed physical layouts of power networks are often kept secret for commercial or security reasons (16, 34), and the structural connectivities in human brains are not yet fully understood at the subcortical level (20, 35). Second, learning the process that leads to the spreading outcomes (e.g., failure sizes) is still intricate, if not impossible, through the forward approach (36). This is because, with the exception of few state-of-the-art models (37–39), solutions to the network dynamic equations can only be obtained at equilibrium conditions when the spreading process has been concluded and the networks are stable. The transient states of the networks, while the spreading process is proceeding, are difficult to analyze (40, 41). This limitation is also a result of the fact that most models in the forward approach focus on processes and quantities at the network level, such as network fragmentation and the size of the giant component (31), while the local node-to-node propagation process is simplified or overlooked. Third, the underlying spreading mechanisms may vary across different systems. Therefore, a model tailored for one specific system has limited generalizability to other systems. Modeling approaches with specific spreading mechanisms inhibit identifying generic patterns of spreading processes across different systems (42). The proposed backward approach addresses these three limitations by removing the requirement for network topologies as an input, formulating the node-to-node failure-propagation process probabilistically, and adopting the four general spreading mechanisms, respectively.

We tested and validated our proposed model by comparing inference results to simulation data in four simulation studies: (a) cascading failures in interdependent power and transportation networks in New York City (NYC) during Hurricane Sandy, (b) a hypothetical influenza epidemic in NYC, (c) a congestion cascade scenario in the Sioux Falls benchmark network (43), and (d) cascading power outages in the Wood–Wollenberg 6-bus benchmark system (44). The Sioux Falls benchmark network and the Wood–Wollenberg 6-bus benchmark system are widely used cases for testing newly developed models in transportation network analysis and power system analysis, respectively. Results for (a) and (b) verify that the proposed methodology is mathematically sound: The backward approach is capable of precisely inferring the parameters associated with the spreading mechanisms as well as recovering the simulated failure-spreading processes. The results for (c) and (d) cross-validate the robustness of the backward approach: The backward approach demonstrates solid performance when the failure data are independently generated using well-established domain-specific spreading mechanisms, such as user rerouting in a roadway network and load rebalancing in a power grid. The simulation study (d) also validates the backward approach by comparing the inferred failure-spreading process with data generated by a fully validated simulation procedure that has been tested in its ability to replicate a real-world process of cascading power outages (32, 45–47). All comparisons in the simulation studies between the inferred failure-spreading processes and simulation data return consistent patterns, suggesting the accuracy, robustness, and generalizability of the proposed methodology.

Model

The stochastic cascade model formulates the likelihood of a set of node failures given the four general spreading mechanisms, as shown in the upper line of Eq. 1:

\begin{matrix} L (β^{E}, β^{T}, β^{S}, β^{F}) & = & Pr (T = t | E, P_{j - i}; \forall i \in N, j \in N, i \neq j) \\ = & \prod_{i \in N} Pr (T_{i} = t_{i} | E, P_{j - i}; \forall j \in N \ i), \end{matrix}

[1]

where

T = [T_{1}, \dots, T_{n}]

is a vector of random variables representing the failure times for the n nodes and

t = [t_{1}, \dots, t_{n}]

is a realization of

T

(e.g., observed node failure times). In this paper, we consider discrete time steps. E represents external factors which could vary with contexts. In general, external factors determine the onset of a spreading process by giving rise to initial failures (also known as seeds) (18, 48).

P_{j - i}

denotes the failure-propagation probability from node j to node i, which accounts for the three failure-propagation mechanisms: temporal dependence, spatial dependence, and functional dependence. These three failure-propagation mechanisms capture how the lapse of time, the separation in geographical space, and the connections between node-level processes (e.g., individual activities in an epidemic network) change the dynamics of failure propagations between nodes. The parameters

β^{E}

,

β^{T}

,

β^{S}

, and

β^{F}

are associated with external factors, temporal dependence, spatial dependence, and functional dependence, respectively. N is the set of nodes in the network(s). Under the conditional independence assumption, the likelihood of the set of node failures can be decomposed into a sequence of (conditionally) independent failures of individual nodes (49), as presented in the second line of Eq. 1.

The node failure probability

Pr (T_{i} = t_{i} | E, P_{j - i}; \forall j \in N \ i)

in Eq. 1 can be reformulated using the hazard function

λ_{i} (t_{i})

as in Eq. 2. A hazard function, when defined in a discrete-time setting, is the instantaneous failure probability at a given time step

t_{i}

, which can be written as a conditional probability

Pr (T_{i} = t_{i} | T_{i} \geq t_{i}, E, P_{j - i}; \forall j \in N \ i)

. For readability, we leave out the conditional terms associated with external factors and failure propagations in Eq. 2:

Pr (T_{i} = t_{i}) = \{\begin{aligned} \prod_{τ = 1}^{t_{i} - 1} [1 - λ_{i} (τ)] λ_{i} (t_{i}), if t_{O S} < t_{i} \leq t_{O E} \\ \prod_{τ = 1}^{t_{O E}} [1 - λ_{i} (τ)], if t_{i} > t_{O E} \\ 1 - \prod_{τ = 1}^{t_{O S}} [1 - λ_{i} (τ)], if t_{i} \leq t_{O S} \end{aligned},

[2]

where

t_{O S}

and

t_{O E}

are the times when observation starts and ends, respectively. The three lines in Eq. 2 concern cases of observed node failures, right-censored nodes, and left-censored nodes, respectively. Proof of Eq. 2 is provided in SI Appendix. Eq. 2 shows that node failure probabilities in all three cases reduce to functional forms in terms of hazard functions.

For node i, its failure hazard at a given time step

t_{i}

is conceived as the combined hazard of failure due to external factors (if they exist) and failure due to propagations from other failed nodes. To formulate the hazard function, we assume no interaction between these two hazards, and thus failures due to external factors and propagations are viewed as independent events at this given time step

t_{i}

, which leads to Eq. 3:

λ_{i} (t_{i}) = 1 - [1 - λ_{i} (t_{i} | E)] \prod_{{j : j \in N \ i}} (1 - P_{j - i}) .

[3]

When considering failure propagation from node j to node i in a stochastic setting, there exist uncertainties regarding when node j fails. Conceptually, as long as j fails before

t_{i}

, failure propagation from j to i happening at time step

t_{i}

would be feasible. Accounting for all possible cases, the failure-propagation probability from node j to node i,

P_{j - i}

, can be specified as in Eq. 4:

P_{j - i} = \sum_{t_{j} : t_{j} < t_{i}} Pr (T_{j} = t_{j}) Pr (t_{i} - t_{j}),

[4]

where

Pr (t_{i} - t_{j})

is the probability that it takes time

t_{i} - t_{j}

for failure to propagate from node j to node i. Proof of Eq. 4 is provided in SI Appendix. Eqs. 3 and 4 decompose the failure hazard of node i at time

t_{i}

into failures attributed to external factors and propagations, denoted by

λ_{i} (t_{i} | E)

and

Pr (t_{i} - t_{j})

, respectively. The four general spreading mechanisms can thus be quantified and integrated into the model by specifying the functional forms for

λ_{i} (t_{i} | E)

and

Pr (t_{i} - t_{j})

. In the four simulation studies implemented,

λ_{i} (t_{i} | E)

and

Pr (t_{i} - t_{j})

are approximated by assuming a probability distribution for

t_{i}

and

t_{i} - t_{j}

, respectively. This allows parameterizing the model and statistically evaluating its performance in estimating the parameters associated with the spreading mechanisms. More details are provided in Materials and Methods.

Inference in the above stochastic cascade model is thus formulated as an MLE problem with the likelihood function defined by Eq. 1. MLE has been applied to modeling the spreading dynamics of disease (50–52) and information (53–55). These two lines of research, however, only account for temporal dependence (i.e., the failure time interval) between nodes and require multiple instances of cascades for the model estimation. The present model differs from the existing ones by synthesizing the four general spreading mechanisms, thus enabling the estimation with a single cascade instance.

Results

The above inference model is tested by comparing the inferred failure-spreading processes with the simulated failure-spreading processes in the four simulation studies. For the study of interdependent infrastructure cascading failures (a) and influenza epidemic (b), we randomly generate the parameter values and simulate failure-spreading processes based on the generated parameter values. The failure time of each node is produced from the simulations and is fed into the above inference model to obtain the inferred parameter values and inferred failure-spreading processes. For the study of congestion cascade (c) and cascading power outages (d), we employ two simulators that replicate real-world failure-spreading processes to generate node failure times in the transportation network and power network, respectively. These two simulators assume spreading mechanisms specific to transportation systems and power systems and are independent of the inference model. The failure times are then used as inputs to the inference model. The detailed formulations of the inference model, which involve conceiving the functional forms for

λ_{i} (t_{i} | E)

and

Pr (t_{i} - t_{j})

, are the same for (a), (c), and (d) with the adoption of the four general spreading mechanisms. Spreading mechanisms specific to influenza epidemics are integrated into the model formulation of (b) to test the adaptability of the proposed methodology to domain-specific spreading processes when their underlying mechanisms are well understood (56). More details on inference model formulations are provided in Materials and Methods.

(a) Interdependent Infrastructure Cascading Failures.

The four general spreading mechanisms are adopted as abstractions of the complex and difficult-to-model (specific) spreading mechanisms behind cascading failures of interdependent infrastructures (34). The external factors include the impacts of high winds and storm surges (57), evaluated by parameters

β_{1}^{E} \in [0,1]

and

β_{2}^{E} \in [0,1]

, respectively. Spatial dependence is associated with parameters

β_{1}^{S} \in [0,1]

and

β_{2}^{S} \in [0, \infty)

, which capture the location-based failure-propagation likelihood (58) between two nodes when they are colocated and when they are separated by a distance, respectively. Functional dependence is drawn from null models (SI Appendix) and is characterized by

β^{F}

, which denotes the likelihood of failure propagation from one node to another if their operations are related (e.g., one providing resources to the other). The parameter estimates, together with their preset values and t values from the MLE, are shown in Table 1. A t value is the standardized deviation of a parameter estimate from its preset value, evaluated as

t = (\hat{β} - β) / S E_{β}

, where

\hat{β}

is the parameter estimate,

β

is the parameter’s preset value, and

S E_{β}

is the estimated standard error for this parameter.

Table 1.

Parameter estimates for the model of infrastructure cascading failures

Parameter	Preset value	Estimate	t value
$β_{1}^{E}$	0.035	0.037	1.13
$β_{2}^{E}$	0.113	0.113	0.02
$β_{1}^{S}$	0.032	0.032	0.09
$β_{2}^{S}$	1.691	1.691	0.01
$β^{F}$	0.065	0.047	−1.81

All parameter estimates in Table 1 have insignificant deviations from their preset values, indicated by t values within the range [

-

1.96, 1.96]. This demonstrates the capability of our model to accurately quantify how the general spreading mechanisms contribute to the emergence of cascading failures.

The numbers of failed nodes due to external factors and failure propagations at each time step are shown in Fig. 1. Their temporal patterns show remarkable correspondence between the simulated curves (Fig. 1A) and the inferred ones (Fig. 1B). In terms of trends, the total number of failed nodes and number of failures due to propagations keep increasing until time step 5, before they drop to close to zero around time step 14. Overall this trend is consistent with the empirical results on disaster impacts during Hurricane Sandy (57). The number of failed nodes due to external factors decreases straight to zero at time step 6. The highest numbers of failed nodes per time step for the simulation and inference result are also similar with a value of 1,050 and 1,037, respectively, both at time step 5. The almost identical temporal patterns of simulation and inference result provide strong evidence that the model can capture the underlying cascading failure process given cascading failure outcomes (node failure times). This is further demonstrated by the comparison of failure hazards in Fig. 2.

Fig. 1.

Comparison between the simulated cascading process (A) and the inferred one (B). Different curves correspond to different spreading mechanisms. Failure due to external factors represents damages caused by storm surges and high winds. Propagation means failures of other nodes lead to the failure of a given node. The total number is the sum of the numbers of failures due to the above two mechanisms.

Fig. 2.

Evolution of failure hazards in the simulated cascading failure process (A) and the inferred one (B). Each figure shows the hazards of failures due to external factors or propagations separately.

The inferred hazard functions (Fig. 2B) present patterns similar to the simulated ones (Fig. 2A). The hazard of failure due to external factors trends downward from the beginning of the simulation, while the hazard of failure due to propagation reaches a peak around time step 6 and decreases thereafter. The result suggests that the model is able to recover the dynamics of general spreading mechanisms in a scenario where they are constantly evolving. In addition, although the failure hazards due to external factors and due to propagation assume similar functional forms (Materials and Methods), the differences in their temporal trends suggest that a single functional form in the detailed model formulation could capture different dynamics for the general spreading mechanisms. This property suggests our model’s versatility in handling diverse types of failure-spreading processes.

The spatial patterns of infrastructure cascading failures in the simulation and inference result are presented in Fig. 3. The inference outcomes (Fig. 3 B and D) accurately capture the simulated failure time of power service and transportation service (Fig. 3 A and C) in the majority of the census tracts in NYC. The inferred failures and simulated failures show comparable spatial patterns in terms of how they spread geographically: Early-stage failures (at time steps 1–3) are scattered across the region, surrounded by failures that occurred later in time. This implies that certain census tracts act as centers that propagate failures of infrastructure service to their surrounding census tracts.

Fig. 3.

Comparison of the spatial patterns of infrastructure cascading failures between the simulated patterns (A and C) and inferred patterns (B and D). A and B show simulated and inferred failure time of transportation service in each census tract. C and D show corresponding results for the power service.

In addition to the temporal and spatial patterns of failure spreading, we also look into the node-to-node failure-propagation patterns, which are compared between the simulated patterns (Fig. 4A) and the inferred patterns (Fig. 4B) by first visualizing the propagation networks (as shown in the graphs) and then quantifying the degree distributions of the propagation networks. In a propagation network, a (directed) link from node A to node B signifies that A is the most likely source to have propagated failure to B. In other words, a propagation network depicts the most likely paths through which failures propagate from the initial seeds to the rest of the network.

Fig. 4.

Comparison between the simulated failure-propagation network (A) and the inferred one (B). The histograms and the curves in the figures show the degree distributions of the propagation networks and power-law functions fitted to the degree distributions, respectively. The graph inside each figure illustrates the propagation network, with nodes colored orange and links colored blue.

In Fig. 4, the visualized propagation networks show comparable patterns between simulation (Fig. 4A) and inference (Fig. 4B): Nodes form tiny clusters among which few links are present. This suggests that in both the simulated data and the inference result the node-to-node propagation patterns are “local,” where any single node only influences a few other nodes rather than having a widespread influence over the entire infrastructure networks. This is also implied by the degree distributions shown in Fig. 4, on which power-law distributions can be fitted. The simulated propagation network and the inferred propagation network have notably matched degree distributions, with both similar densities for each degree category and close values of the fitted power-law coefficients.

(b) Influenza Epidemic.

We apply the inference model to a hypothetical influenza-spreading scenario in NYC. Its setting is different from the infrastructure cascading failures in case (a) in five aspects. First, each node in the influenza-spreading network represents an individual person, which means the size of the epidemic network is much larger than the interdependent infrastructure networks where a node represents a given infrastructure service in a zone. The total number of nodes (individuals) in the influenza-spreading network is 4,414,610 out of a population of 8,175,133 in NYC, considering the effect of vaccination (59, 60). Second, a recovery process is modeled in the formation of influenza epidemics. The susceptible–infected–recovered model is used to simulate the influenza epidemic, in which each infected (or equivalently, “failed”) individual recovers and is no longer infective after 6 d in infection (61). Third, external factors are not considered in the epidemic study, as inflow of influenza infection cases from outside the study region is assumed to be controllable through travel restrictions (62). Fourth, a temporal dependence parameter

β^{T} \in [0, \infty)

is introduced to account for how the separation of two node failures in time affects the likelihood of failure propagation between these two nodes. Finally, a specific failure-spreading mechanism (i.e., close human contact) is modeled in the epidemic process, compared with the general failure-spreading mechanisms used in the infrastructure cascading failures. This setting is attributed to the insights in spreading mechanisms of infectious disease in existing research (e.g., refs. 56, 62, and 63). The close-contact rate between two individuals is modeled to be proportional to the travel rate (number of trips per day) with a coefficient

β_{1}^{F} \in [0,1]

if these two individuals are located in different zones and proportional to the population density (population per square mile) with a coefficient

β_{2}^{F} \in [0,1]

if the two individuals live in the same zone (63). In addition, spatial dependence in the epidemic model depends on zone proximity: Any two individuals living in neighboring zones (which means the two zones share some boundaries) have additional likelihood

β^{S} \in [0,1]

of transmitting influenza between them in addition to the transmission likelihood attributed to close contact. This treatment for spatial dependence is intended to take into account other but less-common influenza-spreading mechanisms such as exposure to contaminated fomites (56). The parameter estimates, along with their preset values and t values, are presented in Table 2.

Table 2.

Parameter estimates for the model of influenza epidemic

Parameter	Preset value	Estimate	t value
$β_{1}^{F}$	0.0015	0.0014	−0.07
$β^{S}$	0.017	0.017	−0.0001
$β_{2}^{F}$	0.038	0.038	−7×10⁻⁷
$β^{T}$	3.6	3.6	0.07

All four parameters in Table 2 are accurately estimated, with estimates close to their preset values and deviations being insignificant (

| t | < 1.96

). This demonstrates the capability of the inference model to recover the dynamics of a failure-spreading process when underlying (specific) spreading mechanisms are known and can be integrated into the inference model. The inferred epidemic process, in comparison with the simulated one, is illustrated in Fig. 5.

Fig. 5.

Comparison between the simulated epidemic process and the inferred one. A illustrates the number of influenza cases per time step throughout the study period, and B shows the temporal evolution of reproduction numbers. The red line marks the threshold value of 1 for reproduction numbers.

The simulated epidemic process and the inferred epidemic process in Fig. 5 show almost identical patterns. In terms of the number of influenza cases per time step, the curves for the simulated cases and inferred cases almost overlap: The inferred number of cases is within 1% deviation from the simulated number of cases for any time step. Two waves of infection outbreaks, one around time step 13 and the other around time step 25, are noticeable in both the simulated and inferred epidemic process. The reproduction numbers in both the simulation and inference result show decreasing trends and cross the threshold of 1 (0 at log-scale) at the same time step of 6. These findings demonstrate our model’s ability to recover the temporal patterns of an influenza spreading process. In terms of the spatial patterns, Fig. 6 illustrates the proportion of infected population in each census tract at time steps 10, 20, and 30 for the inferred epidemic process (Fig. 6 A–C) and the simulation (Fig. 6 D–F), respectively. It shows highly consistent spatial patterns between the two: The disease first emerges and spreads locally in southern Brooklyn, gradually covers the entire borough, and propagates to Manhattan, Queens, and Staten Island by the end of the study period.

Fig. 6.

Spatial-temporal evolution of infections. A–C illustrate the inferred proportion of infected population in each census tract at time steps 10, 20, and 30, respectively. D–F are the corresponding proportions in the simulation data.

The node-to-node failure-propagation patterns are analyzed at both zone level and individual level. The zone-level propagation networks, which indicate whether influenza spreads from one zone (census tract) to another, are illustrated in Fig. 7. The simulated propagation network (Fig. 7A) and the inferred propagation network (Fig. 7B) share a noticeable topological pattern: The network organization evolves from a lattice-like structure observed at early stages to a tree-like structure later in the epidemic process. The former pattern is recognizable in the upper part of the networks in Fig. 7, which is characterized by small, overlapping cycles, and the latter is apparent in the lower part of the networks, showing “spikes” that are long branches in the tree structures. This finding adds to the evidence that the proposed model is capable of capturing the failure-propagation patterns at the network level as well as how the patterns evolve over time.

Fig. 7.

Comparison of the failure-propagation (i.e., influenza-spreading) networks at the zone level between simulation (A) and inference result (B). Every node in the networks represents a census tract, and a link from node A to node B indicates there exist (at least one) cases in which an individual from census tract A infects an individual(s) in census tract B.

The degree distributions of the propagation networks at both zone level and individual level are evaluated in Fig. 8. At the zone level (Fig. 8 A and B), the node degrees follow Poisson distributions, which implies that zone-to-zone propagations take place independently of each other and can be approximated by Bernoulli processes. The degrees of individuals in the propagation networks are fitted into power-law distributions, suggesting that most people infect only a few others while a small number may have transmitted influenza to a large group. The proposed inference model successfully reconstructs the simulated degree distributions at the two scales, with similar shapes of the distributions as well as close Poisson/power-law coefficients between the simulated propagation networks and inferred propagation networks.

Fig. 8.

Comparison of degree distributions of failure-propagation networks. A and B show the simulated and inferred degree distributions of the network at the zone level. The distributions are fitted to Poisson distributions with similar parameters. C and D show the corresponding results at the individual level. In an individual-level influenza-spreading network, nodes represent individual people, and a link from one to the other indicates the former infects the latter. Power-law distributions are fitted for the degree distributions in this case.

(c) Congestion Cascade.

In a roadway network, when a link (i.e., a road segment) gets congested, travelers using this link will choose a new route to bypass the congested link(s) (64). This behavior causes a redistribution of traffic flow from the congested link(s) to other links and could overload other links, resulting in a congestion cascade.

We apply the inference model to a congestion cascade by modeling it as a failure-spreading process where failure of a link is interpreted as being congested. The Sioux Falls network, shown in Fig. 9A, serves as the benchmark network (43) to which the inference results are compared. The network is initially operating at user-equilibrium state (65) in which every user chooses the fastest route in travel time among all possible routes. At time step 1, links 6

\to

8 (meaning the road segment from node 6 to node 8 in Fig. 9A) and 8

\to

6 would fail as they are already overloaded under user equilibrium and become the seeds for subsequent failure propagations. At every time step, overloaded links fail and their flow is redistributed to the surviving links at the next time step. The time step when each link fails is recorded and fed as input into the inference model. The simulated link failure times are illustrated in Fig. 9B. The number of failed links per time step is shown in Fig. 9 C and D for the simulation data and inference result, respectively.

Fig. 9.

Sioux Falls network and the congestion cascade process. The topology of the Sioux Falls roadway network is illustrated in A. Each node represents a roadway intersection and each link represents a road segment. The network after the congestion cascade is shown in B. The red dashed links are congested, with the attached numbers denoting failure times. C and D show the simulated number of failed (congested) links per time step and the inferred number of failed links per time step, respectively. RC indicates the right-censoring period (i.e., when the time step is larger than 4).

The congestion cascade process is concluded in four time steps when the network reaches a new equilibrium where no new flow redistributions or congestions/failures happen. The numbers of congested/failed links at every time step in the simulation data and inference result demonstrate remarkable similarities, with deviations within

\pm

2. Both numbers increase for three time steps before dropping at the fourth time step. As no more failure happens after time step 4, the surviving links at the end of time step 4 become right-censored, whose numbers are also the same for the simulation data and inference result.

The simulated and inferred failure-propagation networks are shown in Fig. 10 A and B, respectively. Every node in Fig. 10 corresponds to a link (i.e., a road segment) in the Sioux Falls network (Fig. 9A). Both propagation networks show a separation of two clusters, one initialized by the failure of link 6

\to

8 (representing the road segment from 6 to 8 in Fig. 9A but denoted as node 6

\to

8 in Fig. 10) and the other by the failure of link 8

\to

6. More interestingly, the topologies of the clusters initialized by link 6

\to

8 in the simulation data and inference result are exactly the same. For the cluster extending from link 8

\to

6, the model is successful in recovering the key hubs for propagating failures. In Fig. 10A, the links 8

\to

6, 8

\to

9, and 9

\to

5 are identified as hubs, as each of them propagates failures to more than one link. They thus play a more important role in facilitating the failure propagations than other links and could serve as potential candidates for strengthening to inhibit the progress of failure spreading. These three links emerge as hubs in the inferred failure-propagation network (Fig. 10B) as well. A noticeable difference between the simulated propagation network and the inferred one is the overestimation of the number of hubs. The inference model misidentifies links 11

\to

4 and 21

\to

24 as hubs, which does not agree with the simulated propagation network. This difference likely results from the power-law degree distribution assumed for the functional dependence (SI Appendix) in the inference model. Due to the fat tail in a power-law distribution, it tends to generate a larger number of high-degree links (road segments) that have functional dependence with multiple other links (road segments), compared with a binomial distribution which suggests more random occurrence of functional dependences. This finding casts doubts on the common belief that power-law distributions are prevalent in transportation systems (66).

Fig. 10.

Failure-propagation patterns for simulated data (A) and inference result (B). Each node in this figure represents a link in the Sioux Falls network (Fig. 9A) that fails during the congestion cascade process. For example, the node labeled 8 $\to$ 6 corresponds to the link (road segment) from 8 to 6 in Fig. 9A. A link in this figure represents the direction of failure propagation from one link (road segment) to another link (road segment) in Fig. 9A.

(d) Cascading Power Outages.

We use a validated simulation model called DCSIMSEP (45, 46) and replicate a detailed simulation process on the 6-bus benchmark system (44) as implemented by Hines et al. (47). DCSIMSEP simulates multiple complex mechanisms in power outage cascades, including grid separation, load rebalancing, load shedding, flow redispatch, and overloading failures (SI Appendix). This simulator was validated by comparing its simulation results with prior research results in 41 power networks (32). Hines et al. (47) used this simulator to generate 1,000 instances of outage cascade in the 6-bus benchmark system and calculated the probability of outage propagating from one power line to another, which they called an “influence graph” (SI Appendix).

Instead of using 1,000 cascade instances, we randomly generate one power outage cascade with simulation settings identical to those in ref. 47 and apply the proposed inference model to this one cascade to obtain the propagation network. The 6-bus benchmark system, the simulated outage cascade (power line failure times) upon it, and the inferred power line failure times are illustrated in Fig. 11.

Fig. 11.

A simplified diagram of the 6-bus benchmark power system, with simulated failure times (A) and inferred failure times (B) marked. Black-colored nodes are power generators that produce electricity, and orange-colored nodes are users who consume electricity. Links in the networks symbolize power lines. The number attached to each link is the power line’s failure time. RC indicates this link is right-censored and does not fail in the simulation or inference model. The two dashed lines are initially removed and thus regarded as nonexistent in both the simulation and inference model (*SI Appendix*).

The simulated failure times and the inferred ones are exactly the same except for two power lines, namely line 1–4 (i.e., the power line connecting node 1 and 4 in Fig. 11) and line 2–3. The influence graph obtained by Hines et al. (47) is shown in Fig. 12A, in comparison with the inferred propagation network in Fig. 12B.

Fig. 12.

Comparison between the influence graph (47) and propagation network, shown in A and B, respectively. Every node in the two graphs corresponds to a power line (i.e., a link in Fig. 11), and every link represents a nonzero probability of failure propagating from one power line to another. A darker and bolder link indicates a higher probability of failure propagation. The number associated with each link in B is the failure-propagation probability. A is reprinted with permission from ref. 47 (Copyright 2017, IEEE).

The influence graph and the inferred propagation network are consistent in two key aspects. First, for every power line that fails in the simulation its most likely source of failure in the propagation network coincides with the one in the influence graph. This consistency is evident for the propagations from line 2–4 to lines 1–4, 2–5, and 4–5, which present high probabilities (marked by dark bold arrows) in both the influence graph and propagation network. For line 2–3, the propagation network also correctly identifies its most likely source of failure as line 2–5. Second, failure-propagation probabilities from line 3–5 to lines 1–4 and 2–3 and from line 2–5 to 4–5 are all insignificant (

<

0.05) in the propagation network, which corresponds to the small probabilities of these propagation paths in the influence graph. Two propagation paths, one from line 3–5 to 4–5, and the other from line 2–4 to 2–3, show differences between the influence graph and propagation network. The failure-propagation probability from 3–5 to 4–5 is significantly larger in the influence graph than in the propagation network. This could be attributed to the propagation network’s being based on only one cascade instance, which has insufficiently characterized the propagation likelihood from 3–5 to 4–5. The use of only one cascade in the propagation network is also responsible for another difference between the influence graph and propagation network: The lines 1–2, 3–6, and 5–6 are isolated in the propagation network, as no failure happens to them in this single-cascade simulation.

Computational Complexity.

We evaluated the computational complexity of the proposed MLE and find a theoretical range from O(n) to

O (T_{o b s}^{2} n^{2})

(where

T_{o b s}

is the length of the observation period), depending on the underlying distribution of input failure times (SI Appendix). When node failures are distributed evenly in time across the study period (i.e., uniformly distributed), the computation time is close to the upper bound; when node failure times have a more concentrated distribution around a certain value, the computation time moves toward the lower bound. Considering that spreading events are typically fast processes characterized by large numbers of failures in a short span of time (67, 68), it is expected that when applied to real-world data the proposed model should demonstrate a performance close to O(n). This result is verified by a numerical analysis of the relationship between computation time and network size (i.e., the number of individuals) in the influenza epidemic simulations, as depicted in Fig. 13. In this analysis, the population in every census tract is decreased by a common scaling factor, and the computation time is evaluated as the scaling factor changes. Fig. 13 supports that the inference model has a computational complexity close to O(n) by demonstrating a close-to-linear relationship between the computation time and the scaling factor.

Fig. 13.

Numerical analysis of computational complexity. The figure shows how computation time for one iteration in solving the MLE changes with varying network sizes. For each network size (corresponding to a population scaling factor in the figure), we implement 100 instances of simulation and inference with randomized population selections. Each point in the figure represents one instance. The line connects the mean computation time over the 100 instances for every value of the network size.

Conclusion and Discussion

Modeling failure-spreading processes is an active and promising research field across multiple disciplines, considering their ubiquity and many unaddressed challenges in the current state of the art. The ultimate goals along this line of research are to understand and control the spreading of failures. A backward-approach modeling framework aimed at these goals is proposed and tested here. Four general spreading mechanisms, which serve as abstractions of a variety of domain-specific failure-spreading mechanisms, are assumed to underlie all failure-spreading processes. They are formulated into a survival-analysis-based model formulation that describes how the four general spreading mechanisms give rise to a cascade of node failures. MLE is used to estimate the parameters associated with the general spreading mechanisms. With the estimated parameters, the spreading process and node-to-node failure-propagation patterns are reconstructed.

The proposed modeling framework demonstrates good performance both in quantification of the four general spreading mechanisms and in reconstructing the failure-spreading process temporally and spatially as well as the local node-to-node failure-propagation patterns. In the simulation study of interdependent infrastructure cascading failures (a) and influenza epidemic (b) where the parameter values are preset in the simulations, the proposed model accurately infers the parameters in both cases. The temporal-spatial patterns of the failure-spreading processes, as well as the node-to-node propagation patterns are compared between the simulation data and inference results in all four simulation studies. The simulated and inferred temporal-spatial patterns show remarkable correspondence consistently throughout the four studies. The proposed model is also capable of recovering properties of the simulated failure-propagation patterns, such as the degree distributions of the propagation networks, the clustering of failures, the emergence of hubs in the propagation process, and the most likely failure-propagation paths to individual nodes, as demonstrated by all four simulation studies. Particularly in the case of cascading power outages (d), the inferred node-to-node failure-propagation probabilities using one cascade instance are consistent with the result from a validated simulation approach using the same power network but 1,000 cascade instances. This further demonstrates the capacity of the proposed approach in inferring failure-spreading processes under non-data-intensive circumstances, considering that certain types of failure-spreading data such as data on large-scale infrastructure failures are often sparse.

One challenge in modeling failure spreading is that (specific) spreading mechanisms vary from case to case and in different systems. Consequently, the specific spreading mechanisms involved in a real-world failure-spreading instance are usually not immediately known following the failures, and only knowable after a period of investigations and study (69, 70). The proposed approach demonstrates potential capacity to address this challenge by using a high-level abstraction of specific spreading mechanisms, which are represented by the four general spreading mechanisms. Two properties of the proposed approach could be summarized from the four simulation studies. First, the model demonstrates the flexibility of integrating knowledge in specific spreading mechanisms. In the influenza epidemic simulation study (b), where the specific spreading mechanism (i.e., close human contact) is known, this specific mechanism can be formulated into the model. In other words, the proposed approach can be operational both at an abstract level with the four general spreading mechanisms and at a more detailed level with domain-specific spreading mechanisms. This provides the proposed approach with the capacity to model both a well-understood spreading process (e.g., the spreading of some infectious diseases) as well as an emergent spreading process where there is little knowledge. Second, the ability of the proposed approach in inferring the dynamics of failure-spreading processes can be independent of the specific spreading mechanisms underlying the spreading processes in specific systems. This is shown through the simulation studies of congestion cascade (c) and cascading power outages (d), where the simulation data are generated by distinct (specific) spreading mechanisms. We show that our inference model recovers notably similar spreading processes and node-to-node failure-propagation patterns with their simulated counterparts that are independently generated using domain-specific simulators. This finding has two suggestions: (i) the proposed model has general applicability to a variety of systems and spreading processes; and (ii) the model demonstrates robustness against misspecifications of its detailed formulations that quantify the spreading mechanisms.

One limitation of the proposed approach is the approximations made in formulating the likelihood of node failure due to external factors and the likelihood of node-to-node failure propagation, by assuming a probability distribution of node failure time (due to external factors) and a distribution of failure time interval, respectively. The approximations can cause two potential issues. First, it creates an interaction effect among the three failure-propagation mechanisms. In particular, temporal dependence is made interactive with spatial dependence and functional dependence. This is because the parameterization of the probability distributions in terms of spatial dependence and functional dependence affects the shape of the distributions which capture temporal dependence. The interaction among the propagation mechanisms is a potential problem when the time dimension has an effect on the node-to-node failure propagations (e.g., periodicity in propagation occurrences) independent from spatial and functional factors. Interpreting the modeling results to capture the dynamics of each failure-propagation mechanism would then be difficult in this case, as their dynamics would confound each other. Second, a predefined and fixed probability distribution may be insufficient to characterize the dynamics of the failure-spreading mechanisms under certain circumstances. This is especially the case for a spreading process with a long life span, such as a global influenza epidemic which could last for months or even years. In this case, the spreading mechanisms themselves could be evolving over time, resulting in shifts in the parameter values or even the type of probability distributions. Addressing this issue would require an inference model formulation that is capable of updating itself, which is a potentially interesting topic for future research.

Materials and Methods

The detailed model formulations are described in this section. The general functional forms for

λ_{i} (t_{i} | E)

and

Pr (t_{i} - t_{j})

can be described by Eqs. 5 and 6, respectively:

λ_{i} (t_{i} | E) = f_{e} (t_{i}, E; β^{E}, β^{T}),

[5]

Pr (t_{i} - t_{j}) = f_{p} (τ_{j i}; β^{S}, β^{T}, β^{F}),

[6]

where

τ_{j i}

is the time interval between the failures of node i and node j. Two specific functional forms for

f_{e}

and

f_{p}

are used in the four simulation studies. In interdependent infrastructure cascading failures (a), geometric functions are adopted for both

f_{e}

and

f_{p}

as in Eqs. 7 and 8. In the influenza epidemic (b), congestion cascade (c), and cascading power outages (d),

f_{p}

is characterized by Weibull functions as in Eq. 9. With no external factors considered,

f_{e}

is not used in the latter three simulation studies:

f_{e, 1} (t_{i}; β^{E}) = {[1 - p_{i, 1} (β^{E})]}^{t_{i} - 1} p_{i, 1} (β^{E}),

[7]

f_{p, 1} (τ_{j i}; β^{S}, β^{F}) = {[1 - p_{j i, 1} (β^{S}, β^{F})]}^{τ_{j i} - 1} p_{j i, 1} (β^{S}, β^{F}),

[8]

f_{p, 2} (τ_{j i}; β^{S}, β^{T}, β^{F}) = \exp {- {[γ_{j i, 2} (β^{S}, β^{F}) τ_{j i}]}^{β^{T}}} - \exp {- {[γ_{j i, 2} (β^{S}, β^{F}) (τ_{j i} + 1)]}^{β^{T}}},

[9]

where

p_{i, 1}

and

p_{j i, 1}

are the likelihoods of failure due to external factors for node i and failure propagation from node j to node i at a single time step, respectively, and

γ_{j i, 2}

is the rate of failure propagations from node j to node i. More specifically in the infrastructure cascading failures (a), the instantaneous failure likelihood due to external factors

p_{i, 1} (β^{E})

is formulated as a function of the strength of high winds and storm surges, as in Eq. 10:

p_{i, 1} (β^{E}) = 1 - 1 / e^{(β_{1}^{E} E_{i}^{W S} + β_{2}^{E} E_{i}^{F D})},

[10]

where

E_{i}^{W S}

and

E_{i}^{F D}

are measures of the strength of high winds and storm surges at node i. The instantaneous failure-propagation likelihood,

p_{j i, 1} (β^{S}, β^{F})

in the infrastructure cascading failures (a), however, has the same formulation as the failure-propagation rate

γ_{j i, 2}^{(c, d)} (β^{S}, β^{F})

in congestion cascade (c) and cascading power outages (d), which is presented in Eq. 11:

p_{j i, 1} (β^{S}, β^{F}) = γ_{j i, 2}^{(c, d)} (β^{S}, β^{F})

= β_{1}^{S} e^{(- β_{2}^{S} d_{j i})} + β^{F} A_{j i}^{n u l l} - β_{1}^{S} β^{F} A_{j i}^{n u l l} e^{(- β_{2}^{S} d_{j i})},

[11]

where

d_{j i}

is the distance between node i and node j and

A_{j i}^{n u l l}

is the functional dependence structure denoting whether node i is functionally dependent on node j.

The influenza epidemic study (b) has a unique formulation of failure-propagation rate

γ_{j i, 2}^{(b)} (β^{S}, β^{F})

, which assumes a linear relation with trip rate, zone proximity or population density, as in Eq. 12:

γ_{j i, 2}^{(b)} (β^{S}, β^{F}) = \{\begin{aligned} β_{1}^{F} v_{j i} + β^{S} \cdot P r o x (i, j), & i f z_{i} \neq z_{j} \\ β_{2}^{F} ρ_{i} + β^{S}, & i f z_{i} = z_{j} \end{aligned},

[12]

where

z_{i}

and

z_{j}

are the census tracts individual i and j are located in,

v_{j i}

is the trip rate (number of trips per day) from

z_{j}

to

z_{i}

,

ρ_{i}

is the population density in

z_{i}

, and Prox(i,j) is an indicator of whether

z_{i}

and

z_{j}

are neighboring census tracts.

Acknowledgments

We thank M. Elizabeth Halloran (University of Washington) for her invaluable help in revising and preparing this paper. This work was supported by National Science Foundation Civil, Mechanical and Manufacturing Innovation Grant 1536340 and National Institutes of Health Grant 1R01GM108731-01A1.

Supporting Information

Appendix (PDF)

Download
940.60 KB

References

1

NM Ferguson, CA Donnelly, RM Anderson, The foot-and-mouth epidemic in Great Britain: Pattern of spread and impact of interventions. Science 292, 1155–1160 (2001).

Crossref

PubMed

Google Scholar

2

P Teunis, et al., Infectious disease transmission as a forensic problem: Who infected whom? J R Soc Interf 10, 20120955 (2013).

Crossref

PubMed

Google Scholar

3

D Kempe, J Kleinberg, É Tardos, Maximizing the spread of influence through a social network. Theor Comput 11, 105–147 (2015).

Crossref

Google Scholar

4

R Lans, G Bruggen, J Eliashberg, B Wierenga, A viral branching model for predicting the spread of electronic word of mouth. Marketing Sci 29, 348–365 (2010).

Crossref

Google Scholar

5

K Takemoto, T Tamura, T Akutsu, Theoretical estimation of metabolic network robustness against multiple reaction knockouts using branching process approximation. Phys A Stat Mech Appl 392, 5525–5535 (2013).

Crossref

Google Scholar

6

AG Smart, LAN Amaral, JM Ottino, Cascading failure and robustness in metabolic networks. Proc Natl Acad Sci USA 105, 13223–13228 (2008).

Crossref

PubMed

Google Scholar

7

R Ma, X Ban, J-S Pang, Continuous-time dynamic system optimum for single-destination traffic networks with queue spillbacks. Transp Res B Methodological 68, 98–122 (2014).

Crossref

Google Scholar

8

JJ Wu, HJ Sun, ZY Gao, Cascading failures on weighted urban traffic equilibrium networks. Phys A Stat Mech Appl 386, 407–413 (2007).

Crossref

Google Scholar

9

P Hines, K Balasubramaniam, EC Sanchez, Cascading failures in power grids. IEEE Potentials 28, 24–30 (2009).

Crossref

Google Scholar

10

R Albert, I Albert, GL Nakarado, Structural vulnerability of the North American power grid. Phys Rev E 69, 025103 (2004).

Crossref

PubMed

Google Scholar

11

CD Brummitt, RM D’Souza, EA Leicht, Suppressing cascades of load in interdependent networks. Proc Natl Acad Sci USA 109, E680–E689 (2012).

Crossref

PubMed

Google Scholar

12

AE Motter, Y-C Lai, Cascade-based attacks on complex networks. Phys Rev E 66, 065102 (2002).

Crossref

Google Scholar

13

MEJ Newman, Spread of epidemic disease on networks. Phys Rev E 66, 016128 (2002).

Crossref

PubMed

Google Scholar

14

C Moore, MEJ Newman, Epidemics and percolation in small-world networks. Phys Rev E 61, 5678–5682 (2000).

Crossref

Google Scholar

15

R Pastor-Satorras, A Vespignani, Epidemic spreading in scale-free networks. Phys Rev Lett 86, 3200–3203 (2001).

Crossref

PubMed

Google Scholar

16

SM Rinaldi, JP Peerenboom, TK Kelly, Identifying, understanding, and analyzing critical infrastructure interdependencies. IEEE Control Syst 21, 11–25 (2001).

Crossref

Google Scholar

17

DS Bassett, et al., Dynamic reconfiguration of human brain networks during learning. Proc Natl Acad Sci USA 108, 7641–7646 (2011).

Crossref

PubMed

Google Scholar

18

Y Yang, T Nishikawa, AE Motter, Small vulnerable sets determine large network cascades in power grids. Science 358, eaan3184 (2017).

Crossref

PubMed

Google Scholar

19

Y Yang, T Nishikawa, AE Motter, Vulnerability and cosusceptibility determine the size of network cascades. Phys Rev Lett 118, 048301 (2017).

Crossref

PubMed

Google Scholar

20

JD Power, et al., Functional network organization of the human brain. Neuron 72, 665–678 (2011).

Crossref

PubMed

Google Scholar

21

LM Sander, CP Warren, IM Sokolov, C Simon, J Koopman, Percolation on heterogeneous networks as a model for epidemics. Math Biosci 180, 293–305 (2002).

Crossref

PubMed

Google Scholar

22

DJ Watts, A simple model of global cascades on random networks. Proc Natl Acad Sci USA 99, 5766–5771 (2002).

Crossref

PubMed

Google Scholar

23

D Li, et al., Percolation transition in dynamical traffic network with evolving critical bottlenecks. Proc Natl Acad Sci USA 112, 669–672 (2015).

Crossref

PubMed

Google Scholar

24

W-X Wang, Y-C Lai, D Armbruster, Cascading failures and the emergence of cooperation in evolutionary-game based models of social and economical networks. Chaos Interdiscip J Nonlinear Sci 21, 033112 (2011).

Crossref

PubMed

Google Scholar

25

P Matjaž, Evolution of cooperation on scale-free networks subject to error and attack. New J Phys 11, 033027 (2009).

Crossref

Google Scholar

26

L Huang, L Yang, K Yang, Geographical effects on cascading breakdowns of scale-free networks. Phys Rev E 73, 036102 (2006).

Crossref

Google Scholar

27

KI Goh, DS Lee, B Kahng, D Kim, Sandpile on scale-free networks. Phys Rev Lett 91, 148701 (2003).

Crossref

PubMed

Google Scholar

28

I Dobson, BA Carreras, VE Lynch, DE Newman, Complex systems analysis of series of blackouts: Cascading failure, critical points, and self-organization. Chaos Interdiscip J Nonlinear Sci 17, 026103 (2007).

Crossref

PubMed

Google Scholar

29

S Eubank, et al., Modelling disease outbreaks in realistic urban social networks. Nature 429, 180–184 (2004).

Crossref

PubMed

Google Scholar

30

I Dobson, BA Carreras, VE Lynch, DE Newman, An initial model for complex dynamics in electric power system blackouts. Proceedings of the 34th Annual Hawaii International Conference on System Sciences (IEEE Computer Society, Washington, DC), pp. 710–718 (2001).

Crossref

Google Scholar

31

AL Barabási Network Science (Cambridge Univ Press, Cambridge, UK, 2016).

Google Scholar

32

P Hines, E Cotilla-Sanchez, S Blumsack, Do topological models provide good information about electricity infrastructure vulnerability? Chaos Interdiscip J Nonlinear Sci 20, 033122 (2010).

Crossref

PubMed

Google Scholar

33

S Boccaletti, V Latora, Y Moreno, M Chavez, DU Hwang, Complex networks: Structure and dynamics. Phys Rep 424, 175–308 (2006).

Crossref

Google Scholar

34

M Ouyang, Review on modeling and simulation of interdependent critical infrastructure systems. Reliability Eng Syst Saf 121, 43–60 (2014).

Crossref

Google Scholar

35

BU Forstmann, et al., Multi-modal ultra-high resolution structural 7-tesla mri data repository. Sci Data 1, 140050 (2014).

Crossref

PubMed

Google Scholar

36

R Baldick, et al., Initial review of methods for cascading failure analysis in electric power transmission systems - IEEE PES CAMS task force on understanding, prediction, mitigation and restoration of cascading failures. 2008 IEEE Power and Energy Society General Meeting - Conversion and Delivery of Electrical Energy in the 21st Century (IEEE, Piscataway, NJ), pp. 1–8

Google Scholar

37

SN Dorogovtsev, AV Goltsev, JFF Mendes, Critical phenomena in complex networks. Rev Mod Phys 80, 1275–1335 (2008).

Google Scholar

38

A Barrat, M Barthelemy, A Vespignani Dynamical Processes on Complex Networks (Cambridge Univ Press, Cambridge, UK, 2008).

Crossref

Google Scholar

39

Y Yang, AE Motter, Cascading failures as continuous phase-space transitions. Phys Rev Lett 119, 248302 (2017).

Crossref

PubMed

Google Scholar

40

D Kübler, G Weizsäcker, Limited depth of reasoning and failure of cascade formation in the laboratory. Rev Econ Stud 71, 425–441 (2004).

Crossref

Google Scholar

41

T Gross, CJD D’Lima, B Blasius, Epidemic dynamics on an adaptive network. Phys Rev Lett 96, 208701 (2006).

Crossref

PubMed

Google Scholar

42

J Lorenz, S Battiston, F Schweitzer, Systemic risk in a unifying framework for cascading processes on networks. Eur Phys J B 71, 441 (2009).

Crossref

Google Scholar

43

LJ LeBlanc, EK Morlok, WP Pierskalla, An efficient approach to solving the road network equilibrium traffic assignment problem. Transp Res 9, 309–318 (1975).

Crossref

Google Scholar

44

AJ Wood, BF Wollenberg Power Generation, Operation, and Control (Wiley, New York, 2012).

Google Scholar

45

MJ Eppstein, PDH Hines, A r̈andom chemistry algorithm for identifying collections of multiple contingencies that initiate cascading failure. IEEE Trans Power Syst 27, 1698–1705 (2012).

Crossref

Google Scholar

46

P Rezaei, PDH Hines, MJ Eppstein, Estimating cascading failure risk with random chemistry. IEEE Trans Power Syst 30, 2726–2735 (2015).

Crossref

Google Scholar

47

PDH Hines, I Dobson, P Rezaei, Cascading power outages propagate locally in an influence graph that is not the actual grid topology. IEEE Trans Power Syst 32, 958–967 (2017).

Google Scholar

48

KML Charland, et al., Effect of environmental factors on the spatio-temporal patterns of influenza spread. Epidemiol Infect 137, 1377–1387 (2009).

Crossref

PubMed

Google Scholar

49

T Koski, J Noble Bayesian Networks: An Introduction (Wiley, Chichester, UK) Vol 924 (2011).

Google Scholar

50

K Hampson, et al., Transmission dynamics and prospects for the elimination of canine rabies. PLoS Biol 7, e1000053 (2009).

Crossref

Google Scholar

51

L Danon, et al., Networks and the epidemiology of infectious disease. Interdiscip Perspect Infect Dis 2011, 28 (2011).

Crossref

Google Scholar

52

P Netrapalli, S Sanghavi, Learning the graph of epidemic cascades. Proceedings of the 12th ACM SIGMETRICS/PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems (ACM, New York), pp. 211–222 (2012).

Crossref

Google Scholar

53

M Gomez Rodriguez, J Leskovec, B Schölkopf, Modeling information propagation with survival theory. Proceedings of the 30th International Conference on Machine Learning (ACM, New York) Vol 28, 666–674 (2013).

Google Scholar

54

M Gomez-Rodriguez, J Leskovec, D Balduzzi, B Schölkopf, Uncovering the structure and temporal dynamics of information propagation. Netw Sci 2, 26–65 (2014).

Crossref

Google Scholar

55

N Du, L Song, H Woo, H Zha, Uncover topic-sensitive information diffusion networks. Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics (IJCAI), pp. 229–237 (2013).

Google Scholar

56

JH Beigel, et al., Avian influenza a (h5n1) infection in humans. New Engl J Med 353, 1374–1385 (2005).

Crossref

PubMed

Google Scholar

57

X Guan, C Chen, Using social media data to understand and assess disasters. Nat Hazards 74, 837–850 (2014).

Crossref

Google Scholar

58

Li Daqing, J Yinan, K Rui, S Havlin, Spatial correlation analysis of cascading failures: Congestions and blackouts. Sci Rep 4, 5381 (2014).

Crossref

PubMed

Google Scholar

59

F Gany, R Rau-Murthy, I Mujawar, Increasing influenza vaccination in New York city taxi drivers: A community driven approach. Vaccine 33, 2521–2523 (2015).

Crossref

PubMed

Google Scholar

60

; Centers for Disease Control and Prevention, Seasonal influenza vaccine effectiveness, 2005-2017. Available at https://www.cdc.gov/flu/professionals/vaccination/effectiveness-studies.htm. Accessed December 20, 2017. (2017).

Google Scholar

61

DJD Earn, J Dushoff, SA Levin, Ecology and evolution of the flu. Trends Ecol Evol 17, 334–340 (2002).

Crossref

Google Scholar

62

TC Germann, K Kadau, IM Longini, CA Macken, Mitigation strategies for pandemic influenza in the United States. Proc Natl Acad Sci USA 103, 5935–5940 (2006).

Crossref

PubMed

Google Scholar

63

V Belik, T Geisel, D Brockmann, Natural human mobility patterns and spatial spread of infectious diseases. Phys Rev X 1, 011001 (2011).

Google Scholar

64

DM Scott, DC Novak, L Aultman-Hall, F Guo, Network robustness index: A new method for identifying critical links and evaluating the performance of transportation networks. J Trans Geogr 14, 215–227 (2006).

Crossref

Google Scholar

65

JG Wardrop, Some theoretical aspects of road traffic research. Proc Inst Civil Eng 1, 325–362 (1952).

Crossref

Google Scholar

66

M Akbarzadeh, S Memarmontazerin, S Soleimani, Where to look for power laws in urban road networks? Appl Netw Sci 3, 4 (2018).

Crossref

PubMed

Google Scholar

67

DE Newman, et al., Risk assessment in complex interacting infrastructure systems. Proceedings of the 38th Annual Hawaii International Conference on System Sciences (IEEE, Piscataway, NJ), pp. 63c (2005).

Google Scholar

68

L Zonghua, H Bambi, Epidemic spreading in community networks. Europhys Lett 72, 315–321 (2005).

Crossref

Google Scholar

69

I Dobson, J Kim, KR Wierzbicki, Testing branching process estimators of cascading failure with data from a simulation of transmission line outages. Risk Anal 30, 650–662 (2010).

Crossref

PubMed

Google Scholar

70

; U.S.-Canada Power System OutageTask Force, Final report on the August 14th blackout in the United States and Canada: Causes and recommendations (US Dept of Energy, Washington, DC and National Resources Canada, Ottawa, ON, Canada), Report. (2004).

Google Scholar

Information & Authors

Information

Published in

Proceedings of the National Academy of Sciences

Vol. 115 | No. 35
August 28, 2018

PubMed: 30111540

Classifications

Copyright

Submission history

Published online: August 15, 2018

Published in issue: August 28, 2018

Keywords

Acknowledgments

We thank M. Elizabeth Halloran (University of Washington) for her invaluable help in revising and preparing this paper. This work was supported by National Science Foundation Civil, Mechanical and Manufacturing Innovation Grant 1536340 and National Institutes of Health Grant 1R01GM108731-01A1.

Notes

This article is a PNAS Direct Submission. Y.F. is a guest editor invited by the Editorial Board.

Authors

Affiliations

Xiangyang Guan https://orcid.org/0000-0002-4821-6551

Department of Civil and Environmental Engineering, University of Washington, Seattle, WA 98195

View all articles by this author

Cynthia Chen¹ [email protected]

Department of Civil and Environmental Engineering, University of Washington, Seattle, WA 98195

View all articles by this author

Notes

1

To whom correspondence should be addressed. Email: [email protected].

Author contributions: X.G. and C.C. designed research, performed research, contributed new reagents/analytic tools, analyzed data, and wrote the paper.

Competing Interests

The authors declare no conflict of interest.

Metrics & Citations

Metrics

Note: The article usage is presented with a three- to four-day delay and will update daily once available. Due to ths delay, usage data will not appear immediately following publication. Citation information is sourced from Crossref Cited-by service.

Citation statements

Altmetrics

Citations

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

Cited by

View Options

View options

PDF format

Download this article as a PDF file

DOWNLOAD PDF

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Personal login Institutional Login

Recommend to a librarian

Recommend PNAS to a Librarian

Save for later

Purchase options

Purchase this article to get full access to it.

Single Article Purchase

General methodology for inferring failure-spreading dynamics in networks

Proceedings of the National Academy of Sciences

Vol. 115
No. 35
pp. 8645-E8326

Restore content access

Restore content access for purchases made as a guest

Featured Topics

Articles By Topic

Featured Topics

Articles By Topic

Featured Topic

Articles By Topic

Significance

Abstract

Sign up for PNAS alerts.

Model

Results

(a) Interdependent Infrastructure Cascading Failures.

(b) Influenza Epidemic.

(c) Congestion Cascade.

(d) Cascading Power Outages.

Computational Complexity.

Conclusion and Discussion

Materials and Methods

Acknowledgments

Supporting Information

References

Information

Published in

Classifications

Copyright

Submission history

Keywords

Acknowledgments

Notes

Authors

Affiliations

Notes

Competing Interests

Metrics

Citation statements

Altmetrics

Citations

Cited by

View options

PDF format

Get Access

Login options

Recommend to a librarian

Purchase options

Restore content access

Figures

Tables

Other

Share

Share article link

Share on social media

Further reading in this issue

Income inequality not gender inequality positively covaries with female sexualization on social media

Oceanographic boundaries constrain microbial diversity gradients in the South Pacific Ocean

Recurrent computations for visual pattern completion

Widespread seawater intrusions beneath the grounded ice of Thwaites Glacier, West Antarctica

Bodily maps of emotions

Solving the 250-year-old mystery of the origin and global spread of the German cockroach, Blattella germanica

Sign up for thePNAS Highlights newsletter