Abstract

The systemic stability of a stock market is one of the core issues in the financial field. The market can be regarded as a complex network whose nodes are stocks connected by edges that signify their correlation strength. Since the market is a strongly nonlinear system, it is difficult to measure the macroscopic stability and depict market fluctuations in time. In this article, we use a geometric measure derived from discrete Ricci curvature to capture the higher-order nonlinear architecture of financial networks. In order to confirm the effectiveness of our method, we use it to analyze the CSI 300 constituents of China’s stock market from 2005 to 2020 and the systemic stability of the market is quantified through the network’s Ricci-type curvatures. Furthermore, we use a hybrid model to analyze the curvature time series and predict the future trends of the market accurately. As far as we know, this is the first article to apply Ricci curvature to forecast the systemic stability of China’s stock market, and our results show that Ricci curvature has good explanatory power for the market stability and can be a good indicator to judge the future risk and volatility of China’s stock market.

1. Introduction

Through more than thirty years of development, China’s capital market has grown continuously. With improvements of the trading mechanism, the market stability has been gradually enhanced and the market plays a more and more important role in optimizing the social financing structure and promoting the allocation of resources. On the other hand, China’s financial market is in its infancy, and abnormal market fluctuations still occur occasionally. For example, from 2007 to 2008, the Shanghai Composite Index fell from 6124, the highest point, to 1664, a drop of 70%. During the market crash in 2015, the market experienced significant abnormal fluctuations which lasted for half a year. As the key factors of derivative pricing and financial risk management, it is of great significance to study how to measure and forecast the market stability reasonably and accurately. This kind of ability to analyze and predict the market is conducive to the objective and quantifiable evaluation of China’s financial market, the analysis of market stability factors, and the formulation of targeted policies so as to realize the early warning and prevention of financial risks and the maintenance of financial stability.

The stock market is a nonlinear and nonstationary system with strong volatility, tight coupling, and asymmetry. Individual stocks in the market interact with each other, and the abnormal fluctuations of individuals may quickly enlarge to the whole market. To better understand the highly correlated market, as well as to achieve monitoring and adjustment of it, economists advocate the use of many new tools and interdisciplinary approaches, such as trigger points, feedback, contagion, and complexity theory [16]. In particular, to describe the stability macroscopically, we should not consider each individual separately but should regard the market as a whole system, which coincides with the nature of complex networks [7, 8]. Empirical cross-correlation among stock prices has been extensively studied and explored for more than two decades [914]. The correlation between stock returns allows us to construct a variety of correlation-based networks, such as minimum spanning trees (MST) [10, 1517] or threshold networks [18], where nodes represent stocks and edges represent correlation strength (or converted to a distance metric). In recent years, correlation-based networks become one of the common tools for modeling and analyzing complex financial systems [14, 15, 1921].

Since there are interactions that occur among groups of more nodes besides pairwise interactions, to reveal the higher-order nonlinear relationship in a network [2225], curvature, which is a key concept in geometry proposed by Jost [26], can be an appropriate and powerful tool, and it has been increasingly used as network metrics in recent years [24, 25, 27, 28]. In 2015, Sandhu et al. [29] applied the graph curvature to cancer networks for the first time. Sandhu et al. [28] also studied the evolution of Ollivier–Ricci curvature in the financial threshold network and showed that Ollivier–Ricci curvature could be used to determine the stability of USA S & P-500 over the period 1998–2013. A recent study by Samal et al. [30] confirmed that discrete Ricci curvature could be an excellent indicator of stability and volatility for financial markets of USA and Japan. For the financial market in China, relevant studies have confirmed that it has significant small-world effect and scale-free feature [3133], which provide us a theoretical basis for the combination of network geometry and China’s financial market. In summary, the description of the stability of China’s stock market through geometric measurement is the first motivation of the research work in this article.

In addition to measure the stability, prediction of trends of the market is also an exciting research area and this is another main purpose of this article. We will use a hybrid machine learning model which combines deep neural network and wavelet decomposition to achieve this goal. We remark that, because the financial curvature time series are complex, nonstationary, and very noisy, the classic time series models, such as ARIMA, GARCH, et al., are not suitable for this task.

Since deep learning models can successfully extract features of real-world data, combining deep learning with financial market forecasting is regarded as a charming strategy [34]. Among them, recurrent neural network (RNN) [35, 36] is a kind of recursive neural network which is using sequence data as input, recursive in the direction of the evolution of sequence, and chained by all nodes. To overcome gradient disappearance and gradient explosion of RNN, a specific kind of RNN named long short-term memory (LSTM) [37, 38], which takes into account the long-term dependence of time series, is gradually used in time series forecasting. Kumar and Ningombam [39] evaluated the effectiveness of LSTM for making predictions about stock prices of APPL (Apple Inc./NASDAQ). Liu [40] applied LSTM to the large interval volatility forecasting of S&P 500 and AAPL and finally concluded that LSTM could achieve a better forecasting result than GARCH (1, 1). Huang, et al. [41] decomposed financial data into long-term and short-term trends by variational mode decomposition and then utilized LSTM to predict the future trends of the sequences.

Wavelet decomposition (WD) is an approach that describes the relationship between the time series in time and frequency domains simultaneously. Through wavelet decomposition, the noise feature of time series can be fixed. Therefore, it is natural to combine wavelet decomposition and forecasting models to improve the prediction accuracy of time series. In the research of impact of COVID-19 on the global economy, Štifanić, et al. [42] integrated the stationary wavelet transform and bidirectional long short-term memory neural network to forecast crude oil and stock prices and achieved satisfactory results. Peng, et al. [43] applied a LSTM-based model into energy consumption forecasting, which also combined wavelet decomposition and LSTM, and achieved a prediction accuracy better than the basic LSTM model.

In the present article, according to the previous works and our two main purposes, we first construct a threshold network based on the daily returns of the constituents of the CSI 300 index over 16 years. The main objective of this study is to confirm that discrete Ricci curvature can be applied to networks of China’s stock market and can accurately describe its systemic stability. We find that Ricci curvature provides a good response to the systemic characteristic of the financial market in China and we can use this tool to identify important events (good or bad) in the market. As another main contribution, we develop a hybrid forecasting model which provides a good response to the future trends of the market.

2. Preliminaries

2.1. Graph and Minimum Spanning Tree

In mathematics, a network is usually called graph, which is composed of a finite set of nodes and a set of edges between nodes, denoted as , where is the graph, is the set of nodes in , and is the set of edges in . Table 1 lists some of the concepts related to graph.

For brevity, we only discuss undirected graphs. Two nodes of a graph are said to be connected if there is a path between them. If any two nodes in the graph are connected, the graph is called a connected graph. The spanning tree of a connected graph with n vertices is a connected subgraph that contains all n vertices but has only n − 1 edges. If an edge is added to a spanning tree, it necessarily forms a ring, and if an edge is reduced, it is no longer a connected graph.

2.1.1. Minimum Spanning Tree (MST)

In a given undirected graph , represents the edge connecting nodes u and , and represents the weight of this edge. If there exists T which is a spanning tree of and is minimal, T is called a minimal spanning tree of . We usually use Prim’s algorithm [44] to implement the construction of a minimum spanning tree of a graph.

2.2. Ricci-Type Curvatures for Network Analysis

As an important geometric quantity, the classical Ricci curvature quantifies the deviation for the tangent direction and requires a smooth manifold as well as a tensor and higher-order derivatives [26]. This requirement is not applicable to discrete graphs or networks, so it is necessary to discretize it to apply in networks. In this work, we apply four different types of discrete Ricci curvatures to the threshold network of China’s stock market. Their definitions and applications can be found in many relevant literature studies. For completeness, we briefly describe their definitions here.

2.2.1. Ollivier–Ricci Curvature

This is a widely used discretization [25, 27, 28] of the classical Ricci curvature raised by Ollivier [45, 46]. In recent years, it has also been applied to financial networks [29, 30]. In a space with positive curvature, the average distance between balls is less than the center distance, while in a negative curved space, the opposite conclusion is reached. Ollivier–Ricci (OR) curvature extends the previous observations from balls (volumes) to measures (probabilities), and the OR curvature of the edge e connecting nodes u and is defined as in (1), and represent measures concentrated at nodes u and , is the distance between u and , and is the Wasserstein distance [47] between the discrete probability measures and . The Wasserstein distance is given by where is the set of probability measures that satisfy

In addition, the probability distribution for must be specified, which is chosen to be uniform over the neighboring nodes of u [48].

2.2.2. Forman–Ricci Curvature

Forman–Ricci (FR) curvature is based on the relationship between the Riemannian Laplace operator and the Ricci curvature [49]. It has been shown that FR curvature and edge betweenness centrality are highly correlated [25, 50]. In the undirected network, the FR curvature of edge e connecting nodes u and is defined as [24]. where , , and denote the weights of the edge e, the nodes u and , respectively. In addition, and denote the set of edges connecting u and , respectively, but excluding the edge e.

2.2.3. Menger–Ricci Curvature

Menger’s approach [51] is based on viewing the graph as a metric space, and the path length between two nodes is treated as the distance between two points in the metric space. Suppose T is a triangle in the metric space with sides a, b, and c, then Menger curvature of T is given by where and is the radius of the circumscribed circle of the triangle T. Then, Menger–Ricci (MR) curvature of an edge e in a network can be defined as [52] where denotes the set of triangles formed by side e.

2.2.4. Haantjes–Ricci Curvature

Haantjes [53] defined the curvature of a curve in a metric space as the ratio of the arc length to the chord length of the curve. For a discrete network, suppose that is a simple path between nodes and , is the length of the path, and is the shortest distance between nodes and . Haantjes–Ricci (HR) curvature of the simple path π is

Then, HR curvature of an edge e can be defined as where πe denotes the paths that connects the nodes anchoring the edge e.

The previous four discretizations focus on capturing different geometric properties portrayed by the classical Ricci curvature. OR curvature can well capture the aspect of volume growth of classical Ricci curvature. We use OR curvature in networks to compare the average distance between two nodes. FR curvature depicts the geodesic diffusivity of the classical Ricci curvature, and we use FR curvature in networks to show the information spread at the ends of edges. Both MR and HR curvatures can capture the geodesics dispersal rate of the classical Ricci curvature. In this work, we ignore the weights of the edges in the network and calculate the average of edges for these four discrete Ricci curvatures according to equations (1)–(8), respectively, and considering the computational complexity, we only use the path between nodes whose length is less than or equal to 4 in the calculations of MR and HR curvatures.

We remark that since we have already considered the correlation coefficients of the stock’s return time series during network construction, even if the weights of edges are set to be equal to 1, the threshold network can still reflect the mutual influence between stocks and describe the main transmission mechanism of information in the market. At the same time, taking uniform weights equal to 1 can obviously save computing resources and improve computing efficiency.

2.3. Discrete Wavelet

Wavelet analysis is a time-frequency analysis method and can achieve high resolution in both time and frequency domains. Through decomposing the curvature time series of our financial networks into several components based on various frequencies, wavelet analysis is able to filter out the chaotic components so as to remove the influence of noises and improve the prediction performance effectively.

The wavelet transform is roughly divided into continuous transform and discrete transform and both are based on two specific functions: mother wavelet function and daughter wavelet function. For the continuous case, assuming and are the Fourier transform of , is called mother wavelet function, if meets and the definition of daughter wavelet function is as follows: where a and b are, respectively, called the expansion factor and translation factor.

Due to the fact that our curvature data are based on the daily returns of stocks, we utilize the discrete wavelet transform to decompose the time series. Assigning and to a and b in equation (10), discrete daughter wavelet function is as follows: where . For brevity, we use instead of from now on. The discrete wavelet transform corresponding to is as follows: where and are the conjugates of .

Our denoising process of wavelet decomposition is divided into the following three steps:Step 1: we determine a wavelet function and the number of decomposition layers and then decompose the original time seriesStep 2: we select an appropriate threshold to eliminate the fluctuations exceeding the threshold and retain the specific signalsStep 3: we reconstruct the retained signals to form a new signal

3. Data and Methods

3.1. Data Description

The data of this article are collected from East money (https://www.eastmoney.com), including daily closing prices for N = 111 stocks, T = 3889 trading days, from January 4, 2005 to December 31, 2020. All the N = 111 stocks are constituents of the CSI 300 index. Due to some unavoidable factors such as stock suspensions, some stocks are missing their prices on certain trading days. Considering that the stock prices do not change too much in a short period of time, we fill the gaps with the data of previous trading time.

First, for each stock, we construct a daily return time series according to the formula as follows: where , and are the adjusted closing price of the kth stock at time t. Then, the equal-time Pearson cross-correlation coefficients of the daily return time series of stock i and stock j are defined as where is the covariance of and in a time interval of length τ, and t indicates the end date of the interval of τ trading days. In our empirical research, we use the following two schemes to divide time series in order to better illustrate the reliability of our conclusion by comparison of these two approaches.(i)A nonoverlapping time interval of τ = 22 trading days (one trading month)(ii)An overlapping time interval of τ = 22 days, with a rolling shift of Δ = 5 trading days (one trading week)

Corresponding to correlation coefficients, we construct the distance measures dij which are widely used for the construction of financial networks [10, 15, 54].

3.2. Threshold Network Construction

First, for a given time interval of τ trading days ending on trading day t, we get a distance matrix whose elements are . This distance matrix can be considered as an edge-weighted complete graph , whose nodes are stocks and the weight of an edge between stocks i and j is given by . Next, with the help of Prim’s algorithm [44], we create MST based on the complete graph , which selects the most relevant connections of the stocks. Finally, to capture more significant information in the market, we add edges in to connect nodes i and j in if for some threshold θ. The complete graph constructed by MST and the threshold θ is called a threshold network and is denoted as .

In this article, we set the threshold θ = 0.75 and use for calculating different kinds of Ricci curvatures.

3.3. The Hybrid Forecasting Model

Due to the fact that the curvature time series is composed of nonlinear features, various temporal information pieces, and noises, it is challenging to achieve an accurate forecasting result. Wavelet decomposition can analyze the series from different scales, which can not only reflect the overall trend but also extract the effective information of the series in details. On the other hand, as a deep learning model, LSTM is able to learn long-term correlations and mine complicated nonlinear relationships within the curvature series effectively. Based on the previous facts, we propose a hybrid WD-LSTM model, combining the strengths of wavelet decomposition and long short-term memory network, to forecast the future trends of the market. The WD-LSTM model involves three phrases: decomposition, forecasting, and integration. In the decomposition phrase, we decompose the original curvature series data into four high frequency sequences (detail) and one low frequency sequence (approximation). Next, in the forecasting phrase, LSTM is utilized to forecast each decomposed sequence, respectively. Finally, the prediction results of all subsequences are aggregated in the integration phrase. The architecture of the WD-LSTM model is shown in Figure 1.

LSTM used in the forecasting phrase is a specially designed RNN and suitable for processing and forecasting important events with very long intervals and delays in the time series. The architecture of LSTM at time t is composed of four units: forget gate, input gate, output gate, and cell state, which is shown in Figure 2. To clarify the details of LSTM, we use , U, and b with different subscripts to denote the linear coefficients and biases of these units.

The output ft of forget gate at time t represents the probability of forgetting the hidden cell state of the previous layer, which can be calculated by where σ is the sigmoid activation function, ht−1 denotes the state of the hidden layer at time t − 1, and xt denotes the input vector at time t.

The input gate is responsible for processing the current input signal and composed of two parts depending on sigmoid and tanh activation functions, respectively. This gate can be formulated as

The cell state is updated according to forget gate and input gate which is formulated as where denotes the Hadamard product.

The output gate is formulated as with the output state Ot and hidden cell state Ct at time t, the hidden state of the cell is updated as

Finally, we set a forecast unit which is a fully connected neural network with outputs as the forecasting values Yt of the time series at time t + 1 according to the hidden state ht:

To complete the building of the whole LSTM model, we set four layers including the input layer, LSTM layer, fully connected layer, and regression layer, as shown in Figure 1, where the regression layer is used to give the mean square error of the outputs.

4. Empirical Results

4.1. Market Stability

Exploring the explanatory power of Ricci curvature for the stability of China’s stock market is one of the main purposes of this article. We analyze the logarithmic returns of constituents of the CSI 300 index over a 16-year period (2005–2020) by means of building the undirected network with the threshold θ = 0.75. The MST and threshold network constructed based on the data are shown in Figure 3, and Table 2 lists some of the ticker symbols corresponding to numbers of nodes in the figures.

Figure 4 depicts four curvature time series of the threshold network building with nonoverlapping time intervals (τ = 22 trading days) and Figure 5 with a rolling shift of Δ = 5. Obviously, the fluctuation trends of the curvature time series which are obtained by using two different data processing methods are essentially consistent, which confirm the generalization performance of our methods and the reliability of our conclusions.

We list some of the major events in China’s financial market between 2005 and 2020 in Table 3. As key events in the market, during these events, the rule, structure, participants, or external environment of the market have changed significantly and the stability should be poorer than the normal periods. To verify the effectiveness of the geometric quantities of networks, we compare these events and the curvature time series and find out that the fluctuations of the curvature time series can capture these key information pieces of the market well. Some of the events are marked with dotted lines in Figures 4 and 5.

Combining the results in Figures 4 and 5 and the events in Table 3, we find that the four discrete Ricci curvatures can depict the market stability. During the periods of those key events, the curvature time series fluctuates to different degrees. In particular, when the news is significantly good or bad, the time series shows large fluctuations. We therefore believe that the discrete Ricci curvatures can serve as good indicators of the stability for China’s stock market.

4.2. Forecasting of the Systemic Stability

To accomplish another main purpose, we apply the WD-LSTM model to analyze the curvature time series and forecast the future trend of China’s stock market. The WD-LSTM model contains three phrases: decomposition, forecast, and integration. The empirical results through the previous three phrases are presented in detail in the following section.

4.2.1. Decomposition of Curvature Series

According to (10) and (12), we first decompose the original curvature series into four high frequency sequences (detail) and one low frequency sequence (approximation). For brevity, we choose FR curvature series (Δ = 5) as an example and present its decomposition results in Figure 6.

4.2.2. Forecast of Decomposed Sequences

The second step of the WD-LSTM model is to forecast each component decomposed by the WD module by using the LSTM module. In our experiment, each decomposed sequence is divided into the training set and testing set according to the proportion of 80% and 20%. Since τ = 22 and Δ = 5, the training time series is from February 2, 2015 to November 2, 2017. The number of LSTM layers is set to be 200. While in the process of training the LSTM model, the max iteration and the initial learning rate is set to be 250 and 0.005. Besides, the optimizer of LSTM is chosen to be Adam and the gradient threshold is set to be 1. After training, by using the back-propagation algorithm, we use the hidden state ht−1 to forecast the value at time t, where t is from November 9, 2017 to December 31, 2020.

Figure 7 presents the forecasting result of decomposed sequences of FR curvature series.

4.2.3. Integration of Forecasting Results

The final step of the WD-LSTM model is to integrate the forecasting results of decomposed sequences. After the integration phrase, we can get the final forecasting results of the curvature series. We show the final forecasting results of the four Ricci-type curvature series in Figure 8. We also list the evaluation metrics of the final forecasting results, including mean absolute error (MAE), mean square error (MSE), and R2, in Table 4.

4.3. Model Comparison and Empirical Summary

To verify the superiority of the WD-LSTM model, in this subsection, we carry out a comparative experiment where a basic LSTM model is utilized to forecast the four Ricci-type curvature series directly. Table 5 presents the evaluation metrics of the single LSTM model’s final forecasting results.

Comparing Tables 4 and 5, it is obvious that for each evaluation metric, the forecasting performance of the WD-LSTM model is significantly better than that of the basic LSTM model for all the four Ricci-type curvature series. It implies that the wavelet decomposition plays a remarkable role and the hybrid model can handle the strong nonlinearity, complex time characteristics, and noise interference of the curvature series better than the single LSTM model.

Furthermore, there must be performance differences between the four Ricci curvatures. Samal et al. [30] have shown that FR curvature is more sensitive and can detect both crashes and bubbles in USA S & P-500 and Japanese Nikkei-225 markets more efficiently. For China’s stock market, comparison of R2 metrics for the four kinds of curvatures in Tables 4 and 5 obviously implies that the performance of the hybrid model is better than a single LSTM model. In the hybrid model, all the four curvatures have excellent explanatory power for depicting and forecasting the stability of China’s stock market. In particular, the R2 metric of OR curvature series is closer to 1 than those of the other three. We can infer that the OR curvature series is more suitable for China’s market, which is different from the conclusion about the foreign markets. This may reflect the different characteristics of China’s and foreign markets. According to the definitions of these two curvatures, FR curvature is mainly aimed at capturing the diffusion characteristics of the geodesic, which is more sensitive to events than other curvatures, and can better capture the details of the market while OR curvature measures the relative distance between two respective neighborhoods of two vertices that form an edge. Therefore, it is more suitable for China’s market where macrocontrol measures are implemented more effectively and the comovement effect of the stock sectors is more obvious.

5. Conclusion

In this article, we apply different types of discrete Ricci curvatures of networks to characterize the systemic stability of China’s stock market. We verify the reliability of our methods by monitoring the fluctuations of the constituents of the CSI 300 index from 2005 to 2020 in conjunction with Table 3. We find that network curvatures can be used as good indicators for the systemic stability of China’s stock market.

Based on the above, we also make a more in-depth application of the geometric measure. A hybrid WD-LSTM model, combing wavelet decomposition with long short-term memory network, is applied to forecast the future trends of the systemic stability for China’s stock market by means of modeling and predicting the curvature series data. Comparing to the single LSTM model, the WD-LSTM model performs significantly better. Moreover, the empirical result shows that OR curvature is most suitable for China’s market and the proposed hybrid model has excellent forecasting performance.

In summary, we use discrete Ricci curvature as a measure of the stability for China’s financial market and apply an effective hybrid model to forecast the future trends. Our methods and models are very helpful to develop new financial regulatory tools to better identify, forecast, and prevent market risks and contribute to financial stability.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Disclosure

A preprint of the research has previously been published on arXiv [55].

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This research was supported by the National Natural Science Foundation of China (no. 12271039), New Liberal Arts Research and Reform Practice Project of the Ministry of Education (no. 2021060011), and the Emerging Interdisciplinary Project of CUFE.