The dependency network approach provides a system level analysis of the activity and
topology
In mathematics, topology (from the Greek words , and ) is concerned with the properties of a geometric object that are preserved under continuous deformations, such as stretching, twisting, crumpling, and bending; that is, without closing ho ...
of directed
networks
Network, networking and networked may refer to:
Science and technology
* Network theory, the study of graphs as a representation of relations between discrete objects
* Network science, an academic field that studies complex networks
Mathematics
...
. The approach extracts causal topological relations between the network's nodes (when the network structure is analyzed), and provides an important step towards inference of causal activity relations between the
network node
In telecommunications networks, a node (, ‘knot’) is either a redistribution point or a communication endpoint. The definition of a node depends on the network and protocol layer referred to. A physical network node is an electronic devic ...
s (when analyzing the network activity). This methodology has originally been introduced for the study of financial data,
[Dror Y. Kenett, Yoash Shapira, Gitit Gur-Gershgoren, and Eshel Ben-Jacob (submitted), Index Cohesive Force analysis of the U.S. stock market, Proceedings of the 2011 International Conference on Econophysics, Kavala, Greece] it has been extended and applied to other systems, such as the
immune system
The immune system is a network of biological processes that protects an organism from diseases. It detects and responds to a wide variety of pathogens, from viruses to parasitic worms, as well as Tumor immunology, cancer cells and objects such ...
,
[Asaf Madi, Dror Y. Kenett, Sharron Bransburg-Zabary, Yifat Merbl, Francisco J. Quintana, Stefano Boccaletti, Alfred I. Tauber, Irun R. Cohen, and Eshel Ben-Jacob (2011), Analyses of antigen dependency networks unveil immune system reorganization between birth and adulthood]
Chaos 21, 016109
and
semantic networks
A semantic network, or frame network is a knowledge base that represents semantic relations between concepts in a network. This is often used as a form of knowledge representation. It is a directed or undirected graph consisting of vertices, ...
.
In the case of network activity, the analysis is based on
partial correlation
In probability theory and statistics, partial correlation measures the degree of association between two random variables, with the effect of a set of controlling random variables removed. When determining the numerical relationship between two ...
s.
[Kunihiro Baba, Ritel Shibata, Masaaki Sibuya (2004), Partial correlation and conditional correlation as measures of conditional independence]
Aust New Zealand J Stat 46(4): 657–774
/ref>[Yoash Shapira, Dror Y. Kenett, and Eshel Ben-Jacob (2009), The Index Cohesive Effect on Stock Market Correlations]
/ref>[Dror Y. Kenett, Matthias Raddant, Thomas Lux, and Eshel Ben-Jacob (submitted), Evolvement of uniformity and volatility in the stressed global market, PNAS][Eran Stark, Rotem Drori and ]Moshe Abeles
Moshe Abeles ( he, משה אבלס; born 1936 in Tel Aviv) is an Israeli brain researcher and neurophysiologist. He is emeritus professor at the Hebrew University in Jerusalem and at the Life Science Faculty of Bar Ilan University in Ramat Gan ...
(2006), Partial Cross-Correlation Analysis Resolves Ambiguity in the Encoding of Multiple Movement Features
J Neurophysiol 95: 1966–1975
/ref> In simple words, the partial (or residual) correlation
In statistics, correlation or dependence is any statistical relationship, whether causal or not, between two random variables or bivariate data. Although in the broadest sense, "correlation" may indicate any type of association, in statisti ...
is a measure of the effect (or contribution) of a given node, say ''j'', on the correlations between another pair of nodes, say ''i'' and ''k''. Using this concept, the dependency of one node on another node is calculated for the entire network. This results in a directed weighted adjacency matrix
In graph theory and computer science, an adjacency matrix is a square matrix used to represent a finite graph. The elements of the matrix indicate whether pairs of vertices are adjacent or not in the graph.
In the special case of a finite simple ...
of a fully connected network. Once the adjacency matrix has been constructed, different algorithms can be used to construct the network, such as a threshold network, Minimal Spanning Tree (MST), Planar Maximally Filtered Graph (PMFG), and others.
Importance
The partial correlation based dependency network is a class of correlation network, capable of uncovering hidden relationships between its nodes.
This original methodology was first presented at the end of 2010, published in PLoS ONE. The authors quantitatively uncovered hidden information about the underlying structure of the U.S. stock market, information that was not present in the standard correlation
In statistics, correlation or dependence is any statistical relationship, whether causal or not, between two random variables or bivariate data. Although in the broadest sense, "correlation" may indicate any type of association, in statisti ...
networks. One of the main results of this work is that for the investigated time period (2001–2003), the structure of the network was dominated by companies belonging to the financial sector
Financial services are the economic services provided by the finance industry, which encompasses a broad range of businesses that manage money, including credit unions, banks, credit-card companies, insurance companies, accountancy companies ...
, which are the hubs in the dependency network. Thus, they were able for the first time to quantitatively show the dependency relationships between the different economic sectors
One classical breakdown of economic activity distinguishes three sectors:
* Primary: involves the retrieval and production of raw-material commodities, such as corn, coal, wood or iron. Miners, farmers and fishermen are all workers in the pr ...
. Following this work, the dependency network methodology has been applied to the study of the immune system
The immune system is a network of biological processes that protects an organism from diseases. It detects and responds to a wide variety of pathogens, from viruses to parasitic worms, as well as Tumor immunology, cancer cells and objects such ...
, and semantic networks
A semantic network, or frame network is a knowledge base that represents semantic relations between concepts in a network. This is often used as a form of knowledge representation. It is a directed or undirected graph consisting of vertices, ...
.
Overview
To be more specific, the partial correlation of the pair ''(i, k)'' given ''j'', is the correlation between them after proper subtraction of the correlations between ''i'' and ''j'' and between ''k'' and ''j''. Defined this way, the difference between the correlations and the partial correlations provides a measure of the influence of node ''j'' on the correlation
In statistics, correlation or dependence is any statistical relationship, whether causal or not, between two random variables or bivariate data. Although in the broadest sense, "correlation" may indicate any type of association, in statisti ...
. Therefore, we define the influence of node ''j'' on node ''i'', or the dependency of node ''i'' on node ''j'' − ''D''(''i'',''j''), to be the sum of the influence of node ''j'' on the correlations of node ''i'' with all other nodes.
In the case of network topology, the analysis is based on the effect of node deletion on the shortest paths between the network nodes. More specifically, we define the influence of node ''j'' on each pair of nodes ''(i,k)'' to be the inverse of the topological distance between these nodes in the presence of ''j'' minus the inverse distance between them in the absence of node ''j''. Then we define the influence of node ''j'' on node ''i'', or the dependency of node ''i'' on node ''j'' − ''D''(''i'',''j''), to be the sum of the influence of node ''j'' on the distances between node ''i'' with all other nodes ''k''.
The activity dependency networks
The node-node correlations
The node-node correlations can be calculated by Pearson’s formula:
:
Where and are the activity of nodes ''i'' and ''j'' of subject n, ''μ'' stands for average, and sigma the STD of the dynamics profiles of nodes'' i'' and ''j''. Note that the node-node correlations (or for simplicity the node correlations) for all pairs of nodes define a symmetric correlation matrix whose element is the correlation between nodes ''i'' and ''j''.
Partial correlations
Next we use the resulting node correlations to compute the partial correlations. The first order partial correlation coefficient is a statistical measure indicating how a third variable affects the correlation between two other variables. The partial correlation between nodes ''i'' and ''k'' with respect to a third node is defined as:
:
where and are the node correlations defined above.
The correlation influence and correlation dependency
The relative effect of the correlations and of node ''j'' on the correlation ''C''(''i'',''k'') is given by:
:
This avoids the trivial case were node ''j'' appears to strongly affect the correlation , mainly because and have small values. We note that this quantity can be viewed either as the correlation dependency of ''C''(''i'',''k'') on node ''j'' (the term used here) or as the correlation influence of node ''j'' on the correlation ''C''(''i'',''k'').
Node activity dependencies
Next, we define the total influence of node ''j'' on node ''i'', or the dependency ''D''(''i'',''j'') of node ''i'' on node ''j'' to be:
:
As defined,''D''(''i'',''j'') is a measure of the average influence of node ''j'' on the correlations ''C(i,k)'' over all nodes ''k'' not equal to ''j''. The node activity dependencies define a dependency matrix ''D'' whose (''i'',''j'') element is the dependency of node ''i'' on node ''j''. It is important to note that while the correlation matrix ''C'' is a symmetric matrix, the dependency matrix D is nonsymmetrical – since the influence of node ''j'' on node ''i'' is not equal to the influence of node ''i'' on node ''j''. For this reason, some of the methods used in the analyses of the correlation matrix (e.g. the PCA) have to be replaced or are less efficient. Yet there are other methods, as the ones used here, that can properly account for the non-symmetric nature of the dependency matrix.
The structure dependency networks
The path influence and distance dependency: The relative effect of node ''j'' on the directed path – the shortest topological path with each segment corresponds to a distance 1, between nodes ''i'' and ''k'' is given:
:
where and are the shortest directed topological path from node ''i'' to node ''k'' in the presence and the absence of node ''j'' respectively.
Node structural dependencies
Next, we define the total influence of node ''j'' on node ''i'', or the dependency ''D''(''i'',''j'') of node ''i'' on node ''j'' to be:
As defined, ''D''(''i'',''j'') is a measure of the average influence of node ''j'' on the directed paths from node ''i'' to all other nodes ''k''. The node structural dependencies define a dependency matrix ''D'' whose (''i'',''j'') element is the dependency of node ''i'' on node ''j'', or the influence of node ''j'' on node ''i''. It is important to note that the dependency matrix D is nonsymmetrical – since the influence of node ''j'' on node ''i'' is not equal to the influence of node ''i'' on node ''j''.
Visualization of the dependency network
The dependency matrix is the weighted adjacency matrix, representing the fully connected network. Different algorithms can be applied to filter the fully connected network to obtain the most meaningful information, such as using a threshold approach, or different pruning algorithms. A widely used method to construct informative sub-graph of a complete network is the Minimum Spanning Tree (MST).[Rosario N. Mantegna, Hierarchical structure in Financial markets]
Eur. Phys. J. B 11 (1), 193–197 (1999)
Another informative sub-graph, which retains more information (in comparison to the MST) is the Planar Maximally Filtered Graph (PMFG)[Michele Tumminello, Tomaso Aste, Tiziana Di Matteo and Rosario N. Mantegna, A tool for filtering information in complex systems, PNAS 102 (30), 10421–10426 (2005)] which is used here. Both methods are based on hierarchical clustering
In data mining and statistics, hierarchical clustering (also called hierarchical cluster analysis or HCA) is a method of cluster analysis that seeks to build a hierarchy of clusters. Strategies for hierarchical clustering generally fall into tw ...
and the resulting sub-graphs include all the ''N'' nodes in the network whose edges represent the most relevant association correlations. The MST sub-graph contains edges with no loops while the PMFG sub-graph contains edges.
See also
*Semantic lexicon
A semantic lexicon is a digital dictionary of words labeled with semantic classes so associations can be drawn between words that have not previously been encountered. Semantic lexicons are built upon semantic networks, which represent the semanti ...
*Dependency network (graphical model)
Dependency networks (DNs) are graphical models, similar to Markov networks, wherein each vertex (node) corresponds to a random variable and each edge captures dependencies among variables.
Unlike Bayesian networks, DNs may contain cycles.
Ea ...
References
{{Reflist
Network analysis