Computational Statistics and Data Analysis 55 (2011) 1357–1366
Contents lists available at ScienceDirect
Computational Statistics and Data Analysis
journal homepage: www.elsevier.com/locate/csda
Partial correlation with copula modeling
Jong-Min Kim
a,∗
, Yoon-Sung Jung
b
, Taeryon Choi
c
, Engin A. Sungur
a
a
Statistics Discipline, Division of Science and Mathematics, University of Minnesota at Morris, Morris, MN, 56267, USA
b
Office of Research, Alcorn State University, Alcorn State, MS, 39096, USA
c
Department of Statistics, Korea University, Seoul, 136-701, South Korea
article info
Article history:
Received 26 January 2010
Received in revised form 21 September
2010
Accepted 22 September 2010
Available online 16 October 2010
Keywords:
Partial correlation
Gaussian copula
Gene network
abstract
We propose a new partial correlation approach using Gaussian copula. Our empirical
study found that the Gaussian copula partial correlation has the same value as that which
is obtained by performing a Pearson’s partial correlation. With the proposed method,
based on canonical vine and d-vine, we captured direct interactions among eight histone
genes.
© 2010 Elsevier B.V. All rights reserved.
1. Introduction
The current Pearson partial correlation approach is popular because of the simple computation advantage it confers. But
the current approach has many drawbacks: for example, it does not exist if the first or second moments do not exist. Possible
values depend on the marginal distributions which are not invariant under non-linear strictly increasing transformations
(Kurowicka and Cooke, 2006). This was our motivation to propose a new approach to partial correlation using copula,
specifically a Gaussian copula. Since Sklar (1959) proposed the theorem of the copula, numerous copula functions have been
introduced in the last five decades. Recently, Nelson (2006) summarized the theories of numerous copula functions and Yan
(2007) developed the R package of multivariate dependence with copulas. But most copulas have a limitation which fails
to satisfy the copula properties when extended from bivariate to multivariate cases. To overcome this limitation, Aasa et al.
(2009) proposed pair-copula constructions of multiple dependence, based on the work of Bedford and Cooke (2002). Since
model construction is hierarchical, it is not simple to incorporate more variables in the conditioning sets with pair-copula
which uses the inverse of the conditional bivariate distribution function, h-function inverse. But pair-copula constructions
by Aasa et al. (2009) are promising way to derive a partial correlation, so we adopted a Gaussian bivariate copula by using the
conditional distributions to find a partial correlation. To find a partial correlation, we derive a conditional standard normal
distribution by using multivariate normal distribution properties and estimate the partial correlation coefficient by the
Gaussian copula. In the general theory of partial correlation, the partial correlation coefficient is a measure of the strength
of the linear relationship between two variables after we control for the effects of other variables. If the two variables of
interest are Y and X , and the control variables are Z
1
, Z
2
,..., Z
n
, then we denote the corresponding partial correlation
coefficient by ρ
YX |Z
1
,Z
2
,...,Z
n
.
∗
Corresponding author.
E-mail address: jongmink@morris.umn.edu (J.-M. Kim).
0167-9473/$ – see front matter © 2010 Elsevier B.V. All rights reserved.
doi:10.1016/j.csda.2010.09.025