| 17.1.12.2 Algorithm (Correlation Coefficient)CorrCoef-Algorithm There are a number of coefficients which are appropriate to use under different circumstances. Among them, the most frequently-used one is Pearson's product moment correlation coefficient.
 Correlation CoefficientsPearson's product moment correlation coefficientPearson's  product moment correlation coefficient measures the linear relations between two variables.
 Let /math-ea2fb250ea955e9548d7d78941d80ec0.png?v=0) .and /math-5b7011069d54d7494a581c5a39afc3a0.png?v=0) be the standard deviations of two random variables X and Y respectively. Then the Pearson's  product moment correlation coefficient between the variables is /math-28e6ed2438f8ba9a554b7c2c5bfdcbbb.png?v=0) 
 where E(.) denotes the expected value of the variable, and cov(.) means covariance.
 To use this method, one should make sure that the interval data comes from paired observations, and that the variables are normally distributed. The data should not contain any extreme values, because they are apt to affect the result. Pearson's  product moment correlation coefficient could sometimes be misleadingly small when the variables have a non-linear relationship.
 Spearman Rank Correlation CoefficientSpearman Rank correlation coefficient is a non-parametric measure; therefore, it is suitable for data that is not normally distributed. It works better in detecting a non-linear relationship between two variables. It can be defined as
 /math-30e2b8b2169b7c14da1f976316e7c59c.png?v=0) 
 where d is the difference in statistical rank of corresponding variables.
 Because statistical rank is just the ordinal number of a value in a list, Spearman Rank correlation coefficient can be computed even when actual values of the variables are unknown.
 Kendall correlation coefficientKendall correlation coefficient, or Kendall tau, is equivalent to Spearman R in terms of their assumptions and statistical power. However, Kendal correlation coefficient has a more intuitive interpretation. And its algebraic structure is simpler. Furthermore, it does not require ordering of the data before the computation.
 Kendall correlation coefficient can be computed by
 /math-e77b5d7ffbaeb1048e2202d9f7dcec6c.png?v=0) 
 where C is the number of concordant pairs (pairs of observations that have the same signs), D is the number of discordant pairs (pairs of observations that have opposite signs), and q is defined in Significance Level of r.
 Significance of RPearson and Spearman typesFor Pearson and Spearman correlation types, let
 /math-24747472afeca9759e13e01893745921.png?v=0)
 where r is the correlation of two variables and N is number of observations.
 Then t follows a t-distribution with N-2 degrees of freedom. The two-tailed significance level can be calculated as:
 /math-ca08a21786c0a0775c6a839bc73e1a0b.png?v=0)
 Kendall typeFor Kendall correlation type, let
 /math-c939a27ffcf3bc1776bfe8fb5a0eac2c.png?v=0)
 where
 /math-548bc5538f0ef9eacf0ae3c5be40f69c.png?v=0)/math-5caaeb544cb4ccce475311401745a3c6.png?v=0)/math-e6c7791706cf93f4232f2c8534736995.png?v=0)/math-eb2c6e9601d25f02e8838ef730b5bd67.png?v=0)/math-c9f02f1e1911eedbf05988df3ce43feb.png?v=0)/math-a407e02e1a2d60aaea82067fef66b721.png?v=0)/math-ca7363f711e3049973f2371b26b477d6.png?v=0)/math-046af118cd5a4fed647d9ee0185f654a.png?v=0)
 Then z is approximated by a standard normal distribution. And the two-tailed significance level is:
 /math-2edf21ee0d07e7b466bf4ebb5bfec769.png?v=0)
 |