Correlation


Description


Examines the (increasing or decreasing) relationship between two or more continuous or ordinal variables.

Dialog


There are two lists (the 'Variable' and 'With' lists) which can be used to determine which pairs of variables will have correlations calculated. If there are variables present in the 'With' list, all 'With' variables will be correlated with all 'Variable' variables. Otherwise, correlations will be calculated between every possible pair.

Options

The alternative hypothesis can be specified, as well as the confidence level of the intervals. A number of printing options are also available:

  1. Print Matrix - Should any output be printed
  2. N - Sample Size
  3. CI - Confidence interval
  4. Stat - Test statistic
  5. p-value - The p-value of the correlation test

Plots

Three types of plots are available for correlations. The third, a full correlation matrix, is only available if no w'With' variables have been specified.

  1. Scatter Plots - an array of scatter plots
    1. Lines - type of regression line to show linear, loess (smoothed), or none
    2. Alpha - Transparency of the points
    3. Common Axis - row and colummn-wise common axes
  2. Circles - An array of circles whose size and colour indicate correlation strength and direction
    1. Max Radius - Size of the biggest possible circle
  3. Correlation Matrix - The lower triangle of the matrix has scatter plots, and the upper has the correlation values.
    1. Lines - type of regression line to show linear, loess (smoothed), or none
    2. Max Size - Maximum size of the correlation text
    3. Alpha - Transparency of the points

Correlations

Correlations measure the strength of the relationship between two variables. A correlation can range from -1 to 1, with a correlation of 0 indicating no relationship. If the correlation is negative, there is a negative relationship between the two variables (i.e. one goes up as the other goes down). Likewise, if the correlation is positive, the relationship is positive (i.e. when one goes up, so does the other). The three most common correlation statistics are supported by this dialog.

Pearson's

This is the "Usual" correlation. It estimates the strength of the linear relationship between two variables. It assumes that the sample size is large enough, and that the relationship is actually linear. It is also sensitive to outliers.

Spearman's

This rank correlation is obtained by rank transforming both variables, and then calculating the pearson's correlation on the transformed data. Doing this makes it insensitive to outliers, and relaxes the linearity requirement (it only requires that the relationship be monotonic.

Kendall's

Like Spearmans, Kendall's correlation is not sensitive to outliers, and will detect any monotonic relationship.

Examples


Do people lie about their height and weight?