A higher number denotes higher dependency. The proposed measure uses a number in the closed interval [0, 1] to indicate the nonlinear correlation degree of the concerned multivariable data set, with 0 and 1 denotes the weakest and the strongest relationship, respectively. Correlation is a statistical method used to measure the strength and direction of association between a pair of variables. N = number of items ranked. In statistics, the correlation is a measure which tells mathematically the kind of linear relationship, the two variables has. Example: Let's say we have a dataset of height and weight of ten males. Correlation Coefficient. If there isn't a direct measure, how can we achieve this? What measures do we have to analyse the relevance of a categorical feature to the target value? Correlation test is used to evaluate the association between two or more variables. A Pearson correlation is a number between -1 and 1 that indicates how strongly two variables are linearly related. Chi-squared test is known, but I can't find any implementation of it for categorical values. The most familiar measure of dependence between two quantities is the Pearson product-moment correlation coefficient (PPMCC), or "Pearson's correlation coefficient", commonly called simply "the correlation coefficient". Measures of Correlation. 