Class CorrelationCoefficients
- java.lang.Object
-
- org.openjump.core.attributeoperations.statistics.CorrelationCoefficients
-
public class CorrelationCoefficients extends Object
Class that calculates various correlation coefficients.- Version:
- $Rev: 2509 $ modified: [sstein]: 16.Feb.2009 changed logger-entries to comments
- Author:
- Ole Rahn
FH Osnabrück - University of Applied Sciences Osnabrück,
Project: PIROL (2005),
Subproject: Daten- und Wissensmanagement
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description class
CorrelationCoefficients.CorrelationInformation
class
CorrelationCoefficients.RankCorrelationInformation
protected class
CorrelationCoefficients.SpearmanRankNumberPair
-
Constructor Summary
Constructors Constructor Description CorrelationCoefficients(Feature[] features, String attr1, String attr2)
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description protected double
aritmeticMiddle(Feature[] features, int attr)
static double
getDeviation(Feature[] features, String attr, double mean)
Returns the deviation of the values of the given attribute.CorrelationCoefficients.RankCorrelationInformation
getKendalsTauRankCoefficient()
"Spearman Rank Order Correlations (or "rho") and Kendall's Tau-b (or "tau") Correlations are used when the variables are measured as ranks (from highest-to-lowest or lowest-to-highest)"
http://www.themeasurementgroup.com/datamining/definitions/correlation.htmdouble
getMean(int nr)
Get the aritmetic middle for the nr-th attribut givenCorrelationCoefficients.CorrelationInformation
getPearsonCoefficient()
get Pearson's correlation coefficient (good, dimension-less measure, if there is a linear relation between the attributes)
see: http://www.netzwelt.de/lexikon/Korrelationskoeffizient.htmlprotected HashMap<Integer,Double>
getRank2SpearmanRankMap(Object[] sortedValues, HashMap<Object,Integer> value2NumAppearances)
CorrelationCoefficients.RankCorrelationInformation
getSpearmansRhoCoefficient()
get Pearson's correlation coefficient (good, dimension-less measure, if there is a linear relation between the attributes)
see: http://www.netzwelt.de/lexikon/Korrelationskoeffizient.htmlprotected double
getVariance(String attr)
protected CorrelationDataPair[]
initializeDataStorage(Feature[] features)
-
-
-
Method Detail
-
initializeDataStorage
protected CorrelationDataPair[] initializeDataStorage(Feature[] features)
-
getDeviation
public static double getDeviation(Feature[] features, String attr, double mean)
Returns the deviation of the values of the given attribute. Uses a given mean to avoid multiple calculation of the mean. To get the mean take a look at the FeatureCollectionTools class. This class is also used by aritmeticMiddle().- Parameters:
features
- array containing the features we want the deviation forattr
- name of the attribute to calculate the deviation formean
- the mean for the given features- Returns:
- the deviation
- Throws:
IllegalArgumentException
- if the attribute is not of a numerical type- See Also:
FeatureCollectionTools
-
getVariance
protected double getVariance(String attr)
-
aritmeticMiddle
protected double aritmeticMiddle(Feature[] features, int attr)
-
getMean
public double getMean(int nr)
Get the aritmetic middle for the nr-th attribut given- Parameters:
nr
- index number of attribut to calculate the mean for- Returns:
- the mean for the attribute or Double.NaN, if errors occured
-
getPearsonCoefficient
public CorrelationCoefficients.CorrelationInformation getPearsonCoefficient()
get Pearson's correlation coefficient (good, dimension-less measure, if there is a linear relation between the attributes)
see: http://www.netzwelt.de/lexikon/Korrelationskoeffizient.html- Returns:
- Pearson's correlation coefficient
-
getRank2SpearmanRankMap
protected HashMap<Integer,Double> getRank2SpearmanRankMap(Object[] sortedValues, HashMap<Object,Integer> value2NumAppearances)
-
getSpearmansRhoCoefficient
public CorrelationCoefficients.RankCorrelationInformation getSpearmansRhoCoefficient()
get Pearson's correlation coefficient (good, dimension-less measure, if there is a linear relation between the attributes)
see: http://www.netzwelt.de/lexikon/Korrelationskoeffizient.html- Returns:
- Pearson's correlation coefficient
-
getKendalsTauRankCoefficient
public CorrelationCoefficients.RankCorrelationInformation getKendalsTauRankCoefficient()
"Spearman Rank Order Correlations (or "rho") and Kendall's Tau-b (or "tau") Correlations are used when the variables are measured as ranks (from highest-to-lowest or lowest-to-highest)"
http://www.themeasurementgroup.com/datamining/definitions/correlation.htm- Returns:
- RankCorrelationInformation
-
-