Thursday, May 26, 2016

Statistical data type

Data TypePossible valuesExample usageLevel of measurementDistributionScale of relative differencesPermissible statisticsRegression analysis
binary0, 1 (arbitrary labels)binary outcome ("yes/no", "true/false", "success/failure", etc.)nominal scaleBernoulliincomparablemodeChi-squaredlogistic,probit
categorical1, 2, ..., K (arbitrary labels)categorical outcome (specific blood type,political party, word, etc.)categoricalmultinomial logit,multinomial probit
ordinalinteger orreal number(arbitrary scale)relative score, significant only for creating a rankingordinal scalecategorical??relative comparisonordinal regression(ordered logit,ordered probit)
binomial0, 1, ..., Nnumber of successes (e.g. yes votes) out of Npossibleinterval scale??binomial,beta-binomial, etc.additive??mean,median,mode,standard deviation,correlationbinomial regression(logistic,probit)
countnonnegativeintegers (0, 1, ...)number of items (telephone calls, people, molecules, births, deaths, etc.) in given interval/area/volumeratio scalePoisson,negative binomial, etc.multiplicativeAll statistics permitted for interval scales plus the following:geometric mean,harmonic mean,coefficient of variationPoisson, negative binomial regression
real-valuedadditivereal numbertemperature, relative distance,location parameter, etc. (or approximately, anything not varying over a large scale)interval scalenormal, etc. (usually symmetric about themean)additivemean,median,mode,standard deviation,correlationstandardlinear regression
real-valuedmultiplicativepositive real numberprice, income, size,scale parameter, etc. (especially when varying over a large scale)ratio scalelog-normal,gamma,exponential, etc. (usually a skeweddistribution)multiplicativeAll statistics permitted for interval scales plus the following:geometric mean,harmonic mean,coefficient of variationgeneralized linear modelwithlogarithmiclink

No comments:

Post a Comment