Solved – Usual practices on ordinal data in psychometric tests (z-score, percentiles, t-test)

ordinal-datapsychologypsychometricst-testz-score

I want to build a standardised score on ordinal data. Data comes from several scales from a personality test of summed up rating items. It seems that ordinal data in psychology is often treated like being from a higher scale of measure.

My questions:

Is it usual practice to build z-scores and percentiles for psychometric tests from ordinal data, as they are based on the mean an standard deviation?
Will "purists" be okay with building z-scores and percentiles like this? Is there some "trick" (e.g., McCall surface transformation) that should be used instead to make everybody happy?
Is it usual practice to use a T-Test on this kind of data? The test seems to be very robust. But is it not "better" (e.g., safer) to use Mann-Whitney U-Test or related tests instead?
I'd say I'm actually not a purist, as I understand that there are some advantages when treating ordinal data like being from a higher scale of measure. But where are the limitations of good and usual practice doing so.

Best Answer

It is widely used practice to use so called Likert scales as quasi metric in a school of thought called Classical Test Theory (CTT, as opposed to Item Response Theory, IRT).

Usually you are expected to formulate the possible choices so that they feel equidistant and you are supposed to prove that what you add is somehow of the same kind and not apples and pears (think: Internal consistency, think: factor analysis).

If your scale has Internal Consistency and is composed of a number of items and otherwise "reasonable", then it is common practice to compute t-tests and do all sorts of other stuff for metric variables.

It may not be mathematically correct but is has stood the test of time, and often results are surprisingly similar compared to those gained by more complicated and less used IRT, even if the latter may be the future in a computerized time.

To read more, search for "Likert-scale" and "Classical Test Theory".

Best Answer

Related Solutions

Ordinal Data – Is Amazon’s Average Rating Misleading? Insights from Mean and Likert Data

Benefits of using the mean to summarise central tendency of a 5 point rating

Why the mean is good for Amazon

Are differences in scale use really a problem?

Possible advances beyond the mean

References

Solved – Is it appropriate to do a t-test with z-scores

Related Question