|
Next: Bibliography Up: A Paradoxical Property of Previous: An Urn Model
Methodological Consequences
It may be questioned whether a situation of the kind demonstrated
in Section 2.1 will ever occur in actual research. Doesn't the
median paradox belong to that species of highly artificial counterexamples
without practical relevance constructed by meticulous mathematicians
to prove the foolishness of everyday research practice? Even if
one shares the critical view behind this question, the example
is sufficient to show that the process of 'deriving' an aggregate
hypothesis from a hypothesis referring to individuals needs an
argument of plausibility, if the hypothesis is explicated in terms
of an order of medians: Although a situation with
The difference between both kinds of 'derivation' has further
consequences for the methodology of testing causal hypotheses in psychology.
We are now sufficiently prepared to discuss the fairness of the
method of strict hypothesis testing outlined in Section 1.1. If
treatment effects are explicated as individual or average causal
effects upon the expectation of the dependent variable, then a
causal hypothesis claiming a positive individual causal effect
(i.e.,
Whereas the difference
Should this fact be coined into an argument against median hypotheses or against the method of strict hypothesis testing based on deliberate sampling bias? Under a reconstructive (i.e., non-normative) view of methodology, it is not the task of this discipline to establish norms, but to point out possible consequences of methods, including their advantages as well as risks of erroneous interpretations of data. In this understanding, the methodological relevance of the median paradox can be summarized in the conclusion that it hinders a way of redeeming hypothesis testing in psychological research from justified complaints about its lack of daring predictions, if effects of conditions are explicated in terms of medians.
On the other side, shifting to the test of differences in expectations
may be considered problematic for dependent variables with ordinal
scale level. So it should be mentioned that there is another way
of explicating a 'positive' effect of a condition b (relative
to a) on a dependent variable: A hypothesis can claim that
the relation
In summary, an adequate answer to the median paradox and to the
requirement of strict and fair hypothesis testing would consist
in the transition from median hypotheses to strict stochastic
order as an explication of a 'positive' effect of a treatment
b (vs. a) upon an ordinally scaled dependent variable.
But of course it must be left to the individual researcher to
judge whether his or her theory really implies a concept of an
effect, where shifts of probability mass have to be regarded as
positive or negative effects, if and only if the median is changed.
This would e.g. mean to speak of a positive effect of treatment
b (vs. a) in a situation, where the pth quantile
(i.e., the scale value with a cumulative probability p)
is greater under condition b only for values of pin the immediate neighbourhood of p=0.5. 23
If a psychological
theory is based on a concept of a positive effect with these implications,
then it would be unfair to replace a median hypothesis by the
hypothesis of strict stochastic order: Results with crossing ogives
would be regarded as instances of refutation, although the non-occurence
of such results doesn't follow from the psychological theory.
However, due to the median paradox a test of the aggregate hypothesis
A more general methodological consequence to be drawn from the median paradox is a warning against a common practice of 'intuitive' derivation of statistical hypotheses, where a property, which is intended to refer to individuals, is translated into a formally identical aggregate hypothesis. Perhaps the disregard for such warnings is partly due to the fact that they have frequently been based on the lacking aggregation stability of highly formalized mathematical models like exponential learning models (Sidman, 1952 [21]) or the Thurstonean law of comparative judgment (Bakan, 1967/1970 [4]). But the difference in the behaviour under aggregation of expectations and medians should make us cautious towards assertions (e.g., Hager, 1992, p. 21 [9]) that the choice of particular parameters for a statistical hypothesis doesn't touch the validity of a hypothesis deduction from a psychological theory, which isn't itself formalized mathematically, and that this choice can be gouverned entirely by the scale level of the dependent variable and by needs of controlling the power of statistical tests. In order to derive an aggregate hypothesis leading to a valid and fair tests of a psychological hypothesis referring to individuals, the aggregation stability of the hypothesized property has to be paid due attention. Next: Bibliography Up: A Paradoxical Property of Previous: An Urn Model Methods of Psychological Research 1997 Vol.1 No.4 © 1997 Pabst Science Publishers |
|
|