Next: An Urn Model
Up: Demonstration and Interpretation of
Previous: An Example
Some methodological conclusions could be drawn immediately, but
their relevance can be judged more concretely, if we understand
some principles behind the paradox. This understanding may be
supported by a closer analysis of the above example. Of course,
the ogives for the RSO-process result from an application of Eq.
(1) to the present situation with
,
leading to
 |
(3) |
The medians for unit i under conditions a and bare
resp.
,
whereas
they are
resp.
for
unit j, and
resp.
for the RSO-process. Finally, let xi and
xj be the abscissas of the intersection points
of the ogives characterizing unit i resp. j. Then
,
and
.
It can be left to the reader to choose between a graphical and an
algebraic check of these assertions, the latter one being based
on the detailed algebraic description of the underlying functions
fu(c,x) in the appendix.
A first approach to principles behind the median paradox is based
on a geometrical interpretation of Eq. (3). If we consider only
the three thin plots in Fig. 1 (i.e., the ogives for condition
a) and draw a vertical line for an arbitrary abscissa x,
then the intersection of this line with the ogive representing
is the midpoint
between the intersections of the same line with the ogives for
fi(a,x) and fj(a,x). Now fi(a,x)is the first curve crossing the median line; but after this intersection
it slows down considerably, and according to the above midpoint
principle the same happens with
,
since fj(a,x) is almost 0 for x<15.
In other words, the ogive
cannot cross the median line, before fj(a,x)becomes greater than 0 to the same amount as fi(a,x)is less than 1, and this happens only at x=16.5,
since
.
But beyond this point, fj(a,x) increases rapidly
and crosses the median line at an earlier point than fj(b,x).
The latter curve has contributed to a relatively early crossing
of the ogive for
with the median line at x=13.5. Certainly, this property
is mainly due to the fast increase of fi(b,x),
but since the midpoint principle holds also for
,
and fi(b,x) is decelerated as the cumulative
probability approaches 1, the intersection of
with the median line would occur at a later point, unless a function
value considerably greater than zero (namely 0.094) would
be taken by fj(b,x) already at x=13.5.
Beyond this point, fj(b,x) increases only slowly
and is overhauled by fj(a,x) such that it crosses
the median line at a later point.
Obviously, the intersections of the ogives characterizing unit
i and j at xi resp. xjare crucial for the principle behind the median paradox. This
fact can be used to understand this paradox in another approach,
which is similar to the introductory examples of changing the
median of a pooled group by manipulations not affecting the subgroup
medians or viceversa. Translating these examples into a probabilistic
framework, we can state the following property of the median:
If probability mass is moved upward or downward such that the
distribution has a well defined median16
before and after the move,
then the median is changed, if and only if a part of the moved
mass crosses the old median. This property may be used to perform
moves affecting only one of the medians, although every move of
probability mass in a distribution characterizing a unit under
a condition c implies (due to Eq. (1)) a similar -
although smaller - move in the distribution characterizing
the RSO-process under the same condition. For instance, the
crossing of the ogives characterizing unit j at
can be considered to be the result of two moves of probability
mass in opposite directions, these moves making up the
individual effect17
for unit j of treatment b (vs. a)
upon the distribution of the dependent variable: One part of the
probability mass below xj of the distribution
for condition a has been shifted to the left such that
the ogive for condition b is left of the one for condition
a below xj. Since the individual median
isn't crossed by this shift, it is left unchanged; but a non-zero
part of the same mass crosses the median for the RSO-process,
contributing to the low median for this process under condition
b. Conversely, a part of the probability mass above xjis moved upwards. It doesn't cross the medians for the RSO-process,
but for the individual distribution, leading to
.
Capitalizing systematically on this property of the median, we
can reconstruct something like a cookbook recipe, which has served
for the construction of the example in Fig. 1. Suppose that we
have started with a situation with ogives given by
,
and
,
the respective medians being denoted as
resp.
.
In other words, under both conditions the cumulative distribution
functions are identical with the steep ogive for the respective
unit. Then it is easily verified that the medians of the ogives
characterizing the RSO-process would be 15.0. Now
the transition from this situation to the one represented in Fig.
1 can be considered as a result of the following steps (where
and
are
the abscissas of intended intersection points of ogives characterizing
the same unit):
- -
-
In a first step, we change
by shifting a part of the probability mass from the interval
]xi, 15.0[to the interval
]15.0, 30.0[. Obviously, this shift results
in an increased value of
,
whereas
remains unaffected.
- -
-
A second step is again concerned with
:
We shift a part of the probability mass from
to
.
Then the median
will fall below its hithertoo value of
(leading to
),
whereas
keeps its value resulting from the first step.
- -
-
In two further steps, we perform corresponding changes
with
:
Shifting probability mass from
]15.0, xj[to
]0.0,15.0[ leads to
and finally a shift from
to
gives
.
The above steps have resulted in two units with intersecting ogives,
and this is, indeed, the most effective way of obtaining a median
paradox. Generalizing the results to RSO-processes with more
than two units, we can conclude that the occurrence of a median
paradox is favoured by a pattern consisting in an intersection
of ogives at a point with cumulative probability greater than
0.5 for units with low medians (our unit i), and
a cumulative probability less than 0.5 at the intersection
points of units with high medians (our unit j). However,
it should be noted that it isn't necessary for the occurrence
of a median paradox to have intersections of both types. We could
e.g. take the curves fi(a,x), fi(b,x)and fj(a,x) as in the example and combine them
with a curve
resulting from a shift of the entire curve fj(a,x)to the right, leading to
Then we will have again
,
but
.
If the said shift of fj(a,x) leading to
isn't too big, we nevertheless get
,
and again the median paradox is complete. Similarly, a median
paradox with an intersection of ogives only for unit jcould result from keeping the curves fi(b,x),
fj(a,x) and fj(b,x) of the
example and obtaining
by a suitable shift of fi(b,x) to the left.
Understanding principles behind a paradox may be important to
avoid erroneous conclusions from data. For instance, understanding
that the Simpson paradox
(see, e.g., Agresti, 1990, p. 155 [1]) is
just a special case of confounding leads to the control of potential
confounders as a method of avoiding misled conclusions. At first
glance, the median paradox looks similar: The direction of a statistical
association can change under aggregation. However, the median
paradox is not based on confounding, but on the properties of
the median which have beeen outlined above. Since the source of
the paradox is inherent to the median by its definition, we will
have to discuss in Section 3 whether avoiding the median as a
parameter reflecting treatment effects isn't the adequate answer
to the median paradox.
Next: An Urn Model
Up: Demonstration and Interpretation of
Previous: An Example
Methods of Psychological Research 1997 Vol.1 No.4
© 1997 Pabst Science Publishers
|