|
Next: Quantitative TrendsTrend Tests, Up: MPR-online 1996Vol.1, No.4 Previous: Contents
IntroductionUsually, it depends on one of two prerequisites whether tests for quantitative trends are applied or not. First, the independent variable is quantitative, and second, the independent variable is quantitative and a particular quantitative trend hypothesis is to be tested (see Keppel, 1973, p. 114) [33]. In the first case, the experimenter does not proceed from certain expectations, he or she just looks for the best functional description of his or her data. In the second case, however, the data are examined as to their compatibility with predictions derived from a certain theory or substantive (i.e., psychological) hypothesis. Under these circumstances, it is always possible to specify the exact relations between the independent variable and the values of the dependent variable in advance. Although this article deals exclusively with the case of testing theories and (psychological) hypotheses by means of predictions derived from them, the considerations presented here will prove their importance for other cases, too.
The distinction between substantive or psychological hypotheses
and statistical hypotheses is often either blurred or not taken
into account in empirical psychological literature as well as
in some textbooks. Psychological hypotheses refer to psychological
constructs such as 'aggression,' 'self-esteem,'
or 'imagery' and they 'treat the phenomena of nature
and man' (Clark, 1963, p. 457 [10]). In contrast, 'statistical
hypotheses concern the behavior of observable random variables'
(Clark, 1963, pp. 456-457 [10])
such as 'population variances,'
'population means,' 'population correlations,'
and 'distribution functions.' Most often, psychological
hypotheses are examined using statistical hypotheses which are
only loosely related to them. This is the case, for instance,
when the psychological hypothesis enables the prediction of a
certain rank order of parameters across several experimental
conditions and the well known F test is applied, testing
against the hypothesis that not all parameters (population means
Some authors call for a closer connection between the psychological hypothesis and the statistical hypothesis or hypotheses. They specifically demand that statistical hypotheses should be derived from the psychological one, 'even in a rather loose sense of derive' (cf. Hager, 1987 [21], 1992 [22]; Meehl, 1967 [47]; Wampold, Davis & Good, 1992 [63]; Westermann & Hager, 1986 [66]). Hager (1992, pp. 54-68) [22] has argued that this derivation should preserve the psychological hypothesis' empirical content as it is understood by Popper (1981 [54], 1992). To this aim, he has proposed two additional criteria of derivation, namely appropriateness and exhaustiveness. 'Appropriateness' means that the derived statistical hypothesis has to conform with the direction of the relation claimed in the psychological hypothesis, and 'exhaustiveness' means that a prediction has to encompass any relation or aspect of the psychological hypothesis which can be expressed by statistical concepts (see Hager, 1987 [21], 1992 [22], and Hager & Hasselhorn, 1995 [25], for further details). If a statistical hypothesis is connected to a psychological hypothesis by a derivation and if it meets with the two criteria just mentioned, it is called a statistical prediction (SP for short). This linkage between two kinds of hypotheses by a derivation together with two criteria seems necessary and sufficient to ensure an unambiguous separation of those results which are in complete accordance with the psychological hypothesis from those that contradict it. Such a partition of possible results conforms to demands formulated by Fisher (e.g., 1966 [17]) as well as by Popper (1980) [53]. It is, however, very often the case in empirical psychological literature that this basic principle, advocated independently by a statistician and by a philosopher, is violated, as the analyses by Hager (1992) [22], by Hager and Westermann (1983) [26] and by Westermann and Hager (1986) [66] show.
A statistical prediction is a special statistical hypothesis which
is not necessarily equivalent to the null or the alternative hypothesis
of a (wide-spread and/or single) statistical test. A null hypothesis
(
Choosing the first option means that either one or both of the
principles of appropriateness and exhaustiveness with respect
to the particular statistical prediction is violated by the statistical
hypotheses actually tested, and/or that the decisions made
are mainly data-based. Data-based decisions rely on statistical
tests and on subsequent differential interpretations of
data patterns. If - for example - the significance of an overall
F test is taken as the basis for interpreting the rank
order of sample means as being the same as of the population
means, this is a data-based decision not covered by the test performed.
If the F test is performed on a comparison with more than
one degree of freedom, it does not refer to distances
If, on the other hand, the statistical hypotheses actually tested turn out to be only loosely linked to the psychological hypothesis of interest or to the statistical prediction derived from it, the probability of false decisions concerning the psychological hypothesis can be enhanced substantially, or in more general terms: the probability of false 'truths' can be enhanced greatly. I will cite no examples from current empirical literature to demonstrate this, but rather deal with some textbook presentations; empirical researchers should not be expected to act in a more sophisticated manner than textbook authors. To lower the probability of false 'truths' it is important to apply the criteria of adequateness and exhaustiveness when deriving testable statistical hypotheses from the statistical prediction or when decomposing it into testable partial hypotheses. Several of the subsequent considerations will focus on this demand. If psychologists describe relations among variables by means of mathematical functions they aim for a greater degree of exactness or precision than is possible when using less precise methods of description. This goal of exactness, however, may be rendered unattainable by choosing tests which are not exact enough: One is working with a very precise and seemingly exact scientific terminology and hypotheses, but because of inappropriate statistical procedures the hypotheses actually tested do not reflect the quantification or the functionality to a sufficient degree. This lack of correspondence between the quantitative hypothesis to be tested and the one actually tested will be examined subsequently. It will also be argued that certain tests of qualitative trend hypotheses can result in analoguous problems.
By calling a trend hypothesis a statistical prediction, it is
meant that hypotheses of this kind can occur in empirical research,
that is, may serve as the target hypothesis to be tested. I shall
not deal with psychological hypotheses leading to a particular
statistical trend prediction, but intend to discuss some trend
hypotheses and their relation to some commonly administered tests.
Thus, the main question I seek to answer is: Given a particular
(quantitative or qualitative) trend hypothesis, which of some
well-known statistical tests is best suited to test it, whereby
'best suited' does not refer to statistical assumptions,
but to features of trends. This question will be discussed from
the perspective of the method of planned or focussed contrasts
(among expectations of normally distributed random variables or
population means
Next: Quantitative TrendsTrend Tests, Up: MPR-online 1996Vol.1, No.4 Previous: Contents Methods of Psychological Research 1996, Vol.1, No.4 © 1997 Pabst Science Publishers |
|
|