next up previous
Next: Empirical comparison between Trasid Up: Trasid-Rotation Previous: Rotation for orientation of

Maximising the hyperplane count (with and without division by the square root of the communality)

In this step, the numerical values of the elements of the transformation-matrix tex2html_wrap_inline712 of the intermediate solution will be changed continuously, until specific criteria are reached. However, not all numerical values of a given precision will be entered as elements of the transformation-matrix (this would need too much time, even if only values of three digits are entered). The iterative change or adjustment of the elements of the transformation-matrix is restricted to the strictly neccessary. Positive and negative increments will be added alternately to all non-diagonal elements of the transformation-matrix. Therefore, the procedure is called Transformation-matrix-Search and Identification (Trasid). After every iterative change of the transformation-matrix, the solution will be checked with regard to the maximisation of the hyperplane count. The orientation on the main loadings will not be lost, because the iterative change will be performed only within a certain range around the elements of the transformation-matrix.

The maximised parameters:

Two ways of computing the hyperplane count are performed: the count for absolute loadings tex2html_wrap_inline679 0.10 (or another value, if intended), which are divided by the square root of the communality (h) and the count for absolute loadings tex2html_wrap_inline679 0.10 (without division by h). The priority of maximisation is given to the hyperplane count based on the absolute loadings divided by h. A priority for the absolute loadings without division by h may produce solutions, which depend strongly on unreliable variables, because low reliability may lead to low communality. Low communality means that the loadings of the variables will be low, so that they will more likely fall in the hyperplane and thereby may influence the rotation. In addition to the argument that unreliable variables should not have a strong influence on the rotation, it is not clear why a variable with low communality, i.e. which is not really represented by one factor of a solution, may influence the rotation of the factors. So, the simple way of computing the hyperplane count, as proposed by Cattell (1952), must be criticised: when a solution has many variables with low communalities, the simple sum of the absolute loadings tex2html_wrap_inline679 0.10 is not very informative. With very similar arguments Bargmann (1955) used the absolute loadings divided by h for his test of significance of simple structure.

However, it is possible to maximise the hyperplane count without division by h within the range of solutions, which is determined by the hyperplane count with division by h. In this case, the hyperplane count with division by h has the priority of maximisation. When a maximisation of the hyperplane count without division by h is not in conflict with the maximisation of hyperplane count with division by h, there is no reason to avoid this maximisation, because in this case the solution which is only based on the variables with high communalities is not very different from the solution which includes also variables with low communalities in the maximisation.

The combined criterion, which realises the abovementioned priority, and which will be maximised in the Trasid procedure, can be formalised as follows: For n variables and m factors a matrix B is defined, with the elements tex2html_wrap_inline748, according to the first condition for the hyperplane count (tex2html_wrap_inline750 are the factor loadings):


equation85

Depending on sample size, other values for the hyperplane count can be chosen (see above), but in order to be compatible with most of the previous studies a value of 0.10 will be chosen for the cut-off here. A second matrix C is defined, with the elements tex2html_wrap_inline754, according to the second, simple condition for the hyperplane count:


equation97

Then, the following criterion will be maximised:


equation107

Dividing the simple hyperplane count by the total number of loadings in the matrix (tex2html_wrap_inline756) ensures that the increase in the latter can never be greater than the increase of one loading falling in the hyperplane according to the first hyperplane count. So, when one loading falls into the hyperplane according to the first criterion, no increase in the second, simple hyperplane count can be greater or equal (remembering that never all absolute loadings of a solution will be tex2html_wrap_inline679 0.10 or even tex2html_wrap_inline679 0.20). Thus, the present combined criterion realises the priority of the hyperplane count with division of the absolute loadings by h over the simple hyperplane count.

The criterion in formula 6 is maximised according to the iterative change of elements of the intermediate transformation-matrix tex2html_wrap_inline712. Because the matrix is changed only in a range or width (W, see below) around the values of tex2html_wrap_inline712, this criterion is not maximised over the whole matrix.

Besides the fact, that in a first step, a rotation for the orientation on the main loadings is performed, the present combined criterion (formula 6), which takes into account division by h, is the main difference between Trasid and other methods which maximise the hyperplane count as Maxplane-, PPFP-, and Hyball-rotation. The above-mentioned problems, which may occur, when only the hyperplane count for the absolute loadings without division by h is maximised, are demonstrated with an example in a further section (see below).

Because the hyperplane count is a discrete criterion, and because there is no gradient for maximising it, micro-local optima may easily occur (Katz & Rohlf, 1974). In PPFP-rotation Katz and Rohlf (1974, 1975) circumvent the problem using an exponential function of the loadings, which is continuous. The disadvantage of this method is that it is not clear how far the hyperplane count is still directly maximised.

That is why, as in Cattell and Muerle (1966), a trial-and-error maximisation of the hyperplane count will be performed with Trasid, however with the difference that in the cases, in which the iterative procedure cannot maximise the hyperplane count (i.e. the combined criterion), a maximisation of a continuous parameter will be performed. In this case, the parameter is called continuous, because with every change in the transformation-matrix and the factor pattern a change of the parameter will occur. The hyperplane count will only change, when absolute loadings fall into the cut-off value, whereas the continuous parameter will be sensitive to all absolute loadings above the cut-off value.

The parameter used here is the mean difference between the absolute loadings (divided by h) and the cut-off value for the hyperplanes (here 0.10). When the mean difference becomes smaller, the loadings are ``closer'' to the hyperplanes and the probability increases for the variables to fall into the hyperplanes in the iteration process. The difference between the absolute loadings and the cut-off value makes sense only when the absolute loadings are greater than the cut-off value, because a zero loading can and should stay at zero and should not be maximised in order to come closer to the cut-off value. So, the mean difference is only computed for the absolute loadings greater then the cut-off value.

Instead of minimising the mean difference between the absolute loadings divided by h and the cut-off value, the mean of the reciprocal value of the differences is maximised. The hyperbolic function yields extreme values for very low differences, so that the mean value is influenced mostly by variables with low differences. Because the parameter corresponds to the mean proximity of the loadings to the cut-off value, it will be called P. The parameter will be computed over the whole factor matrix as for the hyperplane count. We define a matrix D of differences with the following elements tex2html_wrap_inline782, according to the following conditions:


equation118

If the absolute loadings divided by h are greater than the cut-off value, the matrix contains tex2html_wrap_inline786, otherwise it contains zero-elements. Then the sum of tex2html_wrap_inline782 is divided by the number of non-zero elements, i.e. the number n of loadings in the matrix (tex2html_wrap_inline792) minus the number of zero elements, i.e. the sum of tex2html_wrap_inline748 (see formula 4).


equation137

Maximising P will tend to increase the hyperplane count, also if this is not guarenteed. Of course, other functions which maximise the hyperplane count are possible.

Maximising of the proximity P has the lowest priority, i.e. when the combined hyperplane count can only be maximised at the cost of minimising P, P will be minimised and the combined hyperplane count maximised. Maximising of P supports the maximisation of the hyperplane count, because the lower loadings are pushed in the direction of the hyperplanes (e.g. a loading of .20 will become .15 when P is maximised, thus it will be nearer to the cut-off value), so that the trial-and-error algorithm will have a greater chance to ``catch'' this loading into the hyperplane. The maximisation of P thus increases the propability for maximising the combined hyperplane count by means of the trial-and-error-algorithm.

The iterative change of the transformation-matrix

The elements of the transformation-matrix will be changed (i.e. increments will be added or subtracted to all non-diagonal elements of the matrix, see below), until the above-mentioned parameters are maximal. The main problem is to avoid local optima. Of course, local optima can be considered local, only if an even better solution has been found. Because local optima cannot be excluded theoretically or mathematically, a complex algorithm for the iterative change of the transformation-matrix is proposed. Several different algorithms for the maximisation would be possible. The present algorithm is a first attempt to reach a very high hyperplane count (see below) within a reasonable amount of time. Different algorithms and different parameters within the present algorithm have been checked, and did not produce higher hyperplane counts.

Two parameters of the algorithm are varied systematically: (1) the amount of change C of the increments (see formula 9, below), which are successively added to the elements of the transformation-matrix and (2) the maximal width W of the increments, i.e. the maximal value, which is added to the elements. The increments are added alternately to the transformation-matrix tex2html_wrap_inline712 of the intermediate solution, which is the result of the rotation for orientation of the factors on the main loadings and to the optimised transformation-matrix tex2html_wrap_inline816, which is the transformation-matrix of the until then best solution. The fact that the increments are added to tex2html_wrap_inline712 and To ensures that the algorithm starts from two different, but sensible matrices (for more on the influence of starting positions on the rotation, see Rozeboom, 1992).

The increments, which are added to the elements of the transformation matrices, are computed according to formula 9. For C, values of 0.002, 0.003, 0.004, 0.005 and 0.006 are entered successively in formula 9. Even smaller values could be entered for C, but the algorithm would run much more slowly. In the empirical tests of the algorithm smaller values for C did not improve the simple structure of the solutions. The widths W of the increments are 0.005, 0.010, 0.015, 0.020...0.150, i.e. the iterative change of the transformation-matrix is performed for all widths between 0.005 and 0.015 in steps of 0.005. Of course, even greater values can be chosen for the maximal width, e.g. 0.20 may sometimes be necessary in order to get the optimal solution. But the values should not be too big (e.g. > 0.30) in order to avoid collapsing of the factors and in order to maintain the previously established orientation on the main loadings. For an empirical illustration of the maximisation of the hyperplane count depending on W, see below. The increments tex2html_wrap_inline832 are computed on the basis of the previous increments i. The starting value is an increment of 0.0001.


equation144

Figure 2 contains the recursive sequence of i, which is computed according to formula 9 with a change C of 0.002 and 0.006 and a width of W of 0.10. Note that the increments are positive and negative, so that positive and negative increments will be added to the elements of the intermediate transformation-matrix tex2html_wrap_inline712 and the until then optimised matrix tex2html_wrap_inline816.

 figure149
Figure 2: Sequence of the increment i according to formula 9

When the absolute value of the increment i becomes equal to or greater than W, the iteration process stops (as for the 17th iteration with C = 0.006 in Figure 2). For a given W, there are many more iterations when C is small. Because the transformation-matrix is normalised, the exact values of the increment will be slightly smaller.

An example for the meaning of W: when W is 0.02, (positive and negative) increments will be added to every non-diagonal element of the transformation-matrices tex2html_wrap_inline712 and tex2html_wrap_inline816, until the absolute size of the increments becomes equal or greater than W. The change C of the increments will be 0.002, until the increment i becomes equal to or greater than the W of 0.02. Then, the same iteration process starts with a value of 0.003 for C until the increment reaches W. This process continues until C becomes 0.006.

The iterative change of the transformation-matrix can be described as follows: the first change value C will be 0.002. According to formula 9 the first increment will be -0.0021 for the first non-diagonal element in the first column and second row (tex2html_wrap_inline710) of the transformation-matrix of the intermediate solution (tex2html_wrap_inline712). The new matrix will be called tex2html_wrap_inline884. i will be a bit smaller, because tex2html_wrap_inline884 will be normalised. The width W will be 0.005 in this first step.

tex2html_wrap_inline884 is the matrix to compute the factor structure from the orthogonal factor matrix. But the simple structure should be maximised in the factor pattern. So, the factor pattern will be computed according to formula 10.


equation156

tex2html_wrap_inline894 is the factor pattern, A is the orthogonal starting solution and tex2html_wrap_inline898 is the inverse of the transposed tex2html_wrap_inline884 (Formula 7 for computation of the factor pattern is according to Harman, 1976, p. 268, formula 12.21). When the factor pattern is computed, it will be checked, if the criterion for the hyperplane count (formula 6) has been maximised. If the criterion has been maximised, the new transformation-matrix tex2html_wrap_inline884 will be saved as optimised transformation-matrix tex2html_wrap_inline816, if not, two cases can be distinguished: (1) if the hyperplane count according to formula 6 has been reduced, tex2html_wrap_inline884 will not be saved. (2) If the hyperplane count according to formula 6 has not changed, it will be checked, if the proximity parameter P (formula 8) has increased. If P has increased, again tex2html_wrap_inline884 will be saved as optimised matrix tex2html_wrap_inline816, if it has not increased, tex2html_wrap_inline884 will not be saved.

After tex2html_wrap_inline816 has been computed for the first time, all subsequent iterations will be performed alternately starting from tex2html_wrap_inline816 and from the intermediate solution tex2html_wrap_inline712. The next change of the element tex2html_wrap_inline710 will be performed with an increment of 0.0041. The next increment will be -0.0061, so that the absolute value of i is more than the W of 0.005. Only this increment will be added to the element tex2html_wrap_inline710 of tex2html_wrap_inline712 and tex2html_wrap_inline816. Then, the iterative procedure will be performed with a C of 0.002 for the next non-diagonal elements (t31) of tex2html_wrap_inline712 and tex2html_wrap_inline816. When the procedure has been performed for all non-diagonal elements of tex2html_wrap_inline712 and tex2html_wrap_inline816 a new cycle begins with a change C of 0.003. After the last cycle with C=0.006, a new iteration of the whole procedure starts with a width W=0.01. The next starts with W=0.015, up to the maximal value of W (here 0.15).

The optimised factor pattern (according to the above-mentioned criteria) and transformation-matrix tex2html_wrap_inline816 will be saved.


next up previous
Next: Empirical comparison between Trasid Up: Trasid-Rotation Previous: Rotation for orientation of

Methods of Psychological Research 1997 Vol.2 No.2
© 1998 Pabst Science Publishers