It should, however, be noted that if each individual had been scored by the average of his work on eight or ten different days instead of by his work in just one test, the variability would have been somewhat less than appears in Figs. 61, 62, and 63.

Fig. 62.—The scores of 100 11-year-old pupils in a test of computation. Estimated from the data given by Burt ['17, p. 68] for 10-, 11-, and 12-year-olds. The score equals the number of correct figures.

It is also the case that if each individual had been scored, not in problem-solving alone or division alone, but in an elaborate examination on the whole field of arithmetic, the variability would have been somewhat less than appears in Figs. 61, 62, and 63. On the other hand, if the officers and the soldiers rejected for feeblemindedness had been included in Fig. 61, if the 11-year-olds in special classes for the very dull had been included in Fig. 62, and if all children who had been to school six years had been included in Fig. 63, no matter what grade they had reached, the effect would have been to increase the variability.

Fig. 63.—The scores of pupils in grade 6 in city schools in the Woody Division Test A. The score is the number of correct answers obtained in 20 minutes. From Woody ['16, p. 61].

In spite of the effort by school officers to collect in any one school grade those somewhat equal in ability or in achievement or in a mixture of the two, the population of the same grades in the same school system shows a very wide range in any arithmetical ability. This is partly because promotion is on a more general basis than arithmetical ability so that some very able arithmeticians are deliberately held back on account of other deficiencies, and some very incompetent arithmeticians are advanced on account of other excellencies. It is partly because of general inaccuracy in classifying and promoting pupils.

In a composite score made up of the sum of the scores in Woody tests,—Add. A, Subt. A, Mult. A, and Div. A, and two tests in problem-solving (ten and six graded problems, with maximum attainable credits of 30 and 18), Kruse ['18] found facts from which I compute those of Table 13, and Figs. 64 to 66, for pupils all having the training of the same city system, one which sought to grade its pupils very carefully.

Figs. 64, 65, and 66.—The scores of pupils in grade 6 (Fig. 64), grade 7 (Fig. 65), and grade 8 (Fig. 66) in a composite of tests in computation and problem-solving. The time was about 120 minutes. The maximum score attainable was 196.