TABLE II

PRELIMINARY TESTS

Reasoning—Unlimited Time

100 Different Pupils Tested Each Time

Number of Problems% Reasoned Correctly as Printed% Reasoned Correctly as ReversedAverage % Reasoned CorrectlyWeight According to Average %
Correct
Weight Used as Probably the Best
19592.693.811
28682.284.11.11
3948991.511
4808381.51.51
58886871.11
66957.463.21.51.4
77080751.251.2
8294436.52.61.6
91915.517.25.452
102427.425.73.62
1117 7.512.37.62
12 716.411.782

Precautions observed to make the Scoring Accurate

The simplicity of the tests made the scoring comparatively easy; and with the observance of the following precautions it is believed that a high degree of accuracy was attained. (1) In so far as practicable, all the papers were scored by a single judge—only two persons being employed on any phase of the work for the entire twenty-six systems; (2) each problem was scored through one hundred or more papers, then the next followed through, etc.; (3) the score for each part of each problem, the errors, etc., were entered on a blank provided with a separate column for each item; (4) where there was doubt as to how the score should be made, the scorer made a written memorandum of how the case was finally decided and this memorandum served as the guide for all future similar cases.

What the Scores Measure

As used in this study the words achievements, products, abilities, except where otherwise qualified, must necessarily refer to the results of the particular tests employed in this investigation. That some systems may achieve other and possibly quite as worth-while results from their arithmetic work is not denied; but what is denied is that any system can safely fail to attain good results in the work covered by these particular tests. Whatever else the arithmetic work may produce, it seems safe to say that by the end of the sixth school year, it should result in at least good ability in the four fundamental operations and the simple, everyday kind of reasoning called for in these problems. It does not then seem unreasonable, in view of the precautions previously enumerated, to claim that the scores made by the respective systems afford a reliable measure of the products of their respective procedures in arithmetic.