The measurement of intelligence - Lewis M. Terman

It is too much, however, to suppose that the instructions can be made “fool-proof.” With whatever definiteness they may be set forth, situations are sure to arise which the examiner cannot be formally prepared for. There is no limit to the multitude of misunderstandings possible. After testing hundreds of children one still finds new examples of misapprehension. In a few such cases the instruction may be repeated, if there is reason to think the child’s hearing was at fault or if some extraordinary distraction has occurred. But unless otherwise stated in the directions, the repetition of a question is ordinarily to be avoided. Supplementary explanations are hardly ever permissible.

In short, numberless situations may arise in the use of a test which may injure the validity of the response, events which cannot always be dealt with by preconceived rule. Accordingly, although we must urge unceasingly the importance of following the standard procedure, it is not to be supposed that formulas are an adequate substitute either for scientific judgment or for common sense.

Scoring.

The exact method of scoring the individual tests is set forth in the following chapters. Reference to the record booklet for use in testing will show that the records are to be kept in detail. Each subdivision of a test should be scored separately, in order that the clinical picture may be as complete as possible. This helps in the final evaluation of the results. It makes much difference, for example, whether success in repeating six digits is earned by repeating all three correctly or only one; or whether the child’s lack of success with the absurdities is due to failure on two, three, four, or all of them. Time should be recorded whenever called for in the record blanks.

Recording responses.

Plus and minus signs alone are not usually sufficient. Whenever possible the entire response should be recorded. If the test results are to be used by any other person than the examiner, this is absolutely essential. Any other standard of completeness opens the door to carelessness and inaccuracy. In nearly all the tests, except that of naming sixty words, the examiner will find it possible by the liberal use of abbreviations to record practically the entire response verbatim. In doing so, however, one must be careful to avoid keeping the child waiting. Occasionally it is necessary to leave off recording altogether because of the embarrassment sometimes aroused in the child by seeing his answer written down. The writer has met the latter difficulty several times. When for any reason it is not feasible to record anything more than score marks, success may be indicated by the sign +, failure by −, and half credit by ½. An exceptionally good response may be indicated by ++ and an exceptionally poor response by − −. If there is a slight doubt about a success or failure the sign? may be added to the + or −. In general, however, score the response either + or −, avoiding half credit as far as it is possible to do so.

If the entire response is not recorded it is necessary to record at least the score mark for each test when the test is given. It must be borne in mind that the scoring is not a purely mechanical affair. Instead, the judgment of the examiner must come into play with every record made. If the scoring is delayed, there is not only the danger of forgetting a response, but the judgment is likely to be influenced by the subject’s responses to succeeding questions. Our special record booklet contains wide margins, so that extended notes and observations regarding the child’s responses and behavior can be recorded as the test proceeds.

Scattering of successes.

It is sometimes a source of concern to the untrained examiner that the successes and failures should be scattered over quite an extensive range of years. Why, it may be asked, should not a child who has 10-year intelligence answer correctly all the tests up to and including group X, and fail on all the tests beyond? There are two reasons why such is almost never the case. In the first place, the intelligence of an individual is ordinarily not even. There are many different kinds of intelligence, and in some of these the subject is better endowed than in others. A second reason lies in the fact that no test can be purely and simply a test of native intelligence. Given a certain degree of intelligence, accidents of experience and training bring it about that this intelligence will work more successfully with some kinds of material than with others. For both of these reasons there results a scattering of successes and failures over three or four years. The subject fails first in one or two tests of a group, then in two or three tests of the following group, the number of failures increasing until there are no successes at all. Success “tapers off” from 100 per cent to 0. Once in a great while a child fails on several of the tests of a given year and succeeds with a majority of those in the next higher year. This is only an extreme instance of uneven intelligence or of specialized experience, and does not necessarily reflect upon the reliability of the tests for children in general. The method of calculation given above strikes a kind of average and gives the general level of intelligence, which is essentially the thing we want to know.

Scoring.

Recording responses.

Scattering of successes.

Supplementary considerations.