Even now we may go so far as to say that the majority of us are secretly wishing they could say “Who did you see?” It would be a weight off their unconscious minds if some divine authority, overruling the lifted finger of the pedagogue, gave them carte blanche. But we cannot too frankly anticipate the drift and maintain caste. We must affect ignorance of whither we are going and rest content with our mental conflict—uncomfortable conscious acceptance of the “whom,” unconscious desire for the “who.”[133] Meanwhile we indulge our sneaking desire for the forbidden locution by the use of the “who” in certain twilight cases in which we can cover up our fault by a bit of unconscious special pleading. Imagine that some one drops the remark when you are not listening attentively, “John Smith is coming to-night.” You have not caught the name and ask, not “Whom did you say?” but “Who did you say?” There is likely to be a little hesitation in the choice of the form, but the precedent of usages like “Whom did you see?” will probably not seem quite strong enough to induce a “Whom did you say?” Not quite relevant enough, the grammarian may remark, for a sentence like “Who did you say?” is not strictly analogous to “Whom did you see?” or “Whom did you mean?” It is rather an abbreviated form of some such sentence as “Who, did you say, is coming to-night?” This is the special pleading that I have referred to, and it has a certain logic on its side. Yet the case is more hollow than the grammarian thinks it to be, for in reply to such a query as “You’re a good hand at bridge, John, aren’t you?” John, a little taken aback, might mutter “Did you say me?” hardly “Did you say I?” Yet the logic for the latter (“Did you say I was a good hand at bridge?”) is evident. The real point is that there is not enough vitality in the “whom” to carry it over such little difficulties as a “me” can compass without a thought. The proportion “I : me = he : him = who : whom” is logically and historically sound, but psychologically shaky. “Whom did you see?” is correct, but there is something false about its correctness.
It is worth looking into the reason for our curious reluctance to use locutions involving the word “whom” particularly in its interrogative sense. The only distinctively objective forms which we still possess in English are me, him, her (a little blurred because of its identity with the possessive her), us, them, and whom. In all other cases the objective has come to be identical with the subjective—that is, in outer form, for we are not now taking account of position in the sentence. We observe immediately in looking through the list of objective forms that whom is psychologically isolated. Me, him, her, us, and them form a solid, well-integrated group of objective personal pronouns parallel to the subjective series I, he, she, we, they. The forms who and whom are technically “pronouns” but they are not felt to be in the same box as the personal pronouns. Whom has clearly a weak position, an exposed flank, for words of a feather tend to flock together, and if one strays behind, it is likely to incur danger of life. Now the other interrogative and relative pronouns (which, what, that), with which whom should properly flock, do not distinguish the subjective and objective forms. It is psychologically unsound to draw the line of form cleavage between whom and the personal pronouns on the one side, the remaining interrogative and relative pronouns on the other. The form groups should be symmetrically related to, if not identical with, the function groups. Had which, what, and that objective forms parallel to whom, the position of this last would be more secure. As it is, there is something unesthetic about the word. It suggests a form pattern which is not filled out by its fellows. The only way to remedy the irregularity of form distribution is to abandon the whom altogether for we have lost the power to create new objective forms and cannot remodel our which-what-that group so as to make it parallel with the smaller group who-whom. Once this is done, who joins its flock and our unconscious desire for form symmetry is satisfied. We do not secretly chafe at “Whom did you see?” without reason.[134]
But the drift away from whom has still other determinants. The words who and whom in their interrogative sense are psychologically related not merely to the pronouns which and what, but to a group of interrogative adverbs—where, when, how—all of which are invariable and generally emphatic. I believe it is safe to infer that there is a rather strong feeling in English that the interrogative pronoun or adverb, typically an emphatic element in the sentence, should be invariable. The inflective -m of whom is felt as a drag upon the rhetorical effectiveness of the word. It needs to be eliminated if the interrogative pronoun is to receive all its latent power. There is still a third, and a very powerful, reason for the avoidance of whom. The contrast between the subjective and objective series of personal pronouns (I, he, she, we, they: me, him, her, us, them) is in English associated with a difference of position. We say I see the man but the man sees me; he told him, never him he told or him told he. Such usages as the last two are distinctly poetic and archaic; they are opposed to the present drift of the language. Even in the interrogative one does not say Him did you see? It is only in sentences of the type Whom did you see? that an inflected objective before the verb is now used at all. On the other hand, the order in Whom did you see? is imperative because of its interrogative form; the interrogative pronoun or adverb normally comes first in the sentence (What are you doing? When did he go? Where are you from?). In the “whom” of Whom did you see? there is concealed, therefore, a conflict between the order proper to a sentence containing an inflected objective and the order natural to a sentence with an interrogative pronoun or adverb. The solution Did you see whom? or You saw whom?[135] is too contrary to the idiomatic drift of our language to receive acceptance. The more radical solution Who did you see? is the one the language is gradually making for.
These three conflicts—on the score of form grouping, of rhetorical emphasis, and of order—are supplemented by a fourth difficulty. The emphatic whom, with its heavy build (half-long vowel followed by labial consonant), should contrast with a lightly tripping syllable immediately following. In whom did, however, we have an involuntary retardation that makes the locution sound “clumsy.” This clumsiness is a phonetic verdict, quite apart from the dissatisfaction due to the grammatical factors which we have analyzed. The same prosodic objection does not apply to such parallel locutions as what did and when did. The vowels of what and when are shorter and their final consonants melt easily into the following d, which is pronounced in the same tongue position as t and n. Our instinct for appropriate rhythms makes it as difficult for us to feel content with whom did as for a poet to use words like dreamed and hummed in a rapid line. Neither common feeling nor the poet’s choice need be at all conscious. It may be that not all are equally sensitive to the rhythmic flow of speech, but it is probable that rhythm is an unconscious linguistic determinant even with those who set little store by its artistic use. In any event the poet’s rhythms can only be a more sensitive and stylicized application of rhythmic tendencies that are characteristic of the daily speech of his people.
We have discovered no less than four factors which enter into our subtle disinclination to say “Whom did you see?” The uneducated folk that says “Who did you see?” with no twinge of conscience has a more acute flair for the genuine drift of the language than its students. Naturally the four restraining factors do not operate independently. Their separate energies, if we may make bold to use a mechanical concept, are “canalized” into a single force. This force or minute embodiment of the general drift of the language is psychologically registered as a slight hesitation in using the word whom. The hesitation is likely to be quite unconscious, though it may be readily acknowledged when attention is called to it. The analysis is certain to be unconscious, or rather unknown, to the normal speaker.[136] How, then, can we be certain in such an analysis as we have undertaken that all of the assigned determinants are really operative and not merely some one of them? Certainly they are not equally powerful in all cases. Their values are variable, rising and falling according to the individual and the locution.[137] But that they really exist, each in its own right, may sometimes be tested by the method of elimination. If one or other of the factors is missing and we observe a slight diminution in the corresponding psychological reaction (“hesitation” in our case), we may conclude that the factor is in other uses genuinely positive. The second of our four factors applies only to the interrogative use of whom, the fourth factor applies with more force to the interrogative than to the relative. We can therefore understand why a sentence like Is he the man whom you referred to? though not as idiomatic as Is he the man (that) you referred to? (remember that it sins against counts one and three), is still not as difficult to reconcile with our innate feeling for English expression as Whom did you see? If we eliminate the fourth factor from the interrogative usage,[138] say in Whom are you looking at? where the vowel following whom relieves this word of its phonetic weight, we can observe, if I am not mistaken, a lesser reluctance to use the whom. Who are you looking at? might even sound slightly offensive to ears that welcome Who did you see?
We may set up a scale of “hesitation values” somewhat after this fashion:
- Value 1: factors 1, 3. “The man whom I referred to.”
- Value 2: factors 1, 3, 4. “The man whom they referred to.”
- Value 3: factors 1, 2, 3. “Whom are you looking at?”
- Value 4: factors 1, 2, 3, 4. “Whom did you see?”
We may venture to surmise that while whom will ultimately disappear from English speech, locutions of the type Whom did you see? will be obsolete when phrases like The man whom I referred to are still in lingering use. It is impossible to be certain, however, for we can never tell if we have isolated all the determinants of a drift. In our particular case we have ignored what may well prove to be a controlling factor in the history of who and whom in the relative sense. This is the unconscious desire to leave these words to their interrogative function and to concentrate on that or mere word order as expressions of the relative (e.g., The man that I referred to or The man I referred to). This drift, which does not directly concern the use of whom as such (merely of whom as a form of who), may have made the relative who obsolete before the other factors affecting relative whom have run their course. A consideration like this is instructive because it indicates that knowledge of the general drift of a language is insufficient to enable us to see clearly what the drift is heading for. We need to know something of the relative potencies and speeds of the components of the drift.
It is hardly necessary to say that the particular drifts involved in the use of whom are of interest to us not for their own sake but as symptoms of larger tendencies at work in the language. At least three drifts of major importance are discernible. Each of these has operated for centuries, each is at work in other parts of our linguistic mechanism, each is almost certain to continue for centuries, possibly millennia. The first is the familiar tendency to level the distinction between the subjective and the objective, itself but a late chapter in the steady reduction of the old Indo-European system of syntactic cases. This system, which is at present best preserved in Lithuanian,[139] was already considerably reduced in the old Germanic language of which English, Dutch, German, Danish, and Swedish are modern dialectic forms. The seven Indo-European cases (nominative genitive, dative, accusative, ablative, locative, instrumental) had been already reduced to four (nominative genitive, dative, accusative). We know this from a careful comparison of and reconstruction based on the oldest Germanic dialects of which we still have records (Gothic, Old Icelandic, Old High German, Anglo-Saxon). In the group of West Germanic dialects, for the study of which Old High German, Anglo-Saxon, Old Frisian, and Old Saxon are our oldest and most valuable sources, we still have these four cases, but the phonetic form of the case syllables is already greatly reduced and in certain paradigms particular cases have coalesced. The case system is practically intact but it is evidently moving towards further disintegration. Within the Anglo-Saxon and early Middle English period there took place further changes in the same direction. The phonetic form of the case syllables became still further reduced and the distinction between the accusative and the dative finally disappeared. The new “objective” is really an amalgam of old accusative and dative forms; thus, him, the old dative (we still say I give him the book, not “abbreviated” from I give to him; compare Gothic imma, modern German ihm), took over the functions of the old accusative (Anglo-Saxon hine; compare Gothic ina, Modern German ihn) and dative. The distinction between the nominative and accusative was nibbled away by phonetic processes and morphological levelings until only certain pronouns retained distinctive subjective and objective forms.
In later medieval and in modern times there have been comparatively few apparent changes in our case system apart from the gradual replacement of thou—thee (singular) and subjective ye—objective you (plural) by a single undifferentiated form you. All the while, however, the case system, such as it is (subjective-objective, really absolutive, and possessive in nouns; subjective, objective, and possessive in certain pronouns) has been steadily weakening in psychological respects. At present it is more seriously undermined than most of us realize. The possessive has little vitality except in the pronoun and in animate nouns. Theoretically we can still say the moon’s phases or a newspaper’s vogue; practically we limit ourselves pretty much to analytic locutions like the phases of the moon and the vogue of a newspaper. The drift is clearly toward the limitation, of possessive forms to animate nouns. All the possessive pronominal forms except its and, in part, their and theirs, are also animate. It is significant that theirs is hardly ever used in reference to inanimate nouns, that there is some reluctance to so use their, and that its also is beginning to give way to of it. The appearance of it or the looks of it is more in the current of the language than its appearance. It is curiously significant that its young (referring to an animal’s cubs) is idiomatically preferable to the young of it. The form is only ostensibly neuter, in feeling it is animate; psychologically it belongs with his children, not with the pieces of it. Can it be that so common a word as its is actually beginning to be difficult? Is it too doomed to disappear? It would be rash to say that it shows signs of approaching obsolescence, but that it is steadily weakening is fairly clear.[140] In any event, it is not too much to say that there is a strong drift towards the restriction of the inflected possessive forms to animate nouns and pronouns.