III

In the last of these papers we examined the nature of inductive or synthetic reasoning. We found it to be a process of sampling. A number of specimens of a class are taken, not by selection within that class, but at random. These specimens will agree in a great number of respects. If, now, it were likely that a second lot would agree with the first in the majority of these respects, we might base on this consideration an inference in regard to any one of these characters. But such an inference would neither be of the nature of induction, nor would it (except in special cases) be valid, because the vast majority of points of agreement in the first sample drawn would generally be entirely accidental, as well as insignificant. To illustrate this, I take the ages at death of the first five poets given in Wheeler’s Biographical Dictionary. They are:

Aagard, 48.

Abeille, 70.

Abulola, 84.

Abunowas, 48.

Accords, 45.

These five ages have the following characters in common:

1. The difference of the two digits composing the number, divided by three, leaves a remainder of one.

2. The first digit raised to the power indicated by the second, and divided by three, leaves a remainder of one.

3. The sum of the prime factors of each age, including one, is divisible by three.

It is easy to see that the number of accidental agreements of this sort would be quite endless. But suppose that, instead of considering a character because of its prevalence in the sample, we designate a character before taking the sample, selecting it for its importance, obviousness, or other point of interest. Then two considerable samples drawn at random are extremely likely to agree approximately in regard to the proportion of occurrences of a character so chosen. The inference that a previously designated character has nearly the same frequency of occurrence in the whole of a class that it has in a sample drawn at random out of that class is induction. If the character be not previously designated, then a sample in which it is found to be prevalent can only serve to suggest that it may be prevalent in the whole class. We may consider this surmise as an inference if we please—an inference of possibility; but a second sample must be drawn to test the question of whether the character actually is prevalent. Instead of designating beforehand a single character in reference to which we will examine a sample, we may designate two, and use the same sample to determine the relative frequencies of both. This will be making two inductive inferences at once; and, of course, we are less certain that both will yield correct conclusions than we should be that either separately would do so. What is true of two characters is true of any limited number. Now, the number of characters which have any considerable interest for us in reference to any class of objects is more moderate than might be supposed. As we shall be sure to examine any sample with reference to these characters, they may be regarded not exactly as predesignated, but as predetermined (which amounts to the same thing); and we may infer that the sample represents the class in all these respects if we please, remembering only that this is not so secure an inference as if the particular quality to be looked for had been fixed upon beforehand.

The demonstration of this theory of induction rests upon principles and follows methods which are accepted by all those who display in other matters the particular knowledge and force of mind which qualify them to judge of this. The theory itself, however, quite unaccountably seems never to have occurred to any of the writers who have undertaken to explain synthetic reasoning. The most widely-spread opinion in the matter is one which was much promoted by Mr. John Stuart Mill—namely, that induction depends for its validity upon the uniformity of Nature—that is, on the principle that what happens once will, under a sufficient degree of similarity of circumstances, happen again as often as the same circumstances recur. The application is this: The fact that different things belong to the same class constitutes the similarity of circumstances, and the induction is good, provided this similarity is “sufficient.” What happens once is, that a number of these things are found to have a certain character; what may be expected, then, to happen again as often as the circumstances recur consists in this, that all things belonging to the same class should have the same character.

This analysis of induction has, I venture to think, various imperfections, to some of which it may be useful to call attention. In the first place, when I put my hand in a bag and draw out a handful of beans, and, finding three-quarters of them black, infer that about three-quarters of all in the bag are black, my inference is obviously of the same kind as if I had found any larger proportion, or the whole, of the sample black, and had assumed that it represented in that respect the rest of the contents of the bag. But the analysis in question hardly seems adapted to the explanation of this proportionate induction, where the conclusion, instead of being that a certain event uniformly happens under certain circumstances, is precisely that it does not uniformly occur, but only happens in a certain proportion of cases. It is true that the whole sample may be regarded as a single object, and the inference may be brought under the formula proposed by considering the conclusion to be that any similar sample will show a similar proportion among its constituents. But this is to treat the induction as if it rested on a single instance, which gives a very false idea of its probability.

In the second place, if the uniformity of Nature were the sole warrant of induction, we should have no right to draw one in regard to a character whose constancy we knew nothing about. Accordingly, Mr. Mill says that, though none but white swans were known to Europeans for thousands of years, yet the inference that all swans were white was “not a good induction,” because it was not known that color was a usual generic character (it, in fact, not being so by any means). But it is mathematically demonstrable that an inductive inference may have as high a degree of probability as you please independent of any antecedent knowledge of the constancy of the character inferred. Before it was known that color is not usually a character of genera, there was certainly a considerable probability that all swans were white. But the further study of the genera of animals led to the induction of their non-uniformity in regard to color. A deductive application of this general proposition would have gone far to overcome the probability of the universal whiteness of swans before the black species was discovered. When we do know anything in regard to the general constancy or inconstancy of a character, the application of that general knowledge to the particular class to which any induction relates, though it serves to increase or diminish the force of the induction, is, like every application of general knowledge to particular cases, deductive in its nature and not inductive.

In the third place, to say that inductions are true because similar events happen in similar circumstances—or, what is the same thing, because objects similar in some respects are likely to be similar in others—is to overlook those conditions which really are essential to the validity of inductions. When we take all the characters into account, any pair of objects resemble one another in just as many particulars as any other pair. If we limit ourselves to such characters as have for us any importance, interest, or obviousness, then a synthetic conclusion may be drawn, but only on condition that the specimens by which we judge have been taken at random from the class in regard to which we are to form a judgment, and not selected as belonging to any sub-class. The induction only has its full force when the character concerned has been designated before examining the sample. These are the essentials of induction, and they are not recognized in attributing the validity of induction to the uniformity of Nature. The explanation of induction by the doctrine of probabilities, given in the last of these papers, is not a mere metaphysical formula, but is one from which all the rules of synthetic reasoning can be deduced systematically and with mathematical cogency. But the account of the matter by a principle of Nature, even if it were in other respects satisfactory, presents the fatal disadvantage of leaving us quite as much afloat as before in regard to the proper method of induction. It does not surprise me, therefore, that those who adopt this theory have given erroneous rules for the conduct of reasoning, nor that the greater number of examples put forward by Mr. Mill in his first edition, as models of what inductions should be, proved in the light of further scientific progress so particularly unfortunate that they had to be replaced by others in later editions. One would have supposed that Mr. Mill might have based an induction on this circumstance, especially as it is his avowed principle that, if the conclusion of an induction turns out false, it cannot have been a good induction. Nevertheless, neither he nor any of his scholars seem to have been led to suspect, in the least, the perfect solidity of the framework which he devised for securely supporting the mind in its passage from the known to the unknown, although at its first trial it did not answer quite so well as had been expected.