The Logic of Chance, 3rd edition / An Essay on the Foundations and Province of the Theory of Probability, With Especial Reference to Its Logical Bearings and Its Application to Moral and Social Science and to Statistics - John Venn

§ 10. The position then which we have now reached is this. Taking it for granted that the Law of Error will fall into the symbolic form expressed by the equation y = ^h/_√π e^−h²x², we have rules at hand by which h may be determined. We therefore, for the purposes in question, know all about the curve of frequency: we can trace it out on paper: given one value,—say the central one,—we can determine any other value at any distance from this. That is, knowing how many men in a million, say, are 69 inches high, we can determine without direct observation how many will be 67, 68, 70, 71, and so on.

We can now adequately discuss the principal question of logical interest before us; viz.

why do we take averages or means? What is the exact nature and amount of the advantage gained by so doing? The advanced student would of course prefer to work out the answers to these questions by appealing at once to the Law of Error in its ultimate or exponential form. But I feel convinced that the best method for those who wish to gain a clear conception of the logical nature of the process involved, is to begin by treating it as a question of combinations such as we are familiar with in elementary algebra; in other words to take a finite number of errors and to see what comes of averaging these. We can then proceed to work out arithmetically the results of combining two or more of the errors together so as to get a new series, not contenting ourselves with the general character merely of the new law of error, but actually calculating what it is in the given case. For the sake of simplicity we will not take a series with a very large number of terms in it, but it will be well to have enough of them to secure that our law of error shall roughly approximate in its form to the standard or exponential law.

For this purpose the law of error or divergence given by supposing our effort to be affected by ten causes, each of which produces an equal error, but which error is equally likely to be positive and negative (or, as it might perhaps be expressed, ‘ten equal and indifferently additive and subtractive causes’) will suffice. This is the lowest number formed according to the Binomial law, which will furnish to the eye a fair indication of the limiting or Exponential law.[5] The whole number of possible cases here is 2¹⁰ or 1024; that is, this is the number required to exhibit not only all the cases which can occur (for there are but eleven really distinct cases), but also the relative frequency with which each of these cases occurs in the long run. Of this total, 252 will be situated at the mean, representing the ‘true’ result, or that given when five of the causes of disturbance just neutralize the other five. Again, 210 will be at what we will call one unit's distance from the mean, or that given by six causes combining against four; and so on; until at the extreme distance of five places from the mean we get but one result, since in only one case out of the 1024 will all the causes combine together in the same direction. The set of 1024 efforts is therefore a fair representation of the distribution of an infinite number of such efforts. A graphical representation of the arrangement is given here.

§ 11. This representing a complete set of single observations or efforts, what will be the number and arrangement in the corresponding set of combined or reduced observations, say of two together? With regard to the number we must bear in mind that this is not a case of the combinations of things which cannot be repeated; for any given error, say the extreme one at F, can obviously be repeated twice running. Such a repetition would be a piece of very bad luck no doubt, but being possible it must have its place in the set. Now the possible number of ways of combining 1024 things two together, where the same thing may be repeated twice running, is 1024 × 1024 or 1048576. This then is the number in a complete cycle of the results taken two and two together.

§ 12. So much for their number; now for their arrangement or distribution. What we have to ascertain is, firstly, how many times each possible pair of observations will present itself; and, secondly, where the new results, obtained from the combination of each pair, are to be placed. With regard to the first of these enquiries;—it will be readily seen that on one occasion we shall have F repeated twice; on 20 occasions we shall have F combined with E (for F coming first we may have it followed by any one of the 10 at E, or any one of these may be followed by F); E can be repeated in 10 × 10, or 100 ways, and so on.

Now for the position of each of these reduced observations, the relative frequency of whose component elements has thus been pointed out. This is easy to determine, for when we take two errors there is (as was seen) scarcely any other mode of treatment than that of selecting the mid-point between them; this mid-point of course becoming identical with each of them when the two happen to coincide. It will be seen therefore that F will recur once on the new arrangement, viz.

by its being repeated twice on the old one. G midway between E and F, will be given 20 times. E, on our new arrangement, can be got at in two ways, viz.

by its being repeated twice (which will happen 100 times), and by its being obtained as the mid-point between D and F (which will happen 90 times). Hence E will occur 190 times altogether.