I think the experiment involves factors quite comparable to those that determine the verdict of a jury. The cards with their spots are the evidence pro and con which each juryman has before him to interpret. Each person’s decision on the number is his interpretation of the situation. The arguments, too, seem quite comparable to the arguments of the jury. Both consist in pointing out factors of the situation that have been overlooked, and in showing how different interpretations may be possible.

Another man wrote:

In the experiment it seemed that one man judged by one criterion and another by another, such as distribution, size of spots, vacant spaces, or counting along one edge. Discussion often brought a man’s attention to other criterions than those he used in his first judgment, and these often outweighed the original. Similarly, different jurymen would base their opinion on different aspects of the case, and discussion would tend to draw their attention to other aspects. The experiment also illustrated the relative weight given to the opinion of different fellow-jurymen. I found that the statements of a few of the older men who have had more extensive psychological experience weighed more with me than those of the others. Suggestion did not seem to be much of a factor. A man is rather on his mettle, and ready to defend his original impression until he finds that it is hopeless.

Again, another wrote:

To me the experiment seemed fairly comparable to the real situation. As in an actual trial, the full truth was not available, but, certain evidence was presented to all for interpretation. As to the nature of the discussion itself, I think there was the same mingling of suggestion and real argument that is to be found in a jury discussion.

Another said:

The discussion influenced me by suggesting other methods of analysis. For instance, comparison of the amount of open space in two cards, comparison of the number of dots along the edges, estimation in diagonal lines, were methods mentioned in the discussion which I used in forming my own judgments. It does not seem to me that in my own case direct suggestion had any appreciable effect. I was aware of a tendency toward contrasuggestibility. There was a half-submerged feeling that it would be good sport to stick it out for the losing side. The lack of any unusual amount of suggestion and the presence of the influences of analysis and detailed comparison seem to me to show that the tests were in fact fairly comparable to situations in a jury-room.

To be sure, there were a few who were strongly impressed by the evident differences between the rich material of an actual trial and the meager content of our tests, there the actions of living men, here the space relations of little spots. But they evidently did not sufficiently realize that the forming of such number judgments was not at all a question of mere perception; that, on the contrary, many considerations were involved. Most men felt the similarity from the start.

What were the results of this first group of experiments? Our interest must evidently be centered on the question of how many judgments were correct at the first vote before any discussion and any show of hands were influencing the minds of the men, and how many were correct at the last vote, after the two periods of discussion and after taking cognizance of the two preceding votes. If I sum up all the results, the outcome is that fifty-two per cent. of the first votes were correct, and seventy-eight per cent. of the final votes were correct. The discussion of the successive votes had therefore led to an improvement of twenty-six per cent. of all votes. Or, as the correct votes were at first fifty-two per cent., their number is increased by one half. May we not say that this demonstration in exact figures proves that the confidence in the jury system is justified? And may it not be added that, in view of the wide-spread prejudices, the result is almost surprising? Here we had men of high intelligence who were completely able to take account of every possible aspect of the situation. They had time to do so, they had training to do so, and every foregoing experiment ought to have stimulated them to do so in the following ones. Yet their judgment was right in only fifty-two per cent. of the cases until they heard the opinions of the others and saw how they voted. The mere seeing of the vote, however, cannot have been decisive, because forty-eight per cent.,—that is, virtually half of the votes,—were at first incorrect. The wrong votes might have had as much suggestive influence on those who voted rightly as the right votes on those on the wrong side. Nevertheless, if the change was so strongly in the right direction, the result must clearly have come from the discussion.

But I am not at the end of my story. I also made exactly the same experiments with a class of advanced female university students. When I started, my aim was not to examine the differences of men and women, but only to have ampler material, and I confined my work to students in psychological classes because I was anxious to get the best possible scientific analysis of the inner experiences. I had no prejudice in favor of or against women as members of the jury, any more than my experiments were guided by a desire to defend or to attack the jury system. I was anxious only to clear up the facts. The women students had exactly the same opportunities for seeing the cards and the votes and for exchanging opinions. The discussions, while carried on for the same length of time, were on the whole less animated. There was less desire to convince and more restraint; but the record which was taken in shorthand showed nearly the same variety of arguments that the men had brought forward. Everything agreed exactly with the experiments of the men, and the only difference was in the results. The first vote of all experiments with the women showed a slightly smaller number of right judgments. The women had forty-five per cent. correct judgments, as against the fifty-two per cent. of the men. I should not put any emphasis on this difference. It may be said that the men had more training in scientific observations, and the task was therefore slightly easier for them than for most of the women. I should say that, all taken together, men and women showed an equal ability in immediate judgment, as with both groups about half of the first judgments were correct. The fact that with the men two per cent. more, with the women five per cent. less, than half were right would not mean much. But the situation is entirely different with the second figure. We saw that for the men the discussion secured an increase from fifty-two per cent. to seventy-eight per cent.; with the women the increase is not a single per cent. The first votes were forty-five per cent. right, and the last votes were forty-five per cent. right. In other words, they had not learned anything from discussion.