2.2. Condorcet to the rescue?
К оглавлению1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 1617 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33
34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50
51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67
68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84
85 86 87 88 89 90 91 92 93 94
Condorcet’s jury theorem, in its simplest form, says that if a jury is facing
a binary choice and each jury member makes her decision independently
and has a better-than-even chance of making the right decision, a simple
majority of the jurors is likely to make the right decision, and this will
tend toward certainty as the number of jurors tends toward infinity. We
can think of the successful linear models we have introduced as a jury: The
jury must make a binary decision about a target, and each jury member
makes her decision on the basis of a single piece of evidence. Each piece of
evidence correlates positively with the target; so each juror’s decision is
going to be right more often than not. And the linear model adds together
each juror’s judgment to come to a final decision about the target. The
only difference between the different types of models is that some weigh
certain lines of evidence more than others. Putting this in terms of our
jury analogy, some models have more jurors focusing on certain lines of
evidence than others. So given Condorcet’s jury theorem, we should expect
linear models to predict reasonably well. (Thanks to Michael Strevens
and Mark Wunderlich for suggesting this explanation.)
The Condorcet explanation leaves open at least two questions. First,
many successful linear models consist of a small number of cues (sometimes
as few as two). But Condorcet’s jury theorem suggests that high
reliability usually requires many jurors. So the success of linear models still
seems a bit mysterious. Second, why are linear models, particularly those
with a very small number of cues, more reliable than human experts? After
all, if human experts are able to use a larger number of reliable cues than
simple linear models, why doesn’t the Condorcet explanation imply that
they will typically be more reliable than the models? We will address these
questions in section 3. But for now, let’s turn to a different explanation for
the success of linear models.
Condorcet’s jury theorem, in its simplest form, says that if a jury is facing
a binary choice and each jury member makes her decision independently
and has a better-than-even chance of making the right decision, a simple
majority of the jurors is likely to make the right decision, and this will
tend toward certainty as the number of jurors tends toward infinity. We
can think of the successful linear models we have introduced as a jury: The
jury must make a binary decision about a target, and each jury member
makes her decision on the basis of a single piece of evidence. Each piece of
evidence correlates positively with the target; so each juror’s decision is
going to be right more often than not. And the linear model adds together
each juror’s judgment to come to a final decision about the target. The
only difference between the different types of models is that some weigh
certain lines of evidence more than others. Putting this in terms of our
jury analogy, some models have more jurors focusing on certain lines of
evidence than others. So given Condorcet’s jury theorem, we should expect
linear models to predict reasonably well. (Thanks to Michael Strevens
and Mark Wunderlich for suggesting this explanation.)
The Condorcet explanation leaves open at least two questions. First,
many successful linear models consist of a small number of cues (sometimes
as few as two). But Condorcet’s jury theorem suggests that high
reliability usually requires many jurors. So the success of linear models still
seems a bit mysterious. Second, why are linear models, particularly those
with a very small number of cues, more reliable than human experts? After
all, if human experts are able to use a larger number of reliable cues than
simple linear models, why doesn’t the Condorcet explanation imply that
they will typically be more reliable than the models? We will address these
questions in section 3. But for now, let’s turn to a different explanation for
the success of linear models.