Voter Satisfaction Index

Utility measurements: Group A: 5 candidates, 20 voters, random utilities; Each entry averages the results from 4,000,000 simulated elections. Group B: 5 candidates, 50 voters, utilities based on 2 issues, each entry averages the results from 2,222,222 simulated elections.

Voting system	VSI A	VSI B
Magically elect optimum winner	100.00%	100.00%
Range (honest voters)	96.71%	94.66%
Borda (honest voters)	91.31%	89.97%
Approval (honest voters)	86.30%	83.53%
Condorcet-LR (honest voters)	85.19%	85.43%
Range & Approval (strategic exaggerating voters)	78.99%	77.01%
IRV (honest voters)	78.49%	76.32%
Plurality (honest voters)	67.63%	62.29%
Borda (strategic exaggerating voters)	53.26%	51.78%
Condorcet-LR (strategic exaggerating voters)	42.56%	41.31%
IRV (strategic exaggerating voters)	39.07%	39.21%
Plurality (strategic voters)	39.07%	39.21%
Elect random winner	0.00%	0.00%

These results are from 2 of 720 different models, with millions of elections simulated for each model. Note that range voting is approximately as great an improvement over plurality voting, as plurality is over random selection; range voting effectively doubles the benefit brought about by the invention of democracy. These experimental results also strongly suggest that range voting is the least susceptible to strategic voting, of these common methods.

Testing Utility

In order to get an idea of how utilitarian various voting systems are, there are two fundamental methods we can employ. If we could somehow precisely read minds, to get honest utility values, and if we had the money to scientifically study and interview millions of voters over millions of real elections, using all different kinds of election methods, we would certainly prefer to do that. But we can't read minds, nor can we hold millions of large-scale elections. We can't even use results from real elections, because you cannot infer utilities based on votes. For example, in a plurality election for the fruit in the above scenario, boy 1 and boy 2 would have voted for banana, and boy 3 would have voted for orange. But we would have no way to determine their honest utility values based on that information. Hence the second method.

While we can't infer utilities from votes, we can infer votes from utilities, making certain assumptions about how honest/strategic, or informed/ignorant the voters are. Thus, when Princeton Ph.D. Warren D. Smith set out to calculate the utility produced by various voting methods, he used five parameters, or "knobs", to specify these things, and then used a computer program he wrote, to perform millions of simulated elections for each of 720 different parametrizations, or knob settings. The number of candidates were varied from two, to several. Some elections used 100% honest voters, while others used 100% strategic voters. The effect of voter ignorance was simulated, such that uninformed voters might behave as though they liked a candidate more or less than they really would have, had they researched him further. The goal was to produce enough simulations with enough different knob settings, that at least some of them would very closely model reality. This page explains in greater detail why we use computer simulations to measure utility.

Say we were to use the values from the scenario above, in which three brothers buy a piece of fruit at the market. We would create millions of scenarios like this, using different fruits, and different buyers, and of course, different voting methods. But what system should we use to describe the results? Warren D. Smith prefers a method called "Bayesian regret", which actually measures how much utility was wasted because a voting method chose the wrong winner. In our fruit scenario, the regret is simply the utility produced by the chosen fruit, subtracted from the utility produced by the ideal fruit, orange. So picking the orange would produce a regret of 0, whereas picking the apple would produce a regret of 6. Bayesian regret is the average of this value, over as many hypothetical scenarios as we choose to execute. An ideal voting system that always picked the ideal winner would have a Bayesian regret of 0, by definition.

While doctor Smith cites some academic reasons for preferring to express utility in the form of Bayesian regret, some find this system problematic. One of the chief criticisms of this method is that a lower number is actually better, and this can confuse people who are new to the concept. Another problem that some cite is that the utility units have an arbitrary magnitude, making it difficult to compare Bayesian regret figures from two different simulations.

A proposed solution to these problems is to use voter satisfaction indexes. VSI is calculated as

Voting method	Average voter utility (in "happiness units") = U	Voter satisfaction index (U - R) ÷ (O - R)	Bayesian regret (O - U)
Magically elect optimum winner	1.31348 = O	100%	0
Range voting	1.26407	96.71%	0.04941
Borda count	1.18293	91.31%	0.13055
Approval voting	1.10773	86.30%	0.20575
Condorcet (LR method)	1.09101	85.19%	0.22247
Instant runoff voting	0.99034	78.49%	0.32314
Plurality voting	0.8272	67.63%	0.48628
Elect random winner	-0.1887 = R	0%	1.50218

Range voting produces the highest voter satisfaction index

Testing Utility

Expressing Utility

Conclusions

Improve These Simulations