Best Rank-Order Voting Systems versus Range Voting II: N≥4 candidates under the Random Normal Elections Model (RNEM)

By Warren D. Smith. Email comments to: warren.wds AT gmail.com.

Draft#1: 20 Oct 2008. Draft#2: 6 Nov 2008. Declaring it final 27 January 2009.

Abstract

Regard this as "Part II" of the preceding Best Rank-Order Voting Systems versus Range Voting. We shall re-use notions, definitions, and understanding from Part I.

Part I gave methodology for provably identifying "best" voting systems (measured via "Bayesian Regret") and used it to

Identify the best rank-order-ballot voting system for honest voters under the RNEM ("random normal elections model");
Show the superiority of "range voting" versus every rank-order-ballot voting system in 3-candidate RNEM elections, for any mixture of honest and strategic voters.

We now go further by identifying the pseudo-best rank-order-ballot voting system for strategic voters in N-candidate elections. (This is under the RNEM, and for a certain kind of "strategic voting" we call the "moving-average strategy"; here "pseudo-best" means, essentially, that the voting system is best for strategic voters who all have independent random notions of the presumed winning chances of the candidates. "Best" and "pseudo-best" coincide if N≤3. Severe difficulties arise if one tries to handle "best" for N≥4. Readers unhappy with the "pseudo-best" concept can console themselves with our theorem that "pseudo-best rank-order system" and "best weighted-positional system" mean the same thing under the RNEM.) At least if the issue of "best" versus "pseudo-best" is ignored, we show the superiority of Range Voting over every rank-order system for any number N of candidates with 3≤N≤31 for V→∞ honest voters, and also for strategic voters, and finally if 3≤N≤9 for any mixture of honest and strategic voters. We also tabulate the RNEM Bayesian Regrets and "wrong winner probabilities" of these and other voting systems, for many values of N, to about 4 decimals. Finally, we examine the N→∞ limit, showing that all sufficiently large N, honest range voting has regret which in the limit is zero relative to that of "random winner," whereas various other voting methods like Approval and Borda have asymptotically nonzero relative regret; and strategic range=approval voting has asymptotic regret which is a constant factor times smaller than that of random winner, although certain other voting methods (e.g. plurality) with strategic voting have asymptotically the same regret as random winner.

All our results can be or are proven in fully rigorous mathematical style except

The results with N≤31 depend on certain numerical integrations;
We show that the regret of range voting is less than the regret of the best rank-order system by plotting two regret-versus-electorate-composition curves, and seeing one curve always is above the other.

While this level of rigor seems good enough for the purposes of economists and political scientists, it falls short of the standards of ultrapure mathematicians. Part I had already outlined mechanical methods for converting proofs of this sort to full rigor. We show how those techniques can be extended to also work here, but we have not, and doing so would be even more painful here than in Part I, especially for larger N.

1. Introduction

Part I had explained the "correlation based procedure" for computing Bayesian Regrets in the V→∞ limit of V-voter RNEM elections for a large variety of voting systems. Two kinds of multidimensional integration problems arose inside that procedure:

Computing "correlations" and "variances";
Computing "regrets" and/or "wrong-winner probabilities."

In Part I, we evaluated the correlations and variances for many voting systems (including range voting and the best-rank-order-ballot and best-ratings-ballot systems) in 3-candidate elections as closed formulas involving mathematical constants such as π. We then used those formulas to evaluate the regrets and wrong-winner probabilities by either numerical or symbolic integration (indeed, we showed how all these could also be done symbolically in closed form, but in some cases the formulas are large).

But it is possible instead to do both kinds of integration numerically. If we employ the simplest kind of numerical integration procedure – Monte Carlo integration – we then get a quite simple computer program adequate to reach 4-decimal accuracy for all numbers N of candidates with 3≤N≤31.

There is one problem, though. If correlation matrices are computed using inexact numerical-integration methods rather than exact formulas, they can sometimes fail to be positive definite. (But of course, genuine correlation matrices are always positive definite symmetric, and with 1s in each diagonal entry.) Most of the time, this is not a problem, which is because, as a consequence of "Gershgorin's circle theorem" from linear algebra, any sufficiently small-norm symmetric perturbation of a positive definite symmetric matrix, still is positive definite. However, as we saw in Part I, many voting methods yield "degenerate" aka "reduced-rank" correlation matrices, which are merely positive semidefinite. For them, an arbitrarily small perturbation always exists which moves us outside of the domain of positive definite matrices! With indefinite correlation matrices, we're dead – the whole "correlation based procedure" from section 5 of Part I grinds to a halt.

This obstacle can be dodged in those cases (such as "weighted positional" voting systems with 100% honest voters) where exact formulas are available giving every entry of the correlation matrices (see section 5.1 of Part I). It occasionally struck in other cases, but I was able, by either perturbing the matrix entries by a fixed perturbation by at most a few parts in 10⁹, or by doing several re-runs with different random seeds (or both), to avoid this problem in every case.

An engineer, confronted with this problem, might apply a numerical method (e.g. Higham 2002 & 2008) for replacing a putative correlation matrix by the nearest matrix which genuinely is positive definite symmetric with unit diagonal, thus "repairing" the problem. However, in this paper I am trying to be a mathematician, not an engineer, and my goal is to prove statements about optimal Bayesian Regrets of voting methods – or at least, provide an incomplete proof which we show could be mechanically completed, by carrying out certain computations with "interval arithmetic" starting from rigorous error bounds – this is a lesser standard of "proof" which I regard as still adequate for many purposes. Some kinds of computations – e.g. carried out by "bounded depth arithmetic circuits" – are clearly going to work well with interval arithmetic. But it is not immediately clear that would be the case for Higham's algorithm, which involves a fairly complicated matrix process iterated to convergence. I want everything to be maximally simple. I therefore have abstained from these kinds of repair-attempt here.

The output of the program is given in table 1. If the random number generators are trusted, then "Chernoff bounds" show (for all the integration problems here) "exponential tails," which means we can be highly confident the numbers the computerized Monte-Carlo outputs, are not far from the truth. Based on various re-run checks, we believe the standard errors in all numbers are at most a few units in the last decimal place.

**Table 1.** **Wrong winner percentages** and **V^-1/2Regrets** for voting methods under N-candidate V-voter RNEM with V→∞. Values were computed either from exact formulas and/or by correlation-based procedure using separate Monte Carlo integrations to compute *both* correlations and regrets. (For N≤3, see Part I.) Honest Range Voting is superior (i.e. has less regret) to the best rank-order voting system for each N with 3≤N≤31. If we instead have strategic voters, then the same thing happens (at least, if the distinction between "pseudo-best" and "best" is ignored) indicated by comparing "strategic approval=range" versus the strategic rank-order systems. When N=4, strategic range voting is also superior to the best "semi-strategic" rank-order system, which is "top-2" voting. See also notes below.
line #	#Canddts→ ↓Voting Method	N=4	N=5	N=6	N=7	N=8	N=9	N=31
1	Magic Best	0%, 0	0%, 0	0%, 0	0%, 0	0%, 0	0%, 0	0%, 0
2	Honest Range	22.50%, 0.08299	22.28%, 0.0731	21.97%, 0.0657	21.67%, 0.0602	21.38%, 0.0559	21.10%, 0.0523	18.13%, 0.0279
3	Honest Best Ranked	27.83%, 0.1289	28.64%, 0.1236	28.93%, 0.1174	28.96%, 0.1113	28.86%, 0.1057	28.69%, 0.1005	23.90%, 0.0504
4	Honest Borda	27.91%, 0.1297	28.85%, 0.1255	29.32%, 0.1207	29.55%, 0.1161	29.67%, 0.1120	29.71%, 0.1083	29.31%, 0.0786
5	Honest Strat-Best-Ranked	30.44%, 0.1553	33.92%, 0.1767	36.05%, 0.1879	37.35%, 0.1925	39.03%, 0.2033	40.73%, 0.2159	52.88%, 0.3052
6	Mean-Thresh Approval	31.75%, 0.1695	35.96%, 0.2000	39.15%, 0.2245	41.65%, 0.2444	43.70%, 0.2613	45.41%, 0.2756	59.28%, 0.4060
7	Strategic Approval=Range	37.25%, 0.2357	41.59%, 0.2728	44.79%, 0.3016	44.26%, 0.3244	49.22%, 0.3429	50.81%, 0.3578	62.41%, 0.4674
8	Honest Top-2	37.60%, 0.2411	42.30%, 0.2829	46.73%, 0.3303	50.67%, 0.3780	54.10%, 0.4238	57.12%, 0.4675	82.67%, 1.0440
9	Honest Plurality	43.24%, 0.3230	50.05%, 0.4069	55.30%, 0.4804	59.51%, 0.5460	62.96%, 0.6046	65.85%, 0.6580	87.43%, 1.2717
10	Best Semi-Strategic (Top-2)	46.44%, 0.3928
11	Magic Best-Of-2	50.00%, 0.4652	60.00%, 0.5988	66.67%, 0.7030	71.43%, 0.7880	75.00%, 0.8594	77.78%, 0.9208	93.55%, 1.4923
12	Strategic PsuBest Ranked	53.10%, 0.5096	56.81%, 0.5481	59.2%, 0.569	60.85%, 0.5806	61.94%, 0.5861	62.74%, 0.5878	68.27%, 0.5836
13	Strategic Top-2	53.12%, 0.5273
14	Strategic Borda	54.33%, 0.5201	58.69%, 0.5716	61.44%, 0.6005	63.42%, 0.6201	64.95%, 0.6350	66.22%, 0.6677	76.41%, 0.7809
15	Strategic Hon-Best-Rnkd	54.70%, 0.5248	59.42%, 0.5835	62.49%, 0.6199	64.73%, 0.6465	66.52%, 0.6686	68.02%, 0.6880	80.30%, 0.9089
16	Strategic Plurality	58.25%, 0.5792	66.11%, 0.7128	71.43%, 0.8170	75.29%, 0.9020	78.21%, 0.9734	80.51%, 1.0349	94.03%, 1.6062
17	Random Winner	75.00%, 1.0294	80.00%, 1.1623	83.33%, 1.2672	85.71%, 1.3522	87.50%, 1.4236	88.89%, 1.4850	96.77%, 2.0565
18	Magic Worst	100.0%, 2.0588	100.0%, 2.3259	100.0%, 2.5344	100.0%, 2.7043	100.0%, 2.8472	100.0%, 2.9700	100.0%, 4.1129

NOTES ABOUT THE TABLE:

How to make the table-entries rigorous via a mechanical computation: It would be possible in principle – we have not done it, but a sufficiently determined reader could, at least for small Ns – to employ rigorous numerical integration techniques to replace every number in table 1 with arbitrarily tight rigorous upper and lower bounds. That is because all of our integrals are integrating standard Gaussians times linear functions of the coordinates, over certain polytopal-cone regions. Those integrands are well behaved because they have both rapidly decreasing tails and also easily-computed |derivative| bounds. Therefore numerical integration via summation over lattice points inside a big ball, will produce an answer whose error is boundable using (a) the derivative bounds, (b) tail mass-bound for integral outside the ball, (c) by using "interval arithmetic" to cover errors caused by inexact arithmetic. The second integration to find regrets and wrong-winner probabilities employs the results of the first integration (to find correlations, etc.) inside its integrand. Therefore errors in the earlier results will contaminate the later ones, but since these can be incorporated inside the interval arithmetic they will not ruin anything.

By semi-strategic voting in a 4-candidate election, we mean this. Before the election begins, two candidates are (randomly) named by God as the two "frontrunners." The voters now vote the maximum-allowed vote for whichever of these two they prefer and the minimum-allowed vote for the other. (This is intended to maximize the vote's "impact" on the frontrunner-vs-frontrunner race, the only pairoff that, they believe, probably matters.) Then, they vote honestly about the two remaining candidates (i.e. rank them honestly relative to each other, and subject to the constraint that the two frontrunners are artificially ranked top and bottom). With semi-strategic voters, we shall prove in theorem 3 that the best rank-order-ballot voting system for 4-candidate RNEM (V→∞)-voter elections, is "top-2" voting, i.e. the weighted positional system with weights (+1, +1, -1, -1).

The notion of strategic voting we shall examine in this paper is the moving-average strategy: Before the election begins, "God" randomly chooses, and publicizes, an ordering of the candidates from presumably-most-likely-to-win to presumably-least. (The two most-likely are the two "frontrunners.") Each voter now votes the maximum-allowed vote for whichever of the frontrunners she prefers and the minimum-allowed for the other. She then proceeds down the rest of God's ordering in decreasing-presumed-likelihood order, giving each candidate either the maximum- or minimum-allowed vote (subject to the constraint that the vote still be legal according to the rules of the voting system, given all her preceding decisions) depending on whether its utility (for that voter) exceeds or is less than the average-utility among the preceding candidates.

Notation: Many rows in table 1 consider the effect of mis-using our optimal voting systems. For example, if we find the best rank-order voting system for strategic voters, we could consider what happens if non-strategic, i.e. honest, rank-order voters employ that method instead – see line 5 of the table. Or conversely, what happens if strategic voters try to employ the rank-order voting system (which had been stated and derived in section 6 of Part I) that is best for honest voters? – see line 15.

Semi-strategic voting, since it is partially honest, might intuitively be expected to yield regret lower than fully-strategic voting. Line 10 and 12 of table 1 confirm this conjecture in 4-candidate RNEM (V→∞)-voter elections – but note from line 7 that even the best semi-strategic rank-order voting system still has worse regret than fully-strategic 4-candidate range voting (which in turn has worse regret than honest range voting).

FIRST MAIN RESULT – THE SUPERIORITY OF RANGE VOTING:
Table 1 (and our other, unprinted, calculations of the same ilk when 10≤N≤30) indicate that for each number N of candidates with 3≤N≤31 range voting is superior (i.e. has less regret, in the V→∞ voter RNEM) to the best rank-order system both for fully-honest and for fully-strategic voters – at least, if the distinction between "best" and "pseudo-best" is ignored, otherwise we still may conclude range's superiority versus the best weighted-positional system.

Honest-strategic voter mixtures: Based on that one would guess that when 3≤N≤31 range is superior to the best rank-order voting system for any mixture of H-fraction honest and (1-H)-fraction strategic voters(every H with 0≤H≤100%). Part I had confirmed that guess for N=3 candidates. This paper shall confirm it also for every N with 4≤N≤9, but things get a little trickier. The difficulty is that, with N=3 Part I had shown Borda voting to be a best rank-order voting system, regardless of whether we have honest voters, strategic voters, or any mixture. But when 4≤N the (pseudo)best rank-order voting system depends on H.

Further plans: Section 6 explicitly identifies the pseudo-best rank-order voting system for an honest+strategic voter H-mixture. When 4≤N≤9, it never is Borda for any value of H; but Borda does nearly as well as these best systems especially for small N and high H. Substantially greater regret for Borda versus the pseudo-best rank-order system is only seen (for honest voters) at fairly large N, e.g. N=31.

We then plot regret-versus-H for both range voting and the (pseudo)best rank-order voting system to verify range's superiority at a glance [since the range-regret curve lies below the (pseudo)best-rank-order-curve for every H]. The same numerical methods used to compute table 1 are re-used in constructing these plots. Again, by use of interval arithmetic and rigorous numerical integration, one could in principle mechanically convert all these curve-plots to fully-rigorous proofs, but we have not done so.

Finally, section 7 then considers the limit N→∞ of a large number of candidates, and proves theorems about the asymptotic behavior of the regrets of various voting methods.

2. Pseudo-best vs Best

If one tries to find the "best" rank-order voting system for moving-average-type strategic voters, one encounters difficulties. The problem is that the voting system can know (or deduce from the strategic ballots with very high probability when V→∞ of making a correct deduction) "God's order." Once it has acquired this knowledge, the voting system can then attempt to compensate for the strategic ballots by acting in an "unfair" way, i.e. by treating different candidates unequally. For example, God's two "frontrunner" candidates, even when ranked first by a voter, presumably would not be given the maximum number of points by the best weighted-positional voting system – and candidates ranked differently by God would be treated differently by the voting system handling any particular ballot. This kind of voting system seems unacceptable in practice because it assumes voters are dishonest/strategic. Well, of course, that was the whole point– but what if some voter actually is honest? In that case the voting system often would artificially re-rank her top-ranked candidate non-top, "refusing to accept" and "altering" her honest vote because "it knows she is lying"! This kind of behavior simply does not seem politically acceptable.

For this reason, it seems better to ask for "pseudo-best" rank-order voting systems. These minimize Bayesian Regret under the assumption that the voters (1) use moving-average strategy, (2) each voter does that under an independent and random notion of what "God's order" is. Pseudo-best voting systems treat all candidates equally.

However, when we actually evaluate BR for a pseudo-best voting system, we shall employ the usual assumption that there is only one "God's order" and all the voters use it as the basis for their strategy. Note that the assumptions used when optimizing the voting system design and the assumptions used when evaluating that design differ. This is the whole point of "pseudobest" versus "best." I admit it is somewhat artificial but I did not see any better way to proceed.

The best versus pseudo-best issue did not arise in Part I because the best and pseudo-best rank-order voting systems in 3-candidate elections are the same (both Borda). However when we have N≥4 candidates as in the present paper, they differ. Section 3 explicitly identifies the pseudo-best rank-order voting system for moving-average-type strategic voters in N-candidate RNEM elections.

More detailed look: The truly best voting system for strategic voters would have a matrix M of weights such that a candidate would receive score M_ij if a voter ranked him ith while God's ordering ranked him jth. It would be possible in principle (but we shall not do it) to compute the optimum matrix – in which M_ij was the expected utility of the candidate ranked ith by our (assumed strategic) voter and jth by God – and then to compute the Bayesian Regret of the resulting voting system using the correlation-based technology from Part I.

I consider such matrix-voting systems morally unacceptable: the winner ought to depend entirely on the votes and not at all on a priori perceptions from "God" of likelihoods of winning. But even if we were to accept this kind of voting system, we then would encounter a different problem as soon as we tried to use it: how would we decide what "God's order" was to be? The different candidates doubtlessly would argue and sue each other and the election authority endlessly to try to manipulate their position in it.

If we demand that the voting system employ absolutely no information about "God's order" we presumably should have it act as though all God-orderings are exactly equally likely. Unfortunately the voting system, in its quest to optimize societal utility, could try to "outwit" such restrictions by "deducing" God's ordering from the votes themselves, then using the full matrix M. If we forbid that and demand that it really act as though all God-orderings are exactly equally likely, then the optimal (as-restricted) voting system for strategic voters – the pseudo-best voting system – must be a weighted positional system with weights W_i∝∑_jM_ij.

Equivalently, we could simply demand a weighted positional voting system, in which case the best weights for strategic voters (maximizing societal utility) will yield the (genuinely) best weighted positional voting system. So, to summarize the outcome of the preceding argumentation as a "theorem" (albeit that may be a misnomer since it is somewhat in the nature of a "definition"):

THEOREM 1 (PseudoBest Rank Order=Best Weighted Positional): The best weighted positional voting system (i.e. minimizing Bayesian Regret for strategic voters under some model of strategic voter behavior) is the same thing as the pseudo-best rank-order-ballot voting system.

Before the reader dances off into the sunset feeling that the "pseudobest" definition has evaded all problems... there can still be one more problem: With some voter strategic behaviors, the "best" weighted positional system could actually have greater weights for the second-ranked candidate than the top-ranked one! In that case, the voters of course would no longer use that strategic behavior in reality – they'd e.g. switch to ranking their true-favorite "second." The problem here is not that the "best weighted positional" = "pseudobest" definition is wrong; it is that whatever underlying voter strategic behavior model was used, was unrealistic. We shall encounter this exact difficulty in section 3 and propose a cure involving averaging weights in increasing-weight blocks.

3. Best rank-order voting system for strategic voters (especially using "moving-average" strategy)

We begin by considering a very wide class of voter "strategies" – not just the ones Part I had been considering based on the assumption by voters that everybody besides two known "frontrunner" candidates had negligible winning chances. Instead allow voters to use any function F to determine their votes from candidate-utilities and pre-published information about the candidates and their estimated election chances. Call a voting system fair if it treats all candidates and voters equally, i.e. is symmetric under reordering of either. (Previous authors have called fair voting methods "anonymous and neutral.") We shall only consider fair voting systems in the rest of this paper.

UNDERLYING PRINCIPLE (optimality of weighted positional under very wide class of voter strategies):
It should be obvious to any reader who has understood the logical flows in Part I (see especially its sections 1 and 6) that the optimum (regret minimizing) fair rank-order voting method in the RNEM with V→∞ voters is a weighted positional system in which the weight for the kth-ranked candidate is the expected utility of that candidate for a random voter under the RNEM, conditioned on F producing a rank-ordering ranking him kth. These weights are just a certain set of constants which can be computed once and for all given the definition F of voter strategy.

Remark: In consequence, I would expect Approval voting to still provably be superior to every rank-order voting system, for most reasonable strategy notions, for any mixture of honest and strategic voters, in 3-candidate RNEM elections. (However, we are intentionally leaving that somewhat vague; the reader will have to check it for any particular strategy notion.)

We now drop total generality to focus on the moving-average strategy.

Incidental Observation (last-ranked's utility almost irrelevant): In any rank-order-ballot voting system, voters employing the moving-average strategy will produce a rank for God's last-numbered candidate which is, under the RNEM, uncorrelated with and independent of the utility of that candidate (since this vote is determined purely by the utilities of the other N-1 candidates)! More difficulties with correlation matrices will be discussed next section.

A useful sequence Q_k: To identify the best fair rank-order voting system for this kind of strategic voter, a useful sequence of quantities to know are Q_k, the expected utility of a candidate, conditioned on a voter regarding him as having greater utility than the average-utility of k-1 given other candidates. I.e, Q_k is the expected value of a standard normal deviate conditioned on the fact it is greater than the average of k-1 other independent such deviates. If 2≤k this is, as a k-dimensional integral over a halfspace,

Q_k = 2∫∫…∫_{(k-1)x_k≥(x₁+…+x_k-1)} x_kP(x₁)P(x₂)…P(x_k) dx₁dx₂…dx_k.

where P(x)=(2π)^-1/2exp(-x²/2) denotes the standard normal density function. We can evaluate the integral immediately by rotating the coordinate system so that one of the axes becomes perpendicular to the hyperplane

(k-1)x_k=x₁+…+x_k-1.

In that case the integrals over all the other k-1 directions can be replaced by just the value at the origin, so the k-dimensional integral becomes 1-dimensional, with value

Q_k = 2[(k-1)/k]^1/2∫_0≤x<∞xP(x)dx = [2(k-1)/(πk)]^1/2.

THEOREM 2 (Best rank-order systems for strategic voters): For V voters employing the moving-average strategy, the best (i.e. least expected regret) fair rank-order voting system under the RNEM in the V→∞ limit is, for 2≤N<∞, the weighted-positional system whose weights are in decreasing order S_J^(N) for 1≤J≤N. We have S₁^(N)=Q₂ while for 2≤J≤⌊N/2⌋ they are given by this formula

S_J^(N) = [T_J^(N) - T_N+1-J^(N)] / [B_J^(N) + B_N+1-J^(N)] where T_J^(N) = ∑_J≤k<N Q_k+12^-kbinomial(k-2, J-2) , B_J^(N) = ∑_J≤k<N 2^-kbinomial(k-2, J-2)

and for J≥⌈N/2⌉ we can get the weights from the negation symmetry S_J^(N)+S_N+1-J^(N)=0. Here binomial(b,a)=b!/[a!·(b-a)!] where 0!=1.

CAVEAT: However, when N≥6 these weights S_J^(N) need to be "adjusted" to cure the "disease" that some of them increase with J, see text below.

REMARK: The sum-based formula also gives the right answer for S₁^(N) provided binomial(a,b) with negative a or b are interpreted in the correct limiting sense.

Proof (of Theorem as supplemented by Caveat): From linearity of expectations we have that ∑_1≤J≤NS_J^(N)=0. From negation symmetry we have S_J^(N)+S_N+1-J^(N)=0 for 1≤J≤N. From the first step in the definition of the moving-average strategy, we have S₁^(N)=Q₂. Now the remaining weights not yet determined by these relations are S_J^(N) for 2≤J≤⌊N/2⌋. For these we argue as follows. Why would our strategic voter rank a candidate Jth? If she ranked all except i (i≥0) of the J-2+i candidates numbered m=3,...,J+i (where m numbers them in "God's order") maximum, then also ranked God's (J+i+1)th-numbered candidate maximum, then that (J+i+1)th-numbered candidate would be ranked Jth in the voter's strategic rank-ordering and would have expected utility Q_J+i+1, and this event would occur with probability proportional to 2^2-J-ibinomial(J-2+i, i). The powers of 2 come from the fact that each question of the form "does candidate X's utility lie above or below the average of the preceding candidates?" is an unbiased coin flip. The binomials come from the fact that the probability that N coin flips yield exactly Z heads is 2^-Nbinomial(N,Z). Now use binomial(b,b-a)=binomial(b,a) and let k=J+i to get the claimed formula for T_J^(N).

It is also possible for Jth rankings to arise via voters who rank that candidate minimum, in which case the same formula arises "thinking negated" where the Jth-ranking from top corresponds to the N+1-Jth ranking from bottom. Between these we have accounted for every possibility. (The denominator just comes from normalizing the probability distribution; and since we may multiply both the numerator and denominator by the same arbitrary constant, we use that to simplify the exponents in the power of 2.)

This causes the best voting system for strategic voters in 2-candidate RNEM elections to be weighted positional with weights (Q₂, -Q₂), which when rescaled by √π is (+1, -1), i.e. simple plurality voting. Similarly the best voting system for strategic voters in 3-candidate RNEM elections has weights (Q₂, 0, -Q₂), which when rescaled by √π is (+1, 0, -1), i.e. Borda voting.

We already knew both these facts from Part I, but for 4-candidate RNEM elections we move into new territory with the "root3 voting system." Its weights [rescaled by (3π)^1/2] are (√3, 1, -1, -√3).

Incidentally, note that root3 voting is the first example (N=4) where the best weighted-positional voting systems for honest and for strategic voters differ.

One can now continue on trying to derive the best rank-order voting systems for strategic RNEM voters for N-candidate elections for N=5,6,7,… We always get a weighted positional system, whose weights are certain rational linear combinations of the Q_j. When N=5 they are [rescaled by 8(3π)^1/2]

(8√3, 8+3√2, 0, -8-3√2, -8√3)

However, a funny thing happens when N=6 (and presumably for all N≥6): we find that S₂^(N)>S₁^(N)! That would mean that a voter would be better off ranking her favorite frontrunner second rather than first... contradicting the validity and whole underlying rationale of the "best rank-order voting system for strategic voters." Presumably the right cure for this is to replace any increasing block of weights by their averages – albeit when N gets larger that just leads to further diseases, see table 2, which require more curing. With the cures suggested there – namely that each block of B non-decreasing weights needs to be replaced by B repetitions of its mean, and this adjustment needs to be re-done until the weights form a non-increasing sequence – the resulting voting system will have the property under the RNEM that a candidate who receives score X from a strategic voter, will, in fact, then have expected utility X. (This iterative adjustment process must terminate/converge since each adjustment decreases the sum of the squares of the weights, hence the process cannot "loop forever.") That property justifies our claim that the voting system is "optimal" for this kind of voter – it elects the candidate with the greatest expected (conditioned on the votes) utility. QED.

Must "best voting methods for strategic voters" exist? We have seen that they exist in the RNEM for moving-average strategy with N≥2 candidates and V→∞ voters. But it is not clear to me that they must exist in general. The problem is this. Our "underlying principle" makes it clear that for any F defining the strategic behavior, a unique best voting system will exist under the V→∞ RNEM, but it might happen that that voting system is such that F no longer constitutes "sensible strategy." One could, of course, then alter F to make it "more sensible" and then see what voting system results, and then alter F again, and so on. The problem is: this iterative process might enter an infinite loop and never terminate. So my current answer to this question is "sometimes, but perhaps not always."

More general (albeit perhaps mysterious) remark: it is well known that there is no such thing as "best strategy" if one attempts to generalize Von Neumann "game theory" to games with more than 2 players. So the whole question does not really make sense. "Nash equilibrium" is an attempt to salvage game theory which often is useful, but for the purposes of voting theorists in large elections, is extremely useless. [For example, an election in which every voter votes for the agreed-by-all-to-be-worst candidate "Hitler," who wins... is a "Nash equilibrium."]

This explains why we are always focusing on some specific voter behavior F as a strategy (e.g. the moving-average strategy) instead of enquiring what is "the best strategy" – there is no such thing.

**Table 2:** Weights defining best rank-order voting system for strategic voters in N-candidate RNEM elections, computed from theorem 2. Scaled so that the max and min weights are ±π^-1/2 so that the weights give expected utility of a candidate conditioned on strategic voter ranking him kth. (Should be accurate to 0.51 units in the last decimal place.) Certain blocks of weights, shown underlined, unfortunately increase. The correct cure for this disease is to replace the entire block by repetitions of its mean, but then additional diseases are encountered when N=31 involving the italicized weights too. The right cure then is to replace both the underlined *and* italicized weights by repetitions of their common mean (last column). Call the resulting *adjusted* weights "S_k^(N)." Then observe from them that the best rank-order voting systems for 31, 100, and 316 candidates with strategic voters *greatly resemble approval voting.*
N	S_k^(N) for k=1,2,…N (Blocks underlined/italicized)	Block-mean
2	0.564189583547756, -0.564189583547756 (Plain plurality)
3	0.564189583547756, 0, -0.564189583547756 (Borda)
4	0.564189583547756, 0.325735007935280, -0.325735007935280, -0.564189583547756 (Root3 voting)
5	0.564189583547756, 0.498482082670948, 0, sym
6	0.564189583547756, 0.587688288478586, 0.261953280543306, sym	0.575938936013171
7	0.564189583547756, 0.633211139753256, 0.442205188900285, 0, sym	0.598700361650506
8	0.564189583547756, 0.656295438618553, 0.556980936770844, 0.226420506087571, sym	0.610242511083154
9	0.564189583547756, 0.667957199011994, 0.626712277209905, 0.399505055925336, 0, sym	0.616073391279875
31	0.564189583547756, 0.679773410167051, 0.721194190473972, 0.741678440464367, 0.753697372447156, 0.761484854452681, 0.766651755249552, 0.769337398026234, 0.767902422857322, 0.757825643105279, 0.730540639763292, 0.673615376561153, 0.573822412896530, 0.423242673970826, 0.225831515802532, 0, sym	0.728570519141333
100	S_k⁽¹⁰⁰⁾ are identical for k=1..39, then decrease for k=40..50.	0.772901887476515
316	S_k⁽³¹⁶⁾ are identical for k=1..135, then decrease for k=136..158.	0.788931905605522

NUMERICAL CONFIRMATION: Each line in table 2, for 2≤N≤9, was computed by instead performing 462 million Monte Carlo experiments. (Each experiment was to create N independent standard-normal deviates, compute the moving-average-strategy rank-order vote based on those N utilities, and keep track of the mean utility of the candidate ranked Jth in these votes for each J with 1≤J≤N. The thus-obtained Monte-Carlo results, rounded to 4 decimal places, were found to agree with the exact S_J^(N) formula in every case to within an error of ±1 in the final decimal place.

4. Some matrices, and problems with them

Part I had already given closed formulas for the correlation matrices for, e.g, every honest weighted-positional rank-order voting system. We now give a few more (without proof; in most cases the proofs are easy – or at least easy consequences of integrals tabulated in appendix B of Part I – but in other cases the computation is more difficult)

For 4-candidate semi-strategic "top 2" voting, the "C-matrix" (explained in Part I) can be computed in closed form. It is "doubly rank-deficient"

    [ 0.5642 -0.5642  0.0000  0.0000  0.6028  0.0000 ]
C ≈ [-0.5642  0.5642  0.0000  0.0000 -0.6028  0.0000 ]
    [ 0.0000  0.0000  0.5642 -0.5642  0.0000  0.6028 ]
    [ 0.0000  0.0000 -0.5642  0.5642  0.0000 -0.6028 ]

where the exact formulas for the numbers given as decimals are 0.5642≈π^-1/2 and 0.6028≈(1-2/π)^1/2. We have

      [ 1 -1  0  0 ]
CC^T = [-1  1  0  0 ]
      [ 0  0  1 -1 ]
      [ 0  0 -1  1 ]

and all vote-variances are 1.

THEOREM 3 (Best Semi-Strategic Rank-Order=Top-2): The best (regret-minimizing) semi-strategic voting system, under the RNEM with V→∞ voters, in 4-candidate elections, is "top-2" voting, i.e. the weighted positional system with weights proportional to (+1, +1, -1, -1).

Proof: By our underlying principle from section 2, it has to be weighted positional with weights proportional to expected utilities (in the eyes of any given voter) conditioned on that voter's vote. Because of our postulate/definition that "God" chooses the two frontrunners (and two non-frontrunners) randomly in a way unrelated to all voters' utilities, and because of our definition of "semi-strategic" voting, the two frontrunners and two non-frontrunners are exactly symmetric, so their weights must be the same, and by translation and scaling we can make them be (+1,-1). QED.

For 4-candidate Strategic Approval=Range we have rank-deficiency 1 and

    [ 0.5642 -0.5642  0.0000  0.0000  0.6028  0.0000  0.0000 ]
C ≈ [-0.5642  0.5642  0.0000  0.0000 -0.6028  0.0000  0.0000 ]
    [-0.3257 -0.3257  0.6515  0.0000  0.0000  0.6028  0.0000 ]
    [-0.2303 -0.2303 -0.2303  0.6910  0.0000  0.0000  0.6028 ]

      [ 1 -1  0  0 ]
CC^T = [-1  1  0  0 ]
      [ 0  0  1  0 ]
      [ 0  0  0  1 ]

All vote-variances are 1, and the additional exact formulas are 0.6515≈2(3π)^-1/2, 0.6910≈(2π/3)^-1/2, 0.3257≈(3π)^-1/2, 0.2303≈(6π)^-1/2. For 5-candidate Strategic Approval=Range we have rank-deficiency 1 and

    [ 0.5642 -0.5642  0.0000  0.0000  0.0000  0.6028  0.0000  0.0000  0.0000 ]
C ≈ [-0.5642  0.5642  0.0000  0.0000  0.0000 -0.6028  0.0000  0.0000  0.0000 ]
    [-0.3257 -0.3257  0.6515  0.0000  0.0000  0.0000  0.6028  0.0000  0.0000 ]
    [-0.2303 -0.2303 -0.2303  0.6910  0.0000  0.0000  0.0000  0.6028  0.0000 ]
    [-0.1785 -0.1784 -0.1784 -0.1784  0.7136  0.0000 -0.0001  0.0000  0.6028 ]

      [ 1 -1  0  0  0 ]
      [-1  1  0  0  0 ]
CC^T = [ 0  0  1  0  0 ]
      [ 0  0  0  1  0 ]
      [ 0  0  0  0  1 ]

All vote-variances are 1, and the additional exact formulas are 0.1784≈(10π)^-1/2 and 0.7136≈(5π/8)^-1/2.

REMARK: Note that in all the cases above, det(CC^T)=0 so that the vote-vote correlation matrix CC^T is not positive-definite; it is merely semidefinite. A zero-eigenvalue eigenvector is (1,1,0,...,0), but this is not bothersome since strategic voters always vote (x,-x) in the first two coordinates and hence never "activate" this eigenvector.

A much more annoying indefiniteness problem occurs with 5-candidate psubest-strategic-rank-order voting,

      [ 1.0000 -1.0000  0.0000  0.0000  0.0000 ]
      [-1.0000  1.0000  0.0000  0.0000  0.0000 ]
CC^T = [ 0.0000  0.0000  1.0000 -0.7071 -0.7071 ]
      [ 0.0000  0.0000 -0.7071  1.0000  0.0000 ]
      [ 0.0000  0.0000 -0.7071  0.0000  1.0000 ]

where the exact formula for 0.7071 is 2^-1/2. This also has det(CC^T)=0, arising from the unique eigenvector with zero eigenvalue, which is e=(0, 0, √2, 1, 1). The reason this is much more annoying is that voters always vote in ways non-orthogonal to this eigenvector, and hence with vote-utility correlation matrices generated by approximate Monte-Carlo methods, there is a decent probability that the "correlation based procedure" from part I will try to Cholesky-factor a matrix with a negative eigenvalue – which, of course, is impossible.

Experiments with the Monte-Carlo code showed, however, that

Mixing in even a small percentage of honest voters always seems to cure the indefiniteness problem.
With 100% strategic voters, we (usually, depending on the random seed) can "adjust" the vote-utility correlation matrix by a tiny perturbation of order 10^-9 in each entry (i.e. well below the levels of random noise from Monte Carlo) to make it become positive definite.

This same problem arose in other cases too, but the same fix has always worked in our experience so far.

5. Some remarks on some other voting systems

Alan Gibbard, in his proof of his version of the Gibbard-Satterthwaite theorem, focused attention on the following two randomized voting methods, which, he showed, were the unique two "strategy-proof" rank-order voting systems (and probabilistic mixtures of these two):

Random ballot: A ballot is selected at random, and the winner is the candidate it ranks top.
Random pair: A pair of candidates (call them A and B) are selected at random. All other candidates besides A & B are eliminated. The winner is whoever wins the A versus B simple-majority vote based on the ballots.

In other words, strategic and honest voting, in these two lottery-voting systems, are the same thing.

THEOREM 4: Under the RNEM with V→∞ voters, "random ballot" has the same regret as "random winner." Also, "random pair" has the same regret as "strategic plurality voting."

Proof: is immediate from the definition of the RNEM and our model of strategic plurality voting. QED.

In the real world, our model of the two "frontrunners" being "selected randomly by God" is not completely correct. I suspect, indeed, that the two frontrunners have better quality on average than random candidates. (But this certainly is not always the case. For example, two US presidents often selected by historians as the "worst" are George W. Bush and Warren G. Harding. It is hard to believe that their respective third-party opponents Ralph Nader and Eugene V. Debs would have made worse presidents than them. Debs was actually a political prisoner at the time of the election but Harding, once elected, pardoned him.) If so, then random pair in the real world presumably would have worse regret than strategic plurality voting.

6. (Pseudo)Best rank-order voting system for strategic+honest voter mixture

THEOREM 5 (Best rank-order system for honest+strat voter mix): The pseudo-best rank-order voting system in N-candidate elections in the V→∞ voter RNEM with a mixture of H-fraction honest and (1-H)-fraction strategic, voters, is weighted positional with weights

H·G_k^(N)+(1-H)S_k^(N) where G_k^(N) from Part I's appendix A are the optimum weights for honest voters, and S_k^(N) from our theorem 2 are the optimum weights for strategic voters.

The proof is immediate from the underlying principle from section 3.

With the optimum voting system known, we then can employ the known methodology from section 8 of Part I to evaluate its Bayesian regret for an H:1-H honest+strategic voter mixture. When I did so using our usual numerical-integration-based "correlation-based procedure," the result was the following plots. The other curve plotted is the regret for range voting using the same H:1-H voter mixture – the strategic range voters are using moving-average strategy and thus are effectively using approval voting.

Interesting aside: The best rank-order system is only a little better than Borda when N∈{4,5}.

As you can see immediately from the plots (and the case N=3 is covered in Part I)

CLAIM: Range voting is superior to every rank-order voting system (even the best) in the V→∞ voter RNEM, for N-candidate elections with 3≤N≤9, at any voter-honesty fraction H with 0≤H≤100%, provided we ignore the distinction between best and pseudobest. (That is, more precisely: the regret of range voting is less than the regret of the pseudo-best rank-order system at every H, where the "pseudo-best" system minimizes regret in a model where the strategic voters each have independent uniform-random beliefs about "God's likelihood order," but then actual regret is evaluated in a setting where there is only one, publicly known, such order.)

Figure 1. Regrets versus electorate honesty-percentage for several voting methods with V→∞ voters under the RNEM. Accuracy should be substantially better than the widths of the curves and dots. The curve labeled "best weighted-positional" voting system, also (by theorem 1) is the "pseudo-best rank-order voting system." The best and pseudo-best rank-order systems coincide in the 3-candidate case but not for more than 3 candidates as here.

7. Large numbers N of candidates

What about 7, 8, 9,..., candidate elections? It would be nice to have a theorem proclaiming range voting's superiority versus the best rank-order system for every number N≥3 of candidates. That is a good question. It is often possible to prove ∫A(x)dx>∫B(x)dx despite the absence of closed formulas for either integral. We know what the best rank-order system is for honest voters (see Part I's section 6) and we know the pseudo-best system for strategic voters (here) so that we know what "A(x)" and "B(x)" are; the difficulty is that our integrals are multidimensional. An interesting task for future authors, then, would be to develop suitable "comparison theorems" enabling such proofs.

Since I currently lack such tools, it is useful to consider the limit of N→∞ candidates. The idea is that if we can show something for both N≤31 and for N→∞, that makes it plausible that it works for all N.

THEOREM 6 (Range voting's regret is small for large #candidates): For honest range voting in the RNEM with V voters, V→∞, as the number of candidates N becomes large, the regret is O(1/lnN)^1/2 times the regret of "random winner."

Proof sketch: Use the results about the variances associated with the G_k^(N) in Part I's appendix A to see that all the "normalization factors" for range voters will be equal up to relative error O(1/lnN) (except for an arbitrarily small fraction of voters, which hence will have an arbitrarily small effect on total utilities). We can now use, e.g, Chebyshev's bound on probability densities with bounded variance and more strongly, Hall's work cited in that appendix showing exponential tail bounds; and also we can use the fact that all voters are independent and have independent utility-versus-range-vote errors, combined with the Chernoff bound, to see that, with probability→1, all the range-vote totals will be the same as the corresponding true-utility-totals, up to additive errors whose root mean square has order ±V^1/2(lnN)^-1/2. Meanwhile the maximum among the N true utility totals will almost surely be of order V^1/2(lnN)^+1/2. Thus (now again using facts about the G_k^(N) from Part I's appendix A) even the maximum among the errors in approximation of utilities by vote totals will still be far smaller – by a factor of order (lnN)^1/2 at least – than the maximum utility, QED.

In contrast,

THEOREM 7 (Other regrets are large for large #candidates): For each of these weighted positional voting systems: plurality, antiPlurality, Borda, and median-as-threshold approval voting (or, indeed, any weighted positional system in which at least a positive constant fraction of the system-defining weights differ by at least a positive constant fraction of full scale from the optimum weights given in Part I's section 6) in the RNEM with V honest voters, V→∞, as the number of candidates N becomes large, the regret tends to a positive constant times the regret of "random winner."

Proof sketch: Each vote will, as an approximation to utility, have additive |error| (on average for each voter) a constant fraction of full scale. All these errors are independent, leading via the central limit theorem when V→∞ to the vote totals having an expected additive |errors| (as an approximation to summed candidate-utilities) of order V^1/2, which (note) is the same typical order as the |utilities| themselves and all these errors are independent in the V→∞ limit. Therefore, if we elect the candidate with maximum vote rather than maximum utility, we will lose utility (i.e. experience regret) of the same order as the maximum utility. QED.

REMARK: I conjecture that the pseudo-best rank-order system (for V→∞ strategic voters under the RNEM) has the same order regret as strategic approval voting and random winner, but with a better constant factor than random winner. This would follow immediately if we could prove that at least a positive constant fraction of the S_J^(N) are equal, as empirically appears to be the case, for large N.

REMARK: I conjecture that the best rank-order system (for V→∞ honest voters under the RNEM) has the same order regret as range voting but with a greater constant factor due to its effective artificial uniformization of the intercandidate utility spacings. It easily may be shown that Borda, median-as-threshhold Approval, and Plurality have successively worse constant factors in their regrets in the large-N limit. I also conjecture that strategic approval voting also obeys theorem 7 and with a strictly better constant factor than random winner.

THEOREM 8 (Strategic and Mean-based "honest" Approval asymptotically same): Strategic (moving-average strategy) and honest (mean-based threshhold) have with probability→1 asymptotically the same regrets in the RNEM with V honest voters, V→∞, as the number of candidates N becomes large.

Proof sketch: The idea is that the moving average is essentially the same as the full average (both converge to 0 under the RNEM) for almost all of the N candidates, when N→∞. Hence the approximations of utility-totals by vote-totals will have asymptotically the same distributions for both voting methods. QED.

THEOREM 9 (Strategic Plurality for large #candidates): For strategic plurality voting, in the RNEM with V honest voters, V→∞, as the number of candidates N becomes large, the regret is asymptotic to the regret of random winner. (This is also true for the "random ballot" election method.)

Proof: Consequence of our exact regret formulas and Part I's appendix A – in its notation the expected regret of strategic plurality is G₁^(N)-G₁⁽²⁾ which when N→∞ is asymptotic to the expected regret G₁^(N)-0 of random winner or random ballot. QED.

Another question is: "what about other statistical/political models besides RNEM (such as D-dimensional politics models, etc)?" Can future authors prove analogous results to ours in those models? With the help of some fortuitous miracles we are able to accomplish that in some cases. That will be explained in a new paper "part III."

References:
G.Alefeld & J.Herzberger: Introduction to Interval Computations, Academic Press, 1983.
Alan Gibbard: Manipulation of Schemes that Mix Voting with Chance, Econometrica 41 (1977) 587-600.
R. Hammer, M. Hocks, U. Kulisch, D. Ratz: Numerical Toolbox for Verified Computing I: With Algorithms and Pascal-XSC Programs, Springer-Verlag, Berlin, 1994.
Eldon Hansen & G. William Walster: Global Optimization using Interval Analysis (2nd ed.) Marcel Dekker, New York, 2004.
Nicholas J. Higham: Computing the nearest correlation matrix – a problem from finance, IMA Journal of Numerical Analysis 22,3 (2002) 329-343; also "A Preconditioned Newton Algorithm for the Nearest Correlation Matrix," 2008 preprint available here http://eprints.ma.man.ac.uk/1086/01/covered/MIMS_ep2008_50.pdf.
Ramon E. Moore: Interval Analysis, Prentice Hall, Englewood Cliffs, N.J., 1966.
Ramon E. Moore: Methods and Applications of Interval Analysis, SIAM, Philadelphia PA 1979.
Miodrag S. & Ljiljana D. Petkovic: Complex Interval Arithmetic and Its Applications. Berlin: Wiley, 1998.

Return to main page

Page summarizing all 3 papers in this series