RangeVoting.org - answer to puzzle ?? - Pairwise defeat probabilities

Puzzle #??: Realizability of Alleged Matrix of Pairwise-Defeat Probabilities

Puzzle:
Suppose there are C candidates (numbered 1,2,...,C) in a rank-order-ballot election, and it is claimed, for some C×C matrix Z, that a random voter ranks A above B with probability Z_AB. (Of course, these obey Z_AB+Z_BA=1 and Z_AB≥0.)

Really? Perhaps there simply is no probability distribution on the C! possible rank-order ballots, which yields the alleged pairwise probabilities. Show that some Z are realizable and some Z are not, by giving simple examples of both.
For which matrices Z do probability distributions that realize Z exist, and which Z are unrealizable? Give an algorithm with, given any matrix Z, will decide its realizability – and will find a realization if one exists.
Open: What is the fastest algorithm you can think of for that decision-task?
Open: Devise the most efficient algorithm you can for generating a random C-permutation, such that its output obeys the pairwise probabilities specified by some Z-matrix. (Actually, there are two speeds to consider: the time to produce the first such random permutation, and then the time to produce each additional one from the same probability distribution on demand.)
Show that there exists a positive real constant k such that if |Z_AB-1/2|<k·logC/C for all A,B then Z is realizable. [Easier: show that if C is even and |Z_AB-1/2|<1/(2C-2) for all A,B then Z is realizable and there is a simple random-permutation-generation algorithm showing that.]
In the other direction, show that there exists a positive constant j such that Z_AB=1/2±j·C^-0.499 will, when C→∞ and the ± signs are chosen randomly by (C-1)C/2 independent fair coin tosses, fail to be realizable with probability→1.

The lesson of parts (e) and (f) of this puzzle seems (vaguely speaking) to be that, with random voting, Condorcet cycles become exceedingly likely to exist in C-candidate elections when C→∞, but only weak cycles are likely. But it remains an open problem to close the substantial gap between the bounds in (e) and (f).

Answers

By Warren D. Smith Dec 2009

a. If Z_AB=1/2 for all A,B (the "pure coin toss" case) then plainly this Z is realizable; just use random permutations (all equally likely) as votes.

Also, any matrix Z such that Z_AB=1 (or 0) if A lies "above" B (or not) in some ordering of the C candidates, is realizable; just make all votes be identical to that ordering.

It will follow from the linear program in (b) – or more simply just by taking a "probabilistic mixture" – that any Z-matrix which is a convex linear combination of Z-matrices each known to be realizable, is realizable.

But any matrix Z containing a "3-cycle of strength>2/3," i.e. if min(Z₁₂,Z₂₃,Z₃₁)>2/3, is unrealizable.

Actually, more generally, you cannot have a K-cycle of strength>(K-1)/K for any K with 3≤K≤C.

One may show that in the case of a 3-cycle a⊃b⊃c⊃a amongst 4 candidates {a,b,c,d} by explicitly writing down the linear program we shall describe below, and solving it. The linear program is (using 4-letter variable names) to maximize s subject to abcd+abdc+acbd+acdb+adbc+adcb+ cabd+cadb+cdab+dabc+dacb+dcab ≥ s+ cdba+cbda+cbad+dcba+dbca+dbac+ bacd+badc+bcda+bcad+bdac+bdca,
bacd+badc+bcda+bcad+bdac+bdca+ abcd+abdc+adbc+dabc+dbca+dbac ≥ s+ cabd+cadb+cbda+cbad+cdab+cdba+ acbd+acdb+adcb+dacb+dcab+dcba,
cabd+cadb+cbda+cbad+cdab+cdba+ bcda+bcad+bdca+dcab+dcba+dbca ≥ s+ abcd+abdc+acbd+acdb+adbc+adcb+ bacd+badc+bdac+dabc+dacb+dbac,
abcd+abdc+acbd+acdb+adbc+adcb+ bacd+badc+bcda+bcad+bdac+bdca+ cabd+cadb+cbda+cbad+cdab+cdba+ dabc+dacb+dbca+dbac+dcab+dcba=1
and all 25 variables nonnegative; an optimum solution is s=dbca=cabd=abcd=1/3, and all other variables 0. More generally the maximum min-strength K-cycle arises by making all permutations that order the K cycling candidates in a cyclic permutation of their original order, have equal probabilities, and all other permutations have zero probability. It is not hard to see this is a local-optimum to all tiny perturbations, and since for linear programs local optima are global optima, the proof is complete.

We remark that the maximum min-strength of any K-cycle, for all K combined, can be found in "polynomial time" by performing K steps of "breadth first search" in the directed graph with only arcs of strength>s included, and doing outer binary searches on s. We also remark that the existence of majority-preference cycles does not necessarily have anything directly to do with unrealizability, because you can add P times the ordering 1234...C to the mix and (1-P) times C...4321, and as a result you get a pairwise matrix which (if P>>1-P) is highly biased to favor a linear ordering of the candidates – this will "eliminate" all low-strength cycles.

b. Let the probability distribution on C-permutations π be P_π. (There are C! quantities P_π in all.) Matrix Z is realizable if and only if the linear program

P_π ≥ 0     [C! positivity constraints]
∑_πP_π = 1     [one equality constraint]
∑_{π ordering A above B}P_π = Z_AB     [(C-1)C/2 equality constraints]

(where the Z_AB are regarded as known and the P_π's as unknown) has a solution, and any such solution constitutes a realization. Any algorithm for solving linear programs, such as the "simplex method," will do, and indeed will enable finding the "best" realization, namely the one maximizing ∑_πP_πQ_π for any desired provided C!-vector Q of permutation "qualities."

The only problem is that since C! grows very fast with C, this algorithm is slow. It also is a memory-hog.

The above linear program is already in "standard form." It has C! variables and (C-1)C+2 inequality constraints aside from the C! positivities. (An equality x=y is written as two inequalities x≤y and -x≤-y.)

The dual form of this linear program may be found by employing G.B.Dantzig's "duality theorem" for linear programming.

G.B.Dantzig: Linear Programming and Extensions, Princeton University Press 1963.

It has (C-1)C/2+1 variables, call them W and Y_AB with 1≤A<B≤C. It is

W + ∑_1≤A<B≤C (±1 if π orders A above/below B)Y_AB ≥ Q_π [C! inequality constraints]
Find the solution minimizing W + ∑_1≤A<B≤C Y_ABZ_AB.

The dual problem is "bounded" (i.e. the quantity being minimized cannot be made arbitrarily hugely negative) if and only if there is a solution to the primal problem, i.e. if and only if Z is realizable.

c. It is a well known NP-complete problem ("optimum linear ordering," proved NP-complete by R.M.Karp in one of the first NP-completeness papers) to find the "best ordering" of the C coordinates such that ∑_1≤A<B≤CM_AB is maximized, where M is a given C×C matrix. From that it follows that, if all the Q_π are equal, that it is NP-complete even to decide, given a set of dual variables (W and the Y_AB) whether they violate a constraint! This suggests the conjecture that our realizability decision problem is NP-hard, and there is no polynomial time decision-algorithm. On the positive side, it also proves (using either the "ellipsoid algorithm" or "simplex algorithm" with a violated-constraint-finder constructed from an exhaustive searcher) that this problem lies in PSPACE.

Probably, however, there are many interesting theorems (please find them!) showing that various easily-identifiable classes of Z-matrices are or or not realizable, and yielding permutation random-generation algorithms in the former case. We shall find some ourselves in parts (e) and (f).

d. An inefficient (but it works) random generation algorithm is to solve the primal linear program, then generate each permutation π with probability P_π.

e. If the Z-matrix is close enough to the all-coin-toss case then I can prove it is realizable and construct an efficient random generation algorithm. To keep the description simple let us assume C is even. As is well known to those who supervise round robin chess tournaments, there are known ways, for any even C, to split the (C-1)C/2 games among C players into C-1 "rounds" each consisting of C/2 games among disjoint pairs of players. Let one of those rounds be candidate#1 vs candidate#2, 3 vs 4, 5 vs 6, etc. If we simply generate a random permutation which is of the form [12][34][56]... with probability 1/2 and ...[56][34][12] with probability 1/2, where by [AB] is meant "either AB or BA, with probability (Z_AB-1/2)(C-1)+1/2 for the former" then this permutation will have pairwise probabilities (Z_AB-1/2)(C-1)+1/2 for every pair AB playing a "game" in that tournament "round" and exactly 50-50 probability for every other pair. By using a probability mixture of all these C-1 "round" random-generators, each with probability 1/(C-1), we get the correct probability Z_AB for every pair AB (since the entire tournament is the disjoint union of the games in each single round). This works if |Z_AB-1/2|≤1/(2C-2) for all A,B with A≠B. (It will not work otherwise, because some of the "game win" probabilities would have to be negative or greater than 1.)

That solution arose essentially from transforming Stearns' solution to puzzle 28 to make it apply to the present problem. A more powerful solution arises from doing the same transformation of Erdös and Moser's asymptotically-better solution to puzzle 28. This shows every Z-matrix with the property that each entry is within ±O(logC/C) of 1/2, is realizable. It also shows a polynomial-time random-generator exists that demonstrates that Z's realizability – although it remains an open problem to find that generator efficiently (just because something exists, does not mean it is easy to find starting from Z; Erdös and Moser showed their graph-decomposition existed but did not provide an efficient algorithm to find it; does one exist?).

Actually, we can argue for any realizable Z there always is a polytime random generator (which may not be easy to find) because you can just use the tight constraints in the linear program, to get a poly-size set of simple generators such that the desired Z is a probability-mixture of them.

f. One might conjecture, though, that all these results are very weak and Z_AB being within ±0.01 of 1/2 suffices, no matter how large C is, for realizability. Surprisingly, that is not so, and the O(logC/C) bound from part (e) cannot be improved to O(C^-0.5). We shall now prove that.

Let ε be any arbitrarily small positive constant. The proof idea is to consider Z-matrices that arise from using random ± signs in Z_AB=1/2±jC^2ε-0.5 for some sufficiently-large constant j, when 1≤A<B≤C.

We then consider the statistics of random numbers and the LP-dual formulation from part (b) [using all Q_π=1]. Let Y be (-k) times a matrix of the same random signs as were used to construct Z. This will cause the quantity being minimized in the LP-dual formulation, to become unboundedly hugely negative, as we take k→∞. Specifically, the sum in that expression almost surely (meaning, with probability→1 when C→∞) drops proportionally to -kjC^1.5+2ε. One can argue from the statistics of random bits (even considering the "worst" C-permutation π among the C! possibilities) – specifically "Hoeffding's inequality" – that the probability is high (i.e. goes to 1 when C→∞) that none of the C! inequality constraints will be violated, provided we employ W=kC^1.5+ε.

W.D.Smith: Tail bounds for sums of independent random variables, March 2005, available online as paper #85 at /WarrenSmithPages/homepage/works.html.

This value of W rises slowly enough when C→∞ that the quantity being minimized still goes to -∞. Hence with probability→1 the dual LP is unbounded and hence the Z-matrix unrealizable.

Q.E.D.

(Incidentally the same argument works if instead of random ± "coin tosses," we employ random normal "noise.")

Further Remarks: "dice-realizability."

Another interesting related problem is to decide whether the pairwise defeat probabilities Z_AB are realizable by some set of C different K-faced "dice." A "die" means a (perhaps nonuniform) probability distribution on any fixed set of K distinct real values; die A will "defeat" (i.e. roll a higher value than) die B with probability Z_AB. (See puzzle #6 and #67 about "intransitive dice.")

Such dice-realizability happens if and only if the following quadratic matrix equation:

P Q P^T = Z,

supplemented by the following linear equalities and inequalities:

[1,1,1,...,1] P^T = [1,1,1,...,1] and P_XY ≥ 0

has a real-matrix solution P. Here P is a C×K matrix (the "dice probabilities"), P^T is its transpose (which is K×C); Q is the constant K×K matrix such that Q_YX=1 if Y>X, =1/2 if X=Y, and =0 if X>Y; and the two all-1's vectors are K-long and C-long respectively.

This is not the same question because, e.g, it is known that no 3-cycle of strength>(√5-1)/2≈0.61803 is realizable in this dice-sense, whereas a 3-cycle of strength 2/3≈0.66667 is realizable in the original sense. In other words, dice-realizability⇒realizability but the reverse implication does not hold.

Dice-realizability is an algorithmically-decidable question ("existential theory of the reals," A.Tarski & successors). Indeed, it lies in PSPACE (since ETR does). But again, it is a totally open question how difficult this decision problem is. It's pretty unusual to have this relationship between a linear program and a quadratic matrix problem like this.

There seems to be a quite-simple iterative algorithm, however, for solving this provided one starts from a good enough initial guess. The idea is that a small additive perturbation of P causes a linear change in the left hand side of every equation and inequality (since we can neglect the quadratic term). We solve this linear program for the right perturbation, apply it, and continue on. The linear system is actually of special form – a "Sylvester matrix equation" – which means it can be solved especially fast. One could simply try this starting from some heuristically generated initial guesses; if we succeed in converging to a solution, then great, we have a random generator: simply "roll the C dice" defined by our solution to get one value for each candidate; then sort the candidates (breaking ties randomly) by their dice-values.

Return to puzzles

Return to main page