RangeVoting.org -- answer to puzzle ??

Answer to Puzzle #??: Arithmetic with normal random deviates

Puzzle:
Let X and Y be two independent "normal" random deviates with mean and standard deviation μ_X and σ_X, and μ_Y and σ_Y, respectively.

If you add the two (X+Y) the result is also an exactly-normal deviate with mean μ_X+μ_Y and standard deviation [(σ_X)²+(σ_Y)²]^1/2.
If you subtract the two (X-Y) the result is also an exactly-normal deviate with mean μ_X-μ_Y and standard deviation [(σ_X)²+(σ_Y)²]^1/2.
If you multiply the two (X·Y) then you do not get an exactly-normal deviate unless σ_Xσ_Y=0. However, if σ_X<<μ_X and σ_Y<<μ_Y then you do get an approximately normal deviate. Show its mean is exactly μ_Xμ_Y while its standard deviation is exactly [(σ_Xμ_Y)² + (σ_Yμ_X)² + (σ_Xσ_Y)²]^1/2.
If you divide the two (X/Y) then you do not get an exactly-normal deviate unless σ_Y=0 and μ_Y≠0. However, if σ_X<<μ_X and σ_Y<<μ_Y then you do get an approximately normal deviate. Show its mean is approximately [1+(σ_Y/μ_Y)²]μ_X/μ_Y and indeed it is given asymptotically by any truncation of the infinite series [1 + δ² + 3·δ⁴ + 3·5·δ⁶ + 3·5·7·δ⁸ + 3·5·7·9·δ¹⁰ + ...]μ_X/μ_Y, δ = σ_Y/μ_Y → 0 (although this series diverges if δ≠0). Meanwhile its standard deviation is approximately (1/μ_Y)[(σ_X)² + (σ_Yμ_X)² + (σ_Xσ_Y)²]^1/2. and indeed its square (i.e. variance) is given asymptotically by any truncation of the also-divergent series (σ_X/μ_Y)² + (μ_Y)^-2 [(σ_X)² + (μ_X)²] [δ² + 9δ⁴ + 75δ⁶ + 735δ⁸ + 8505δ¹⁰ +...]
How many terms of these asymptotic series should be used, and approximately what |error| will you then get?
If f(z) is some generic analytic function, then argue f(X) will (in the limit when σ=σ_X is made small while μ=μ_X is held fixed) be approximately normal with mean μ_f = f(μ) + [2^-1/1!] f''(μ) σ² + [2^-2/2!] f⁽⁴⁾(μ) σ⁴ + [2^-3/3!] f⁽⁶⁾(μ) σ⁶ + ... This series can converge, for nice-enough f(z), in which case it yields an exact result. But it also can diverge for all σ≠0 in which case it is only an asymptotic series. The mean square is μ_(f²) = f(μ)² + [2^-1/1!] (f²)''(μ) σ² + [2^-2/2!] (f²)⁽⁴⁾(μ) σ⁴ + [2^-3/3!] (f²)⁽⁶⁾(μ) σ⁶ + ... and then the variance of f(X) is μ_(f²)–(μ_f)².
What is (10±1)/(5±1)?

These formulas are extremely useful for working with statistics (meaning "numbers equipped with 1σ error bars"). Surprisingly, the multiplication, division, and f formulas do not seem to be given by statistics books.

Answer

By Warren D. Smith Sept 2009

A and B are well known. The formulas for the mean arise from linearity of expectations. The formulas for the variance then also arise from linearity of expectations combined with independence. The underlying reason X+Y is exactly normally distributed (ditto X-Y) is that (i) the Fourier transform of a Gaussian is a Gaussian and (ii) the product of two Gaussians is a Gaussian.

For part C, the formula for the mean arises from

μ_XY = ∫∫ F((z-μ_X)/σ_X)/σ_X F((q-μ_Y)/σ_Y)/σ_Y z q dz dq = μ_Xμ_Y

[with integration over the whole of the zq plane, and with F(z)=(2π)^-1/2exp(-z²/2)] whereupon the formula for standard deviation may be verified by computing

(σ_XY)² = ∫∫ F((z-μ_X)/σ_X)/σ_X F((q-μ_Y)/σ_Y)/σ_Y (zq - μ_Aμ_B)² dz dq = (σ_Xμ_Y)² + (σ_Yμ_X)² + (σ_Xσ_Y)².

For part D regard μ_Y/Y as approximately normal then apply part C's multiplication formula.

The fact that 1/(1+δ)=1-δ+δ²-δ³+... is used to deal with small relative deviations δ off mean (and we pretend the "lower tail of Y is cut off," i.e. that the chance of a deviation of the same order as Y's mean, or larger, is neglectible – otherwise the standard deviation of X/Y would, in fact, be infinite).

This technique plus the fact that ∫_-∞<s<∞F(s)s²ⁿds=1·3·5···(2n-1) if n≥1 allows us to see that the mean of 1/Z, where Z is a normal deviate with mean=1 and standard deviation σ<<1, is given asymptotically when δ→0+ by any truncation of

1 + δ² + 3·δ⁴ + 3·5·δ⁶ + 3·5·7·δ⁸ + 3·5·7·9·δ¹⁰ + ...

Meanwhile, using the fact that

[δ-δ²+δ³-δ⁴+...]² = [δ/(1-δ)]² = δ² + 2δ³ + 3δ⁴ + 4δ⁵ + ...

we find that the variance of 1/Z is given asymptotically by any truncation of

δ² + 9δ⁴ + 75δ⁶ + 735δ⁸ + 8505δ¹⁰ + 114345δ¹² + ... + 3·5·7·9···(2n-3)·(2n-1)²δ²ⁿ +...

E. I recommend (following a common recommendation) summing these series up to the term of least absolute value, then stopping (or stopping earlier, if the term is small enough that you're satisfied). The first neglected |term| will then presumably be about the same as the additive error. For each of the δ-power series above we'd keep about δ^-2/2 terms. This (as a result of "Stirling's formula," essentially) would yield least |term| roughly (δ/r)^c+1/δ² where r=√e=1.6487... and c=O(1) is a constant you can choose to optimize performance of the estimate. (Different c's for different series.)

F. Same technique as in part D but employing the Taylor series of f(z) about z=μ. The odd-power terms in the series integrate to zero due to odd symmetry. When you multiply an even term f⁽²ⁿ⁾(μ)/(2n)! times the normal-moment-factor 1·3·5···(2n-1)·σ²ⁿ you get 2^-n/n! f⁽²ⁿ⁾(μ) σ²ⁿ. That explains the first series. The second series is the same series but based on f(z)² not f(z). If f(z)=exp(z) or cos(z) then both series converge for all σ; but as we saw in part D, if f(z)=1/z then we get divergence for all σ≠0. [And the fact that mean²+variance=meansquare for a probability distribution, is just Pythagoras's theorem, integrated.]

G. 2.0925±0.540. [Using δ=0.2 and δ²=0.04 and truncating as in part E.]

Return to puzzles

Return to main page