6.2 Latin rectangles and decomposition of doubly stochastic matrices

A r×n Latin rectangle is a r×n matrix such that each of the numbers 1,2,,n occurs once in each row and at most once in each column. If r=n, we call it a Latin square. It was conjectured and proved that a partially completed Latin square with less than n entries can be completed; see [Jukna 2011, Section 5.1] or and [Van Lint and Wilson, Chapter 17]. But we will show that a Latin rectangle can always be extended to a Latin square or equivalently, a partially filled Latin square with an empty row can always be completed.

Theorem 6.11 (Ryser 1951 ).

If r<n, then any given r×n Latin rectangle can be extended to a (r+1)×n Latin rectangle.

Proof.

Let R be an r×n Latin rectangle. Let Sj,j=1,,n be the set of those integers in [n] that do not occur in the jth column. If Sj’s have a SDR sj,j=1,,n then adding s1,,sn as the (r+1)th column we can get a (r+1)×n Latin rectangle.

Sj’s are (nr)-element subsets of [n] and each element belongs to exactly nr sets. Thus, the result follows from Exercise 6.10. ∎

Our next application is the Birkhoff-von Neumann theorem which is of much use in various areas of mathematics such as convex geometry, optimal transport, probability theorey et al.

An n×n non-negative matrix A={aij} is called a doubly stochastic matrix if its row sum and column sums are both 1 i.e., jaij=iaij=1 for all i,j[n]. A permutation matrix is a doubly stochastic matrix with entries being 0,1. Equivalently, every row and column have exactly one 1. A matrix A is a convex combination of A1,,Am if there exist non-negative numbers λi,i=1,,m with iλi=1 and A=iλiAi.

Theorem 6.12 (Birkhoff -Von Neumann theorem).

Every doubly stochastic matrix is a convex combination of permutation matrices.

Proof.

We will prove by induction on the number of (strictly) positive entries. Since every row sum and column sum is 1, there exist at least at n non-negative entries.

Suppose that there are exactly n positive entries then A is a permutation matrix.

Now let A have more than n positive entries and result holds for any smaller number. Define the row-wise non-zero entries as

Si={j:aij>0},i=1,,n.

If |iISi|<|I| (and say |I|=k) then all the non-zero entries in iISi are in at most k1 rows and hence the k=iIjaij(k1), a contradiction. So Si’s satisfy Hall’s condition for SDR. Let jiSi be the SDR.

Define permutation matrix P1={pij} by pij=1 iff j=ji. Define λ1=miniaiji. By definition of Si’s, λ1>0. Consider A1=Aλ1P1. Again by construction of λ1 and P1, A has non-negative entries and in particular, it has at least one less positive entry than A. Furthermore, is row sum and column sums are equal to 1λ1. Thus B=(1λ1)1A1 is a doubly stochastic matrix with strictly less positive entries than A and by induction, the theorem applies to B. So B is a convex combination of permutation matrices P2,,Pm with weights γ2,,γm respectively. Thus A is a convex combination of permutation matrices with weights λ1,(1λ1)γ2,,(1λ1)γm. ∎