Communications In Mathematical Physics - Volume 262

Commun. Math. Phys. 262, 1–16 (2006) Digital Object Identifier (DOI) 10.1007/s00220-005-1472-9 Communications in Mathe...

Author: M. Aizenman (Chief Editor)

34 downloads 776 Views 8MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!

Report copyright / DMCA form

DOWNLOAD PDF

Commun. Math. Phys. 262, 1–16 (2006) Digital Object Identifier (DOI) 10.1007/s00220-005-1472-9

Communications in

Mathematical Physics

Double Products and Hypersymplectic Structures on R4n Adri´an Andrada, Isabel G. Dotti CIEM, FaMAF, Universidad Nacional de C´ordoba, Ciudad Universitaria, (5000) C´ordoba, Argentina. E-mail: [email protected]; [email protected] Received: 16 January 2004 / Accepted: 26 July 2005 Published online: 9 December 2005 – © Springer-Verlag 2005

Abstract: In this paper we give a procedure to construct hypersymplectic structures on R4n beginning with affine-symplectic data on R2n . These structures are shown to be invariant by a 3-step nilpotent double Lie group and the resulting metrics are complete and not necessarily flat. Explicit examples of this construction are exhibited. 1. Introduction A hypersymplectic structure on a 4n-dimensional manifold M is given by (J, E, g), where J , E are endomorphisms of the tangent bundle of M such that J 2 = −1,

E 2 = 1,

J E = −EJ,

g is a neutral metric (that is, of signature (2n, 2n)) satisfying g(X, Y ) = g(J X, J Y ) = −g(EX, EY ) for all X, Y vector fields on M and the associated 2-forms ω1 (X, Y ) = g(J X, Y ),

ω2 (X, Y ) = g(EX, Y ),

ω3 (X, Y ) = g(J EX, Y )

are closed. Manifolds carrying a hypersymplectic structure have a rich geometry, the neutral metric is K¨ahler and Ricci flat and its holonomy group is contained in Sp(2n, R) ([8]). Moreover, the Levi Civita connection is flat, when restricted to the leaves of the canonical foliations associated to the product structure given by E (see [2]). Metrics associated to a hypersymplectic structure are also called neutral hyperk¨ahler (see [10]). Hypersymplectic structures have significance in string theory. In [14], N = 2 superstring theory is considered, showing that the critical dimension of such a string is 4 and

Both authors were partially supported by CONICET, ANPCyT, SECyT-UNC and ACC (Argentina).

2

A. Andrada, I.G. Dotti

that the bosonic part of the N = 2 theory corresponds to self-dual metrics of signature (2, 2) (see also [5] and [9]). The quotient construction proved to be a powerful method to construct symplectic and hyperk¨ahler structures on manifolds. According to [8] this method cannot always be applied in the setting of hypersymplectic structures. Compact complex surfaces with neutral hyperk¨ahler metrics are biholomorphic to either complex tori or primary Kodaira surfaces and both carry non-flat neutral hyperk¨ahler metrics, by results of Kamada (see [10]). In higher dimensions, hypersymplectic structures on a class of compact quotients of 2-step nilpotent Lie groups were exhibited in [6] in their search of neutral Calabi-Yau metrics. The purpose of this paper is to give a procedure to construct hypersymplectic structures on R4n with complete and not necessarily flat associated neutral metrics. The idea behind the construction will be to consider the canonical flat hypersymplectic structure on R4n and then translate it by using an appropriate group acting simply and transitively on R4n . This group will be a double Lie group (R4n , R2n × {0}, {0} × R2n ) constructed from affine data on R2n . The most important feature achieved by this procedure is that the associated neutral metrics obtained will be complete and invariant by a 3-step nilpotent group of isometries (we note that homogeneity does not necessarily imply completeness in the pseudoriemannian setting.) The degree of nilpotency will be related to the flatness of the metric since we will show that the neutral metric is flat if and only if the group is at most 2-step nilpotent. Moreover, we provide explicit examples of 3-step nilpotent Lie groups admitting compact quotients and carrying invariant complete and non-flat hypersymplectic structures. The induced metric on the associated nilmanifold will be neutral K¨ahler, complete, non-flat and Ricci flat. The outline of this paper is as follows. In §2 we give to R4n a structure of a nilpotent Lie group. Starting with a fixed symplectic structure ω on R2n which is parallel with respect to a pair of affine structures we form the associated double Lie group (R4n , R2n × {0}, {0} × R2n ) and show that it is at most 3-step nilpotent. In §3 we consider canonical symplectic structures on R4n , constructed from the given ω on R2n and show they are invariant by the group constructed in §2. We analyze in §4 the geometry of the homogeneous metric obtained by using the double Lie group structure given to R4n to translate the standard inner product of signature (2n, 2n) on R2n ⊕ R2n . The resulting metric is hypersymplectic (hence Ricci flat), complete and not necessarily flat. Finally, in §5, we exhibit explicitly flat and non-flat complete neutral metrics on R4n which are also K¨ahler and Ricci flat. Complete flat hypersymplectic metrics are constructed on 2-step nilpotent Lie groups of dimension 8n (§5.1) carrying also a closed special form in the sense of ([6], Sect. 2) and thus, the procedure developed in [6] may be applied to produce non-flat neutral Calabi-Yau metrics on the associated Kodaira manifolds. In §5.2, complete non-flat hypersymplectic metrics are exhibited on R8 , where a particular example is given by 3 2 g = − (x1 + x3 ) + x2 − x4 (dx1 + dx3 )2 + 2(x1 + x3 )(dx1 + dx3 )(dx2 − dx4 ) 2 −x1 dx1 dx1 − dx1 dx2 + dx2 dx1 + x3 (dx1 dx3 + dx3 dx1 ) +(x1 + 2x3 ) dx3 dx3 − dx3 dx4 + dx4 dx3

with respect to coordinates x1 , . . . , x4 , x1 , . . . , x4 . Furthermore, metrics with similar properties can be obtained in higher dimensions.

Double Products and Hypersymplectic Structures on R4n

3

2. Group Structure on R4n The main goal of this section will be to attach a 3-step nilpotent Lie group to data (∇, ∇ , ω), where ∇, ∇ are affine structures on R2n compatible with a symplectic structure ω. We shall begin by recalling some definitions which will be used throughout this article. An affine structure (or a left symmetric algebra structure) on Rn is given by a connection ∇, that is, a bilinear map ∇ : Rn × Rn → Rn satisfying the following conditions: ∇x y = ∇y x, ∇x ∇y = ∇y ∇x

(1) (2)

for all x, y ∈ Rn . If ω is a non-degenerate skew-symmetric bilinear form on Rn , the affine structure ∇ is compatible with ω if ω(∇x y, z) = ω(∇x z, y),

x, y, z ∈ Rn .

(3)

We notice that affine structures ∇ on R2n compatible with ω satisfy a condition stronger than (2), namely, ∇x ∇y = 0,

x, y ∈ R2n .

(4)

The last equation follows from ω(∇x ∇y z, w) = ω(∇w x, ∇z y) = −ω(∇z ∇w x, y) = −ω(∇w ∇z x, y) = −ω(∇y w, ∇x z) = −ω(∇x ∇y z, w). Let ∇ and ∇ be two affine connections on R2n compatible with ω and assume furthermore that ∇ and ∇ satisfy the following compatibility condition: ∇x ∇y = ∇y ∇x

(5)

for all x, y ∈ R2n . From (5) and the compatibility of the connections with ω, we obtain the following: ∇x ∇y = ∇y ∇x

(6)

∇x ∇y = −∇y ∇x

(7)

and

for all x, y ∈ R2n . Indeed, (6) follows from ω(∇x ∇y z, w) = −ω(∇y z, ∇x w) = −ω(∇y ∇x w, z) = −ω(∇x ∇y w, z) = −ω(∇x z, ∇y w) = ω(∇y ∇x z, w), and (7) follows from ω(∇x ∇y z, w) = ω(∇y ∇x z, w) = ω(∇w y, ∇z x) = −ω(∇z ∇w y, x) = −ω(∇w ∇z y, x)=−ω(∇x w, ∇y z) =−ω(∇x ∇y z, w)=−ω(∇y ∇x z, w). We shall show in the next theorem that two affine structures ∇ and ∇ on R2n satisfying (5) and (6) give rise to a Lie group structure on the manifold R4n such that

4

A. Andrada, I.G. Dotti

(R4n , R2n × {0}, {0} × R2n ) is a double Lie group. We recall that a double Lie group is given by a triple (G, G+ , G− ) of Lie groups such that G+ , G− are Lie subgroups of G and the product G+ × G− → G, (g+ , g− ) → g+ g− is a diffeomorphism (see [12]). The next result shows that the additional condition (3) of ∇ and ∇ with a fixed ω imposes restrictions on the Lie group obtained. Theorem 2.1. Let ∇ and ∇ be two affine structures on R2n compatible with a symplectic form ω and satisfying also (5). Then R2n × R2n with the product given by (x, x ) · (y, y ) = (x + α(x , y), β(x , y) + y ),

(8)

where x, x , y, y ∈ R2n and 1 α(x , y) = y + ∇y x − ∇y ∇y x , 2

1 β(x , y) = x − ∇x y − ∇x ∇x y 2

(9)

is a 3-step nilpotent double Lie group. Furthermore, the associated Lie bracket on its Lie algebra R2n ⊕ R2n is [(x, x ), (y, y )] = (∇y x − ∇x y , ∇x y − ∇y x ).

(10)

2n 2n 2n 2n 2n Proof. Let us set R2n + := R × {0} and R− := {0} × R . The maps α : R− × R+ −→ 2n 2n 2n 2n R+ and β : R− × R+ −→ R− satisfy the conditions

α0 = 1, αx (0) = 0, αx +y = αx ◦ αy , β0 = 1, βy (0) = 0, βx+y = βy ◦ βx

(11) (12)

2n for all x , y ∈ R2n − , x, y ∈ R+ , where we denote αx (y) := α(x , y) and βy (x ) := 2n 2n β(x , y) for x ∈ R− , y ∈ R+ . The above relations show that α is a left action of R2n − 2n 2n on R2n + and β is a right action of R+ on R− . These maps satisfy also the following compatibility conditions

αx (x + y) = αx x + αβx x y,

βx (x + y ) = βαy x x + βx y .

According to [12], the product given in (8) defines a Lie group structure on R4n such 2n that (R4n , R2n + , R− ) is a double Lie group. Note that the neutral element of this group structure is (0, 0) and the inverse of (x, x ) ∈ R4n is (α(−x , −x), β(−x , −x)). Let us determine now the associated Lie algebra. Linearizing the above actions, we 2n 2n 2n obtain representations µ : R2n − −→ End(R+ ) and ρ : R+ −→ End(R− ) given by d d (dαtx )0 (y), ρy (x ) = (dβty )0 (x ). µx (y) = dt 0 dt 0 For α and β given in (9), we obtain that (dαx )0 = 1 + ∇x ,

(dβy )0 = 1 − ∇y .

Hence, µx (y) = ∇x y,

ρy (x ) = −∇y x ,

showing that the bracket of its Lie algebra is the one given in (10).

Double Products and Hypersymplectic Structures on R4n

5

If ad(x,x ) , x, x ∈ R2n stands for the transformation given by (10), then, using that ∇ and ∇ are torsion-free (see (1)) one has   ad(x,x )

 ∇x −∇x      = .    −∇ ∇  x x

From (4) applied to both ∇ and ∇ , we obtain  ad2(x,x )

 ∇x ∇x −∇x ∇x   =   −∇ ∇ ∇ ∇ x x x x

    .  

Finally, using (7), ad3(x,x ) = 0. Hence R2n ⊕ R2n is a 3-step nilpotent Lie algebra or R2n × R2n is a 3-step nilpotent Lie group, as claimed. We set the notation to be used in what follows. Since the construction of the Lie group structure on R4n in Theorem 2.1 depends on the affine structures ∇ and ∇ , we will denote this Lie group by N∇,∇ . The corresponding Lie algebra will be denoted n∇,∇ and the abelian Lie subalgebras R2n ⊕ {0}, {0} ⊕ R2n will be denoted n+ , n− , respectively. We note that (n∇,∇ , n+ , n− ) is a double Lie algebra, that is, n+ and n− are Lie subalgebras of n∇,∇ and n∇,∇ = n+ ⊕ n− as vector spaces. Remarks. (i) The construction of the Lie algebra in Theorem 2.1 can be made without requiring the affine structures to be compatible with a symplectic form. Indeed, if ∇ and ∇ are affine structures on Rm satisfying (5) and (6), then the bracket (10) defines a Lie algebra structure on Rm ⊕ Rm such that Rm ⊕ {0} and {0} ⊕ Rm are both abelian Lie subalgebras; hence Rm ⊕ Rm is 2-step solvable. Moreover, the centre of Rm ⊕ Rm is given by {(x, x ) ∈ Rm ⊕ Rm : ∇x = ∇x = 0, ∇x = ∇x = 0}.

(13)

(ii) If ∇ is any affine structure on Rm we denote by A the associative (and commutative) algebra obtained from Rm together with the product x.y = ∇x y, x, y ∈ Rm . In [3, 4],

aff(A) denoted the Lie algebra A ⊕ A with Lie bracket [(a, b), (c, d)] = (0, ad − bc) with a, b, c, d ∈ A. Note that if, in (i), we take ∇ = 0 then (5) and (6) trivially hold, obtaining in this case a semidirect product which coincides with aff(A). This family of algebras and various geometric properties were considered in [3, 4]. 3. Invariant Symplectic Structures on R4n In this section we show that n∇,∇ carries three symplectic structures, obtained from the symplectic form ω in R2n compatible with ∇ and ∇ . These forms, defined at the Lie

6

A. Andrada, I.G. Dotti

algebra level, give rise to left-invariant symplectic forms on the corresponding Lie group N∇,∇ . Hence, R4n inherits symplectic structures which are invariant by this nilpotent group. First, we recall that a symplectic structure on a Lie algebra g is a non-degenerate skew-symmetric bilinear form ω satisfying dω = 0, where d ω(x, y, z) = ω(x, [y, z]) + ω(y, [z, x]) + ω(z, [x, y])

(14)

for x, y, z ∈ g. A given symplectic form ω on R2n allows us to define the following non-degenerate skew-symmetric bilinear forms on R2n ⊕ R2n :   ω1 ((x, x ), (y, y )) := ω(x, y) + ω(x , y ), (15) ω2 ((x, x ), (y, y )) := −ω(x, y ) + ω(y, x ),  ω ((x, x ), (y, y )) := ω(x, y) − ω(x , y ). 3 We show below that the above forms are closed with respect to the Lie bracket given in Theorem 2.1. Therefore, they define symplectic structures on n∇,∇ . Proposition 3.1. The 2-forms ω1 , ω2 and ω3 are closed on n∇,∇ . Proof. Since R2n ⊕ {0} and {0} ⊕ R2n are abelian subalgebras of n∇,∇ , it follows that the forms ωi , i = 1, 2, 3, are closed if and only if (dωi )((x, 0), (y, 0), (0, z )) = (dωi )((0, x ), (0, y ), (z, 0)) = 0 for all x, y, z, x , y , z ∈ R2n . But (dωi )((x, 0), (y, 0), (0, z )) = ωi ([(y, 0), (0, z )], (x, 0)) + ωi ([(0, z ), (x, 0)], (y, 0)) = ωi ((−∇y z , ∇y z ), (x, 0)) + ωi ((∇x z , −∇x z ), (y, 0)) and (dωi )((0, x ), (0, y ), (z, 0)) = ωi ([(0, y ), (z, 0)], (0, x ))+ωi ([(z, 0), (0, x )], (0, y )) = ωi ((∇z y , −∇z y ), (0, x ))+ωi ((−∇z x , ∇z x ), (0, y )). Using the expressions of ωi , i = 1, 2, 3 given in (15), we compute  (dω1 )((x, 0), (y, 0), (0, z )) = (dω3 )((x, 0), (y, 0), (0, z ))   = −ω(−∇y z , x) + ω(∇x z , y), x ), (0, y ), (z, 0))   (dω1 )((0, x ), (0, y ), (z, 0)) = −(dω3 )((0, = −ω(∇z y , x ) + ω(∇z x , y ) and

(dω2 )((x, 0), (y, 0), (0, z )) = ω(x, ∇y z ) − ω(y, ∇x z ), (dω2 )((0, x ), (0, y ), (z, 0)) = −ω(∇z y , x ) + ω(∇z x , y ).

Since ∇ and ∇ satisfy (1) and (3), we obtain that dωi = 0, i = 1, 2, 3. It follows from the definitions of the forms ωi , i = 1, 2, 3 that:

Double Products and Hypersymplectic Structures on R4n

7

(i) the restrictions of ω1 and ω3 to n+ and n− are symplectic forms on these subalgebras; (ii) the Lie subalgebras n+ and n− are Lagrangian subspaces of n∇,∇ with respect to the symplectic form ω2 . Let the form ω on R2n be given by ω = e1 ∧ e2 + e3 ∧ e4 + · · · + e2n−1 ∧ e2n , where {e1 , . . . , e2n } is a fixed basis of R2n and {e1 , . . . , e2n } denotes the dual basis. Let us set ej := (ej , 0) and fj := (0, ej ), j = 1, . . . , 2n. Hence {e1 , . . . , e2n , f1 , . . . , f2n } is a basis of R2n ⊕ R2n and the forms ωi , i = 1, 2, 3, can be written as ω1 = e1 ∧ e2 + · · · + e2n−1 ∧ e2n + f 1 ∧ f 2 + · · · + f 2n−1 ∧ f 2n , ω2 = −e1 ∧ f 2 − · · · − e2n−1 ∧ f 2n + e2 ∧ f 1 + · · · + e2n ∧ f 2n−1 , ω3 = e1 ∧ e2 + · · · + e2n−1 ∧ e2n − f 1 ∧ f 2 − · · · − f 2n−1 ∧ f 2n . 3.1. Since n∇,∇ is a double Lie algebra, the endomorphism E given by E(x, y) = (x, −y) for x, y ∈ R2n is a product structure on n∇,∇ , that is, E 2 = 1 and E is integrable, in the sense that it satisfies the condition E[(x, x ), (y, y )] = [E(x, x ), (y, y )] + [(x, x ), E(y, y )] − E[E(x, x ), E(y, y )]. (16) for all x, x , y, y ∈ R2n . We note that the integrability of E is equivalent to n+ and n− , the eigenspaces corresponding to the eigenvalues ± of E, being Lie subalgebras of n∇,∇ . Moreover, since n+ and n− have equal dimension, E is a paracomplex structure on n∇,∇ . The symplectic form ω2 satisfies ω2 (E(x, x ), E(y, y )) = −ω2 ((x, x ), (y, y )) for all x, x , y, y ∈ R2n . Therefore, {n∇,∇ , E, ω2 } is an example of a parak¨ahler Lie algebra in the sense of Kaneyuki (see [11]). Another endomorphism on n∇,∇ related to its decomposition as a double Lie algebra is given by J (x, y) = (−y, x) for x, y ∈ R2n . The endomorphism J is a complex structure on n∇,∇ , that is, J 2 = −1 and J is integrable, i.e., it satisfies J [(x, x ), (y, y )] = [J (x, x ), (y, y )] + [(x, x ), J (y, y )] + J [J (x, x ), J (y, y )] (17) for all x, x , y, y ∈ R2n . We note that J E = −EJ , and therefore {J, E} is a complex product structure on n∇,∇ (see [3]). The symplectic form ω1 satisfies ω1 (J (x, x ), J (y, y )) = ω1 ((x, x ), (y, y )) for all x, x , y, y ∈ R2n . Hence, ω1 is a K¨ahler form on n∇,∇ .

8

A. Andrada, I.G. Dotti

4. Induced Geometry on R4n In this section we analyze the properties of the metric on the manifold R4n obtained by left-translating by the Lie group N∇,∇ , the standard inner product of signature (2n, 2n) on R2n ⊕ R2n . We show that this metric is always complete and it is flat if and only if the Lie group N∇,∇ is 2-step nilpotent (see Theorem 4.2 and Theorem 4.3). Furthermore, this metric on R4n is hypersymplectic with respect to the structures J and E defined previously; in particular, it is neutral K¨ahler and Ricci-flat. Explicit examples will be given in subsequent sections. Let us define a bilinear form g on n∇,∇ by g((x, x ), (y, y )) = −ω(x, y ) + ω(x , y)

(18)

for all (x, x ), (y, y ) ∈ n∇,∇ . It is clearly symmetric and non-degenerate. With respect to the basis {e1 , . . . , e2n , f1 , . . . , f2n }, g can be written as g = 2 −e1 · f 2 − · · · − e2n−1 · f 2n + e2 · f 1 + · · · + e2n · f 2n−1 , where · denotes the symmetric product of 1-forms. Moreover, g satisfies the two following conditions: g(J (x, x ), J (y, y )) = g((x, x ), (y, y )), g(E(x, x ), E(y, y )) = −g((x, x ), (y, y ))

(19) (20)

for x, x , y, y ∈ R2n . Indeed, g(J (x, x ), J (y, y )) = g((−x , x), (−y , y)) = −ω(−x , y) + ω(x, −y ) = g((x, x ), (y, y )) and g(E(x, x ), E(y, y )) = g((x, −x ), (y, −y )) = ω(x, y ) − ω(x , y) = −g((x, x ), (y, y )). Thus, g is a Hermitian metric on n∇,∇ with respect to both structures J and E. We note that the subalgebras n+ and n− are both isotropic subspaces of n∇,∇ with respect to g and this metric has signature (2n, 2n). Moreover, it is easy to verify that the 2-forms ω1 , ω2 and ω3 can be recovered from g and the endomorphisms J and E. Indeed we have   ω1 ((x, x ), (y, y )) = g(J (x, x ), (y, y )), (21) ω2 ((x, x ), (y, y )) = g(E(x, x ), (y, y )),  ω ((x, x ), (y, y )) = g(J E(x, x ), (y, y )). 3 The endomorphisms J and E of n∇,∇ , as well as the 2-forms ω1 , ω2 and ω3 and the metric g can be extended to the group N∇,∇ by left translations. Hence, N∇,∇ is equipped with: (1) a complex structure J and a product structure E such that J E = −EJ ;

Double Products and Hypersymplectic Structures on R4n

9

(2) a (pseudo) Riemannian metric g such that g(J (x, x ), J (y, y )) = g((x, x ), (y, y )), g(E(x, x ), E(y, y )) = −g((x, x ), (y, y )) for all x, x , y, y ∈ (T(N∇,∇ )); (3) three symplectic forms ω1 , ω2 and ω3 which satisfy (21). To summarize, we have obtained Theorem 4.1. The nilpotent Lie group N∇,∇ carries a left-invariant hypersymplectic structure given by the 3-tuple {J, E, g}. In particular, (N∇,∇ , J, g) is a (neutral) K¨ahler manifold and g is a Ricci-flat metric. Note also that E is a product structure on N∇,∇ and hence, there is a decomposition of T(N∇,∇ ) into the Whitney sum of two involutive distributions of the same rank which are interchanged by J . Remark. The leaves of both foliations given by E are Lagrangian submanifolds of the symplectic manifold (N∇,∇ , ω2 ). Hence, N∇,∇ is an example of a homogeneous parak¨ahler manifold (see [11]). 4.1. Curvature and completeness of g. Since g is left-invariant, the Levi-Civita connection ∇ g can be computed on left-invariant vector fields, i.e., on the Lie algebra n∇,∇ . After a computation one finds that   ∇

g

(x,x )

 ∇x + ∇ x   =   0

0 ∇x

+ ∇

x

   .  

(22)

One can verify, using the above expression of ∇ g , that J and E are parallel with respect to the Levi-Civita connection. We will show next that this connection need not be flat. If R denotes the curvature of ∇ g , it is easily seen (using (4)) that R((x, 0), (y, 0)) = R((0, x ), (0, y )) = 0. Moreover, R((x, 0), (0, y )) = ∇(x,0) ∇(0,y ) − ∇(0,y ) ∇(x,0) − ∇(−∇ y ,∇x y ) g

g

g

g

g

x

and using (22) together with (1), (5) and (7) one obtains    ∇x ∇ 0 y   R((x, 0), (0, y )) = 4     0 ∇x ∇y

    = −4 ad[(x,0),(0,y )] .   

Since R and the Lie bracket are skew-symmetric, one finally obtains R((x, x ), (y, y )) = −4 ad[(x,x ), (y,y )] ,

10

A. Andrada, I.G. Dotti

thus showing that ∇ g will be flat if and only if N∇,∇ is 2-step nilpotent. Note also that R=0

if and only if

∇x ∇y = 0

(23)

for all x, y ∈ R2n . Summing up, we have shown Theorem 4.2. The following conditions are equivalent: (i) The Lie algebra n∇,∇ is 2-step nilpotent; (ii) ∇x ∇y = 0 for all x, y ∈ R2n ; (iii) The hypersymplectic metric is flat. We end this section studying the completeness of ∇ g . It follows from [7] that ∇ g will be complete if and only if the differential equation on n∇,∇ g

x(t) ˙ = adx(t) x(t)

(24) g

admits solutions x(t) ∈ g defined for all t ∈ R. Here adx means the adjoint of the transformation adx with respect to the metric g. It is easy to verify that the right-hand g g side of (24) is given by adx(t) x(t) = −∇x(t) x(t) for all t in the domain of x and thus we have to solve the equation g

x(t) ˙ = −∇x(t) x(t).

(25)

The curve x(t) on n∇,∇ can be written as x(t) = (a(t), b(t)), where a(t), b(t) ∈ R2n are smooth curves on R2n . Hence, using (22), Eq. (25) translates into the system a˙ = −∇a a − ∇b a, (26) b˙ = −∇a b − ∇b b. Let us differentiate the first equation of the system above. We have a¨ = −2∇a a˙ − ∇a b˙ − ∇b a˙ = 2∇a ∇a a + 2∇a ∇a b + ∇a ∇a b + ∇a ∇b b + ∇b ∇a a + ∇b ∇a b = 0, using (4), (5), (6) and (7). In the same fashion, we differentiate the second equation of (26) and obtain b¨ = −∇a b˙ − ∇b a˙ − 2∇b b˙ = ∇a ∇a b + ∇a ∇b b + ∇b ∇a a + ∇b ∇a b + 2∇b ∇a b + 2∇b ∇b b = 0, using again (4), (5), (6) and (7). Thus, there exist constant vectors A, B, C, D ∈ R2n such that a(t) = At + B,

b(t) = Ct + D.

The explicit solution of the system (26) with initial condition x(0) = (a0 , b0 ) is given by a(t) = (−∇a0 a0 − ∇a 0 b0 )t + a0 ,

b(t) = (−∇a0 b0 − ∇b 0 b0 )t + b0 .

Therefore, x(t) is defined for all t ∈ R and, in consequence, ∇ g is complete. Thus, we have obtained

Double Products and Hypersymplectic Structures on R4n

11

Theorem 4.3. Hypersymplectic metrics on n∇,∇ are always complete. Remark. The completeness of ∇ g could have been dealt with in the following manner when ∇ g is flat. In this case, from results in [15] we obtain that the completeness follows if the transformation (x, x ) → ∇ g (x,x ) (y, y ) is nilpotent for every (y, y ). But, using (22) one finds    ∇y   g ∇ (y, y ) =   ∇ y

∇ y    .  ∇ 

(27)

y

Using (4) and (23) one shows that ∇ g (y, y )2 = 0 and hence the completeness follows. 5. Explicit Examples In this section we will consider particular cases of the constructions given previously. In the first one we give explicit affine structures ∇ and ∇ such that the resulting group is 8n-dimensional, 2-step nilpotent and with a 4n-dimensional center invariant by the complex structure. It also admits compact quotients, hence the associated nilmanifolds are Kodaira manifolds (see [6]). In this case the hypersymplectic metric obtained on the group is complete and flat, hence it is isometric to the standard one. On the other hand this Lie group carries a closed special form in the sense of ([6], Sect. 2) and thus, the procedure developed in [6] may be applied to produce non-flat K¨ahler Ricci-flat metrics on this family of Kodaira manifolds. In the second one we give a 3-parameter family ∇ and ∇ a,b,c on R4 satisfying all the requirements to obtain a 3-step nilpotent group structure on R8 . In this case the hypersymplectic metric obtained on R8 will be complete and non-flat. Moreover, the bracket relations will show that compact quotients can be obtained. This example can be generalized to higher dimensions, thus obtaining complete non-flat hypersymplectic metrics on R4n , n ≥ 2, invariant by a 3-step nilpotent Lie group. 5.1. Neutral K¨ahler Einstein metrics on Kodaira manifolds. Let us consider on R4n = span{e1 , . . . , e4n } the flat torsion-free connections ∇ and ∇ given by ∇ei ei = ei+1 , i odd, 1 ≤ i ≤ 2n, ∇ej ek = 0, otherwise and

∇e i ei = ei+1 , ∇e j ek = 0,

i odd, 2n + 1 ≤ i ≤ 4n, otherwise.

Clearly, ∇∇ = 0 = ∇ ∇.Also, it is easy to see that both connections are compatible with the standard symplectic form ω on R4n given by ω = e1 ∧e2 +e3 ∧e4 +· · ·+e4n−1 ∧e4n .

12

A. Andrada, I.G. Dotti

We can form then the 8n-dimensional Lie algebra n∇,∇ as in previous sections. It has a basis {ei , fi : i = 1, . . . , 4n}, where ei = (ei , 0) and fi = (0, ei ) and the only non-zero Lie brackets are fi+1 , i odd, 1 ≤ i ≤ 2n, [ei , fi ] = −ei+1 , i odd, 2n + 1 ≤ i ≤ 4n. Let N∇,∇ denote the simply connected Lie group associated to the Lie algebra n∇,∇ . Since the structure constants are 0, 1 or −1, by Malcev’s theorem [13], there exists a discrete subgroup of N∇,∇ such that M := \N∇,∇ is compact. Using (13), we obtain that the centre z of n∇,∇ is given by z = span{ei , fi : i is even}, showing in particular that this Lie group is 2-step nilpotent and z is 4n-dimensional and stable under the action of J . Therefore, the nilmanifold M is an 8n-dimensional Kodaira manifold (see [6]). According to the results in §4, N∇,∇ carries a left-invariant hypersymplectic structure, which is flat since N∇,∇ is 2-step nilpotent (see Theorem 4.2). Besides, since the centre z is, with respect to ω1 , a Lagrangian subspace, the symplectic form ω1 on the Lie group induces a closed special 2-form on M . The method described in [6] can be applied in this case to produce Ricci-flat neutral K¨ahler metrics on M . We note that z is a special Lagrangian subspace of n∇,∇ with respect to the J -holomorphic form = (ω2 + iω3 )2n . This gives rise to special Lagrangian submanifolds on the symplectic manifold M . The Levi-Civita connection ∇ g of the hypersymplectic metric g on the group N∇,∇ is given by  g ∇ei ei = ei+1 , i odd, 1 ≤ i ≤ 2n,    ∇ g f = f , i odd, 1 ≤ i ≤ 2n, i+1 ei i g ∇ e = e i odd, 2n + 1 ≤ i ≤ 4n,  i+1 , f i   ∇ gi f = f , i odd, 2n + 1 ≤ i ≤ 4n, i+1 fi i g

and 0 in all the other possibilities. Note that we have the relations z = span{∇x y : x, y ∈ n∇,∇ } and ∇ g x = 0 for x ∈ z. Hence, Proposition 4.1 in [6] can be applied, obtaining neutral Calabi-Yau metrics on compact quotients of the cotangent bundle of N∇,∇ . 5.2. Complete, non-flat, neutral K¨ahler Einstein metrics on R8 . We will consider next R4 = span{e1 , . . . , e4 } equipped with two affine structures ∇ and ∇ given by   1 0 1 0  0 −1 0 1 ∇e1 = ∇e3 =  , −1 0 −1 0 0 −1 0 1     0 0 0 0 0000 −1 0 −1 0 1 0 1 0  ∇e2 =  , ∇e4 =  , 0 0 0 0 0 0 0 0 −1 0 −1 0 1010     a 0 a 0 0 0 0 0 c a  b −a −a 0 −a 0 and ∇e 1 =  , ∇e 2 =  , −a 0 −a 0 0 0 0 0 c −a −b + 2c a −a 0 −a 0

Double Products and Hypersymplectic Structures on R4n



∇e 3

a c  = −a −b + 2c

0 a −a −b + 2c 0 −a −a −2b + 3c

 0 a , 0 a

13



∇e 4

0 a = 0 a

0 0 0 0

0 a 0 a

 0 0 0 0

for a, b, c ∈ R. One can verify easily that ∇ and ∇ satisfy Eq. (3) with respect to the symplectic form ω = e1 ∧ e2 + e3 ∧ e4 on R4 . In order to see that the compatibility condition (5) holds, we observe that 

∇ej ∇e k

0 −b + c = 0 −b + c

0 0 0 −b + c 0 0 0 −b + c

 0 0 0 0

(28)

for (j, k) = (1, 1), (1, 3), (3, 1), (3, 3) and ∇ej ∇e k = 0 otherwise. From results in §2, we may construct the 8-dimensional nilpotent Lie group N∇,∇ (see Theorem 2.1), whose underlying manifold is R4 × R4 . The group structure is as follows (x, x ) · (y, y ) = (x + α(x , y), β(x , y) + y ), where α and β are given, in terms of their components, by α1 (x , y) = y1 + a(y1 + y3 )(x1 + x3 ), α2 (x , y) = y2 + (by1 − ay2 + cy3 + ay4 )x1 − a(y1 + y3 )x2 + (cy1 − ay2 + (−b + 2c)y3 + ay4 ) x3 +a(y1 + y3 )x4 + 21 (−b + c)(y1 + y3 )2 (x1 + x3 ),

α3 (x , y) = y3 − a(y1 + y3 )(x1 + x3 ), α4 (x , y) = y4 + (cy1 − ay2 + (−b + 2c)y3 + ay4 ) x1 − a(y1 + y3 )x2 + ((−b + 2c)y1 − ay2 + (−2b + 3c)y3 + ay4 ) x3

β1 (x , y) = β2 (x , y) =

x1 x2

+a(y1 + y3 )x4 + 21 (−b + c)(y1 + y3 )2 (x1 + x3 ), − (x1 + x3 )(y1 + y3 ), + (x2 − x4 )(y1 + y3 ) + (x1 + x3 )(y2 − y4 )

− 21 (−b + c)(x1 + x3 )2 (y1 + y3 ),

β3 (x , y) = x3 + (x1 + x3 )(y1 + y3 ), β4 (x , y) = x4 + (x2 − x4 )(y1 + y3 ) + (x1 + x3 )(y2 − y4 ) − 21 (−b + c)(x1 + x3 )2 (y1 + y3 ). We know from §4 that N∇,∇ carries an invariant hypersymplectic structure whose associated neutral metric is complete (Theorem 4.3). Moreover, using (23) and (28), we may conclude that if b = c, then n∇,∇ is 2-step nilpotent and the hypersymplectic metric is flat, whereas if b = c, then n∇,∇ is 3-step nilpotent and the hypersymplectic metric is not flat. Note that taking a, b, c ∈ Q, the structure constants of n∇,∇ with respect to the canonical basis {e1 , . . . , e4 , f1 , . . . f4 } of n∇,∇ are rational, and thus there exists a discrete co-compact subgroup of N∇,∇ [13]. The complete non-flat hypersymplectic metric on the Lie group induces a metric with the same properties on the associated compact quotient.

14

A. Andrada, I.G. Dotti

The Lie group N∇,∇ is diffeomorphic to R8 , hence there exists a global system of coordinates x1 , . . . , x4 , x1 . . . , x4 such that the left-invariant 1-forms dual to the basis {e1 , . . . , e4 , f1 , . . . , f4 } are given as follows e1 = 1 − a(x1 + x3 ) dx1 − a(x1 + x3 ) dx3 , e2 = (−bx1 + ax2 − cx3 − ax4 ) dx1 + 1 + a(x1 + x3 ) dx2 + −cx1 + ax2 − (−b + 2c)x3 − ax4 dx3 − a(x1 + x3 ) dx4 , e3 = a(x1 + x3 ) dx1 + 1 + a(x1 + x3 ) dx3 , e4 = −cx1 + ax2 − (−b + 2c)x3 − ax4 dx1 + a(x1 + x3 ) dx2 + −(−b + 2c)x1 + ax2 − (−2b + 3c)x3 − ax4 dx3 + 1 − a(x1 + x3 ) dx4 , f 1 = (x1 + x3 ) dx1 + (x1 + x3 ) dx3 + dx1 , f 2 = − 21 (−b + c)(x1 + x3 )2 − x2 + x4 dx1 − (x1 + x3 ) dx2 + − 21 (−b + c)(x1 + x3 )2 − x2 + x4 dx3 + (x1 + x3 ) dx4 + dx2 , f 3 = −(x1 + x3 ) dx1 − (x1 + x3 ) dx3 + dx3 , f 4 = − 21 (−b + c)(x1 + x3 )2 − x2 + x4 dx1 − (x1 + x3 ) dx2 + − 21 (−b + c)(x1 + x3 )2 − x2 + x4 dx3 + (x1 + x3 ) dx4 + dx4 . The K¨ahler form ω1 on n∇,∇ is ω1 = e1 ∧ e2 + e3 ∧ e4 + f 1 ∧ f 2 + f 3 ∧ f 4 and the hypersymplectic metric g is given by g = −e1 ⊗ f 2 + e2 ⊗ f 1 − e3 ⊗ f 4 + e4 ⊗ f 3 +f 1 ⊗ e2 − f 2 ⊗ e1 + f 3 ⊗ e4 − f 4 ⊗ e3 . In the particular case a = 0, b = 1, c = 0, we obtain that 3 2 g = − (x1 + x3 ) + x2 − x4 (dx1 + dx3 )2 2 +2(x1 + x3 )(dx1 + dx3 )(dx2 − dx4 ) − x1 dx1 dx1 − dx1 dx2 + dx2 dx1 + x3 (dx1 dx3 + dx3 dx1 ) +(x1 + 2x3 ) dx3 dx3 − dx3 dx4 + dx4 dx3 is a complete non-flat K¨ahler Ricci-flat neutral metric on R8 . 6. Final Comments and Questions We note that the complex structure J defined in n∇,∇ satisfies the condition [J x, J y] = [x, y] for all x, y ∈ n∇,∇ , which implies the integrability of J . An almost complex structure J on a Lie algebra g satisfying [J x, J y] = [x, y] for all x, y ∈ g is called abelian. We also observe that E satisfies a similar condition, [Ex, Ey] = −[x, y] for all x, y ∈ n∇,∇ , which implies the integrability of E. An almost product structure E on a Lie algebra g satisfying [Ex, Ey] = −[x, y] for all x, y ∈ g will be called abelian. Related to these notions we have the following characterization:

Double Products and Hypersymplectic Structures on R4n

15

Proposition 6.1. Let {J, E} be a complex product structure on the Lie algebra g and let (g, g+ , g− ) be the associated double Lie algebra, i.e., g+ and g− are the Lie subalgebras of g such that E|g+ = 1, E|g− = −1. Then the following assertions are equivalent: (i) J is an abelian complex structure. (ii) The Lie subalgebras g+ and g− are abelian. (iii) If A+ and A− denote the annihilators of g− and g+ , respectively, in g∗ , then 2 ∗ dA+ ⊂ A+ ⊗ A− , dA− ⊂ A+ ⊗ A− , where d : g∗ −→ g is given by (df )(x ∧ y) = −f ([x, y]). (iv) E is an abelian product structure. Proof. (i) ⇐⇒ (ii) Assume first that J is abelian. If x, y ∈ g+ then [x, y] ∈ g+ and [J x, Jy] ∈ g− since g+ and g− are subalgebras. But then [x, y] = [J x, J y] ∈ g+ ∩ g− = {0}, and thus [x, y] = [J x, J y] = 0 for all x, y ∈ g+ . Thus, g+ and g− are abelian. Conversely, suppose that g+ and g− are abelian. For u, v ∈ g+ or u, v ∈ g− , from the integrability of J we obtain [J u, v] + [u, J v] = J [u, v] − J [J u, J v] = 0. If x = x1 + x2 , y = y1 + y2 with x1 , y1 ∈ g+ , x2 , y2 ∈ g− , then J ([J x, Jy] − [x, y]) = [J x1 + J x2 , y1 + y2 ] + [x1 + x2 , J y1 + J y2 ] = [J x1 , y1 ] + [J x2 , y2 ] + [x1 , J y1 ] + [x2 , J y2 ] = ([J x1 , y1 ] + [x1 , J y1 ]) + ([J x2 , y2 ] + [x2 , J y2 ]) = 0, and thus J is abelian. (ii) ⇐⇒ (iii) Suppose first that g+ and g− are abelian. Take f ∈ A+ , which is the 2 A+ ⊕ A+ ⊗ A− (see [3]), so we only have annihilator of g− . It is known that df ∈ 2 A+ is zero. For x, y ∈ g+ , we have to see that the component of (df ) in (df )(x ∧ y) = −f ([x, y]) = 0, showing that df ∈ A+ ⊗ A− . The corresponding assertion for f ∈ A− follows in a similar manner. Conversely, if (iii) is valid, take f = f1 + f2 ∈ g∗ with f1 ∈ A+ , f2 ∈ A− , and x, y ∈ g+ or U, V ∈ g− . Then f ([x, y]) = −(df )(x ∧ y) = −(df1 )(x ∧ y) − (df2 )(x ∧ y) = 0, since df1 , df2 ∈ A+ ⊗ A− . Then [x, y] = 0 and both g+ and g− are abelian. (ii) ⇐⇒ (iv) This follows by a straightforward computation. If one of the conditions in the proposition above holds, we will say that the complex product structure {J, E} is abelian. We will say that a hypersymplectic structure is abelian when the underlying complex product structure is abelian. Using the previous proposition and results in [1] one can show that any Lie algebra carrying an abelian hypersymplectic structure is of the form given in Theorem 2.1, that is, a double product of two abelian Lie algebras endowed with compatible affine structures and symplectic forms. Furthermore, we showed in previous sections that in this case the Lie algebra is nilpotent and also the neutral metric is always complete and not necessarily flat. It would be of interest to know geometric properties of neutral

16

A. Andrada, I.G. Dotti

metrics compatible with {J, E}, J and E abelian, without imposing the condition on the associated forms being closed. We also believe it is of interest to proceed as we did in this paper, carrying out the construction of double Lie groups from affine-symplectic data on another class of Lie groups (not necessarily abelian) and then understand the properties of the resulting hypersymplectic manifold. References 1. Andrada, A.: Hypersymplectic Lie algebras. To appear in J. Geom. Phys. 2. Andrada, A.: Estructuras producto complejas y m´etricas hipersimpl´ecticas asociadas. PhD thesis, FaMAF, Universidad Nacional de C´ordoba, December 2003 3. Andrada, A., Salamon, S.: Complex product structures on Lie algebras. Forum Math. 17, 261–295 (2005) 4. Barberis, M. L., Dotti, I.: Abelian complex structures on solvable Lie algebras. J. Lie Theory 14(1), 25–34 (2004) 5. Barret, J., Gibbons, G. W., Perry, M. J., Pope, C. N., Ruback, P.: Kleinian geometry and the N = 2 superstring. Int. J. Mod. Phys. A9, 1457–1494 (1994) 6. Fino, A., Pedersen, H., Poon, Y. S., Sørensen, M.: Neutral Calabi-Yau structures on Kodaira manifolds. Commun. Math. Phys. 248, 255–268 (2004) 7. Guediri, M.: Sur la compl´etude des pseudo-m´etriques invariantes a` gauche sur les groupes de Lie nilpotentes. Rend. Sem. Mat. Univ. Pol. Torino 52, 371–376 (1994) 8. Hitchin, N.: Hypersymplectic quotients. Atti Accad. Sci. Torino Cl. Sci. Fis. Mat. Natur. 124 Suppl., 169–180 (1990) 9. Hull, C. M.: Actions for (2, 1) sigma-models and strings. Nucl. Phys. B 509(2), 252–272 (1998) 10. Kamada, H.: Self-dual K¨ahler metrics of neutral signature on complex surfaces. Tohoku Mathematical Publications, Number 24 (2002) 11. Kaneyuki, S.: Homogeneous symplectic manifolds and dipolarizations in Lie algebras. Tokyo J. Math. 15, 313–325 (1992) 12. Lu, J.-H., Weinstein, A.: Poisson Lie groups, dressing transformations and Bruhat decompositions. J. Diff. Geom. 31, 501–526 (1990) 13. Malcev, A. I.: On a class of homogeneous spaces. Reprinted in Amer. Math. Soc. Translations, Series 1, 9, 276–307 (1962) 14. Ooguri, H., Vafa, C.: Geometry of N = 2 strings. Nucl. Physics B 361, 469–518 (1991) 15. Segal, D.: The structure of complete left-symmetric algebras. Math. Ann. 293, 569–578 (1992) Communicated by G.W. Gibbons

Commun. Math. Phys. 262, 17–32 (2006) Digital Object Identifier (DOI) 10.1007/s00220-005-1473-8

Communications in

Mathematical Physics

Semi-Focusing Billiards: Hyperbolicity Leonid A. Bunimovich1, , Gianluigi Del Magno2, 1

Southeast Applied Analysis Center, School of Mathematics, Georgia Institute of Technology, Atlanta, GA 30332, U.S.A. E-mail: [email protected] 2 Centro di Ricerca Matematica “Ennio De Giorgi”, Scuola Normale Superiore, Piazza dei Cavalieri 3, 56100 Pisa, Italy. E-mail: [email protected] Received: 9 August 2004 / Accepted: 16 June 2005 Published online: 24 November 2005 – © Springer-Verlag 2005

Abstract: In this paper we answer affirmatively the question concerning the existence of hyperbolic billiards in convex domains of R3 . We also prove that a related class of semi-focusing billiards has mixed dynamics, i.e., their phase space is an union of two invariant sets of positive measure such that the dynamics is integrable on one set and is hyperbolic on the other. These billiards are the first rigorous examples of billiards in domains of R3 with divided phase space.

1. Introduction It is well known that the dynamics of a gas of elastically colliding particles (the hard balls or Boltzmann gas) can be reduced to a billiard in a domain with boundary formed by an union of (not necessarily disjoint) cylinders [S2]. The corresponding billiards are called semi-dispersing. An elegant mechanical model of nuclei has been recently introduced [P1, P2], where N point particles interact via an attracting potential which keeps the distances between any two particles less than a constant L (“diameter of a nucleus”). The particles move freely by inertia until the distance between a pair of particles equals L. At this moment, the two particles “collide” elastically. The potential of interaction is therefore a hard core potential as in the hard spheres gas, but the particles are located inside rather than outside the “core”. This model can be reduced to a billiard inside a domain where some smooth components of the boundary are pieces of convex outward cylinders as the interaction potential is attractive. In the hard sphere gas, instead, the repulsive potential generates boundary components which are convex inward cylinders. Because of this duality such billiards can be naturally called semi-focusing billiards.

The first author was partially supported by the NSF grant #0140165 and the Humboldt Foundation. The second author was partially supported by the FCT (Portugal) through the Program POCTI/FEDER.

18

L.A. Bunimovich, G. Del Magno

It is well known that semi-dispersing cylindrical billiards in R3 are non-uniformly hyperbolic if the cylinders are orthogonal and their bases span R3 [Sz] (for more general results on the hyperbolic and ergodic properties of semi-dispersing cylindrical billiards, see the review [Si]). In this paper, we prove an analogous result (Theorem 1) for semifocusing billiards in R3 . Nowhere dispersing ergodic billiards in Rn with n ≥ 3 were constructed in [B-R1, B-R2, B-R3], but the corresponding billiard domains were nonconvex. Numerical studies suggested [P1, P2, P3] that convex hyperbolic billiards exist. Here we prove hyperbolicity for a class of three-dimensional billiard domains containing the one studied in [P3]. The ergodicity will be addressed in a future paper. These billiards, as well as the ones in [B-R1, B-R2, B-R3], can be viewed as higher-dimensional generalizations of two-dimensional stadia. More precisely, we consider billiards in domains with boundaries formed by flat faces and pieces of cylinders whose sections are absolutely focusing curves [B3, D]. Pieces of spheres instead of cylinders were used in [B-R1, B-R2, B-R3]. We also present the first rigorous examples (Theorem 2) of three-dimensional billiards with phase space which is an union of two sets of positive measure such that the dynamics is integrable on one set and is hyperbolic on the other (divided phase space). 2. Generalities 2.1. Billiards in Rn . Let be an open and connected subset of Rn , n ≥ 3 such that ∂ consists of a finite number of hypersurfaces of class C 3 intersecting at most at their boundaries. Let T1 be the unit tangent bundle of which can be identified with × S n−1 , where S n−1 is the unit sphere in Rn . We will denote by {φt }t∈R the billiard flow inside acting on the space obtained from × S n−1 identifying the elements of ∂ × S n−1 according to the standard law of reflection: the angle of incidence equals the angle of reflection. For a precise definition of the billiard flow and the billiard map, see [C-F-S]. Denote by the collection of all unit vectors in Rn attached to the boundary of and pointing inward. The billiard map T : → is the first return map induced by the billiard flow on . If t : → R+ is the first return time to , then the billiard map is given by Tp = φt (p) p for any p = (q, v) ∈ . This map preserves the probability measure dµ = cv, n(q)dqdω, where dq is the Lebesgue measure on ∂, dω is the Lebesgue measure on S n−1 , n(q) is unit normal of ∂ at q pointing inside , and c is a normalizing constant. One of the characteristic features of general billiard maps is that they are not defined and smooth everywhere on . Let S + be the subset of , where T is not defined or fails to be C 1 . The set S − is defined similarly with T replaced by T −1 . S + (S − ) is called the − k−1 i − −i + singular set of T (T −1 ). For any k > 0, the sets Sk+ = ∪k−1 i=0 T S and Sk = ∪i=0 T S + − + =∪ − are the singular sets of T k and T −k . Finally, let S∞ k≥0 Sk and S∞ = ∪k≥0 Sk . All + − these sets have zero measure because µ(S ) = µ(S ) = 0 [C-F-S]. 2.2. The differential of T . We compute the differential of T with respect to an appropriate system of coordinates of the tangent spaces of . We follow [W4, W3]. Let π : → ∂ be the canonical projection given by π(q, v) = q for any (q, v) ∈ . For any p ∈ , denote by Lp and Vp , respectively, the tangent plane of ∂ at π(p) and the plane orthogonal to the vector p. The tangent space Tp can be naturally identified with Lp × Vp . Let P : Vp → Lp be the projection along p, and let I be the

Semi-Focusing Billiards: Hyperbolicity

19

identity operator on Vp . The operator P × I : Vp × Vp → Lp × Vp identifies Lp × Vp with Vp × Vp . Furthermore we identify Vp and VTp by transporting Vp parallel to itself up to π(Tp) and then by using the transformation Up : Vp → VTp which reflects Rn about the tangent plane LTp . After these identifications Dp T becomes a linear operator on Vp × Vp . In fact, Dp T is the composition of two linear maps on Vp × Vp , the first describes the free motion of the point particle from π(p) up to the point of reflection π(T p), and the second describes the reflection of the point particle at π(T p). Both maps can be represented as 2 × 2 matrices of linear operators on Vp . The first map has the following block form: I lI , (1) 0 I where l is the distance between two consecutive reflections, and I is the identity operator on Vp . Let Tp = (q , v ). The second map has the block form I 0 , (2) R I where R = 2v , n(q )P1∗ KP1 is a self-adjoint operator on Vp , P1 : Vp → LTp is the projection onto LTp along p, and K is the second fundamental form of ∂ evaluated at q . Note that P1∗ : LTp → Vp is the projection onto Vp along n(q ). Therefore, as a linear operator on Vp × Vp , the map Dp T has the block form I lI . (3) R I + lR Let ·, · be the Euclidean scalar product on Vp . For any u = (ξ, η) ∈ Vp × Vp and v = (ξ , η ) ∈ Vp × Vp , let ω(u, v) = ξ, η − ξ , η be the standard symplectic form on Vp × Vp . Note that the maps (1)-(3) are symplectic with respect to ω. 3. Semi-Focusing Billiards From now on, we will only consider billiards in domains of R3 . We start this section by introducing a special class of curves called absolutely focusing that were used to construct hyperbolic planar billiards [B3, D]. Next we construct a family of cylindrical surfaces having absolutely focusing curves as sections. These surfaces together with flat faces (connected and bounded subset of planes) form the boundary of our billiards. 3.1. Absolutely focusing curves. Consider a C 3 planar, strictly convex, simple and compact curve γ of R2 . Let Mγ = {(q, v) ∈ γ × R2 : v = 1 and v, n(q) ≥ 0}, where n(q) is the normal vector of γ at q pointing inside the convex hull of γ . The set Mγ is the billiard phase space over γ . Fix an orientation on γ , and call O the first endpoint of γ with respect to this orientation. For any z = (q, v) ∈ Mγ , let s = s(z) be the length of the subarc of γ with endpoints O and q, and let θ = θ(z) ∈ [0, π ] be the angle between v and the oriented tangent of γ at q. The pair (s, θ ) forms a system of coordinates for Mγ . If γ is parametrized by s, then r(s) > 0 and κ(s) > 0 denote the radius of curvature and the curvature of γ at s, respectively. We assume that 0 κ(s)ds ≤ π , where is

20

L.A. Bunimovich, G. Del Magno

the length of γ . The last condition together with the fact that the third derivative of γ is bounded (see [H]) imply that no trajectory can have infinitely many consecutive collisions with γ with the exception, perhaps, of the periodic trajectory connecting the endpoints of γ . An incoming ray to γ is said to be focused by γ if the infinitesimal family of rays parallel to the incoming ray focuses in linear approximation after leaving γ , i.e., after a complete series of consecutive collisions with γ . Definition 1. A curve γ as above is called absolutely focusing if all the incoming rays are focused by γ . Absolute focusing plays a crucial role for constructing hyperbolic billiards with at least one focusing component of the boundary. To ensure such billiards be hyperbolic, one needs to avoid having arbitrarily long focusing times for narrow beams of rays with a series of reflections along the boundary. Indeed, the focusing produces convergence (rather than divergence that is necessary for hyperbolicity) of nearby orbits. It is the mechanism of defocusing that produces hyperbolicity of billiards with focusing components. Defocusing means that an initially convergent beam of rays has enough time before the next collision with the boundary to become divergent. To make this happen, one needs to ensure first that parallel beams of rays are never formed after reflections from the focusing boundary. Clearly, it is in the neighborhood of such “parabolic” orbits that focusing times are not bounded and defocusing does not occur. If such parabolic orbits do not exist, then the focusing times are bounded and one only needs the free passes between reflections from any focusing component and any other component of the boundary to be sufficiently big to ensure defocusing. These two conditions, the absence of parallel beams after a series of reflections along focusing components and sufficiently long free passes after leaving focusing components produce hyperbolicity. 3.2. Cone fields for absolutely focusing curves. We refer the reader to the papers [W1, W2, D] for an introduction to invariant cone fields and their application to billiards. Consider an absolutely focusing curve γ . Let z = (q, v) ∈ Mγ . A vector u ∈ Tz Mγ corresponds to a smooth variation of z, i.e., a smooth family of directions containing z. We say that u focuses if the projection of the corresponding variation onto v ⊥ vanishes in linear approximation at some point q ∈ R2 . The distance between q and q taken with a positive or negative sign depending on whether or not q follows q along its trajectory is called the focusing time of u and is denoted by τ + (z, u). If (us , uθ ) are the components of u ∈ Tz Mγ , z = (s, θ ) ∈ Mγ with respect to the coordinates (s, θ ), then [W1, D] τ + (z, u) =

sin θ . κ(s) + uθ /us

Any absolute focusing curve admits an invariant cone field [D]. Such a cone field C = {C(z)}z∈Mγ for a curve γ is of the form C(z) = u ∈ Tz Mγ : 0 ≤ τ + (z, u) ≤ τC+ (z) for a suitable positive function τC+ : Mγ → R. We associate to C another function τC− : Mγ → R defined through the so-called Mirror Formula 1 τC+ (z)

+

1 τC− (z)

=

2κ(s) , sin θ

z = (s, θ ) ∈ Mγ .

(4)

Semi-Focusing Billiards: Hyperbolicity

21

If u ∈ C(z) such that τ + (z, u) = τC+ (z), then τC− (z) has the following geometrical meaning. Consider the variation associated to u. We obtain a new variation if we reverse the velocities of the variation associated to u and reflect them off γ . Then τC− (z) is the focusing time of this new variation. It follows that τC+ (z) + τC− (T z) ≤ t (z) for any z ∈ Mγ such that T z ∈ Mγ . Denote by τγ+,C the supremum of τC+ (z) over all z ∈ Mγ leaving γ and by τγ−,C the supremum of τγ−,C over all z ∈ Mγ entering γ . Let τγ ,C = max{τγ+,C , τγ−,C }. In this paper, we will only consider absolutely focusing curves γ with a cone field C for which τγ ,C is finite. We will implicitly make this assumption every time that we deal with focusing curves, and we will call such a C the cone field of γ . Some examples of such curves are described in the remaining part of this section. Consider a curve γ as before. For any z = (q, v) ∈ Mγ , let us denote by L(z) the ray emerging from q and parallel to v and by β(z) the length of the segment of L(z) contained in the osculating circle of γ at q. If z = (q, v) and z = (q , v ) are two elements of Mγ corresponding to consecutive collisions of a trajectory with γ , then t (z) is the length of the segment connecting q and q . Among the examples of absolutely focusing curves considered in this paper, there are the curves γ that verify the relation (W ) : β(z) + β(z ) ≤ 2t (z) for any two consecutive collisions z and z with γ [W2]. For C 4 curves, (W) is equivalent to d 2 r/ds 2 ≤ 0. Other examples of curves which are absolutely focusing are those that verify the relation (M) : β(z)(t (z) + t (z )) ≤ 2t (z)t (z ) for any two consecutive collisions z and z with γ . These curves were introduced in [M] and proved to be absolutely focusing in [C-M]. If a C 4 curve γ satisfies d 2 r 1/3 /ds 2 ≥ 0, then any sufficiently small subarc of γ verifies (M) [M]. Examples of curves that satisfy (W) or (M) are arcs of circles, cardioids, √ logarithmic spirals, the arcs of the ellipse given by x 2 /a 2 + y 2 /b2 = 1, |x| ≤ √a/ 2 for 0 < b < a and sufficiently small arcs of x 2 /a 2 + y 2 /b2 = 1, |x| ≥ b2 / a 2 + b2 for 0 < b < a containing one of the points x = ±a, y = 0. An example of an absolutely focusing arc which √ does not verify (W) and (M) is the half-ellipse x 2 /a 2 + y 2 /b2 = 1, x ≥ 0 with a/b < 2 [D, B3].

3.3. A class of semi-focusing billiards. We describe now the class of billiard tables considered in this paper. Let {e1 , e2 , e3 } be the canonical basis of R3 . For any q ∈ R3 , let (q1 , q2 , q3 ) be the components of q with respect to {e1 , e2 , e3 }. Definition 2. Let ai > 0 for i = 1, 2, 3. A box B with edges lying on the coordinate axes of R3 and having length 2a1 , 2a2 , 2a3 is the parallelepiped B = {q ∈ R3 : |qi | ≤ ai , 1 ≤ i ≤ 3}. Let 1 ≤ i ≤ 3. The sets Bi+ = {q ∈ B : qi = ai } and Bi− = {q ∈ B : qi = −ai } are the faces of B perpendicular to ei . Let Bi∞ = {q ∈ R3 : |qj | ≤ aj , j = i} be the infinite box obtained by stretching B to ∞ in the direction of ei . Definition 3. Let 1 ≤ i = j ≤ 3. An absolutely focusing curve γ lying on span(ei , ej ) and attached to the face Bi+ (Bi− ) of a box B is an absolutely focusing curve t ∈ [0, 1] → γ1 (t)ej + γ2 (t)ei such that γ1 (0) = −γ1 (1) = aj , γ2 (0) = γ2 (1) = ai (−ai ) and γ2 (t) ≥ ai (≤ −ai ) for any 0 ≤ t ≤ 1.

22

L.A. Bunimovich, G. Del Magno

Fig. 1. A semi-focusing billiard table

Definition 4. Let 1 ≤ i = j ≤ 3. A cylinder C attached to the face Bi+ (Bi− ) of a box B is a set of the form C = Bi∞ ∩ P −1 γ , where γ is an absolutely focusing curve lying on span(ei , ej ) and attached to Bi+ (Bi− ), and P : R3 → span(ei , ej ) is the orthogonal projection of R3 onto span(ei , ej ). The curve γ and the space N = span(ei , ej )⊥ are called section and axis of C, respectively. Let C1 and C2 be two cylinders attached to opposite faces Bi+ and Bi− of a box B such that their axes are orthogonal. Let be the union of B and the convex hulls of C1 and C2 . An example of a billiard in a domain is depicted in Fig. 1. Let ∂+ and ∂0 be the union of the two cylinders and the union of the faces of ∂, respectively. Let + = π −1 (∂+ ) and 0 = π −1 (∂0 ). 3.4. Spectrum and eigenvectors of R. We compute the eigenvalues and eigenvectors of the operator R when T p = (q , v ) is attached to a cylinder C ∈ ∂. Choose a system of Cartesian coordinates in R3 such that the origin coincides with the point q , the xz-plane coincides with the tangent plane LTp , the z-axis coincides with the axis of C and n(q) = e2 . Let p = (q, v). Using polar coordinates 0 ≤ ρ, 0 ≤ θ1 ≤ π, 0 ≤ θ2 ≤ 2π , we write v = (sin θ1 cos θ2 , sin θ1 sin θ2 , cos θ1 ) and v = (sin θ1 cos θ2 , − sin θ1 sin θ2 , cos θ1 ). The matrix of K(q ) with respect to the basis {e1 , e3 } is given by −κ(q ) 0 , 0 0 where κ(q ) is the curvature of the section γ of C at q . The operator R = 2v , n(q )P1∗ K (q )P1 is self-adjoint. Let k1 and k2 be its eigenvalues, and let w1 , w2 ∈ Vp be the corresponding normalized eigenvectors. One eigenvalue of R is equal to zero so that we may assume that k2 = 0 and w2 = P1−1 e3 , where P1−1 is the orthogonal projection of LTp onto Vp . We now compute w1 and k1 . It is easy to check that w1 , e3 = 0. Thus w1 = λ(−a2 , a1 , 0) ∈ R3 for some λ = 0, where ai = p, ei , i = 1, 2. A straightforward computation gives

Semi-Focusing Billiards: Hyperbolicity

23

Rw1 = −κ(q )

a12 + a22 a22

w1

so that k1 = −2κ(q )

sin θ1 . sin θ2

3.5. Condition on the distance between cylinders. Consider a domain . In this section, we formulate a condition on the distance between the cylinders of ∂ which guarantees the hyperbolicity of the billiard in . We start with some definitions. The notation here is as in the previous sections. Let C be a cylinder of ∂ with section γ . We define d(T p) =

sin θ2 , κ(q ) sin θ1

(5)

where, we recall, T p = (q , v ) and κ(q ) is the curvature of γ at q . Note that the non-zero eigenvalue k1 of the operator R is equal to −2/d(T p) and that d(T p) is the length of the segment of the trajectory of T p contained in the “half-osculating” cylinder of C at q , i.e., the cylinder tangent to C at q with circular section of radius (2κ(q ))−1 . Let C be the cone field of γ . If z = (q , θ2 ), then we define d ± (T p) =

τC± (z) . sin θ1

(6)

It follows immediately from the Mirror Formula (4) that d − (T p) + d + (T p) = 2

d − (T p)d + (T p) . d(T p)

(7)

Let C1 and C2 be the cylinders of ∂. If their sections are γ1 and γ2 , and C1 and C2 are the cone fields of γ1 and γ2 , then we impose the following condition on τγ1 ,C1 + τγ2 ,C2 < dist(C1 , C2 ).

(8)

Remark 1. When the sections of the cylinders are circles, this condition can be replaced by the condition that the distance between the cylinders is larger than the sum of the radii of the circles. Let [p1 , p2 ] denote the finite trajectory starting and ending at cylinders of ∂, where p1 , p2 ∈ + are its initial and final velocity vectors. If moreover l(p1 , p2 ) is the length of [p1 , p2 ], then it is not difficult to see that Condition (8) implies d + (p1 ) + d − (p2 ) < l(p1 , p2 )

(9)

for any trajectory [p1 , p2 ] for which p1 and p2 belong to distinct cylinders. This property is essential in producing the hyperbolic behavior of billiards in domains .

24

L.A. Bunimovich, G. Del Magno

4. Hyperbolicity of Semi-Focusing Billiards In this section, we prove that the billiard map T in a domain satisfying (8) is hyperbolic, i.e., T has non-zero Lyapunov exponents µ-a.e. on . This can be done by constructing an eventually strictly invariant cone field for T [W1, L-W]. 4.1. Cone field. We define a cone field on the set + , and then we extend it to the set = + ∪ 0 by transporting the cones up to the flat faces of ∂ via DT . According to [L-W], to define a cone field on + , we need to specify a pair of transversal Lagrangian subspaces of Vp × Vp for all p ∈ + . Let W1 (p) = span ((w1 (p), 0), (w2 (p), 0)) , W2 (p) = span (w1 (p), −w1 (p)/d + (p)), (w2 (p), −w2 (p)/d + (p)) , where w1 (p) and w2 (p) are the eigenvectors of R(p) and 0 denotes the zero vector in Vp . Also d ± (p) are the quantities defined in (6), where τC± refers to the cone field of the section of the cylinder containing π(p). It is immediate to check that W1 and W2 are transversal and Lagrangian. We now define a quadratic form Q on + associated to W1 and W2 . Since Vp × Vp = W1 (p) ⊕ W2 (p), for every u = (ξ, η) ∈ Vp × Vp , we can write u = u1 + u2 , where u1 = (ξ + d + (p)η, 0) ∈ W1 , and u2 = (−d + (p)η, η) ∈ W2 . The quadratic form Q at p ∈ + is given by Qp (u) := ω(u1 , u2 ) = ξ, η + d + (p) η 2 for u = (ξ, η) ∈ Vp × Vp . The cone field C on + is defined as follows. For any p ∈ + , let C(p) = {u ∈ Vp × Vp : Qp (u) ≥ 0}. Note that C is piecewise continuous on + , because the function d + is piecewise continuous. To finish the construction of C, we extend it to the whole phase space by transporting the cones from + to 0 via Dp T . We recall some definitions from [L-W]. Let int C(p) = {u ∈ Vp ×Vp : Qp (u) > 0}. The map Dp T is monotone (strictly monotone) if Dp T C(p) ⊂ C(T p) (int C(T p)∪{0}) or, equivalently, if QTp (Dp T u) ≥ (>)Qp (u) for every 0 = u ∈ Vp × Vp . 4.2. Change of coordinates in Vp × Vp . To simplify the computations, we introduce a new system of coordinates in the spaces Vp × Vp , p ∈ + . For any p ∈ + , let us consider the set (ξ , η ) of coordinates of Vp × Vp defined by ξ = ξ + d + (p)η, η = η. In coordinates (ξ , η ), the map Dp T takes the form I 0 I d + (T p)I I (l(p, Tp) − d + (p))I , R I 0 I 0 I

(10)

Semi-Focusing Billiards: Hyperbolicity

25

where l(p, Tp) is the distance between π(p) and π(T p), and the quadratic form Q takes the expression Qp (u) = ξ , η for u = (ξ , η ) ∈ Vp × Vp . The next lemma is a direct consequence of the definition of (ξ , η ). Lemma 1. Let (ξ , η ) ∈ Vp × Vp , and (ξ , η ) = Dp T (ξ , η ). The operator Dp T is monotone (strictly monotone) if and only if ξ , η ≥ (>) ξ , η . In the rest of the paper, we will use coordinates (ξ , η ) unless otherwise specified. 4.3. Hyperbolicity. The cone field of two-dimensional stadium-like billiards is strictly invariant every time that the point particle leaves one boundary curve and bounces off another one. This property produces the hyperbolicity of the billiard. As we show in this section, the situation is more complicated for cylindrical billiards: two consecutive reflections at different cylinders are required in order to obtain strict invariance of the cone field. Denote by T1 the first return map on + induced by T . Let S + be the singular set of T1 , i.e., the set of elements of + , where T1 is not defined or fails to be C 1 . For every + −i + + + n > 0, let Sk+ = ∪k−1 i=0 T S . Since S ⊂ S∞ , we have µ(Sk ) = 0 for any k > 0. Theorem 1. The map T is hyperbolic. Proof. We show that the cone field C is eventually strictly invariant. Note that if p ∈ and Tp ∈ 0 , then Dp T C(p) = C(T p) by construction of C so that, in this case, the invariance of C is automatically satisfied. Furthermore the subset of consisting of elements whose trajectory never hits one of the two cylinders coincides with {(q, v) : q ∈ ∂0 and v ∈ ei⊥ } (recall that Bi− and Bi+ are the faces where the cylinders are attached), and therefore it has zero µ-measure. Thus in order to show that C is eventually strictly invariant, it is enough to check that the following properties are satisfied: P1. Dp0 T1 C(p0 ) ⊆ C(T1 p0 ) for all p0 ∈ + \ S + . P2. Given p−1 ∈ + \ S2+ , let p0 = T1 p−1 and p1 = T12 p−1 . For any p−1 ∈ + such that p0 and p1 are attached to different cylinders, we have Dp−1 T12 C(p−1 ) ⊂ int C(p1 ) ∪ {0}. These properties can be equivalently reformulated in terms of the quadratic form Q. P1 translates into QT1 p0 (Dp0 T1 u) ≥ Qp0 (u) for all u ∈ Tp0 + and p0 ∈ + \ S + , and P2 translates into QT 2 p−1 (Dp−1 T12 u) > Qp−1 (u) for all 0 = u ∈ Tp−1 + and 1

p−1 ∈ + \ S2+ such that p0 and p1 are attached to different cylinders. In order to prove P1 and P2, we need some lemmas concerning the matrix form of DT1 . Let p0 ∈ + and p1 = T1 p0 . By definition of T1 , there exists a positive integer m = m(p0 ) such that T m p0 = p1 and if m > 1, then T k p0 ∈ 0 for 1 ≤ k ≤ m − 1. Let p0,k = T k p0 for 0 ≤ k ≤ m. Note that p0,0 = p0 and p0,m = p1 . The map Dp0 T1 : Tp0 + → Tp1 + is equal to Dp0,m−1 T ◦ Dp0,m−2 T ◦ · · · ◦ Dp0,0 T , where Dp0,k T : Tp0,k → Tp0,k+1 for every 0 ≤ k ≤ m − 1. As explained in Sect. 2.2, for

26

L.A. Bunimovich, G. Del Magno

every 0 ≤ k ≤ m − 1, the tangent space Tp0,k can be identified with Vp0,k × Vp0,k and, by using the maps Up0,0 , . . . , Up0,k (Up0,k reflects R3 about the plane Lp0,k+1 ), we can finally identify Vp0,k × Vp0,k with Vp0,0 × Vp0,0 = Vp0 × Vp0 . Doing so, all Dp0,k T and Dp0 T1 become linear maps on Vp0 × Vp0 . For every 0 ≤ k ≤ m, let p0,k = (qk , vk ). For every 0 ≤ k ≤ m − 1, let lk+1 be the length of the segment [p0,k , p0,k+1 ] and Rk+1 = 2vk+1 , n(qk+1 )P1∗ KP1 , where P1 and K are evaluated at p0,k+1 . As an operator on Vp0,k × Vp0,k , the block form matrix of Dp0,k T is given by I lk+1 I . Rk+1 I + lk+1 Rk+1 Let R˜ 1 = R1 and, for 1 ≤ k ≤ m − 1, let R˜ k+1 = −1 k Rk+1 k , where k = Up0,k−1 Up0,k−2 . . . Up0,0 . As a linear operator on Vp0,0 × Vp0,0 , Dp0,k T has the following block form: I lk+1 I . R˜ k+1 I + lk+1 R˜ k+1 From now on, we will think of Dp0 T1 as an operator on Vp0 × Vp0 . It is not difficult to see that the block form of Dp0 T1 is given by I l(p0 , p1 )I , (11) R˜ m I + l(p0 , p1 )R˜ m where l(p0 , p1 ) = l1 + · · · + lm is the length of the trajectory from p0 to p1 . In the computation of (11), we used the fact that Rk+1 is the zero-matrix as p0,k ∈ 0 for 0 ≤ k ≤ m − 2. Lemma 2. 1. Dp0 T1 admits the factorization J 0 I 0 I E I 0 0 J −1

F , I

where J, E, F are self-adjoint operators on Vp0 . 2. With respect to the basis {w1 (p0 ), w2 (p0 )} of Vp , J, E, F take the block form + d + (p1 ) 1) 0 2 0 − dd − (p (p1 ) d(p1 )d − (p1 ) , E= J = 0 1 0 0 and

l(p0 , p1 ) − d + (p0 ) − d − (p1 ) F = 0

0 . l(p0 , p1 ) − d + (p0 ) + d + (p1 )

+ ˜ ˜ Proof. By a straightforward computation, we obtain J = I +d (p1 )Rm , E = J Rm and F = l(p0 , p1 ) − d + (p0 ) I + d + (p1 )J −1 . The matrices J, E, F are self-adjoint. To finish the proof, we compute the entries of R˜ m with respect to the basis {w1 (p0 ), w2 (p0 )}, and we use Formula (7).

The next corollary is an immediate consequence of the previous lemma and Relation (9) (consequence of Condition (8)).

Semi-Focusing Billiards: Hyperbolicity

27

Corollary 1. E, F are positive semi-definite. Furthermore F is positive definite if and only if π(p0 ) and π(p1 ) belong to different cylinders. Lemma 3. 1. Property P1 is satisfied. 2. Let p0 ∈ + \ S + . Suppose that for some 0 = u ∈ Tp0 + , QT1 p0 (Dp0 T1 u) = Qp0 (u). Then there exist a, b ∈ R such that u = (aw2 (p0 ), bw1 (p0 )) if π(p0 ) and π(T1 p0 ) belong to the same cylinder, and u = (aw2 (p0 ), 0) if π(p0 ) and π(T1 p0 ) belong to distinct cylinders. Proof. Given (ξ , η ) ∈ Vp0 × Vp0 , let ξ ξ = D ∈ Vp0 × Vp0 . T p1 1 η η By Lemma 2, we have

ξ η

=

J (ξ + F η ) . J −1 E(ξ + F η ) + η

Since J is symmetric, we obtain

Qp1 (ξ , η ) = ξ + F η , E(ξ + F η ) + η

= Qp0 (ξ , η ) + F η , η + ξ + F η , E(ξ + F η ) ≥ Qp0 (ξ , η )

(12)

because E, F are positive semi-definite by Lemma 2. This proves the first part of the lemma. Let ξk , ηk , 1 ≤ k ≤ 2 be the components of ξ and η with respect to the basis {w1 (p0 ), w2 (p0 )}. Using the matrix form of E, F with respect to this basis (Lemma 2), one can easily check that the equality in (12) holds if and only if ξ1 = η2 = 0 when π(p0 ) and π(Tp0 ) belong to the same cylinder, and if and only if ξ1 = η1 = η2 = 0 when π(p0 ) and π(T p0 ) belong to different cylinders. In this last case, in fact, F is positive semi-definite. This concludes the proof of the second part of the lemma. Let Projp be the orthogonal projection onto Vp . Note that Up p = T p. Lemma 4. Up Projp = ProjTp Up . Proof. For every w ∈ R3 , we have Up Projp w = Up (w − w, pp) = Up w − w, pT p = Up w − Up w, T pT p = ProjTp Up w.

28

L.A. Bunimovich, G. Del Magno

Lemma 5. P2 is satisfied. Proof. Let p−1 ∈ + \ S2+ , p0 = T1 p−1 and p1 = T12 p−1 . Let qi = π(pi ), i = −1, 0, 1. We study only the case q−1 , q0 ∈ C1 and q1 ∈ C2 , where C1 and C2 are the cylinders of ∂. The other cases can be studied similarly. We argue by contradiction. Suppose that there exists a vector 0 = u ∈ Vp−1 × Vp−1 such that Qp−1 (u) = Qp0 (Dp−1 T1 u) = Qp1 (Dp−1 T12 u).

(13)

The second part of Lemma 3 applied to the first equality of (13) (from the left) implies that there exist a, b ∈ R such that u = (aw2 (p−1 ), bw1 (p−1 )). By Lemma 2, we obtain aw2 (p−1 ) Dp−1 T1 u = , b bw1 (p−1 ) where b = −d − (p0 )/d + (p0 ) = 0. On the other hand, the second equality of (13) and Lemma 2 imply that there exists c ∈ R such that Up−1 Dp−1 T1 u = (cw2 (p0 ), 0) ∈ Vp0 × Vp0 . Thus aUp−1 w2 (p−1 ) = cw2 (p0 ), b bUp−1 w1 (p−1 ) = 0.

(14)

b

Since = 0, we have b = 0. Moreover, as u = 0, we have ac = 0. Since w2 (p1 ) and w2 (p0 ) are unit vectors and Up is an isometry, we have a = c. Let N1 and N2 be the unit vectors parallel to the axes of C1 and C2 , respectively. Of course, span(N1 , N2 ) = ei⊥ for some 1 ≤ i ≤ 3, where Bi− and Bi+ are the faces of the box to which C1 and C2 are attached. According to the results of Sect. 3.4, w2 (p−1 ) = Projp−1 N1 and w2 (p0 ) = Projp0 N2 . Using Lemma 4 and the fact that Up0 N1 = N1 , we obtain Up−1 w2 (p−1 ) = Up−1 Projp−1 N1 = Projp0 Up−1 N1 = Projp0 N1 . Thus the first equation of (14) becomes Projp0 N1 = Projp0 N2 . Since N1 and N2 are orthogonal, this equation is satisfied only if p0 ∈ span(N1 , N2 ). But span(N1 , N2 ) = ei⊥ so that p0 , ei = 0. This implies that π(p1 ) ∈ / C2 contradicting our assumption. The proof of Theorem 1 is complete.

Remark 2. This proof fails if at least one pair of opposite faces of the box of are skew parallelograms instead of rectangles, because, in this case, (9) is not verified even if (8) is. More precisely, as a consequence of the fact that a pair of faces of the box of is a skew parallelogram, one of the cylinders of ∂ has the axis which is not parallel to a coordinate axis. Suppose that C1 is such a cylinder. Then it is not hard to see that (essentially because in a parallelogram with non-orthogonal adjacent sides, the number of reflections of a trajectory starting from a vertical side and ending at the other is bounded above, and d ± is arbitrarily large for vectors having direction close to the axis ˆ ⊂ π −1 (C1 ) of positive measure such that i) every of a cylinder) there exists a set

Semi-Focusing Billiards: Hyperbolicity

29

ˆ leaves C1 and after a finite number of collisions with some flat faces of hits p1 ∈ C2 , and ii) if p2 ∈ π −1 (C2 ) is the vector corresponding to the collision with C2 , then ˆ l(p1 , p2 ) < d + (p1 ) for every p1 ∈ . Remark 3. By Theorems 4 and 4.4. of [D], small C 6 perturbations of C 6 absolutely focusing curves are still absolutely focusing, and the focusing times τγ±,C vary continuously with γ . The same property is valid for C 4 curves verifying d 2 r/ds 2 < 0 [W2]. Therefore if satisfies (8) and the sections of the cylinders are C 6 or C 4 verifying d 2 r/ds 2 < 0, then we see that small perturbations (in the proper class of smoothness) of the sections of the cylinders of ∂ produce new domains for which (8) remains valid. By Theorem 1, the billiards in these domains are hyperbolic. 5. Billiards with Divided Phase Space We say that a system has divided phase space if its phase space is the union of two disjoint sets of positive measure such that one has non-zero Lyapunov exponents almost everywhere and the other has zero-Lyapunov exponents almost everywhere. Two-dimensional semi-focusing billiards with divided phase space were constructed in [B4]. Their phase space consists of integrable regions surrounded by hyperbolic and ergodic regions. Using essentially the same ideas as in [B4], we construct, in this section, a class of semi-focusing billiards in R3 with divided phase space. Consider a domain as in Fig. 1 where the sections of the cylinders C1 and C2 are semi-circles and their axes are orthogonal. We also choose so that Condition (8) is satisfied. Pick one of the two cylinders, say C1 , and denote by H its convex hull. Now contract \ H uniformly along the direction of the axis of C2 in such a way to obtain a mushroom-like domain M as in Fig. 2. The cylinder C1 is the “hat” of the mushroom. We assume, although it is not necessary, that M is symmetric with respect to the plane which is perpendicular to the axis of C2 and contains the axis of C1 . Remark 4. More complex mushroom-like domains can be designed following [B4], like, for instance, domains consisting of cylinders with semi-ellipses as sections connected by rectangular boxes. The billiards in these domains have several hyperbolic and integrable regions of positive measure. Assembling together in a proper way countably many cylinders and boxes, even three dimensional billiards with countably many ergodic components can be constructed. We stick to the simple domain described above to avoid technical complications, and because the mechanism producing the divided phase space is the same in all these billiards. Theorem 2. The phase space of the billiard in a domain M is an union of two disjoint invariant subsets 1 and 2 of positive µ-measure such that ˜ such that µ() ˜ = µ(1 ), and the restriction of T to 1. 1 contains an invariant set ˜ is integrable (and hence T has zero Lyapunov exponents µ-a.e. on 1 ), 2. the restriction of T to 2 is hyperbolic. Proof. Let H = π −1 (∂H ∩∂). Let N be the unit vector parallel to the axis of C1 , and let q0 be a point lying on the axis of C1 . For every (q, v) ∈ H , let I1 (q, v) = v, N , and let I2 (q, v) = (q − q0 ) ∧ v, N /(1 − I12 (q, v))1/2 which gives the minimum distance of the line passing through q and parallel to v from the axis of C1 . It is not difficult

30

L.A. Bunimovich, G. Del Magno

Fig. 2. A 3-dim mushroom-like billiard table

to check that I1 and I2 are in involution (with respect to the symplectic form ω; see Subsect. 2.2) and are independent on an open and dense set of H . We show how this can be done on π −1 (C1 ) ⊂ H . The proof for H \ π −1 (C1 ) is omitted because it is similar. Given (q, v) ∈ π −1 (C1 ), let 0 ≤ θ1 , θ2 ≤ π be polar coordinates for v, where θ1 is the angle formed by v with N and π/2 − θ2 is the angle formed by the projection of v onto N ⊥ with n(q). Let r1 be the radius of the section of C1 . A simple computation shows that I1 (q, v) = cos θ1 and I2 (q, v) = −r1 sin θ1 cos θ2 for every (q, v) ∈ π −1 (C1 ). The map (s1 , s2 ) → (x = r1 cos(s2 /r1 ), y = r1 sin(s2 /r1 ), z = s1 ) is a smooth parametrization of C1 where the origin of the system of Cartesian coordinates (x, y, z) lies on the axis of C1 . In coordinates (s1 , s2 , θ1 , θ2 ), the symplectic form ω is given by sin θ1 (ds1 ∧ dθ1 + ds2 ∧ dθ2 ). It follows immediately that I1 and I2 are in involution and independent on int π −1 (C1 ). Let 1 = {p ∈ H : T k z ∈ H

∀k ∈ Z}

be the set of the vectors whose trajectory is “trapped” inside H . It follows from the symmetry of H that |I1 | and |I2 | are first integrals of T |1 . Let 2a be the length of the edge of B parallel to the x-axis. For any 0 ≤ α1 < 1 and a < α2 ≤ r1 , let (α1 , α2 ) = {p ∈ 1 : |I1 (p)| = α1 and |I2 (p)| = α2 }. Each of these sets is a T -invariant, smooth and compact submanifold of codimension two. Let ˜ = (α1 , α2 ). 0≤α1 <1 a<α≤r1

˜ ⊂ 1 , µ() ˜ > 0 and T | ˜ is integrable [A, T]. Thus all It follows immediately that ˜ are equal to zero. To finish the proof of the first part of Lyapunov exponents of T on ˜ = 0. Note that if z ∈ 1 \ , ˜ then the the theorem, we have to show that µ(1 \ )

Semi-Focusing Billiards: Hyperbolicity

31

projection of the trajectory of z onto the xy-plane is periodic. It is not difficult to see that this implies that z ∈ I2−1 (α2 ), where α2 = r1 cos(m/(2r1 )) for some m ∈ Q. Thus m ˜ ⊂ 1 \ , I2−1 r1 cos 2r1 m∈Q

˜ = 0 follows immediately from the fact that I −1 (α2 ) is a smooth hyperand µ(1 \ ) surface of . Let 2 = \ 1 . This set consists of vectors p ∈ such that T k p ∈ \ H for some k = k(p) ∈ Z. By the geometry of , we see that for µ-a.e. p ∈ \ H , there exists a n = n(p) > 0 such that T n(p) p ∈ π −1 (C1 ). On the other hand, since µ(2 ) > 0, µ-a.e. p ∈ 2 returns infinitely many times to 2 by the Poincar´e Recurrence Theorem. Thus the trajectory of µ-a.e. p ∈ 2 bounces back and forth between the cylinders C1 and C2 infinitely many times. Theorem 1 applied to T |2 implies that T |2 is hyperbolic. Acknowledgements. This paper was written while the second author was visiting the Instituto Superior T´ecnico in Lisbon and the Centro di Ricerca Matematica “Ennio de Giorgi” in Pisa whose hospitality G. D. M. acknowledges gratefully. The visit at the last institute was sponsored by the city of Lizzanello (Italy) which G. D. M. thanks warmly. G. D. M. would also like to thank P. Balint for helpful discussions. Finally the authors would like to thank an anonymous referee for valuable and constructive remarks.

References [A]

Arnold, V.: Mathematical methods of classical mechanics. Graduate Texts in Mathematics 60, Berlin-Heidelberg-New York: Springer-Verlag, 1989 [B1] Bunimovich, L.: On the ergodic properties of nowhere dispersing billiards. Commun. Math. Phys. 65, 295–312 (1979) [B2] Bunimovich, L.: Many-dimensional nowhere dispersing billiards with chaotic behavior. Physica D 33, 58–64 (1988) [B3] Bunimovich, L.: On absolutely focusing mirrors. In: Ergodic theory and related topics, III (Gastrow, 1990), Lect. Notes Math. 1514, Berlin-Heidelberg-NewYork: Springer-Verlag, 1992, pp. 62–82 [B4] Bunimovich, L.: Mushrooms and other billiards with divided phase space. Chaos 11(4), 1–7 (2001) [B-R1] Bunimovich, L., Rehacek, J.: Nowhere dispersing 3D billiards with non-vanishing Lyapunov exponents. Commun. Math. Phys. 189, 729–757 (1997) [B-R2] Bunimovich, L., Rehacek, J.: On the ergodicity of many-dimensional focusing billiards, Classical and quantum chaos. Ann. Inst. H. Poincar´e Phys.Th´eor. 68(4), 421–448 (1998) [B-R3] Bunimovich, L., Rehacek, J.: How high-dimensional stadia look like. Commun. Math. Phys. 197(2), 277–301 (1998) [C-M] Chernov, N., Markarian, R.: Entropy of non-uniformly hyperbolic plane billiards. Bol. Soc. Bras. Mat. 23, 121–135 (1992) [C-F-S] Cornfeld, I., Fomin, S., Sinai, Ya.: Ergodic theory. New York: Springer-Verlag, 1982 [D] Donnay, V.: Using integrability to produce chaos: billiards with positive entropy. Commun. Math. Phys. 141, 225–257 (1991) [H] Halpern, B.: Strange billiard tables. Trans. Amer. Math. Soc. 232, 297–305 (1977) [L-W] Liverani, C., Wojtkowski, M.: Ergodicity in Hamiltonian systems. Dynamics Reported 4, Berlin-Heidelberg-New York: Springer-Verlag, 1995 [M] Markarian, R.: Billiards with Pesin region of measure one. Commun. Math. Phys. 118, 87–97 (1988) [P1] Papenbrock, T.: Collective and chaotic motion in self-bound many-body systems. Phys. Rev. C 61, 034602 (2000) [P2] Papenbrock, T.: Lyapunov exponents and Kolmogorov-Sinai entropy for a high-dimensional convex billiard. Phys. Rev. E 61, 1337–1341 (2000)

32 [P3] [S1] [S2] [Si] [Sz] [T] [W1] [W2] [W3] [W4]

L.A. Bunimovich, G. Del Magno Papenbrock, T.: Numerical study of a three-dimensional generalized stadium billiard. Phys. Rev. E 61, 4626–4628 (2000) Sinai, Ya.: Dynamical systems with elastic reflections. Russ. Math. Surv. 25, 137–189 (1970) Sinai,Ya.: Development of Krylov’s ideas. Princeton Series in Physics. Princeton, NJ: Princeton University Press, 1979 Simanyi, N.: Hard Ball Systems and Semi-Dispersive Billiards: Hyperbolicity and ergodicity. In: Hard Ball Systems and the Lorentz Gas, D. Sz´asz (ed.), Berlin: Springer, 2000, pp. 51–88 Sz´asz, D.: The K-property of “orthogonal” cylindric billiards. Commun. Math. Phys. 160, 581–597 (1994) Tabachinkov, S.: Billiards. Panor. Synth. 1, 1995 Wojtkowski, M.: Invariant families of cones and Lyapunov exponents. Erg. Th. Dynam. Syst. 5, 145–161 (1985) Wojtkowski, M.: Principles for the design of billiards with nonvanishing Lyapunov exponents. Commum. Math. Phys. 105, 391–414 (1986) Wojtkowski, M.: Measure theoretic entropy of the system of hard spheres. Erg. Th. Dynam. Syst. 8, 133–153 (1988) Wojtkowski, M.: Linearly stable orbits in 3-dimensional billiards. Commun. Math. Phys. 129(2), 319–327 (1990)

Communicated by G. Gallavotti

Commun. Math. Phys. 262, 33–50 (2006) Digital Object Identifier (DOI) 10.1007/s00220-005-1474-7

Communications in

Mathematical Physics

Uniqueness of the SRB Measure for Piecewise Expanding Weakly Coupled Map Lattices in Any Dimension Gerhard Keller1 , Carlangelo Liverani2 1

Mathematisches Institut, Universit¨at Erlangen-N¨urnberg, Bismarckstr. 1 1/2, 91054 Erlangen, Germany. E-mail: [email protected] 2 Dipartimento di Matematica, II Universit`a di Roma (Tor Vergata), Via della Ricerca Scientifica, 00133 Roma, Italy. E-mail: [email protected] Received: 4 November 2004 / Accepted: 30 June 2005 Published online: 9 December 2005 – © Springer-Verlag 2005

Abstract: We prove the existence of a unique SRB measure for a wide range of multidimensional weakly coupled map lattices. These include piecewise expanding maps with diffusive coupling.

1. Introduction The field of expanding coupled map lattices has witnessed an impressive series of results since the late 1980’s. Starting with [7] numerous authors contributed to the exploration of ergodic and statistical properties of invariant measures for such systems, see e.g. [1–5,8–20,29,31,32,34,35]. In all these publications the single site maps are hyperbolic or expanding (local) diffeomorphisms of a smooth manifold, and the coupling is modeled by a “diffeomorphism” of the infinite-dimensional state space. Only a few publications used a different approach which allows to treat also piecewise expanding maps and such a common coupling like the diffusive nearest neighbour coupling [21–28, 33]. Yet, the state of the field is still far from satisfactory. One of the outstanding open problems is to substantiate rigorously the numerical picture of a phase transition given in [30]. The model considered in the aforementioned paper is a Z2 lattice of expanding Lasota-Yorke like maps, coupled by a diffusive nearest neighbour interaction. As the coupling parameter increases from zero the authors notice the transition from a situation in which only one invariant measure describes the statistical properties of the system to one in which two relevant invariant measures appear (a phase transition, indeed). After more than ten years no aspect of such a picture has been rigorously proven. In the present paper we prove the first (easier) part of the picture: the existence of only one “relevant” (that is SRB) measure for small coupling. The proof is surprisingly elementary. It combines the following key ideas: The essential part of this research was done during an ESF explorative workshop at the Max-PlanckInstitute for Mathematics, Bonn. We thank both institutions for their support.

34

G. Keller, C. Liverani

(i) The starting point is a Lasota-Yorke type inequality for coupled systems (cf. [24, 28]). (ii) The transfer operator of the uncoupled system is interpreted as a tensor product operator of the single site transfer operators (cf. [33]). This allows to make optimal use of the strong mixing properties of the single site systems. (iii) A “site-by-site” decoupling procedure allows to reduce the dynamics of the coupled system “locally” to dynamics of tensor-product type at the cost of only small errors (cf. [25, 26]). (iv) The aforementioned small errors are not controlled in the original system but in a huge extension of that system. This is the essential new idea of this paper. We believe that it has applications far beyond the present model; indeed, we expect it to be useful for all kinds of weakly coupled systems where the local dynamics can be described by linear operators with an isolated simple leading eigenvalue. Typical examples are high temperature stochastic Ising models or weakly coupled uniformly contractive iterated function systems. The plan of the paper is as follows: Section 2 details the model and describes the basic result obtained in the paper. Sect. 3 describes the already mentioned extension of the system and how to use it to get the main estimate of the paper. Sect. 4 contains the proof of the main theorem based on the results of Sect. 3. Finally, Sect. 5 contains the proof for the case of more general coupling, but with an extra simplifying assumption on the single site map. 2. The Model and the Result Given a compact interval I ⊂ R we will consider the phase space := I , where either = Zd or is a box in Zd .1 In the following we always assume I = [0, 1] and 0 = (0, . . . , 0) ∈ , as this can be done without loss of generality.2 We will have a single site dynamics given by the map τ : I → I . We assume τ to be a piecewise C 2 map from I to I with singularities at ζ1 , . . . , ζN−1 ∈ (0, 1) in the sense that τ is monotone and C 2 on each component of I \ {ζ0 = 0, ζ1 , . . . , ζN−1 , ζN = 1}. We assume that τ /(τ )2 is bounded and that inf |τ | > 2.3 Next, we define the unperturbed dynamics T0 : → by [T0 (x)]p := τ (xp ). To define the perturbed dynamics we introduce couplings : → of the form (x) := x + A (x). We call a (a1 , a2 )-coupling, if there are operators A , A : 1 () → 1 () with a1 = A 1 , a2 = A 1 (maximal column sum norm) such that for all k, p, q ∈ , |(A )p | ≤ 2||,

|(DA )qp | ≤ 2||A qp ,

|∂k (DA )qp | ≤ 2||A qp .

(2.1)

Here ∂k denotes the partial derivative with respect to xk . In addition, we say that has finite coupling range w > 0, if ∂p ,q = 0 whenever |p − q| > w. So A qp = A qp = 0 1 By box here, and in the following, we mean a hypercube. Of course much more general shapes can be considered by the same arguments, yet for shapes with too large a boundary problems may arise. To avoid all the related technicalities we confine ourselves to the above mentioned case. 2 The reader should be aware that there is nothing special about Zd , any other lattice (or graph) can be treated similarly, provided the number of different sites that can be reached from a given site along a path of length n grows at most subexponentially in n. 3 Under mild additional assumptions on τ also maps with 1 < inf |τ | ≤ 2 can be treated. The complications, which arise in the proof of a Lasota-Yorke type inequality, were overcome in [28], see also the discussions of this point in [22] and in [26, Footnote 14].

Uniqueness for the SRB for CML

35

when |p − q| > w. We say that a coupling has short range if it is not of finite range and there exist constants L > 0 and γ ∈ (0, 1) such that A qp + A qp ≤ Lγ |p−q| . Similarly, we say that a coupling has long range if it is neither finite range nor short range and there exists c > 0 such that A qp + A qp ≤ L|p − q|c . The diffusive nearest neighbor coupling used in [30], and in much of the numerical literature, is defined by (xq − xp ) (p ∈ ) , (2.2) [ (x)]p = xp + 2d |p−q|=1

and it is an example of a (1, 0)-coupling with range w = 1.4 The dynamics T : → that we wish to investigate is then defined as T := ◦ T0 , and, more precisely, we wish to investigate its invariant measures in some appropriate class. Let M() be the set of signed Borel measures on .5 To state the main result of the paper we need to introduce the concept of measures of bounded variation. Let I be the set of all boxes 1 ⊂ . For each 1 ∈ I we define 6 Var µ := sup

sup

p∈ |ϕ|C 0 () ≤1

Var 1 µ := sup

sup

µ(∂p ϕ),

p∈1 |ϕ|C 0 (I 1 ) ≤1

µ(∂p ϕ) .

(2.3)

It is easy to prove that the set B() := {µ ∈ M() : Var µ < ∞} consists of measures whose finite dimensional marginals are absolutely continuous with respect to Lebesgue and the density is a function of bounded variation [25]. In addition, such measures have finite entropy density with respect to Lebesgue [26, Corollary 5]. In fact, “Var” is a norm and, with this norm, B() is a Banach space.7 It is also useful to introduce the usual total variation norm on signed measures: |µ| :=

sup

|ϕ|C 0 () ≤1

µ(ϕ) .

(2.4)

Just like in [26, Sect. 3.3] one checks easily that |µ| ≤

1 Var µ . 2

(2.5)

As we are interested in studying observable invariant measures, we must restrict to a subclass of the class of all measures in order to make relevant statements. Clearly, M() is too large for our purposes, but on the other hand, in the case in which is infinite, B() is quite small. As usual in thermodynamics, it makes sense to require 4 Of course, if = Zd , then the sum in (2.2) can involve sites not in . To properly define the dynamics it is then necessary to supply some boundary conditions, that is to specify some fixed value for xq , q ∈ . 5 The topology that we use on is the product one. 6 Here, and in the following, we will consider C 0 (I 1 ) as a subspace of C 0 () by the obvious inclusion. Also the sup is restricted to functions derivable with respect to xp . 7 See [26] for a careful discussion of bounded variation in the present context and the relevant associated properties.

36

G. Keller, C. Liverani

some condition on the growth of the relevant quantity with respect to the volume. Let Mv () be the closure of the set 1 d

{µ ∈ M() : ∀η > 0 sup e−η|1 | Var 1 µ < ∞} , 1 ∈I

with respect to the norm | · |. Clearly, Mv () consists of measures that can be uniformly approximated by measures with absolutely continuous finite dimensional marginals whose densities are functions of bounded variation with the variation growing less than exponentially in the size of the boxes. The results of this paper can be summarized, a bit loosely, as follow. Theorem 2.1. For each (a1 , a2 )-coupling of finite range w, there exists 0 > 0 such that, for each || < 0 , the dynamical system (, T ) has a unique invariant measure µ in Mv ().8 In addition, µ belongs to B(), is exponentially mixing both in time and in space, and it is the SRB measure of the system. The proof, which also makes precise the statement, can be found in Sect. 4. To obtain such a result we consider the dynamics acting directly on the measures via the linear operator T∗ µ(A) := µ(T−1 A) (for each measurable set A). The basic facts concerning the operator T∗ are detailed in the following lemma. Lemma 2.2 (Lasota-Yorke inequality). For each (a1 , a2 )-coupling, there exist 1 > 0, λ > 1, and a, b > 0 such that, for each || < 1 , the operator T∗ is well defined as an operator on B(). In addition, for each µ ∈ B() holds true |T∗ µ| ≤ |µ|,

Var(T∗n µ) ≤ aλ−n Var µ + b|µ| . This is the special case θ = 1 of Proposition 4 in [26] (see below for the meaning of θ ). Observe that the proof given there for = Z applies (only if θ = 1!) without changes to ⊆ Zd . From preceding experience it is also useful to consider larger Banach spaces: first define, for each θ ∈ (0, 1], a norm µθ := sup θ |1 | Var 1 µ 1 ∈I

(2.6)

on B(). Then we let B(, θ ) be the completion of B() with respect to this norm.9 Observe that µθ=1 = Var µ. The key estimate on which Theorem 2.1 relies is given in the following lemma whose proof is the content of the next section. Let B 0 () := {µ ∈ B() : µ(1) = 0}. Lemma 2.3. Recall that = I . For each (a1 , a2 )-coupling with finite range w, there exist σ ∈ (0, 1) and C, 2 > 0 such that, for all || < 2 , µ ∈ B 0 (), θ ∈ (0, 1), and n ∈ N holds true T∗n µθ ≤ Cσ n min{||, |e ln θ|−1 } Var µ. 8 If the coupling is defined only for nonnegative (as it is the case for the diffusive nearest neighbour coupling on ), this has to be understood as “for each ∈ [0, 0 ) . . . ” here and in the sequel. 9 Note that if || = ∞ and 0 < θ < 1, then B (, θ ) contains objects that are not signed measures, see [25, 26] for details.

Uniqueness for the SRB for CML

37

Finally, we wish to emphasize the power of the approach by showing the possibility of extending it to more general settings. Short range interactions can be treated in a spirit similar to the one used for the finite range. Nevertheless, the technical construction becomes inevitably more involved. For the long range case the situation looks still similar but one cannot expect an exponential convergence to the invariant measure, so one cannot simply rely on an estimate of the spectral radius of the covering dynamics and the story is bound to acquire an extra layer of complexity. To keep the technicalities to a minimum here we content ourself with the following result (proved in Sect. 5) concerning the short range case with an additional assumption on the single site map. Theorem 2.4. If the map τ is Lipschitz, then for each (a1 , a2 )-coupling of short range, there exists 0 > 0 such that, for each || < 0 , the dynamical system (, T ) has a unique invariant measure µ in Mv (). In addition, µ belongs to B(), is exponentially mixing both in time and in space, and it is the SRB measure of the system.

3. Lifting the System, Proof of Lemma 2.3 From now on we will suppress the dependence on in notations like B(). The basic idea of the present work is to define an extension of the linear system (T∗ , B) and to study its spectral properties instead of the ones of T∗ . To do so define Bp := {µ ∈ B : ∂p ϕ = 0 ⇒ µ(ϕ) = 0} . 0

Remark that Bp ⊂ B 0 . We can then define B := Xp∈ Bp and B := (B 0 ) , these are Banach spaces with the norm µ ¯ := supp∈ Var µp . 0

0

0

As T∗ (B 0 ) ⊆ B 0 , the (coupled) dynamics is easily lifted to B , namely T : B → B can be defined as (T µ) ¯ p := T∗ µp . However, only in the uncoupled case = 0 the 0

operator T 0 leaves the subspace B of B invariant. Since the invariance of this subspace - also under a suitable lift of T∗ when = 0 - is crucial for our approach, we need to proceed more carefully in choosing a suitable lift. To start with, let us consider some total ordering σ : N → Zd of Zd with the property:10 1

1

c−1 i d ≤ |σ (i)| ≤ ci d .

(3.1)

For each p = p + σ (0) and q = p + σ (i) in Zd one can then define the (partial) telescoping operators p,q acting on test functions,11 p,q ϕ(x) :=

ϕ(x) dxp · · · dxp+σ (i−1) −

ϕ(x) dxp · · · dxp+σ (i) .

Essentially p ∈ specifies the point from which one starts to telescope and q how far one is in the telescoping procedure. Note that p,q ϕ = 0 if ∂q ϕ = 0, and that p,q ϕ 10

For example, on a square lattice one can spiral out from zero on larger and larger squares. d Here ϕ ∈ C 0 (I Z ). This definition suffices in view of the identification already mentioned in Footnote 6. 11

38

G. Keller, C. Liverani

does not depend on the variables contained inside a box of size c−2 |q − p| centered at p.12 We then define the lift : B 0 → B by (µ)q := ∗0,q µ, and the projection map P : B → B(θ ) by P (µ) ¯ :=

µp

p∈

which is well defined for θ ∈ (0, 1) even if is infinite.13 In fact, P B →B(θ) ≤ min{||, |e ln θ|−1 } ,

(3.2)

because Var 1 µp = 0 if p ∈ \ 1 . Observe also that, for each function ϕ depending only on finitely many variables and for each µ ∈ B 0 , P ((µ))(ϕ) = µ(ϕ) . In addition, it is easy to verify that P T = T∗ P .

(3.3)

As remarked before, since T B ⊂ B , we need some way to go back to the space B . 0 This is achieved via the (partially defined!) telescoping operator H¯ : B → B , (H¯ µ)q = ∗p,q µp . (3.4) p∈

Indeed, the infinite sum is not always well defined, but as we only consider finite range couplings, the operators m T ,m := H¯ T (m ≥ 1)

(3.5)

are always well defined on B , as the next lemma shows. Lemma 3.1. Let m ≥ 1. The linear operator T ,m : B → B is well defined, and T ,m µ ¯ ≤ C(wm)d sup Var(T∗m µp ) .

(3.6)

p∈

Proof. Let ϕ ∈ C 1 (). As noted above, ∂q (p,q ϕ)m = 0 for all q ∈ such2 that −2 |q − p| < c |q − p|. Consequently, ∂p (p,q ϕ) ◦ T = 0 provided |q − p| > c mw. Therefore, for µp ∈ Bp , ∗p,q T∗m µp (ϕ) = µp (p,q ϕ) ◦ Tm = 0 if |q − p| > c2 mw.

Indeed, if q ∈ , |q − p| < c−2 |q − p|, then, by (3.1), σ −1 (q − p) ≤ cd |q − p|d < c−d |q − p|d ≤ Hence q has already been integrated out in p,q . 13 Note that, on each local test function, the sum reduces to a finite sum. 12

σ −1 (q − p).

Uniqueness for the SRB for CML

39

It follows that for µ¯ ∈ B , µ¯ = (µp )p∈ ,

¯ q = (H¯ T µ) ¯ q= (T ,m µ) m

∗p,q T∗m µp

(3.7)

|q−p|≤c2 mw

is well defined, and T ,m µ ¯ = sup Var(H¯ T µ) ¯ q ≤ 2(2c2 mw)d sup Var(T∗m µp ) . m

q∈

p∈

Recall from (3.3) that P T = T∗ P . As P H¯ = P whenever these operators are well defined, it follows P T ,m = T∗m P .

(3.8)

So we can use T ,m : B → B as a covering dynamics for T∗m : B 0 → B 0 . In particular, we have the commuting diagrams m

H¯

T

B  

−−→ B 0 −−→ −−−−−−−−−−−−→ B H¯ ◦Tm =:T ,m   P

B 0 −−−−−−−− −−−−−→ B(θ ) ∗m T

and, more generally,

(3.9)

n T ,m

B −−−−→ B     P

for all n ≥ 1.

B 0 −−− −→ B(θ ) ∗mn T

which makes sense since, by Lemma 3.1, T ,m is a bounded operator on B . To conclude we need to have a closer look at the operator T ,m . Recall that the sum in (3.7) is a finite sum as the interaction has finite range. Next, writing m = m1 + m2 , for each µ¯ ∈ B and each p ∈ , holds Var(T∗m µp ) ≤ a λ−m1 Var µp + b|T∗m2 µp |

(3.10)

with a = a(a + 21 b), see Lemma 2.2 and Eq. (2.5). In order to profit from the strong mixing properties of the single site operator we use a decoupling trick originally introp duced in a similar context in [33]: approximate by , where site p is decoupled from all other sites. To this end we introduce the following notation: let ι¯p : I → I p be the map (¯ιp (x))q = xq if q = p and (¯ιp (x))p = 0. Then define : I → I ,

xp if q = p p ( (x))q = (3.11) ( (¯ιp (x)))q if q = p.

40

G. Keller, C. Liverani

Note that (Dp )qp = δqp .

(3.12)

This implies (p )∗ (Bp ) ⊆ Bp .

(3.13) p

It is then natural to define the decoupled dynamics T,p := ◦ T0 . Observe that m x) = τ m (x ). (T,p p p Here is a basic estimate for comparing different couplings. It is a variant of [26, Proposition 5], and we give its proof in the appendix. Lemma 3.2. The lemma consists of two parts: a) Let F, F˜ : → be two Lipschitz maps14 with Lipschitz constant L > 0 that are close in the following sense: There are constants K0 , K1 , K2 > 0 such that (i) q∈ supx |F˜q (x) − Fq (x)| ≤ K0 , (ii) q∈ supp∈ supx =p I |∂p F˜q (x =p , ξ ) − ∂p Fq (x =p , ξ )|dξ ≤ K1 , and (iii) sup{Var(Ft∗ ν) : 0 ≤ t ≤ 1, ν ∈ B(), Var ν ≤ 1} ≤ K2 ; Ft = t F˜ + (1 − t)F . Then, for each ν ∈ B, |F˜ ∗ ν − F ∗ ν| ≤ K2 (K0 + K1 ) Var ν .

(3.14)

b) For use in Sect. 5 we provide a variant of the above estimate: if assumptions (i) and (ii) are replaced by 1 (iv) q∈ supx |F˜q (x) − Fq (x)| 2 ≤ K3 for some K3 > 0, then, for each ν ∈ B, 1

|F˜ ∗ ν − F ∗ ν| ≤ 2K22 K3 Var ν .

(3.15)

As in [26, Lemma 8] one shows that the assumptions of part a) of this lemma are p satisfied for F = and F˜ = . Hence one can compute |∗ µ − (p )∗ µ| ≤ ||(8a1 + 2a2 + 4) Var µ

(3.16)

provided || < min{ 6a11 , 9a22 }. Using Lemma 2.2 it is then straightforward to show (compare the proof of [26, Theorem 6]) ∗m2 |T∗m2 µ − T,p µ| ≤ Cm2 || Var µ.

(3.17)

We are finally at the punch-line: let h be the invariant probability density of the single 1 m2 x) =p , τ m2 ξ ) dξ does not depend on xp , we have site map τ . As ψ(x) := 0 h(ξ )ϕ((T,p µp (ψ) = 0, so that

1 d ∗m2 m2 m2 m2 T,p µp (ϕ) = µp (ϕ ◦ T,p ) = µp (χ[0,xp ] (ξ )−xp h(ξ ))ϕ((T,p x) =p , τ ξ )dξ dxp 0

1 d m2 m2 L (χ[0,xp ] − xp h)(ξ )ϕ((T,p x) =p , ξ )dξ , = µp dxp 0 14 F : → is a “Lipschitz map”, if all F (x) are Lipschitz with respect to each coordinate x with q p uniformly bounded Lipschitz constants. This means in particular that all partial derivatives of all Fq exist s Lebesgue-a.e., are uniformly bounded and that Fq (x + sep ) − Fq (x) = 0 ∂p Fq (x + ξ ep )dξ .

Uniqueness for the SRB for CML

41

where L is the transfer operator of the single site map. This means that, calling σ0 the mixing rate for the single site map, ∗m2 |T,p µp | ≤ Cσ0m2 Var µp .

Combining this equation with (3.10) and (3.17) yields Var(T∗m µp ) ≤ a λ−m1 Var µp + bCm2 || Var µp + bCσ0m2 Var µp . 1

Setting σ1 := max{λ−1 , σ0 } 4 < 1 and m1 = m2 , there is, for m large enough, (m) > 0 such that for || < (m) holds Var(T∗m µp ) ≤ σ1m Var µp .

(3.18)

In view of Lemma 3.1 we conclude that, for each µ¯ ∈ B , T ,m µ ¯ ≤ C(mw)d σ1m µ ¯ . 1 At this point we can choose m large enough so that C(mw)d m σ1 =: σ < 1, whereby obtaining T ,m µ ¯ ≤ σ m µ ¯ .

(3.19)

We now conclude the argument by using Eq. (3.2) and (3.9), T∗pm µθ = P T ,m (µ)θ ≤ 2 min{||, |e ln θ |−1 }σ pm Var µ. p

By the usual trick of writing n = pm + q, q < m and Lemma 2.2 we finally have T∗n µθ ≤ C min{||, |e ln θ |−1 }σ n Var µ,

(3.20)

for each µ ∈ B 0 and θ ∈ (0, 1). This finishes the proof of Lemma 2.3. 4. Proof of Theorem 2.1 Having obtained the exponential estimate (3.20), the assertions of Theorem 2.1 can be proved along well known lines. For our convenience we follow once more [26]. The existence of a T -invariant probability measure µ ∈ B follows from a weak compactness argument as in [26, Theorem 4]. The uniqueness of such a measure µ in B is an immediate consequence of (3.20). In fact, it is also possible to obtain an explicit formula for the invariant measure. To do so let µ0 be a reference probability measure (for example the invariant measure of the uncoupled system). Then each measure µ can be represented as µ = µ(1)µ0 + (H (µ − µ(1)µ0 ))q . q∈

It is then natural to consider the Banach space C × B . In such a space a measure is represented by (µ(1), H (µ − µ(1)µ0 )). It is easily seen that one can define a covering dynamics by Sm (a, µ) := (a, T ,m µ + aH (T∗m µ0 − µ0 )).

42

G. Keller, C. Liverani

Setting ν¯ := H (T∗m µ0 − µ0 ), the equation Sm (1, µ) = (1, µ) has the unique solution µ = (Id − T ,m )−1 ν. This gives the following explicit representations of the invariant measure:15 µ = µ0 +

∞

n

(T ,m ν)q = µ0 +

q∈ n=0

∞

T∗nm (T∗m µ0 − µ0 ) = lim T∗n µ0 . n→∞

n=0

(4.1)

Remark 4.1. In particular, the above means that the linear operator Sm : C × B → C × B has a simple leading isolated eigenvalue 1 and a spectral gap. Uniqueness in Mv () follows by a standard approximation argument. Assume there exists an invariant measure µ˜ ∈ Mv (). By definition, given δ, η > 0, µ˜ can be approx

1

imated by a measure µδ,η such that |µ˜ − µδ,η | ≤ δ and Var µδ ≤ Cδ,η eη| | d for any ∈ I. Let ϕ be a function depending only on the variables belonging to a box 0 ∈ I, so that ϕ ◦ Tn depends only on the variables in the nw-neighborhood n of 0 . Then µ(ϕ) ˜ = T∗n (µ˜ − µδ,η )(ϕ) + T∗n (µδ,η − µ )(ϕ) + µ (ϕ).

(4.2)

We must prove that µ(ϕ) ˜ = µ (ϕ). As |T∗n (µ˜ − µδ,η )| ≤ |µ˜ − µδ,η | ≤ δ with an arbitrary δ > 0, it just remains to show that the second term can be made as small as we like by choosing η and n appropriate. Given a measure µ let µn be its marginal with respect to the box n . Given a measure µ on I n it is convenient to extend it to a measure µ on all I by simply tensoring it with the Lebesgue measure on the complement of n ; note that such an extension does not increase the bounded variation of the measure. With these conventions, T∗n (µδ,η − µ )(ϕ) = T∗n ([µδ,η,n ] − [µ,n ] )(ϕ).

(4.3)

Hence, by (3.20), 1 d

|T∗n (µδ,η − µ )(ϕ)| ≤ Cθ θ −|0 | σ n (1 + Cδ,η eη|n | ).

(4.4)

1

As |n | ≤ |0 | + (|0 | d + 2nw)d it follows that, for 2wη < | ln σ |, one can make this term arbitrarily small by choosing n large. Hence µ˜ = µ . Since in the case || < ∞ one has Mv () = M(), it follows that finite systems have only one measure absolutely continuous with respect to Lebesgue. If || = ∞ and ∈ I, let , (x∈ ) := (x∈ , 0 ∈ ). Then , is still an (a1 , a2 )-coupling with finite range w for the configuration space I and T, := , ◦ T0 gives a dynamics to which all our results apply. Calling µ its unique invariant measure absolutely continuous to Lebesgue one can prove, with essentially the same argument as before that, for each ϕ ∈ C 0 (),

lim |µ (ϕ) − [µ ] (ϕ)| = 0.

→

(4.5)

The exponential mixing in space and time can be obtained exactly as in [26]. Finally, our use of the term SRB for the measure µ is justified by the fact that µ enjoys the law of large number with respect to a vast class of initial measures related 15

Note that all the limits below make sense when the measures are applied to local function.

Uniqueness for the SRB for CML

43

to Lebesgue and, in addition, is stable under smooth random perturbations, that is the random perturbations have a unique invariant measure that converges to µ , see [26]. Normally, for finite systems, the criteria for defining the SRB measure are three (e.g., see [36]), the absolute continuity of the measure along the unstable manifolds, the law of large numbers with respect to Lebesgue for smooth observables, the stability with respect to random perturbations. In the infinite case the situation is a bit more subtle since measures tend to be not absolutely continuous one with respect to the other and, in our case, [15] shows that a lot of invariant measures can have marginals absolutely continuous with respect to Lebesgue. Yet, we have shown that, if some moderate regularity is required, then only one invariant measure with absolutely continuous marginals exists, moreover this is the limit obtained by truncating the system to a finite size as (4.5) shows. This together with the fulfillment of the other two requirements is, in our opinion, sufficient to attribute to µ the qualification of SRB. 5. Short Range Since now the range is infinite it is necessary to decompose the interaction according to space scales, the point being that the interaction on larger scales is smaller and smaller in the weak norm but no control is available on its variation, hence it is necessary to wait longer and longer times for the dynamics to act effectively on it. This forces a more complex bookkeeping mechanism which is reflected in the necessity of a larger covering Banach space that, with a slight abuse of notation, we will still call B . Let S be a positive integer to be fixed later. We define the Banach spaces B := {µ¯ := (µq,t,l ) : q ∈ ; t ∈ NS := N \ {1, . . . , S}; l ∈ {0, . . . , t}, µq,t,l ∈ Bq } ; together with the norm µ ¯ := sup q∈

ρ t α l Var µq,t,l + ρ −t α l−t |µq,t,l | ,

sup

t∈NS l∈{0,...,t}

(5.1)

for some constants α, ρ ∈ (0, 1) to be fixed later. Pictorially, one can imagine the above space as a collection of towers at each site q ∈ , where the tower t has height t and the index l denotes the l th floor in this tower. We can now define the lifted linear dynamics

if l > 0 µq,t,l−1 ¯ q,t,l := (T µ) (5.2) ∗m(s+1) ∗ µp,s,s if l = 0, {(p,s):τ (q−p,s)=t} p,q T where

τ (q − p, s) :=

0 if |q − p| ≤ s 2 + S |q − p| if |q − p| > s 2 + S .

Roughly speaking, within each tower s at site p the operator T pushes each measure one floor up, except for the measure at the top level, which is first transformed according to the dynamics of the whole tower and then distributed (by means of the telescoping operators ∗p,q ) to the ground levels of towers at sites q in the following way: If q is close to p (in the sense |q − p| ≤ s 2 + S) the corresponding measure is mapped to the

44

G. Keller, C. Liverani

tower of height t = 0, whereas if q is farther away from p, it is mapped to the tower of height |q − p|. To relate the dynamics of the linear system (T , B ) with that of the operator T∗m (for an integer m to be fixed later) we introduce the space Bw (θ ) as the completion of B with respect to the weak norm |µ|θ := sup

θ |1 | |µ(ϕ)| .

sup

1 ∈I |ϕ|C 0 (I 1 ) ≤1

Then we define

: B → B , 0

(µ)q,t,l =

∗0,q µ 0

if t = l = 0 otherwise ,

and P : B → Bw (θ ),

P µ¯ =

T∗ml µq,t,l .

q,t,l n

n

It is easy to check that P T = T∗mn P , and hence P T = T∗mn , for each n ∈ N, so that the linear system (T , B ) is indeed an extension of T∗m : B 0 → Bw (θ ).16 The following lemma is the main result of this section. Lemma 5.1. If τ is Lipschitz and is a short range coupling (see Sect. 2 for this terminology), then there exist σ ∈ (0, 1) and C, 2 > 0 such that, for all || < 2 , µ ∈ B 0 (), θ ∈ (0, 1), and n ∈ N holds true, |T∗n µ|θ ≤ Cσ n min{||, |e ln θ |−1 } Var µ.

(5.3)

Proof. As in the case of a finite coupling range, estimate (5.3) follows from the fact that T is a strict contraction on B . Namely, we will show that there are m ∈ N, σ ∈ (0, 1), and 2 > 0 such that for all µ¯ ∈ B holds, T µ ¯ = sup |(T µ) ¯ q,t,l |t,l ≤ σ m µ, ¯

(5.4)

q,t,l

where |µ|t,l := ρ t α l Var µ + ρ −t α l−t |µ|, see (5.1). The case l = 0 is easy: for all q and t we have ¯ q,t,l |t,l = |µq,t,l−1 |t,l = α|µq,t,l−1 |t,l−1 ≤ αµ ¯ ≤ σ m µ ¯ , |(T µ)

(5.5)

1

where m > 0 and σ ∈ (α m , 1) will be determined in the course of the proof. ¯ q,0,0 is given So we assume from now on that l = 0 and start with the case t = 0. (T µ) by the sum in (5.2) that ranges over indices p and s. We begin with the contributions for s = 0. Without loss of generality we may assume that m is even. By C we denote any constant that may depend on the “ingredients” of the system (like a, b, λ, etc.) but which Observe that, since s∈NS ,0≤l≤s ρ −s α −l does not converge, it is not true that P µ¯ ∈ B(θ ) for each µ¯ ∈ B , while P B →B (θ) ≤ (1 − ρ)−1 (1 − α)−1 |e ln θ |−1 . 16

w

Uniqueness for the SRB for CML

45

is independent of any constant that is to be fixed during the proof (i.e. S, α, ρ, m, σ, 1 . A crucial choice will be m = δS for some δ > 0 to be fixed later.) Then |∗p,q T∗m µp,0,0 |0,0

= Var(∗p,q T∗m µp,0,0 ) + |∗p,q T∗m µp,0,0 | ≤ 2 Var(T∗m µp,0,0 ) + 2|T∗m µp,0,0 | ∗m 2

≤ 2aλ− 2 Var(T m

∗m 2

µp,0,0 ) + 2(b + 1)|T

µp,0,0 | m

−m 2

≤ Cλ Var(µp,0,0 ) + C||m Var(µp,0,0 ) + Cσ02 Var(µp,0,0 ) m ≤ σ1 Var(µp,0,0 ) 1

(σ1 := max{λ−1 , σ0 } 4 )

≤ σ1m µ ¯

for sufficiently large m and || < (m), where we used essentially the same arguments that lead already to (3.18). Summing this over all p for which τ (q − p, 0) = t = 0 means that one has to sum over all p for which |q − p| ≤ S:

{p:τ (q−p,0)=0}

∗p,q T∗m µp,0,0

0,0

≤ CS d σ1m µ ¯ ≤

σm µ ¯ 2

(5.6)

for a suitable σ ∈ (σ1 , 1), provided m = δS is sufficiently large. Next we estimate the contributions for s = 0 to the sum in (5.2) when l = 0 and t = 0. In this case |∗p,q T∗m(s+1) µp,s,s |0,0 = Var(∗p,q T ∗m(s+1) µp,s,s ) + |∗p,q T ∗m(s+1) µp,s,s | ≤ 2 Var(T ∗m(s+1) µp,s,s ) + 2|T ∗m(s+1) µp,s,s | ≤ 2aλ−m(s+1) Var(µp,s,s ) + 2(1 + b)|µp,s,s | ≤ [2aλ−m(s+1) ρ −s α −s + 2(1 + b)ρ s ]|µp,s,s |s,s . As s > S and |q − p| ≤ s 2 + S in the case under consideration, we conclude ∗p,q T∗m(s+1) µp,s,s {(p,s):s∈NS \{0},τ (q−p,s)=0}

≤C

∞

0,0

(s 2 + S)d [2aλ−m(s+1) ρ −s α −s + 2(1 + b)ρ s ]µ ¯

s=S+1

¯ ≤ ≤ Cσ2S µ

σm µ ¯ 2

(5.7)

for suitable σ2 ∈ (ρ, 1) and σ ∈ (σ2 , 1), provided λ−m < αρ 2 and m = δS is sufficiently large. We finally turn to the case l = 0 and t = 0. For this we will need the following estimate: There are β ∈ (0, 1) and δ > 0 such that δ/2

|∗p,q T∗m(s+1) µp,s,s | ≤ Cm β |q−p| Var(µp,s,s ), provided m(s + 1) < δ|q − p|. The proof will be given below.

(5.8)

46

G. Keller, C. Liverani

Now, since t = 0, the condition τ (q − p, s) = t in the summation in (5.2) means that t = |q − p| > s 2 + S. In particular, as m = δS, we have m(s + 1) = δS(s + 1) < δ(s 2 + S) < δ|q − p| so that (5.8) is applicable. Therefore |∗p,q T∗m µp,s,s |t,0 = ρ t Var(∗p,q T∗m(s+1) µp,s,s ) + ρ −t α −t |∗p,q T∗m(s+1) µp,s,s | ≤ 2aρ t λ−m(s+1) Var(µp,s,s ) + 2bρ t |µp,s,s | + ρ −t α −t Cm β |q−p| Var(µp,s,s ) ≤ 2aρ t (ραλm )−s + 2bρ s ρ t + (ρα)−s−t Cm β |q−p| |µp,s,s |s,s . √ Hence, observing that t = |q − p| > S and s < t, ∗p,q T∗m µp,s,s ≤

{(p,s):τ (q−p,s)=t} √ t t d

Ct

t,0

ρ (ραλm )−s + ρ s ρ t + (ρα)−s−t Cm β t µ ¯

(5.9)

s=0

≤ Cσ3S µ ¯ ≤ σ m µ, ¯ 1

for suitable σ3 ∈ (ρ, 1) and σ ∈ (σ32δ , 1), provided αρ 2 > β and S = δ −1 m is sufficiently large. Putting together (5.5), (5.6), (5.7), and (5.9) yields T µ ¯ ≤ σ m µ ¯ for some m > 0 and σ ∈ (0, 1). This concludes the proof of Lemma 5.1.

m(s+1) by a map Proof of estimate (5.8). The basic idea of the proof is to approximate T T˜s = T˜,p,q,m(s+1) with the property that ∂p ((T˜s )q ) = 0 if |q − p| ≥ |q − p|. To this end recall the map ι¯p : I → I from Sect. 3, (¯ιp (x))q = xq if q = p and (¯ιp (x))q = 0 if q = p. Then define

m(s+1) (T (x))q ˜ (Ts (x))q = m(s+1) (T (¯ιp (x)))q

if |q − p| < c−2 |q − p| if |q − p| ≥ c−2 |q − p| .

Note first that (p,q ϕ)(T˜s (x)) is constant as a function of xp because c ≥ 1. It follows that ∗p,q T˜s∗ µ = 0 if µ ∈ Bp . Hence, recalling that µp,s,s ∈ Bp , we see that |∗p,q T∗m(s+1) µp,s,s | = |∗p,q (T∗m(s+1) − T˜s∗ )µp,s,s | ≤ 2|(T∗m(s+1) − T˜s∗ )µp,s,s | . The latter quantity can be bounded using Lemma 3.2b. To this end let us check the hypotheses of that lemma.

0 if |q − p| < c−2 |q − p| |(T˜s (x) − Tm(s+1) (x))q | ≤ m(s+1) |(DT )q p |∞ if |q − p| ≥ c−2 |q − p|.

To estimate the derivative notice that 0 ≤ |(D )q p | ≤ δq p + 2L||γ |q −p| so that √ √ 1 1 0 ≤ |(D )q p | 2 ≤ δq p + 2L||γ 2 |q −p| =: (Id + 2L||B)q p . Hence, setting λ+ := |τ |∞ , by the triangular inequality

Uniqueness for the SRB for CML

47

|(DTn )q p | ≤ λn+ {([Id +

2L||B]n )q p }2 .

Using a Cram´er type estimate as in [27] this leads to the following bound for K3 : let n = m(s + 1) and r = |q − p| so that n < δr. Then, for any t > 0, q

|(T˜s (x) − Tm(s+1) (x))q | 2

1

≤

n

λ+2 ([Id +

|q −p|≥c−2 r

≤

1

|q −p|≥c−2 r δ

δr

λ+2 ([Id + −2

≤ (λ+2 e−tc )r

2L||B]n )q p

([Id +

2L||B]δr )q p et|q −p|−tc

−2 r

(5.10)

2L||B]δr )q 0 et|q |

q ∈Zd δ 2

−2

≤ (λ+ ψ(t)δ e−tc )r =: β r = β |q−p| , √ 1 where ψ(t) := q ∈Zd (δq 0 + 2L||γ 2 |q | )et|q | . Clearly, |ψ(t)| < ∞ for t ∈ (0, 1 2 | ln γ |). Hence, if we fix such a t and choose δ > 0 sufficiently small, then β ∈ (0, 1). (These choices are uniform for in a neighbourhood of 0.) The proof that K2 can be taken to be some fixed constant (depending on m but not on p and q) is completely standard and it is left to the reader. Accordingly, Lemma 3.2 yields |∗p,q T∗m(s+1) µp,s,s | ≤ Cm β |p−q| Var(µp,s,s ) and that is (5.8). Lemma 5.1 and (5.3) are the equivalent of Lemma 2.3 and (3.19) which were the basic ingredients to prove Theorem 2.1 in the finite range case. These results can be used now in a similar way to obtain the corresponding result in the short range case. Proof of Theorem 2.4. The proof follows the one of Theorem 2.1, let us outline the main points. The uniqueness of the invariant measure in B follows trivially from Lemma 5.1. On the other hand, the approximation argument is now more subtle since one can no longer use the finite range property in (4.3). Nevertheless, using large deviation type estimates like in the proof of Eq. (5.8) one can show that (4.3) continues to hold if modulo another small error term. The same remarks apply to obtaining the spatial decay of correlation out of the temporal ones: again one has to treat explicitly very long range effect by showing that they produce a very small contribution. Finally, the reasons to call the above invariant measure SRB remain unchanged from the short range case.

48

G. Keller, C. Liverani

6. Appendix Proof of Lemma 3.2. It suffices to estimate (F˜ ∗ ν − F ∗ ν)(ϕ) for a test function ϕ with |ϕ|C 0 () ≤ 1: (F˜ ∗ ν − F ∗ ν)(ϕ) =

ϕ(F˜ x) − ϕ(F x) dν(x) =

1

=

0

=

q∈

1

0 q∈

1

0

∂ (ϕ(Ft x)) dt dν(x) ∂t

∂ ∂q ϕ(Ft x) Ft,q (x) dν(x) dt ∂t

Ft∗ (F˜q − Fq ) · ν (∂q ϕ) dt

so that |F˜ ∗ ν − F ∗ ν| ≤ K2 Var (F˜q − Fq ) · ν q∈

≤ K2

|F˜q − Fq |∞ + sup sup p∈ x =p

q∈

I

|∂p F˜q (x =p , ξ ) − ∂p Fq (x =p , ξ )|dξ Var ν

≤ K2 (K0 + K1 ) Var ν . This proves part a) of the lemma. The above estimate is, in some sense, too good for our needs in Sect. 5 where it may be hard to verify that K1 < ∞. It is then convenient to have a rougher estimate. To this end let us define the function R ∈ C 0 (R , ) by   0 R(x)q = xq  1

if xq < 0 if xq ∈ [0, 1] it xq > 1,

and let ϕ¯ := ϕ ◦ R. Next, define κ, κη : R → [0, ∞), κ(y) := max{1 − |y|, 0} and κη (y) := η−1 κ(η−1 y). For each 1 ∈ I, we introduce, for each η¯ = (ηq )q∈ , the 0 convolution operators Qη, ¯ 1 on C (): (Qη, ¯ 1 ϕ)(x) :=

R1 p∈ 1

κηp (xp − yp )ϕ(x ¯ ∈1 , y) dy ν(dx) .

Not surprisingly, the estimate holds: |Q∗η, ¯ 1 ν − ν| ≤

1 ηq Var ν . 3 q∈1

(6.1)

Uniqueness for the SRB for CML

49

In fact, ν(Qη, ¯ 1 ϕ − ϕ) = κηp (xp − yp )[ϕ(x ¯ ∈1 , y) − ϕ(x)]dy ¯ ν(dx) R1 p∈ 1

=

1

dt 0

=

1

dt 0

=

R 1

R1 p∈ 1

R 1

dz

κηp (xp − yp )

p∈1

1

dt 0

q∈1

κηp (zp )

d ϕ(x ¯ ∈1 , x∈1 + t (y − x∈1 ))dy ν(dx) dt

∂q ϕ(x ¯ ∈1 , x∈1 + tz)zq dz ν(dx)

q∈1

zq

κηp (zp )ν(∂q ϕ¯t,z ),

p∈1

where ϕt,z ¯ ∈1 , x∈1 + tz). From the above formula estimate (6.1) follows, (x) := ϕ(x because R |z|κη (z) dz = η3 . Accordingly, if q∈ ηq < ∞ we can define Qη¯ ϕ := lim Qη, ¯ 1ϕ . 1 →

Then 2 |F ∗ ν − F˜ ∗ ν| ≤ K2 ηq Var ν + sup ν((Qη¯ ϕ) ◦ F − (Qη¯ ϕ) ◦ F˜ ) 3 |ϕ|≤1 q∈

2 ηq Var ν + |ν| ηq−1 |Fq − F˜q |∞ . ≤ K2 3 q q 1

1

− 2 to get Now, for all the q for which |Fq − F˜q |∞ = 0, choose ηq = K2 2 |Fq − F˜q |∞ 1

|F ∗ ν − F˜ ∗ ν| ≤ 2K22 Var ν

1

1

2 |Fq − F˜q |∞ ≤ 2K22 K3 Var ν .

q

This finishes the proof of part b) of the lemma.

References 1. Baladi, V., Degli Eposti, M., Isola, S., J¨arvenp¨aa¨ , E., Kupiainen, A.: The spectrum of weakly coupled map lattices. J. Math. Pures Appl. 77, 539–584 (1998) 2. Baladi, V., Rugh, H.H.: Floquet spectrum of weakly coupled map lattices. Commun. Math. Phys. 220, 561–582 (2001) 3. Bardet, J.-B.: Limit theorems for coupled analytic maps. Probab. Th. Rel. Fields 124, 151–177 (2002) 4. Bricmont, J., Kupiainen, A.: Coupled analytic maps. Nonlinearity 8(3), 379–396 (1995) 5. Bricmont, J., Kupiainen, A.: High temperature expansions and dynamical systems. Commun. Math. Phys. 178(3), 703–732 (1996) 6. Bunimovich, L.A.: Coupled map lattices: one step forward and two steps back. In: Chaos, order and patterns: aspects of nonlinearity—the “gran finale” (Como, 1993). Phys. D 86(1-2), 248–255 (1995) 7. Bunimovich, L.A., Sinai, Ya.G.: Space-time chaos in coupled map lattices. Nonlinearity 1, 491–516 (1988)

50

G. Keller, C. Liverani

8. Fischer, T., Rugh, H.H.: Transfer operators for coupled analytic maps. Ergod. Th.& Dynam. Sys. 20, 109–143 (2000) 9. Gielis, G., MacKay, R.S.: Coupled map lattices with phase transition. Nonlinearity 13, 867–888 (2000) 10. Gundlach, V.M., Rand, D.A.: Spatio-temporal chaos: 1. Hyperbolicity, structural stability, spatiotemporal shadowing and symbolic dynamics. Nonlinearity 6, 165–200 (1991) 11. Gundlach, V.M., Rand, D.A.: Spatio-temporal chaos: 2. Unique Gibbs states for higher-dimensional symbolic systems. Nonlinearity 6, 201–214 (1993) 12. Gundlach, V.M., Rand, D.A.: Spatio-temporal chaos: 3. Natural spatio-temporal measures for coupled circle map lattices. Nonlinearity 6, 215–230 (1993) 13. Gundlach, V.M., Rand, D.A.: Spatio-temporal chaos (Corrigendum). Nonlinearity 9, 605–606 (1996) 14. J¨arvenp¨aa¨ , E.: A note on weakly coupled expanding maps on compact manifolds. Annales Academiæ Scientiarum Fennicæ Mathematica 24, 511-517 (1999) 15. J¨arvenp¨aa¨ , E., J¨arvenp¨aa¨ , M.: On the definition of SRB-measures for coupled map lattices. Commun. Math. Phys. 220(1), 1–12 (2001) 16. Jiang, M.: Equilibrium states for lattice models of hyperbolic type. Nonlinearity 8(5), 631–659 (1995) 17. Jiang, M.: Equilibrium measures for coupled map lattices: existence, uniqueness and finite-dimensional approximations. Commun. Math. Phys. 193, 675-712 (1998) 18. Jiang, M.: Sinai-Ruelle-Bowen measures for lattice dynamical systems. J. Stat. Phys. 111(3-4), 863–902 (2003) 19. Jiang, M., Mazel, A.E.: Uniqueness and exponential decay of correlations for some two-dimensional spin lattice systems. J. Stat. Phys. 82(3-4), 797–821 (1996) 20. Jiang, M., Pesin, Ya.: Equilibrium measures for coupled map lattices: existence, uniqueness and finite-dimensional approximations. Commun. Math. Phys. 193(3), 675–711 (1998) 21. Keller, G.: Coupled map lattice via transfer operators on functions of bounded variation. In: Stochastic and spatial structures of dynamical systems (Amsterdam, 1995), Konink. Nederl. Akad. Wetensch. Verh. Afd. Natuurk. Eerste Reeks 45, Amsterdam: North-Holland, 1996, pp. 71–80 22. Keller, G.: Mixing for finite systems of coupled tent maps. Tr. Mat. Inst. Steklova 216, pp. 320–326 (1997), Din. Sist. i Smezhnye Vopr.; translation in Proc. Steklov Inst. Math. 1997, no. 1, 315–332 (1997) 23. Keller, G.: An ergodic theoretic approach to mean field coupled maps. In: Fractal geometry and stochastics, II (Greifswald/Koserow, 1998), Progr. Probab., 46, Basel: Birkh¨auser, 2000, pp. 183– 208 24. Keller, G., K¨unzle, M.: Transfer operators for coupled map lattices. Ergodic Theory Dynam. Systems 12(2), 297–318 (1992) 25. Keller, G., Liverani, C.: Coupled map lattices without cluster expansion. Discrete and Continuous Dynamical Systems 11, n.2,3, 325–335 (2004) 26. Keller, G., Liverani, C.: A spectral gap for a one-dimensional lattice of coupled piecewise expanding interval maps. Lecture Notes in Physics 671, 115–151, (2005) 27. Keller, G., Zweim¨uller, R.: Unidirectionally coupled interval maps: between dynamics and statistical mechanics. Nonlinearity 15(1), 1–24 (2002) 28. K¨unzle, M.: Invariante Maße f¨ur gekoppelte Abbildungsgitter. Dissertation, Universit¨at Erlangen, 1993 29. Maes, Ch., van Moffaert, A.: Stochastic stability of weakly coupled map lattices. Nonlinearity 10, 715–730 (1997) 30. Miller, J., Huse, D.A.: Macroscopic equilibrium from microscopic irreversibility in a chaotic coupled-map lattice. Pys. Rev. E, 48, 2528–2535 (1993). 31. Pesin, Ya.B., Sinai, Ya.G.: Space-time chaos in chains of weakly interacting hyperbolic mappings. Adv.Sov.Math. 3, 165-198 (1991) ´ 32. H.H. Rugh, Coupled maps and analytic function spaces. Ann. Sci. Ecole Norm. Sup. (4) 35(4) 489–535 (2002) 33. Schmitt, M.: BV -spectral theory for coupled map lattices. Dissertation, Universit¨at Erlangen (2003). See also: Nonlinearity 17, 671–690 (2004) 34. Volevich, D.L.: Kinetics of coupled map lattices. Nonlinearity 4, 37–45 (1991) 35. Volevich, D.L.: Construction of an analogue of Bowen-Sinai measure for a multidimensional lattice of interacting hyperbolic mappings. Russ. Acad. Math. Sbornink 79, 347–363 (1994) 36. Young, L.-S.: What are SRB measures, and which dynamical systems have them. Dedicated to David Ruelle andYasha Sinai on the occasion of their 65th birthdays. J. Stat. Phys. 108(5-6), 733–754 (2002) Communicated by A. Kupiainen

Commun. Math. Phys. 262, 51–89 (2006) Digital Object Identifier (DOI) 10.1007/s00220-005-1425-3

Communications in

Mathematical Physics

Toric Geometry, Sasaki–Einstein Manifolds and a New Infinite Class of AdS/CFT Duals Dario Martelli1 , James Sparks2 1 2

Department of Physics, CERN Theory Division, 1211 Geneva 23, Switzerland. E-mail: [email protected] Department of Mathematics and Jefferson Physical Laboratory, Harvard University, Cambridge, MA 02318, U.S.A. E-mail: [email protected]

Received: 15 December 2004 / Accepted: 15 March 2005 Published online: 24 November 2005 – © Springer-Verlag 2005

Abstract: Recently an infinite family of explicit Sasaki–Einstein metrics Y p,q on S 2 × S 3 has been discovered, where p and q are two coprime positive integers, with q < p. These give rise to a corresponding family of Calabi–Yau cones, which moreover are toric. Aided by several recent results in toric geometry, we show that these are K¨ahler quotients C4 //U (1), namely the vacua of gauged linear sigma models with charges (p, p, −p + q, −p − q), thereby generalising the conifold, which is p = 1, q = 0. We present the corresponding toric diagrams and show that these may be embedded in the toric diagram for the orbifold C3 /Zp+1 ×Zp+1 for all q < p with fixed p. We hence find that the Y p,q manifolds are AdS/CFT dual to an infinite class of N = 1 superconformal field theories arising as IR fixed points of toric quiver gauge theories with gauge group SU (N )2p . As a non–trivial example, we show that Y 2,1 is an explicit irregular Sasaki– Einstein metric on the horizon of the complex cone over the first del Pezzo surface. The dual quiver gauge theory has already been constructed for this case and hence we can predict the exact central charge of this theory at its IR fixed point using the AdS/CFT correspondence. The value we obtain is a quadratic irrational number and, remarkably, agrees with a recent purely field theoretic calculation using a-maximisation. 1. Introduction and Summary The AdS/CFT correspondence [1] predicts that type IIB string theory on AdS5 × Y5 , with appropriately chosen self-dual five-form flux, is dual to an N = 1 four-dimensional superconformal field theory whenever Y5 is Sasaki–Einstein [2–5]. This latter condition may be defined as saying that the metric cone over Y5 ds 2 (C(Y5 )) = dr 2 + r 2 ds 2 (Y5 )

(1.1)

is Ricci-flat K¨ahler, i.e. Calabi–Yau. The superconformal field theory may be thought of as arising from a stack of D3-branes sitting at the tip of the Calabi–Yau cone. Notice

52

D. Martelli, J. Sparks

that unless Y5 is the round metric on S 5 , appropriately normalised, the tip of the cone at r = 0 will be singular. It is a striking fact that, until very recently, the only Sasaki–Einstein five-manifolds that were known explicitly in the literature1 were precisely the round metric on S 5 and the homogeneous metric T 1,1 on S 2 × S 3 , or quotients thereof. For the five-sphere the Calabi–Yau cone is simply C3 and the dual superconformal field theory is the maximally supersymmetric N = 4 SU (N ) theory. For T 1,1 the Calabi–Yau cone is the conifold and the dual N = 1 superconformal field theory was given in [3, 5]. Due to the rather limited number of examples in the literature detailed tests of the AdS/CFT conjecture for more interesting geometries have been lacking2 . Indeed, one is restricted to quotients (orbifolds) of S 5 and T 1,1 . These have been extensively studied using orbifold techniques which by now are completely standard. For example, Klebanov and Witten argued that the field theory for T 1,1 may be obtained via a relevant deformation of the N = 2 orbifold S 5 /Z2 . However, this has changed drastically with the recent discovery [6] of a countably infinite class of explicit Sasaki–Einstein metrics on Y p,q ∼ = S 2 ×S 3 . These were initially found by reduction and T-duality of a class of supersymmetric M-theory solutions discovered in [7]. The family is characterised by two relatively prime positive integers p, q, with q < p. A particularly interesting feature of these Sasaki–Einstein manifolds is that there are countably infinite classes which are both quasi-regular and irregular. These terms are not to be confused with regularity of the metric: the metrics are all smooth metrics on S 2 × S 3 . Rather, they refer to properties of the orbits of a certain Killing vector field. Indeed, on any Sasaki–Einstein manifold Y there exists a canonically defined Killing vector field K, called the Reeb vector in the mathematics literature. The orbits of this Killing vector field may or may not close. If they close then there is a (locally free) U (1) action on Y and such Sasaki–Einstein manifolds are called quasi-regular. The geometries Y p,q with 4p 2 −3q 2 a square are examples of such manifolds. If the orbits of the Reeb vector field do not close the Killing vector generates an action of R on Y , with the orbits densely filling the orbits of a torus, and the Sasaki–Einstein manifold is said to be irregular. The geometries Y p,q with 4p 2 − 3q 2 not a square are the first examples of such geometries in the literature3 . Another interesting feature of these metrics is that the volumes are always given by a quadratic irrational number times the √ volume of the round metric on S 5 – recall a quadratic irrational is of the form a + b c, where a, b ∈ Q, c ∈ N. Moreover, the volumes are rationally related to that of S 5 if and only if the Sasaki–Einstein is quasi-regular. Recall that all four-dimensional N = 1 superconformal field theories possess an R-symmetry, commonly referred to as the U (1) R-symmetry. However, crucially this symmetry is not always a U (1) symmetry – this is true only if the R-charges of all the fields are rational. In general, this is not true, as exemplified by the recent work of [9]. In the latter reference it is shown that the exact R-symmetry of a superconformal field theory maximises a certain combination of t’Hooft anomalies atrial (R) = (9TrR 3 −3TrR)/32. 1 E. Calabi has constructed an explicit K¨ahler–Einstein metric on del Pezzo 6 – recall that this is the blow-up of CP2 at 6 points – with a certain symmetric configuration of the 6 blown-up points. The corresponding Sasaki–Einstein metric on #6(S 2 × S 3 ) is thus also explicit. This metric has apparently never been published. We thank S.–T. Yau for pointing this out to us. 2 Although one can still deduce some geometric information for the regular Sasaki–Einstein manifolds #l(S 2 × S 3 ), which are U (1) bundles over del Pezzo surfaces with l points blown up, l = 3, . . . , 8, even though the general metrics are not known explicitly. 3 Thus disproving a conjecture of Cheeger and Tian [8] that such examples do not exist. We thank the referee for drawing our attention to this reference.

Toric Geometry, Sasaki–Einstein Manifolds and a New Infinite Class of AdS/CFT Duals

53

The maximal value is then precisely the exact a central charge of the superconformal field theory. Since one is maximising a cubic with rational coefficients, the resulting R-charges are always algebraic numbers. Recall that in AdS/CFT the R-symmetry is precisely dual to the canonical Killing vector field K discussed above. Moreover, the central charge aY for the field theory dual to Y is inversely proportional to its volume. In particular, we have [10] aY vol(S 5 ) . = aS 5 vol(Y )

(1.2)

It is thus clearly of interest to identify the dual superconformal field theories for the Sasaki–Einstein manifolds Y p,q , so as to compare the exact results on both sides of the duality. In this paper we take the first substantial steps in this program by analysing in considerable detail the geometry of the manifolds Y p,q , and the associated Calabi–Yau cones. The results allow us to show that the metrics Y p,q are dual to a class of N = 1 superconformal field theories arising as IR fixed points of certain toric quiver gauge theories, with gauge group SU (N )2p . The case p = 2, q = 1 is somewhat special. This corresponds to the geometry with largest volume, and is an irregular metric. The dual field theory therefore has the smallest central charge within the family, and moreover is expected to be quadratic irrational. Rather surprisingly, we find that the metric Y 2,1 turns out to be an explicit metric on the horizon of the complex cone over the first del Pezzo surface. For this, the corresponding SU (N )4 quiver gauge theory and superpotential have already been identified [11]. We can then compute the central charge (1.2) and also the R-charges of the baryons for this theory using AdS/CFT, where the baryons correspond to D3-branes wrapped over 3-cycles whose metric cones are supersymmetric cycles (complex divisors) in the cone over Y 2,1 . The values we find are all quadratic irrational numbers. At first sight these results present a puzzle, as the central charge computed in [9, 12, 13] was found to be a rational number. However, a closer inspection of the quiver theory shows that the a-maximisation calculation is somewhat more subtle in this case4 . Indeed, using a-maximisation [9] applied to the quiver theory, the authors of [14] find a central charge, as well as R-charges, which agree perfectly with the values obtained using the geometrical results of this paper. This constitutes an extremely beautiful test of the AdS/CFT correspondence, as well as the general a-maximisation procedure advocated in [9]. Given the results presented here, in principle the duals to the remaining geometries, with general p and q, q < p, can be constructed using the “toric algorithm” of [11]. These will provide an infinite series of N = 1 superconformal field theories, whose central charges are generically quadratic irrational. It will be interesting to obtain these explicitly, and to compare the results of a-maximisation for these theories with the various geometrical results presented in this paper. However, we leave these calculations for future work. As a final point, we note that in [15] a generalisation of the metrics Y p,q to all dimensions was presented (see also references [16] and [17] for a generalisation of this generalisation). In particular there are countably infinite classes of supersymmetric solutions AdS4 × Y7 to M-theory, which will have three-dimensional CFT duals, where the metric Y7 is built using any positive curvature K¨ahler–Einstein metric in real dimension four [15]. These have been classified [18, 19]. For the case when the K¨ahler–Einstein 4 We are very grateful to M. Bertolini, F. Bigazzi, A. Hanany, K. Intriligator, and B. Wecht for discussions on this issue.

54

D. Martelli, J. Sparks

manifold is toric, one has only three cases: CP2 , CP1 × CP1 , and dP3 , where the latter is the third del Pezzo surface. Using the techniques developed in this paper, one can show that for the first two cases the metric cones over Y7 are given by K¨ahler quotients C5 //U (1), and C6 //U (1)2 , respectively, where the various U (1) charges are, with appropriate definitions5 of the Chern numbers p and k, Q = (p, p, p, −3p + k, −k) and Q1 = (p, p, 0, 0, −2p + k, −k), Q2 = (0, 0, p, p, −2p + k, −k), respectively. Outline. The first point to note about the manifolds Y p,q , and their associated Calabi– Yau cones, is that they are all toric. This essentially means that there is an effective action of a torus T3 ∼ = U (1)3 on C(Y p,q ) which preserves the symplectic form of the cone and commutes with the homothetic R+ action. Indeed, this torus action is an isometry, and so also preserves the metric. The torus action and symplectic form then allow us to define a moment map, µ : C(Y p,q ) → R3 . The image in R3 is always a good convex rational polyhedral cone in R3 [20]. These terms will be explained more carefully later. However, roughly this is a convex cone formed by intersecting some number of planes through the origin. The moment map exhibits C(Y p,q ) as a T3 fibration over this moment cone, with the fibres collapsing over the faces, or facets, of the cone in a way determined by the normal vectors to the facets. We shall find explicitly that the moment cone for Y p,q is a four-faceted good strictly convex rational polyhedral cone. Having computed the moment cone for C(Y p,q ) we may then apply a Delzant theorem [21] for symplectic toric cones worked out recently in [20]. In physics terms, this takes the combinatorial data defining the moment cone and uses it to produce a gauged linear sigma model [22]. By construction the classical vacuum of the linear sigma model is precisely the Calabi–Yau cone one started with. More mathematically, this would be called a symplectic – or, more precisely, K¨ahler – quotient of Cd by a compact abelian group. The final result is: • The metric cones over Y p,q are explicit Calabi–Yau metrics for the U (1) gauged linear sigma model on C4 with charges (p, p, −p + q, −p − q), and zero Fayet–Illiopolous parameter. If we denote the vacuum of a linear sigma model by X = C4 //U (1), then it is easy to see that, rather generally, c1 (X) = 0 is equivalent to the charges of the U (1) gauge group summing to zero. Clearly this is true for the gauged linear sigma model above, and hence X is indeed topologically Calabi–Yau. In this process we lose precise information about the metric – in particular, the induced metric from C4 is not Ricci-flat. However, we have now gained an explicit description of the Calabi–Yau singularity. Indeed, by constructing invariant monomials one obtains an algebraic description of the singularity. One easily sees that this is the hypersurface up+q v p−q = x p+q y p−q in

C4 ,

(1.3)

where the monomials are given by p−q p z3 ,

u = z1

p+q p z4 ,

v = z2

p−q p z3 ,

x = z2

p+q p z4

y = z1

.

(1.4)

We may then give the toric diagram for the Calabi–Yau singularity. This may be realised as an integral polytope in R2 . Roughly, the four outward pointing primitive normal vectors that define the moment cone lie in a plane as a result of the Calabi–Yau 5

In particular, the definitions here are different from those in [15].

Toric Geometry, Sasaki–Einstein Manifolds and a New Infinite Class of AdS/CFT Duals

55

condition. Projecting these vectors onto this plane yields the vertices of the toric diagram for a minimal presentation of the singularity. We show that the resulting toric diagrams may all be embedded inside that of the orbifold C3 /Zp+1 × Zp+1 , where the two factors −1 −1 are generated by (ωp+1 , ωp+1 , 1), (ωp+1 , 1, ωp+1 ) ⊂ SU (3), respectively, where ωp+1 th is a (p + 1) root of unity. The vertices of the polytope are then (0, 0), (0, p + 1) and (p + 1, 0) (the position of the origin is irrelevant) and we show that the toric diagram for C(Y p,q ) lives inside this polytope for all q < p and fixed p. Geometrically, this means that the Calabi–Yau cone C(Y p,q ) may be obtained by (partial) toric crepant resolution of the orbifold [5, 23]. Also, as part of our general analysis, we find a class of supersymmetric submanifolds in the geometries C(Y p,q ). Specifically, we show that the cones over the special orbits of the cohomogeneity one action on Y p,q are calibrated submanifolds – in fact complex divisors – of the Calabi–Yau. Recall that D3-branes wrapped over the horizon 3-cycles are dual to baryons in the AdS/CFT correspondence [24, 25]. We compute the volumes of these submanifolds, and hence give a prediction for the R-charges of the corresponding baryons. Given the toric diagram for C(Y p,q ) there are methods to construct a superconformal field theory, whose Higgs branch is the toric variety X ∼ = C(Y p,q ), purely from the combinatorial data that defines X [11]. Indeed, the point is that the field theory for the orbifold C3 /Zp+1 × Zp+1 , in which the geometries are “embedded”, is known from standard orbifold techniques. The Calabi–Yau cones C(Y p,q ) are obtained by partial resolution, which amounts to turning on specific combinations of Fayet–Illiopolous parameters in the gauged linear sigma model. The field theories in question are then rather conventional quiver gauge theories with polynomial superpotentials. The number of nodes of the quiver is simply twice the area of the toric diagram, which is 2p for all q with fixed p. Rather surprisingly, we find that the toric diagram for Y 2,1 is precisely the same as that for the complex cone over the first del Pezzo surface. Recall that the latter is the blow-up of CP2 at one point, and that the complex cone over this is indeed a real cone over S 2 × S 3 . It follows that Y 2,1 , which is irregular, is an explicit Sasaki–Einstein metric on the horizon, or boundary, of this cone. This is interesting, since the higher del Pezzo surfaces, which are CP2 with 3 ≤ r ≤ 8 generic points blown up, admit K¨ahler–Einstein metrics [18, 19]. The complex cones then carry regular Sasaki–Einstein metrics. The case of one or two points blown up has always been something of a puzzle, since these del Pezzos do not admit K¨ahler–Einstein metrics and thus the Sasaki–Einstein metrics associated to the complex cones could not possibly be regular. We have thus resolved this puzzle, at least in the case of one blow-up. The quiver gauge theory dual to the complex cone over the first del Pezzo surface has been presented in the literature [11]. The AdS/CFT correspondence then predicts the exact central charge of this theory in the IR. Using the explicit metric Y 2,1 , the result we obtain is √ aS 5 13 13 + 46 vol(Y 2,1 ) 7.74 = = ∼ . aY 2,1 12 · 27 27 vol(S 5 )

(1.5)

Remarkably, this value coincides precisely with a recent application of a-maximisation [9] to the quiver gauge theory [14]. Moreover, we also find perfect agreement for the charges of (SU (2)F singlet) baryons in the gauge theory. The plan of the rest of the paper is as follows. In Sect. 2, after recalling some basic facts about Sasaki–Einstein geometry, we give a summary of the construction of the

56

D. Martelli, J. Sparks

metrics Y p,q , and recall several of their features. Section 3 contains a review of symplectic toric geometry – in particular toric contact geometry – which we use extensively in the remainder of the paper. In Sect. 4 we compute the image of the moment map associated to the toric Calabi–Yau cones C(Y p,q ). In Sect. 5 we apply a Delzant construction to obtain a gauged linear sigma model (GLSM) description of the Calabi–Yau spaces. Moreover we analyse directly the structure of the moduli space of vacua of the GLSM in Sect. 5.3. In Sect. 6 the associated toric Gorenstein singularities are described. In Sect. 7 we demonstrate that Y 2,1 is an irregular metric on the horizon of the complex cone over the first del Pezzo surface, and exhibit an explicit (non-K¨ahler and non-Einstein) metric on the latter. Section 8 concludes with a comparison of the geometrical results obtained here with the results of a-maximisation applied to the quiver gauge theory corresponding to the complex cone over the first del Pezzo surface [14]. In Appendix A the techniques used in the paper, which perhaps are unfamiliar to many physicists, are applied to the familiar example of the conifold. 2. Sasaki–Einstein Metrics on S 2 × S 3 In this section we review the geometry of the recently discovered Sasaki–Einstein metrics on S 2 × S 3 [6]. There is an infinite family of such metrics, labeled by two coprime integers p > 1, q < p – we refer to these as Y p,q . Geometrically they are all U (1) principle bundles6 over an axially squashed S 2 bundle over a round S 2 . The integers label the twisting, or Chern numbers, of the U (1) bundle over the two two-cycles, with the constraint q < p arising as a regularity condition on the metric. The manifolds are all cohomogeneity one. The fact that they are all topologically S 2 × S 3 follows from a theorem of Smale [26] on the classification of five-manifold topology. In the following we first recall basic material about Sasakian–Einstein geometry and then turn to the metrics Y p,q .

2.1. Sasakian–Einstein geometry. A Sasaki–Einstein manifold may be defined as a complete positive curvature Einstein manifold7 whose metric cone is Ricci–flat K¨ahler, i.e. a Calabi–Yau cone. The structure of a Sasaki–Einstein manifold may thus be thought of as “descending” from the Calabi–Yau structure of its metric cone (1.1). In particular, contracting the Euler vector r∂/∂r, which generates the homothetic R+ action on the cone, into the K¨ahler form gives rise to a one-form on the base of the cone, Y . The dual of this is a constant norm Killing vector field – called the Reeb vector in the mathematical literature – which via the AdS/CFT correspondence is isomorphic to the R-symmetry of the dual field theory. The Killing vector defines a foliation of the Sasaki–Einstein manifold, and one finds that the transverse leaves have a K¨ahler–Einstein structure. More precisely, one can write the local form of the metric as follows: ds 2 (Y ) = ds42 +

1

3 dψ

+σ

2

,

(2.1)

6 This U (1) is not to be confused with the isometry generated by the Reeb vector. The latter is embedded non-trivially inside the torus defined by this U (1) and U (1) that rotates the axially squashed S 2 fibre. 7 We also require simply-connectedness. This is not strictly necessary. However, given this condition we can use a theorem which relates contact structures to the existence of globally-defined Killing spinors. The latter is the physical property that we wish our manifolds to possess.

Toric Geometry, Sasaki–Einstein Manifolds and a New Infinite Class of AdS/CFT Duals

57

where ds42 is a local K¨ahler–Einstein metric. In particular we have that dσ = 2J4 , d4 = i3σ ∧ 4 ,

(2.2)

where J4 and 4 are the local K¨ahler and holomorphic (2, 0) form for ds42 , respectively. The Reeb Killing vector is given by K≡3

∂ . ∂ψ

(2.3)

Sasaki–Einstein manifolds may then be classified into three families, according to the global properties of the orbits of this Killing vector field: • If the orbits close, and moreover the associated U (1) action is free, the Sasaki–Einstein manifold is said to be regular. The length of the orbits are then all equal. One thus has a principle U (1) bundle over a four-dimensional base K¨ahler–Einstein manifold. • Suppose that the isotropy group x of at least one point x is non-trivial. Notice that x is necessarily isomorphic to Zm , for some integer m, since these are precisely the proper subgroups of U (1). The U (1) action is then locally free, meaning that the isotropy groups are all finite – note that the Killing vector cannot vanish anywhere since it has constant norm. The Sasaki–Einstein manifold is then said to be quasi–regular. In this case notice that the length of the orbit through x is 1/m times the length of the generic orbit. The quotient of any manifold by a locally free compact Lie group action is canonically an orbifold. One thus has a principle orbifold U (1) bundle, or orbibundle, over a K¨ahler–Einstein base orbifold. Moreover, the point x will descend to a Zm – orbifold point x in this base space. • If the orbits do not close, the Sasaki–Einstein manifold is said to be irregular. In this case one does not have a well-defined quotient space. Note that such a Sasaki–Einstein manifold necessarily has at least a U (1)d isometry group, d ≥ 2, with the orbits of the Killing vector filling out a dense subset of the orbits of the torus action. Indeed, the isometry group of a compact Riemannian manifold is always a compact Lie group. Hence the orbits of a Killing vector field define a one-parameter subgroup, the closure of which will always be an abelian subgroup and thus a torus. The dimension of the closure of the orbits is called the rank. Thus irregular Sasaki–Einstein manifolds have rank greater than 1. The five-dimensional regular Sasaki–Einstein manifolds are classified completely [27]. This follows since the smooth four-dimensional K¨ahler–Einstein metrics with positive curvature on the base have been classified by Tian and Yau [18, 19]. These include the special cases CP2 and S 2 × S 2 , with corresponding Sasaki–Einstein manifolds being the homogeneous manifolds S 5 (or S 5 /Z3 ) and T 1,1 (or T 1,1 /Z2 ), respectively. For the remaining metrics, the base is a del Pezzo surface obtained by blowing up CP2 at k generic points with 3 ≤ k ≤ 8 and, although proven to exist, the generic metrics are not known explicitly. We emphasise the lack of existence of K¨ahler–Einstein metrics on the del Pezzo surfaces with one or two points blown up, as this will play an important role later. This fact is actually rather simple to understand. It is a fairly straightforward calculation [28] to show that the Lie algebra H generated by holomorphic vector fields on a K¨ahler–Einstein manifold is a complexification of the Lie algebra generated by Killing vector fields, i.e. isometries. The latter is always a reductive algebra (meaning it is the sum of its centre together with a semi-simple algebra) but for the first and second del Pezzo surfaces the

58

D. Martelli, J. Sparks

algebra H is not reductive. Clearly then H being reductive is always necessary. This is Matsushima’s Theorem [28]. One also requires that the anti-canonical bundle be ample, that is c1 > 0, otherwise the putative K¨ahler–Einstein metric would be indefinite. In complex dimension two, these necessary conditions are in fact sufficient for existence of a K¨ahler–Einstein metric [18, 19], and this leads to the list stated above. It was only recently [29–32] that quasi-regular Sasaki–Einstein metrics were shown to exist on #l(S 2 × S 3 ) with l = 1, . . . , 9. In particular, there are 14 known inhomogeneous Sasaki–Einstein metrics on S 2 × S 3 . We stress that the proof of this is via existence arguments, rather than giving explicit metrics. Specifically, one uses a modification of Yau’s argument to prove existence of K¨ahler–Einstein metrics on certain complex orbifolds, and then builds the appropriate U (1) orbibundle over these to obtain Sasaki–Einstein manifolds. One can also obtain quasi-regular geometries rather trivially by taking quotients of the explicit regular geometries discussed above by appropriate freely-acting finite groups. For example, one can take a freely-acting finite subgroup of SU (3) and quotient S 5 ⊂ C3 by the induced action. 2.2. The metrics Y p,q . We will now review, as well as work out some new, properties of the Sasaki–Einstein metrics Y p,q on S 2 × S 3 . These were presented in [6] in the following local form: ds 2 =

1 − cy 1 q(y) (dθ 2 + sin2 θ dφ 2 ) + dy 2 + (dψ − cos θ dφ)2 6 w(y)q(y) 9

+ w(y) [dα + f (y)(dψ − cos θdφ)]2 ≡ ds 2 (B) + w(y)[dα + A]2 ,

(2.4)

where 2(a − y 2 ) , 1 − cy a − 3y 2 + 2cy 3 q(y) = , a − y2 ac − 2y + y 2 c f (y) = . 6(a − y 2 )

w(y) =

(2.5)

For c = 0 the metric takes the local form of the standard homogeneous metric on T 1,1 . Otherwise, c can be scaled to 1 by a diffeomorphism. Henceforth we assume this is the case. The base B. The analysis of [6] first showed that the four dimensional space B can be made into a smooth complete compact manifold with appropriate choices for the ranges of the coordinates. In particular, for8 0
(2.6)

one can take the ranges of the coordinates (θ, φ, y, ψ) to be 0 ≤ θ ≤ π , 0 ≤ φ ≤ 2π , y1 ≤ y ≤ y2 , 0 ≤ ψ ≤ 2π so that the “base space” B is an axially squashed S 2 8 In the limit a → 1 the two positive roots become equal and y = 1 is a double root. In the case a = 1 the metric is locally that of the round metric on S 5 .

Toric Geometry, Sasaki–Einstein Manifolds and a New Infinite Class of AdS/CFT Duals

59

bundle over a round S 2 . The latter is parametrised by θ , φ, with ψ being an azimuthal coordinate on the axially squashed S 2 fibre. This bundle is geometrically twisted, and may be thought of as the S 2 bundle over S 2 formed by taking the tangent bundle of the round two-sphere and adding a point at infinity to each fibre. Now, the inclusion map U (1) → SO(3) induces a map Z ∼ = π1 (U (1)) → π1 (SO(3)) ∼ = Z2 which is reduction modulo 2. Here we are thinking of U (1) as the group in which the transition functions of T S 2 take their values, and SO(3) as the structure group of the associated oriented S 2 bundle over S 2 . Since T S 2 has Chern number 2 ∼ = 0 mod 2, it follows that the S 2 bundle is trivial and thus the manifold B is topologically a product space, B ∼ = S 2 × S 2 . The range of y is fixed so that 1 − y > 0, a − y 2 > 0, w(y) > 0, q(y) ≥ 0. Specifically, yi are two zeroes of q(y), i.e. are two roots of the cubic Q(y) ≡ a − 3y 2 + 2y 3 = 0 .

(2.7)

If 0 < a < 1 there are three real roots, one negative (y1 ) and two positive, the smallest being y2 . The values y = y1 , y2 then correspond to the south and north poles of the axially squashed S 2 fibre. One may check explicitly that the metric is smooth here with the above identifications of coordinates. The circle fibration. It was shown in [6] that for a countably infinite number of values of a, with 0 < a < 1, one can now choose the period of α so as to describe a principle S 1 bundle over B. This is true if and only if the periods of dA are rationally related. Thus one requires P1 = p,

P2 = q

(2.8)

with the periods Pi , i = 1, 2, given by 1 Pi = 2π

dA ,

(2.9)

Ci

where C1 and C2 give the standard basis for the homology group of two-cycles on B∼ = S 2 × S 2 . In this case, one may take 0 ≤ α ≤ 2π ,

(2.10)

and the five-dimensional space is then the total space of an S 1 fibration over B ∼ = S 2 ×S 2 , with Chern numbers p and q over the two two-cycles. An explicit calculation shows that P1 3 = . P2 2(y2 − y1 )

(2.11)

Moreover, the function y2 (a) − y1 (a) is a monotonic increasing function of a, taking the range 0 < y2 (a) − y1 (a) < 3/2, thus implying a countably infinite number of solutions with 0 < q/p < 1. Furthermore, for any p and q coprime, the space Y p,q is topologically S 2 × S 3 – see [6]. This follows from a result of Smale on the classification of five-manifold topology.

60

D. Martelli, J. Sparks

The volumes. One finds that =

q 3q 2 − 2p 2 + p(4p 2 − 3q 2 )1/2

(2.12)

and the volume of Y p,q is given by vol(Y p,q ) =

q 2 [2p + (4p 2 − 3q 2 )1/2 ] π3 3p 2 [3q 2 − 2p 2 + p(4p 2 − 3q 2 )1/2 ]

(2.13)

which is a quadratic irrational number times the volume π 3 of a unit round S 5 . We note that at fixed p the volume is a monotonic function of q, and is bounded by the following values: vol(T 1,1 /Zp ) > vol(Y p,q ) > vol(S 5 /Z2 × Zp ) .

(2.14)

The rational case, which is easily seen to correspond to quasi–regular manifolds, is described by p, q ∈ N, hcf(p, q) = 1, q < p, which are solutions to the quadratic diophantine 4p2 − 3q 2 = n2

(2.15)

for some n ∈ Z. The solutions to this were given in closed form in [6]. The isometry group. The isometry group of the metrics (2.4) is clearly locally SU (2) × U (1) × U (1), and in particular there are three commuting Killing vectors ∂/∂φ, ∂/∂ψ, and ∂/∂γ . Here we have defined α ≡ γ

(2.16)

so that the three generators have canonical period 2π . For us it will be important to note that the global form of the effectively acting isometry group depends on p and q. In particular, for both p and q odd it is SO(3) × U (1)2 , otherwise it is U (2) × U (1). This will be explained later in Sect. 4. Note that this is precisely analogous to the case of the Einstein manifolds known in the physics literature as T p,q . For these the effectively acting isometry group is shown [33] to be SO(3) × SU (2) when one integer is even, and SO(4) ∼ = (SU (2) × SU (2))/Z2 when both are odd. The latter of course includes the case of T 1,1 [3]. The local K¨ahler–Einstein structure. Employing the change of coordinates α = −β/6 − ψ /6, ψ = ψ one can [6] bring the metric (2.4) into the local Sasaki–Einstein form (2.1). In particular 1−y dy 2 1 (dθ 2 + sin2 θ dφ 2 ) + + w(y)q(y)(dβ + cos θ dφ)2 6 w(y)q(y) 36 (2.17) 1 2 + [dψ − cos θ dφ + y(dβ + cos θdφ)] . 9 The corresponding J4 and 4 , satisfying (2.2), can be taken as ds 2 =

1−y 1 sin θ dθ ∧ dφ + dy ∧ (dβ + cos θ dφ) , (2.18) 6 6 1−y w(y)q(y) 4 = (dθ + i sin θ dφ) ∧ dy + i (dβ + cos θdφ) , 6w(y)q(y) 6 J4 =

(2.19)

Toric Geometry, Sasaki–Einstein Manifolds and a New Infinite Class of AdS/CFT Duals

61

while the Reeb Killing vector is given by K=3

1 ∂ ∂ − . ∂ψ 2 ∂γ

(2.20)

Note that this has compact orbits when is a rational number and corresponds to the quasi-regular class, by definition. This is true if and only if (2.15) holds. If is irrational the generic orbits do not close, but instead densely fill the orbits of the torus generated by [∂/∂ψ, ∂/∂γ ] and we thus fall into the irregular class. The rank of these metrics is thus equal to 2. Note that the orbits close only over the submanifolds given by y = y1 , y2 . These are precisely the special9 orbits of the cohomogeneity one action. The Killing spinors. To show that these manifolds admit globally defined Killing spinors one appeals to the following theorem [34]: every simply-connected spin Sasaki–Einstein manifold, where the latter is defined in terms of the existence of a certain contact structure, admits a solution to the Killing spinor equation. In particular we note that the dual one-form to K is given by 1 η = −2y(dα + A) + q(y)(dψ − cos θ dφ) (2.21) 3 which is globally-defined (the factor of q(y) is essential here). The contact structure is then easy to exhibit in terms of η for the manifolds Y p,q [6]. This theorem is the reason why one a priori requires hcf(p, q) = 1–however see below. The Calabi–Yau cones. It will be important for us to exploit the symplectic structure of the associated Calabi–Yau cones. Rather generally, the Calabi–Yau structure on the metric cone is specified by a K¨ahler (hence also symplectic) form J and a holomorphic (3, 0) form , which in terms of the four-dimensional K¨ahler–Einstein data read as follows: J = r 2 J4 + rdr ∧ ( 13 dψ + σ ), = eiψ r 2 4 ∧ dr + ir( 13 dψ + σ ) .

(2.22) (2.23)

In the specific case of C(Y p,q ), we have 1−y sin θ dθ ∧ dφ 6

1 1 2 + rdr ∧ (dψ − cos θdφ) − d(yr ) ∧ dα + (dψ − cos θ dφ) 3 6

J = r2

(2.24) and

1−y (dθ + i sin θdφ) 6w(y)q(y) ∧ dy − iw(y)q(y) dα + 16 (dψ − cos θdφ) ∧ dr − 2ir ydα + (y − 1) 16 (dψ − cos θ dφ) ,

=e r

iψ 2

(2.25)

9 The manifolds Y p,q are cohomogeneity one, meaning that the generic orbit under the action of the isometry group is codimension one. There are then always precisely two special orbits of higher codimension.

62

D. Martelli, J. Sparks

where we used (2.18), (2.19) and have then rewritten the expressions in terms of the original coordinates. Note that this calculation shows that is invariant under ∂/∂α, namely L∂/∂α = i∂/∂α d + d(i∂/∂α ) = 0,

(2.26)

implying that the Killing spinors are also invariant. This explicitly checks that upon performing a T -duality along the α direction to Type IIA string theory, the number of preserved supersymmetries is unchanged. In fact, this is obvious given the original construction [7] of these metrics. Since we are guaranteed existence of Killing spinors by the theorem of [34], and since we have now shown that the spinors are independent of α, it follows that one may in fact take hcf(p, q) = h > 1 by taking a smooth quotient by Zh of the simply-connected Sasaki–Einstein manifold Y p/ h,q/ h . Since this is rather trivial, we take this as understood in the remainder of the paper. Complex coordinates. It is easy to introduce a (local) set of complex coordinates. To do so we seek three closed complex one-forms ηi such that ∧ ηi = 0. First, consider the following local one-forms obeying the latter property: 1 dθ + idφ, sin θ 1 1 η˜ 2 = dy − i(dα + (dψ − cos θ dφ)), w(y)q(y) 6

dr 1 3 η˜ = − i dα + (y − 1)(dα + (dψ − cos θ dφ)) , 2r 6 η1 =

(2.27)

where now = 2e r

iψ 3

Q(y) sin θ η1 ∧ η˜ 2 ∧ η˜ 3 . 3

(2.28)

Taking z1 = tan θ2 eiφ we immediately find η1 =

dz1 . z1

(2.29)

To obtain two more integrable one-forms one is free to consider linear combinations of the one-forms (2.27). Take 1 η2 = − cos θ η1 + η˜ 2 , 6 1 3 η = cos θ η1 − y η˜ 2 + η˜ 3 . 6

(2.30)

Notice that one can now simply drop the tildes in (2.28). Moreover the η2 , η3 are now closed and hence locally exact. In particular ηi =

dzi , 6zi

(2.31)

Toric Geometry, Sasaki–Einstein Manifolds and a New Infinite Class of AdS/CFT Duals

63

i = 2, 3, with 1 −1 −1 −1 z2 = (y − y1 ) y1 (y2 − y) y2 (y3 − y) y3 e−6iα−iψ , sin θ z3 = r 3 sin θ Q(y) eiψ .

(2.32)

In terms of the {zi }, the three-form assumes a very simple form: =

1 dz1 ∧ dz2 ∧ dz3 . √ z1 z2 18 3

(2.33)

Supersymmetric cycles. In this subsection we will show that the cones over the submanifolds y = y1 , y2 , which recall are the special orbits of the cohomogeneity one action, are in fact divisors in the Calabi–Yau cone. This amounts to showing that they are calibrated with respect to the four-form 21 J ∧ J . We denote the three-submanifolds as i , i = 1, 2, respectively. Thus, we compute the pull–back of 21 J ∧ J to the four-cycles in the Calabi–Yau cone C(Y p,q ) specified by y = yi . The latter are in fact cones over the Lens spaces 1 ∼ = S 3 /Zp+q , 2 ∼ = S 3 /Zp−q . We shall show in detail that this is indeed the topology in Sect. 4. However, this fact can also be seen by computing the pull-back of the K¨ahler form to the four-submanifolds. Defining k = p + q, l = p − q, these are10

J |y=y1 J |y=y2

k 2 = y1 − r sin θ dθ ∧ dφ − rdr ∧ (d2γ − k cos θ dφ) , 2 l 2 r sin θ dθ ∧ dφ − rdr ∧ (d2γ + l cos θdφ) , = y2 2

(2.34) (2.35)

and are precisely the K¨ahler forms associated to cones over round Lens spaces S 3 /Zk and S 3 /Zl , respectively. Indeed, since γ has period 2π , the one-forms multiplying dr are precisely global angular forms (global connections) on the total spaces of circle bundles over S 2 with Chern numbers k and −l, respectively. The total spaces of such bundles are precisely S 3 /Zk and S 3 /Zl , respectively. From these expressions, one calculates 1 r 3 yi (1 − yi ) J ∧ J |y=yi = sin θ dθ ∧ dφ ∧ dγ ∧ dr . 2 3

(2.36)

Let us compare this with the volume form induced on i from the metric (2.4). This is given by vol =

√ r 3 w(yi )(1 − yi ) sin θ dθ ∧ dφ ∧ dγ ∧ dr . 6

(2.37)

Remarkably, since w(yi ) = 4yi2 at any root of the cubic (2.7) we see that this precisely agrees with (2.36). Thus we see that both C(1 ) = {y = y1 } and C(2 ) = {y = y2 } are divisors of C(Y p,q ), or in other words they are supersymmetric submanifolds. 10

Recall that y1 0 and y2 0.

64

D. Martelli, J. Sparks

We may now write down the volumes of the i . Here one needs to use the explicit formulae for the roots of the cubic y1 and y2 in terms of p and q:

1 2p − 3q − 4p 2 − 3q 2 , y1 = 4p (2.38)

1 y2 = 2p + 3q − 4p 2 − 3q 2 . 4p One then easily calculates 2

q 2 (p + q) −2p + 3q + 4p 2 − 3q 2 2 vol(1 ) = 2 π ,

2 2 2 2 2 2p 3q − 2p + p 4p − 3q 2

q 2 (p − q) 2p + 3q − 4p 2 − 3q 2 2 vol(2 ) = 2 π .

2 2 2 2 2 2p 3q − 2p + p 4p − 3q

(2.39)

In particular, let us write down the volumes of i in the case of p = 2, q = 1: vol(1 ) =

√ π2 (31 + 7 13) , 108

vol(2 ) =

√ π2 (7 + 13) . 36

(2.40)

3. Moment Maps and Convex Rational Polyhedral Cones In the remainder of this paper it will be crucial for us that the Sasaki–Einstein manifolds Y p,q admit an effectively acting three-torus T3 = U (1)3 of isometries, which moreover is Hamiltonian. The latter means that the action preserves the symplectic form of the cone C(Y p,q ) and that one can use this to introduce a moment map. The torus is just the maximal torus in the isometry group, and the fact that the torus is half the dimension of the cone means that, by definition, the cones are toric. The image of the cone under the corresponding moment map generally belongs to a special class of convex rational polyhedral cones in R3 [35, 20] – these are simply convex cones formed by intersecting some number of planes through the origin. The normal vectors to these planes, or facets, are necessarily rational and describe which U (1) subgroup of T3 is vanishing over the corresponding codimension two submanifold of C(Y p,q ). This generalises the wellknown result in symplectic geometry that the image of the moment map for a compact toric symplectic manifold is always a particular type of convex rational polytope called a Delzant polytope. In this section we give a general review of symplectic toric geometry. This is mainly rather standard material from the point of view of a symplectic geometer – the reader who is familiar with this subject may therefore wish to skip this section. On the other hand, we hope that this will be a useful self-contained presentation of the material. 3.1. Moment maps for torus actions. In this subsection we give a general summary of moment maps, Hamiltonian torus actions, and symplectic toric manifolds, orbifolds and cones, together with the properties of their images under the moment maps, which are always particular types of rational polytopes (or polyhedral cones) in Rn . The case

Toric Geometry, Sasaki–Einstein Manifolds and a New Infinite Class of AdS/CFT Duals

65

of compact manifolds [36, 37, 21] is rather standard in symplectic geometry, but the generalisation for orbifolds [38], and especially cones [35, 20], is quite recent. We begin by giving a general definition. Suppose that the torus Tn acts effectively – meaning that every non-trivial element moves at least one point – on a symplectic manifold M with symplectic form ω. We identify the Lie algebra of this torus, as well as its dual, with Euclidean n-space, so tn ≡ Lie(Tn ) ∼ = Rn , tn∗ ∼ = Rn . Then a moment n map for the torus action is simply a T –invariant map, µ : M → tn∗ ∼ = Rn ,

(3.1)

dµi = V i ω .

(3.2)

satisfying the condition

Here V i denotes the vector field on M corresponding to the basis vector ei in tn ∼ = Rn , and µi denotes the component of the map µ in the direction ei , i.e. µ = (µ1 , . . . , µn ). Clearly this moment map is unique only up to an additive integration constant. To see where this map comes from, suppose for simplicity that one has a U (1) action on a symplectic manifold M, generated by some vector field V , which moreover preserves the symplectic form. One then says that the U (1) action is symplectic. The latter means that LV ω = 0,

(3.3)

where L is the Lie derivative. Since ω is closed, this condition is just d(V ω) = 0 .

(3.4)

As long as the closed one-form V ω is trivial as a cohomology class, [V ω] = 0 ∈ H 1 (M; R), then one can “integrate” this equation to a function µ, which is precisely the moment map for the U (1) action. The action is then said to be Hamiltonian. For example, the U (1) which rotates one of the circles in T2 , with obvious symplectic form, is not Hamiltonian. Clearly, if H 1 (M; R) is trivial then all symplectic actions are in fact Hamiltonian. A symplectic toric manifold is then by definition a symplectic manifold of dimension 2n with an effective Hamiltonian torus action by Tn . It is by now a classic fact in symplectic geometry that, for a compact symplectic toric manifold M, the image of M under µ is a certain kind of convex rational polytope in Rn called a Delzant polytope [21]. Recall that a polytope is just the convex hull of some finite number of points in Rn . The codimension one hyperplanes that bound the polytope are called its facets. The symplectic toric manifold is then a torus fibration over this polytope, with the fibres collapsing in a certain way over the facets. More precisely, over an interior point of the polytope the fibre of the moment map (the inverse image of the point) is the whole torus Tn , but over the boundary facets this fibre collapses to Tn−1 ∼ = Tn /U (1). Such a U (1) subgroup is specified by a vector in the weight lattice n v ∈ Z of Tn , and this vector is in fact just the normal vector to the facet. Moreover the U (1) fixes a corresponding codimension two submanifold of M. To see this, consider the case where v = e1 = (1, 0, . . . , 0). Denote the corresponding vector field as V . Then over a codimension two fixed point set F ⊂ M we have that V = 0, and moreover F is itself symplectic toric with respect to the torus Tn−1 ∼ = Tn /U (1). In particular, the moment map µ restricted to F is constant in the direction corresponding to V , i.e. µ1 = c = constant. Then µF ≡ µ |F is a moment map for F with < µF , e1 >= c. This defines the hyperplane at x1 = c, where {xi }, i = 1, . . . , n are coordinates on

66

D. Martelli, J. Sparks

Rn . The general case follows similarly. The normal vectors to the facets are thus all rational vectors. If two facets intersect over a codimension two face in Rn , then both the corresponding U (1)’s vanish, and the fibre over this face is a Tn−2 . Continuing in this way, the vertices of the polytope are precisely the points in M which are fixed under the entire torus action. The fact that the polytope is always convex follows from an argument using Morse theory [36, 37]. Delzant polytopes satisfy some additional conditions, as well as being rational: • simplicity – n edges meet at each vertex. • smoothness – for each vertex, the corresponding n edge vectors ui , i = 1, . . . , n form a Z–basis11 of Zn . The polytope data is sufficient to recover the original symplectic toric manifold. Moreover, the correspondence between Delzant polytopes and compact symplectic toric manifolds is one-to-one. Thus, to any Delzant polytope one can associate a corresponding symplectic toric manifold whose image under the moment map is precisely . The proof of this is by construction. This will be extremely important for us in Sect. 5 – in physics terms, the construction realises the manifold as the vacuum of a gauged linear sigma model [22]. We now briefly explain the above conditions. Assuming the first condition holds, the second condition avoids orbifold singularities. Indeed if the smoothness condition fails then Tn / < ui >∼ = is a non-trivial finite abelian group, where ui denotes the span of the ui over Z. In this case the corresponding point in M is an orbifold point with structure group . Indeed, there is a corresponding classification of symplectic toric orbifolds where the smoothness condition is dropped, and moreover one attaches to each facet a positive integer label [38]. This latter necessity can be seen by considering the weighted projective space CP1[k,l] . This is topologically a sphere, with neighbourhoods of the north and south poles replaced by orbifold singularities C/Zk and C/Zl , respectively. The quotient by the U (1) action which rotates around the equator is clearly just a line segment. Thus the orbifold information is completely lost when one takes the image under the moment map. To remedy this [38], quite generally, one associates to each facet a positive integer label m, such that the pre-image of any point in that facet has local orbifold structure group Zm . In the case at hand, the endpoints of the interval are assigned labels k and l, respectively. The first condition - simplicity - avoids even worse singularities than orbifold singularities. As we shall see, for symplectic toric cones this condition is not satisfied at the vertex corresponding to the apex of the cone, unless of course the cone is in fact an orbifold singularity.

3.2. Toric Calabi–Yau cones. This brings us to the generalisation of this theorem [35, 20] for symplectic toric cones, which is the case of interest for us. These may be regarded as non-compact symplectic toric manifolds with a homothetic action of R+ which commutes with the torus action and acts by rescaling the symplectic form. In fact, every symplectic toric cone is a cone over a toric contact manifold Y , and vice versa. In this case the moment map for the symplectic toric cone C(Y ) = R+ × Y may still be defined, away from the apex of the cone, and takes a special form. Define the one-form ηC = r∂/∂rω,

11 This means that the set { n u | n ∈ Z, i = 1, . . . , n} is precisely Zn . i i i i

(3.5)

Toric Geometry, Sasaki–Einstein Manifolds and a New Infinite Class of AdS/CFT Duals

67

where r∂/∂r is the Euler vector, which generates the R+ action on the cone, and ω is the symplectic form. Identifying the base of the cone Y = C(Y ) |r=1 we may define the one-form12 η = ηC |r=1 . One then easily sees that 1 ω = rdr ∧ η + r 2 dη . 2

(3.6)

A straightforward calculation then shows that the moment map µ on the cone is given by µ, ei = ηC (V i )

(3.7)

for any basis vector ei of tn and corresponding vector field V i . Here ηC (V i ) just denotes the dual pairing between one-forms and vectors. The choice of integration constant makes this moment map transform homogeneously under the R+ homothetic action. It also ensures that the apex of the cone, at r = 0, is mapped to the origin of Rn . Let us now also assume13 that the symplectic toric cones are of Reeb type. This means that there is some element ζ ∈ tn ∼ = Rn such that µ, ζ is a strictly positive function on C(Y ). The image of the moment map is then a strictly convex rational polyhedral cone in Rn [35], which, moreover, is good in the sense of reference [20]. Recall that a rational polyhedral cone may be defined as a set of points in Rn of the form C = {x ∈ Rn | x, vi ≤ 0, i = 1, . . . , d},

(3.8)

where the rational vectors vi are the outward pointing normal vectors to the facets of the cone C. Here we may assume that the set {vi } is minimal, meaning that one cannot drop any vector vi from the definition without changing the cone, and also primitive – recall that a vector with integral entries is said to be primitive if it cannot be written as nv, where 1 = n ∈ Z and v is also a vector with integral entries. The requirement that this polyhedral cone is strictly convex means that it is a cone over a polytope. The “conelike" nature of the subspace (3.8) of course descends from the “conelike" nature of the cone we began with – the property that C(Y ) is invariant under a group R+ of homotheties will be inherited by the image under the moment map since by definition the moment map commutes with the R+ action. Clearly the simplicial condition will fail at the apex = origin of Rn unless d = n. Moreover, even in this case the smoothness condition will fail unless the edges span Zn . In this case, by an SL(n; Z) transformation of the torus, one can take this to be the standard basis, whence it is easy to see that the cone one started with is just R2n with its usual symplectic structure. This latter point brings up an issue worth stressing: one is of course free to make an SL(n; Z) transformation of the torus Tn resulting in a change of the basis ei . This will generate a corresponding SL(n; Z) transformation on the image under the moment map. Thus the polytopes and polyhedral cones are only unique up to such transformations. As shown in [20], the image of a symplectic toric cone under its moment map is also a good polyhedral cone. This means the following. Let F be a proper face of the cone C. Over this face there will be a corresponding torus TF ⊂ Tn which is collapsing to zero. For example, in the case that F is a facet, TF ∼ = U (1). For a face F of codimension m the torus is dimension m: dim TF = m. Now, the torus TF ⊂ Tn determines a lattice More precisely we embed Y in C(Y ) at r = 1 and then pull back ηC to Y to give η. The symplectic toric cones that are not of Reeb type are rather uninteresting: they are either cones over S 2 × S 1 , cones over principle T3 bundles over S 2 , or cones over products Tm × S m+2j −1 , m > 1, j ≥ 0 [20]. 12 13

68

D. Martelli, J. Sparks

ZTF = ker(exp : tF → TF ) ⊂ Zn . We then require that the corresponding collection of normal vectors form an integral basis for this lattice, i.e. the collection of normal vectors span the lattice ZTF over Z. This condition may be regarded as a generalisation of Delzant’s conditions for symplectic toric manifolds to symplectic toric cones. In the particular case where the symplectic cone came from a Calabi–Yau cone, one has additional information. In particular, the Sasaki–Einstein metric on Y may be used to define the dual vector field K with η(K) = 1. This is called the Reeb vector in the language of contact geometry. Physically this is dual to the R-symmetry of the field theory. Then there is a corresponding Lie algebra element ζ ∈ tn , and we have µY , ζ = η(K) = 1 .

(3.9)

It follows that the image µY (Y ) lies in the above hyperplane, which is called the characteristic hyperplane [39]. In particular, note that the polytope one obtains by intersecting the polyhedral cone with the characteristic hyperplane will be rational if and only if ζ is rational. The latter condition is required precisely for quasi-regularity of the Sasaki-Einstein metric. Correspondingly, this is also the condition that the characteristic polytope satisfies for an orbifold polytope, and thus that the quotient of Y by the U (1) action generated by K gives an orbifold. Notice that one may then apply the modified Delzant construction of [38] to obtain a gauged linear sigma model describing this orbifold. In principle one could do this for our quasi-regular Sasaki-Einstein manifolds, although we will not pursue this here. 4. The Moment Map and Its Image In this section we explicitly construct the polyhedral cone corresponding to the image of C(Y p,q ) under its moment map. The Calabi–Yau cones on Y p,q are symplectic toric cones. In particular, the T3 action, which is the maximal torus of the isometry group, is Hamiltonian, and one can explicitly integrate the symplectic form (2.24) to obtain a moment map. Note in fact that (2.24) can be written as 1−y 1−y J = dφ ∧ d r 2 cos θ + dψ ∧ d −r 2 + dγ ∧ d r 2 y . (4.1) 6 6 The torus T3 is essentially generated by the Killing vectors ∂/∂φ, ∂/∂ψ, ∂/∂γ . However, one must be careful to ensure that the Killing vectors one takes really do form a basis for an effectively acting T3 . Since this is a slightly subtle point, we first explain a simpler example. A brief detour on Lens spaces. Let us consider the Lens spaces L(1, m) = S 3 /Zm , where we regard S 3 as a (squashed) Hopf S 1 fibration over a round two-sphere. The isometry groups of the latter may be analysed as follows. Embed the round sphere S 3 in R4 , and regard R4 ∼ = H as the space of quaternions. The isometry group of S 3 , preserving its orientation, is SO(4) ∼ = (SU (2)L × SU (2)R )/Z2 , where SU (2)L,R denote left and right actions by the unit quaternions Sp(1) ∼ = SU (2). Thus H q → aqb−1 , ∼ where (a, b) ∈ SU (2) × SU (2) = Spin(4). Notice that (−1, −1) acts trivially, i.e. the two SU (2) factors intersect precisely over the antipodal map. Thus, for a squashed three-sphere, meaning that one squashes the Hopf S 1 fibre relative to the base round S 2 , we see that the isometry group is U (2) ∼ = (SU (2) × U (1))/Z2 .

Toric Geometry, Sasaki–Einstein Manifolds and a New Infinite Class of AdS/CFT Duals

69

However, suppose we now take a quotient of R4 ∼ = H on the right by Zm ⊂ U (1). One still has a left SU (2) action and a right U (1) action, where the latter now factors through a cyclic group of order m. For example, take m = 2, thus giving S 3 /Z2 ∼ = RP3 . ∼ In complex coordinates, H = C ⊕ C, this means (z1 , z2 ) ∼ (−z1 , −z2 ) which identifies antipodal points on the three-sphere. It follows that the centre of SU (2)L acts trivially and hence the effectively acting isometry group is SO(3) × U (1), where U (1) rotates the S 1 fibre with weight one - half the weight of U (1) ⊂ SU (2)R . It now follows that the isometry group for S 3 /Zm for all odd m is U (2), whereas for even m it is SO(3)×U (1)– it is precisely the even cases where Zm contains the antipodal map above. Clearly these Lens spaces have an isometric T2 action. Take m = 2r. From our discussion above, if V1 denotes the Killing vector that rotates the S 2 about its equator with weight one, and V2 denotes the Killing vector that rotates the S 1 fibre, also with weight one, then V1 , V2 do indeed form a basis for an effectively acting T2 . This is the obvious T2 in SO(3) × U (1). For m = 2r + 1 one needs to be more careful: the isometry group is U (2). For example, for r = 0 one has the unit chiral spin bundle of S 2 . As is well-known, a single rotation of S 2 will not result in the spinor coming back to itself: one needs to rotate twice. For an effective action one should thus take a basis e1 = V1 + 21 V2 , e2 = V2 . Here e1 is half the generator of the diagonal U (1) in SU (2) × U (1), and V2 generates the U (1) factor. Of course, one can use the basis e1 = V1 + m2 V2 , e2 = V2 quite generally in all cases. Indeed, recall that the choice of basis is unique only up to an SL(2; Z) transformation. For m = 2r even, this basis is just the SL(2; Z) transformation

1r (4.2) 01 of the basis {V1 , V2 }. The moment cone. After this brief digression, we return to the case of interest. First let us note from the results above that the isometry group of the base B is SO(3) × U (1). Indeed, for fixed y, y1 < y < y2 , we have a copy of S 3 /Z2 ∼ = RP3 , and the group SO(3) × U (1) acts with cohomogeneity one on B with fixed y as generic orbit. Thus, in particular, we may take a basis ∂/∂φ + ∂/∂ψ, ∂/∂φ for an effectively acting two-torus. For C(Y p,q ), one must also add the direction ∂/∂γ . However, here one must be careful to ensure the orbits of the vectors close, and that this torus then acts effectively, just as for the Lens spaces. One finds the following choice suffices: ∂ ∂ + , ∂φ ∂ψ l ∂ ∂ e2 = − , ∂φ 2 ∂γ ∂ e3 = . ∂γ e1 =

(4.3)

Recall that the submanifolds y = y1 , y = y2 of Y p,q are Lens spaces S 3 /Zk , S 3 /Zl , respectively, where recall k = p + q, l = p − q-the shift in e2 is then required precisely by the reasoning above. Note that one can replace l in the formula for e2 by anything congruent to l modulo two (for example, k)-this is just an SL(3; Z) transformation of the torus. Also note that for l even one can in fact take a basis ∂/∂φ, ∂/∂ψ, ∂/∂γ . The

70

D. Martelli, J. Sparks

effectively acting isometry group is thus SO(3) × U (1) × U (1) in this case. For l odd this becomes U (2) × U (1). Let us now consider the moment map for C(Y p,q ). In terms of the basis e1 , e2 , e3 above one finds:

2 r r2 r2 (4.4) µ = (1 − y)(cos θ − 1), (1 − y) cos θ − l y, r 2 y . 6 6 2 Notice that this involves the generically irrational parameter . We will now describe the image of µ, and check that it is given by a good convex rational polyhedral cone in R3 , as predicted by the results of [35, 20]. First, note that the edges of the cone can be identified by fixing any non-zero value of r, say r = 1, and then finding the submanifolds which are fixed under some T2 ⊂ T3 action. Indeed, the edges of the cone, which generate it, are precisely the images of submanifolds in C(Y p,q ) over which some two-torus collapses. There are four such submanifolds at r = 1, given by the north (N) and south (S) poles of the base and fibre two-spheres: these are all copies of a circle – specifically, the fibre over the corresponding point on the base B. We denote the subspaces as follows: N N = {y = y2 , θ = 0}, N S = {y = y2 , θ = π }, SN = {y = y1 , θ = 0}, SS = {y = y1 , θ = π}. Then, using the useful relations14 1 − y1 = −3 ky1 , 1 − y2 = 3 ly2 ,

(4.5)

we find (at r = 1) µ(N N ) = y2 µ(N S) = y2 µ(SN ) = y1 µ(SS) = y1

(0, 0, 1), (−l, − l, 1), (0, − p, 1), (k, q, 1) .

(4.6)

Note that the irrational parameter has factored out and the vectors in (4.6) represent four lines which are spanned as r varies from 0 to infinity. Noting that y1 0 and y2 0 it is then easy to verify that these are the edges of a four-faceted polyhedral cone in R3 generated by: u1 = [0, p, −1],

u2 = [−k, −q, −1],

u3 = [0, 0, 1],

u4 = [−l, −l, 1] (4.7)

with outward-pointing primitive normals: v1 = [1, 0, 0],

v2 = [1, −2, −l],

v3 = [1, −1, −p],

v4 = [1, −1, 0] . (4.8)

As described above, these normals characterise codimension two fixed point sets in C(Y p,q ) over which a circle of the three-torus shrinks to zero size. The corresponding linear combination of Killing vectors in [∂/∂φ, ∂/∂ψ, ∂/∂γ ] should then have vanishing 14 One can derive these using the the explicit form for the periods P of Sect. 2, after using the cubic i equation (2.7).

Toric Geometry, Sasaki–Einstein Manifolds and a New Infinite Class of AdS/CFT Duals

71

norm when restricted to the pre-image of the facet. Indeed, using the metric (2.4) it is straightforward to verify that the four Killing vectors V1 =

∂ ∂ + , ∂φ ∂ψ

V2 = −

∂ ∂ + , ∂φ ∂ψ

V3 =

∂ k ∂ − , ∂ψ 2 ∂γ

V4 =

∂ l ∂ + ∂ψ 2 ∂γ (4.9)

vanish on the submanifolds given by θ = 0, θ = π , y = y1 , and y = y2 respectively. Note that the normals obtained with the moment map use only the symplectic structure of the manifolds, whereas the norms of the Killing vectors are computed using the metrics. Let us now make the following observations about the normal vectors v1 , . . . , v4 : • {v1 , . . . , v4 } span Z3 over Z. Indeed it is trivial to see that v1 , v4 = E1 , E2 . The direction E3 is then obtained as a linear combination of v1 , v2 , v3 , v4 . Indeed, since hcf(l, p) = 1 by Euclid’s algorithm there are integers a, b ∈ Z such that al + bp = 1. • For each of the four edge vectors ui , i = 1, . . . , 4, the corresponding two normal vectors vi1 , vi2 , i1 = i2 ∈ {1234} with ui · vi1 = ui · vi2 = 0 satisfy {a1 vi1 + a2 vi2 | a1 , a2 ∈ R} ∩ Z3 = {a1 vi1 + a2 vi2 | a1 , a2 ∈ Z} .

(4.10)

The second condition is precisely the condition that the cone is good, in the sense of reference [20]. Indeed, this must be true since in [20] it is shown that the image of a symplectic toric cone under its moment map is always a good rational polyhedral cone. The first property does not generically hold, but is special to the geometries we are considering. As we will see later, it is related to the fact that the Sasaki-Einstein manifolds we began with are simply-connected. It will be useful to know the topology of the codimension two submanifolds. Let us denote them as Fi , where i = 1, . . . , 4, respectively. Explicitly we have F1 = {θ = 0}, F2 = {θ = π}, F3 = {y = y1 }, F4 = {y = y2 }. If we project out the γ direction, these are all copies of S 2 . The first two, F1 /U (1), F2 /U (1), are the two fibres of B = S 2 → S 2 over the north and south poles of the base S 2 , and so are representatives of the cycle C1 . The third and fourth, F3 /U (1), F4 /U (1), are the sections of the S 2 bundle at the south and north poles of the fibre S 2 , respectively15 . Since the γ direction describes a principle U (1) bundle over each of these spheres, the total spaces Fi will be Lens spaces L(1, m) ∼ = S 3 /Zm for various values of m ∈ Z. To see which Lens spaces one has, one can simply integrate the curvature two-form −1 dA over Fi /U (1) for each i = 1, . . . , 4. One finds F1 ∼ = F2 ∼ = S 3 /Zp ,

F3 ∼ = S 3 /Zk ,

F4 ∼ = S 3 /Zl .

(4.11)

Thus the facets of the polyhedral cone lift to cones over the above four Lens spaces. The latter two are calibrated submanifolds, as we saw in Sect. 2. 5. Gauged Linear Sigma Models In this section we begin by giving a brief review of gauged linear sigma models [22]. We then move on to describe Delzant’s construction [21] which from a polytope constructs a gauged linear sigma model whose vacuum manifold is precisely the symplectic toric manifold corresponding to . The construction also goes through for cones, 15

These were denoted S1 and S2 in [6].

72

D. Martelli, J. Sparks

provided one starts with a good convex rational polyhedral cone [20]. We then use this method to construct the sigma model for the cone C(Y p,q ). Using this approach, turning on Fayet–Illiopolous parameters in the linear sigma model one (partially) resolves the conical singularity. As a check on our result, we explicitly show how one can recover the topology and group action on Y p,q from the linear sigma model description, thus closing the loop of arguments. This is summarised below: C(Y p,q )

moment map

−→

Delzant

vacuum

polyhedral cone ⊂ R3 −→ linear sigma model −→ C(Y p,q ).

5.1. A brief review. Let z1 , . . . , zd denote complex coordinates on Cd . In physics terms these will be the lowest components of chiral superfields i , i = 1, . . . , d. We may specify an action of the group Tr ∼ = U (1)r on Cd by giving the integral charge matrix i Q = {Qa | i = 1, . . . , d; a = 1, . . . , r}; here the a th copy of U (1) acts on Cd as 1

d

(z1 , . . . , zd ) → (λQa z1 , . . . , λQa zd ),

(5.1)

where λ ∈ U (1). We may then perform the K¨ahler quotient X = Cd //U (1)r by imposing the r constraints d

Qia |zi |2 = ta ,

a = 1, . . . , r,

(5.2)

i=1

where ta are constants, and then quotienting out by U (1)r . The resulting space has complex dimension n = d − r and inherits a K¨ahler structure, and thus also a symplectic structure, from that of Cd . In physics terms, the constraints (5.2) correspond to setting the D-terms of the gauged linear sigma model to zero to give the vacuum, where ta are Fayet–Illiopolous parameters – one for each U (1) factor. The quotient by Tr then removes the gauge degrees of freedom. Thus the K¨ahler quotient of the gauged linear sigma model precisely describes the classical vacuum of the theory. Note that the K¨ahler class of the quotient X depends linearly on the FI parameters ta , and moreover even the topology of the quotient will depend on these. Also observe that, setting all ta = 0, the resulting quotient will be a cone. One sees this by noting that zi → νzi , i = 1, . . . , d is a symmetry in this case, where ν ∈ R+ . The conical singularity is located at zi = 0, i = 1, . . . , d. It is also an important fact that c1 (X) = 0 is equivalent to the statement that the sum of the U (1) charges is zero for each U (1) factor. Thus d

Qia = 0,

a = 1, . . . , r .

(5.3)

i=1

This latter fact ensures also that the one-loop beta function is zero. The sigma model is then Calabi–Yau, although note that the metric induced by the K¨ahler quotient is not Ricci–flat.

Toric Geometry, Sasaki–Einstein Manifolds and a New Infinite Class of AdS/CFT Duals

73

5.2. Delzant’s construction: from polytopes to gauged linear sigma-models. Let us first suppose we have a Delzant polytope which is the image of some compact symplectic toric manifold M under its associated moment map. One can reconstruct M from as follows. Let vi ∈ Zn , i = 1, . . . , d, denote the outward pointing primitive normal vectors to the facets of . For some λi ∈ R we may then write (5.4) = x ∈ Rn |< x, vi >≤ λi , i = 1, . . . , d . Consider now the linear map π : Rd → Rn which maps the standard basis vectors Ei of Rd to vi . Thus π(Ei ) = vi for each i = 1, . . . , d. From the Delzant properties of one easily sees that this map is surjective. The kernel has dimension r = d − n, and defines a corresponding torus Tr ⊂ Td . Now take Cd with its usual action by Td , and consider the moment map where we take the Fayet–Illiopoulos parameters to be ti = λi . From the induced action by Tr ⊂ Td above, we get an induced moment map for the Tr action. One may now take the symplectic reduction Cd //Tr , which is a symplectic manifold of complex dimension d − r = d − (d − n) = n. Moreover, this quotient also inherits an action of Tn = Td /Tr from that of Cd and is thus toric. In fact, it is not difficult to see that the image of Cd //Tr under its moment map, associated to Tn , is just . This is Delzant’s construction [21]. As a completely trivial example, consider the two-sphere S 2 with canonical U (1) action which rotates about the equator. The image of the moment map is just a line segment, with length proportional to the volume of the two-sphere. The outward pointing normal one-vectors are v1 = 1, v2 = −1. The kernel of the map π : Ei → vi is thus (1, 1), whence we see that S 2 is the symplectic reduction of C2 by U (1) with charges (1, 1)-the U (1) quotient is just the Hopf map S 3 → S 2 . There is a corresponding construction for compact symplectic toric orbifolds, which is a generalisation that takes into account that the normals may no longer form a Z–basis for Zn . This introduces finite subgroups which become local orbifold groups in the symplectic quotient [38]. A Delzant construction for cones. Recently a Delzant theorem has been proven for symplectic toric cones [20]. The language used is largely that of contact geometry – recall that a metric cone over a contact manifold is precisely a symplectic cone, and vice versa. The essential point is that the convex rational polyhedral cone one starts with must be good. This ensures that the symplectic quotient is smooth. Since the moment cones µ(C(Y p,q )) are all good cones, we may apply the theorem of [20]: one simply applies Delzant’s construction, as in the compact case, and sets all the Fayet–Illiopolous parameters to zero. Thus recall that the outward pointing primitive normal vectors were found to be v1 = [1, 0, 0],

v2 = [1, −2, −l],

v3 = [1, −1, −p],

v4 = [1, −1, 0] . (5.5)

By inspection the kernel is (p, p, −l, −k). Thus the Delzant theorem for cones gives • U (1) gauged linear sigma–model on C4 with charge vector Q = (p, p, −l, −k). As a preliminary check that this is indeed correct, notice that the charges sum to zero: p + p − l − k = 0, since k = p + q, l = p − q. It follows that the vacuum manifold X of this gauged linear sigma model is topologically Calabi–Yau, c1 (X) = 0, just as expected. Moreover, by turning on the Fayet–Illiopolous parameter t for the U (1) gauge field we will obtain orbifold resolutions of the cone.

74

D. Martelli, J. Sparks

As interesting degenerate cases, consider p = 1, q = 0. This is the (resolved) conifold, which recall is the gauged linear sigma model on C4 with charges Q = (1, 1, −1, −1). Another important case is p = q = 1. This yields C times the linear sigma model on C3 with charges (1, 1, −2). The latter is OCP1 (−2). Taking t = 0 shrinks the CP1 to zero size, yielding the orbifold C2 /Z2 , which is also the A1 singularity. Thus the cone is C × (C2 /Z2 ). This has N = 2 rather than N = 1 supersymmetry. The horizons of these two spaces are thus T 1,1 and S 5 /Z2 . If one formally takes p = q = 1 and q = 0, one obtains Zp quotients of the cases above. In particular these will correspond to orbifolds (C2 /Z2 × C)/Zp and (conifold)/Zp respectively. It is interesting to note that these are consistent with the limiting volumes (2.14), although the metrics Y p,q are not valid in these limits. We can now use the results of [40] to perform further non-trivial checks. According to Theorem 1.1 of Ref. [40] we have the following general topological facts about the base Y of the symplectic toric cone C(Y ) we began with (provided it is of Reeb type): • π1 (Y ) ∼ = Zn /vi , is a finite abelian group. Recall that n = dim(Tn ) is the complex dimension of the cone C(Y ). • π2 (Y ) is a free abelian group of rank d − n, where d is the number of facets of the moment cone. We may now verify that these are indeed true for our examples Y p,q and their moment cones. In particular, for our polyhedral cones recall that the {vi } spanned Z3 over Z, and thus π1 (Y p,q ) is trivial, in agreement with the fact that Y p,q ∼ = S 2 × S 3 for all p, q. Moreover, we may now relax the condition that hcf(p, q) = 1. From the Gysin sequence for the U (1) fibration corresponding to ∂/∂γ , as in the appendix of [6], one sees that π1 (Y p,q ) ∼ = Zh , where h ≡ hcf(p, q). Since now hcf(l, p) = hcf(p, q) = h, Lerman’s theorem says that π1 (Y p,q ) ∼ = Zh , in agreement with the Gysin sequence calculation. For the second point in the theorem, since there are four normals, we also learn that π2 (Y p,q ) ∼ = Z, again in perfect agreement with the topology we started with. 5.3. The topology of the vacuum. In this subsection we verify that one can recover the topology of, as well as the action of the isometry group on, Y p,q correctly as the boundary, or horizon, of the linear sigma model (p, p, −l, −k). Of course, this is guaranteed by the Delzant theorem of [20]. Nevertheless, it is interesting to analyse the relation explicitly, since this sheds considerable light on the geometry and topology. Since this “hands on” approach is rather technical, the reader might well omit the remainder of this section. However, we will need the relation (5.9) between vectors on C4 and Y p,q in the next section. This section also constitutes a direct proof of the equivalence of the gauged linear sigma models with the Calabi–Yau cones, without using any theorems. A direct analysis of the topology. The point of this subsection is to show that the K¨ahler quotient C4 //U (1) is topologically the same as C(Y p,q ). This is far from obvious, but is nevertheless guaranteed by the general theorems we have used thus far. At z3 = z4 = 0 we have a finite sized CP1 , of size t/p, where recall that t is the FI parameter. We may thus introduce gauge invariant coordinates z = z1 /z2 , z = z2 /z1 which cover the open subsets U2 , U1 ⊂ CP1 , where Ui = {zi = 0, z3 = z4 = 0} ⊂ CP1 . On the overlap U2 ∩ U1 we have z = 1/z , thus making the Riemann sphere. However, for p > 1 this CP1 is a locus of Zp orbifold singularities in the K¨ahler quotient.

Toric Geometry, Sasaki–Einstein Manifolds and a New Infinite Class of AdS/CFT Duals

75

Indeed, the subgroup Zp ⊂ U (1) stabilises the subspace (z1 , z2 , 0, 0) of C4 . The fact that we have a non-trivial isotropy subgroup means that this will descend to a locus of Zp orbifold singularities in the quotient space. To analyse this singularity, consider, for example, the subspace given by z1 = 0. Using a gauge transformation we may set z2 to be real and positive, which is thus the north pole of the base CP1 = S 2 . The action of Zp on (z3 , z4 ) is generated by (z3 , z4 ) → (z3 ωp−l , z4 ωp−k ), where ωp = e2πi/p generates q −q Zp . Note that this is equivalent to the anti-diagonal action (z3 , z4 ) → (z3 ωp , z4 ωp ). Thus if U (1)A ⊂ SU (2) ⊂ U (2) acts on C2 in the usual way, we have that the generator q ωp of Zp embeds in U (1)A as ωp . Notice that |z2 |2 ≥ t/p and that, for fixed |z2 |2 > t/p the D-term imposes that the coordinates z3 , z4 define an ellipsoid, which topologically is S 3 modulo the Zp action just discussed. Since q is prime to p this is the Lens space L(1, p). At |z2 |2 = t/p we have z3 = z4 = 0 and the Lens space collapses. Thus the subspace z1 = 0 is a copy of an Ap−1 singularity. Performing the quotient of the Lens space “at infinity” by U (1)A then gives a two-sphere, the map being the p th power of the anti-Hopf map16 . Clearly the same picture holds at all points in CP1 , not just at z1 = 0, as SO(3) acts as a symmetry. It follows that we have an Ap−1 fibration over this CP1 , which thus has a boundary which is a Lens space bundle over CP1 . In fact such a bundle structure of the metrics Y p,q was already noted in reference [6]. We may then quotient the boundary by U (1)A to obtain a space Bˆ that will be an S 2 bundle over the base CP1 = S 2 . To see what this bundle is we may introduce coordinates as follows. Suppose z2 = 0, giving the patch U2 on the base CP1 with coordinate z = z1 /z2 . In order to effectively go to the boundary of our space, we may set l|z3 |2 + k|z4 |2 = constant > 0. In particular, we cannot have both z3 and z4 zero. Suppose then that z3 = 0. We may now introduce the additional coordinate x2 = z¯ 4 /z3 z22 on the fibre. This is invariant under both the original U (1) action – the key point being that k + l = 2p – as well as U (1)A under which the fields have charges (0, 0, 1, −1). Similarly over U1 we have coordinate x1 = z¯ 4 /z3 z12 . The union of these two subspaces thus describes the bundle OCP1 (2). However, note that, due to the presence of the z¯ 4 s, the complex structure here is not inherited from the complex structure of C4 we started with. Since we are only interested in topology and group actions, this fact will not be important for the present discussion. Similarly, for z4 = 0 one has coordinates w2 = z3 z22 /¯z4 and w1 = z3 z12 /¯z4 . This describes OCP1 (−2). The intersection of these subspaces results in the gluing of the two C fibres together to create a Riemann sphere S 2 bundle over CP1 = S 2 –for example, x2 = 1/w2 on the overlap with z2 , z3 , z4 = 0. Thus we obtain precisely the same description as the manifold B discussed earlier: Bˆ ∼ = B. Topologically, the manifold Bˆ just described is the same thing as P(O⊕O(−2)) which is the second Hirzebruch surface F2 . However, due to the z¯ 4 s, the complex structure is not that inherited from C4 . Indeed, if one replaces z¯ 4 by z4 in the above coordinates, one precisely gets F2 , as one can see by analysing the linear sigma model for this manifold17 . 1 The fibre S 2 is thus perhaps best described as CP . Moreover, as explained in [6], as a real manifold F2 is actually just a product space S 2 × S 2 , i.e. the bundle is trivial. It remains to compute the twisting of U (1)A over this base B, which as we have just seen is naturally described as an S 2 bundle over S 2 with twist 2. Over the fibre 16 Note the distinction here with the diagonal subgroup U (1) of U (2). Quotienting by this is the Hopf D map, and moreover since this is a normal subgroup the quotient is also the group U (2)/U (1)D ∼ = SO(3). This SO(3) thus acts naturally on the projected space. 17 This is a U (1)2 model on C4 with charges Q = (1, 1, 2, 0), Q = (0, 0, 1, 1). 1 2

76

D. Martelli, J. Sparks

S 2 , sitting at some point on the base S 2 = CP1 , the twisting of the U (1) is p, as is clear from the above discussion since the fibre sphere descended from the Lens space L(1, p) ∼ = S 3 /Zp . We now compute the U (1) twisting over the copies of S 2 at the south and north poles of the fibre S 2 –these are two sections of the S 2 bundle. Call them S1 and S2 , respectively, as in [6]. These are given by z3 = 0, z4 = 0, respectively, which give linear sigma models on C3 with weights (p, p, −k), (p, p, −l), respectively. The boundaries of these two spaces are Lens spaces L(1, k), L(1, l). To see this, note that S 1 /Zp ∼ = S 1 . Thus the boundaries are S 1 bundles over S 2 . The twisting in each case is easily seen to be k and l, respectively. We may now relate this to our earlier discussion. Recall that the canonical generators C1 , C2 of the second homology of S 2 × S 2 are related to the copies S1 , S2 of S 2 at the south and north poles of the fibre S 2 by 2C1 = S1 − S2 , 2C2 = S1 + S2 .

(5.6) (5.7)

We have just seen that the twisting over S1 and −S2 is k and l, respectively. This gives the Chern numbers over C1 and C2 to be (k + l)/2 = p, and (k − l)/2 = q, respectively. We thus precisely reproduce the topology of Y p,q described in Sec. 2. Moreover, the (not quite effectively acting) isometry group of the Sasaki–Einstein metrics is SU (2)×U (1)2 . The K¨ahler quotient above also has this isometry group – this is just the subgroup of U (4) that commutes with the original U (1) action. Relation between Killing vector fields. It is also now interesting to examine the codimension two fixed point sets of the linear sigma model (p, p, −l, −k) directly, and compare with our polyhedral cone for C(Y p,q ). Thus we now set t = 0. The codimension two fixed point sets are easily found: they are at zi = 0, for each i = 1, . . . , 4. Indeed, from our above discussion of the topology of the vacuum, these are precisely cones over the Lens spaces S 3 /Zp , S 3 /Zp , S 3 /Zk , S 3 /Zl , respectively. In terms of Y p,q , these are the submanifolds Fi , i = 1, . . . , 4, respectively. In particular, note that F3 /U (1) ∼ = S1 , F4 /U (1) ∼ = S2 . Thus we see explicitly that the topology of the subspaces {zi = 0} are the same as C(Fi ), respectively. The relation between the Killing vectors is also easy to make explicit. Let us denote ∂/∂θi as the U (1) that rotates the coordinate zi . Thus ∂/∂θi = 0 defines the codimension two submanifolds zi = 0. We find ∂ ∂ ∂ − , = ∂φ ∂θ1 ∂θ2 ∂ ∂ ∂ =l +k , 2p ∂ψ ∂θ3 ∂θ4 ∂ ∂ ∂ + . =− p ∂γ ∂θ3 ∂θ4 2

(5.8)

These require some explanation. We denote the weights of the ∂/∂θi as a row vector for convenience. Thus consider (1, −1, 0, 0). For t > 0 this precisely rotates the subspace z3 = z4 = 0, which is a copy of CP1 of size t/p, with weight two. Hence we identify this U (1) with 2∂/∂φ. Also, by construction, the ∂/∂γ direction is proportional to (0, 0, 1, −1) which, recall, we denoted U (1)A . However, the orbits of the vector (0, 0, −1, 1) actually wind p times around the circle fibre: recall the projection of this U (1) was the pth power of the anti-Hopf map. Hence this is p∂/∂γ . Finally, note

Toric Geometry, Sasaki–Einstein Manifolds and a New Infinite Class of AdS/CFT Duals

77

that ∂/∂ψ rotates the fibre S 2 with weight one and does not act on the base S 2 . This determines the final vector, as one can see by analysing the action on the coordinates x1 , x2 , w1 , w2 introduced above. To make contact with the normal vectors discussed earlier, one must note that the Killing vector given by (p, p, −l, −k) acts trivially on the vacuum, by construction. Thus (p, −p, 0, 0) is equivalent to both (2p, 0, −l, −k) and (0, −2p, l, k). Thus we compute ∂ ∂ + ∂φ ∂ψ ∂ ∂ − + ∂φ ∂ψ ∂ k ∂ − ∂ψ 2 ∂γ ∂ l ∂ + ∂ψ 2 ∂γ

∂ ∂θ1 ∂ = ∂θ2 ∂ = ∂θ3 ∂ = ∂θ4

=

(5.9)

in perfect agreement with our earlier results: the vectors on the left hand side are precisely the Killing vectors (4.9) which fixed codimension two submanifolds of Y p,q . In particular, this means that the polyhedral cones for C(Y p,q ) and the linear sigma model with weights (p, p, −l, −k) are identical, and thus they are completely equivalent as symplectic toric cones, i.e. they are equivariantly symplectomorphic. We have shown this directly in this subsection, without appealing to any theorems. 6. Toric Gorenstein Canonical Singularities In this section we make contact with reference [5] by explaining the relation of the Calabi–Yau gauged linear sigma model (p, p, −l, −k) to so–called toric Gorenstein canonical singularities. The data required to define a toric Gorenstein canonical singularity of complex dimension n is a convex polygon on Rn−1 , all of whose vertices have integer coordinates. Given any such polygon one can reconstruct the toric singularity, as well as all of its toric crepant resolutions, as follows. Let {Vi | i = 1, . . . , d} denote all vectors in Rn−1 with integer coordinates and with the property that they lie within, or on the boundary of, the polygon. Marking these points gives the toric diagram D. Consider now the set of all linear relations among these vectors d

Qia Vi = 0

(6.1)

i=1

with integer coefficients Qia satisfying d

Qia = 0

(6.2)

i=1

for each a = 1, . . . r, where a labels the set of such linear relations. Clearly r = d − n. One now uses the matrix Qia as the charges of a linear sigma model on Cd with gauge group U (1)r . This is essentially a Delzant construction. The K¨ahler quotient

78

D. Martelli, J. Sparks

X = Cd //U (1)r has complex dimension n = d − r. Setting all FI parameters to zero gives the toric singularity. Moreover, by turning on the FI parameters one obtains (partial) resolutions of the singularity – special values of the FI parameters will give rise to more singular spaces than the generic values. By including all the interior points Vi of the polygon, we have ensured that the linear sigma model reproduces all the toric crepant resolutions of the singularity. The sizes of the blow-ups are controlled by the FI parameters. However, this is not usually a very economical way of constructing the singularity – the minimal presentation, meaning the smallest possible d and thus least number of chiral superfields, arises by using only the vertices of the polygon18 . The toric diagram for the Calabi–Yau cone on Y p,q can be obtained as follows. Recall that the image of the moment map for C(Y p,q ) is a four-faceted polyhedral cone with primitive outward pointing normals v1 = [1, 0, 0],

v2 = [1, −2, −l],

v3 = [1, −1, −p],

v4 = [1, −1, 0] . (6.3)

Notice that these vectors lie in the plane at e1 = 1. Indeed, the normals belong to a plane in R2 precisely when the linear sigma model is Calabi–Yau. Thus we may project onto the e1 = 1 plane to obtain vectors [0, 0],

[−2, −l],

[−1, −p],

[−1, 0] .

We now shift the origin by [1, 0] and then make the SL(2; Z) transformation

l − 1 −1 l −1

(6.4)

(6.5)

to obtain vectors V1 = [l − 1, l],

V2 = [1, 0],

V3 = [p, p],

V4 = [0, 0]

(6.6)

respectively. This is a minimal presentation of the singularity. The pictures below display some examples with low values of p. It is interesting to note that the areas of these polygons are equal to p, independently of q. Indeed, for fixed p, varying q just slides the vertex V1 up and down the hypotenuse of the triangle that defines the orbifold C3 /Zp+1 × Zp+1 . Note that for (p, q) = (2, 1) the toric diagram is the same as that for the complex cone (canonical line bundle) over the first del Pezzo surface, as we discuss in detail in the following section.

Fig. 1. Toric diagram of Y 2,1 embedded in the orbifold C3 /Z3 × Z3

18 If these vectors do not span Zn−1 over Z one must in addition quotient the K¨ahler quotient by the finite group Zn−1 /Vi to correctly reproduce the singularity – this follows from our general discussion in Sect. 3.

Toric Geometry, Sasaki–Einstein Manifolds and a New Infinite Class of AdS/CFT Duals

79

Fig. 2. Toric diagrams of Y 3,2 and Y 3,1 embedded in the orbifold C3 /Z4 × Z4

Let us also remark that the number of points inside the polygon is precisely p − 1. Each point corresponds to a normal vector to a plane in R3 . The total number of Fayet– Illiopolous parameters (K¨ahler parameters) is (4 − 3) + p − 1 = p, and by varying these one moves the planes in their normal directions so that they no longer intersect the origin. By assigning generic values one completely resolves the conical singularity. Indeed, these parameters roughly control the size of CP1 s. We thus learn that the Calabi–Yau cone, where all FI parameters are set to zero, has p collapsed two–spheres. Turning on the FI parameter t > 0 in the linear sigma model (p, p, −l, −k) partially resolves the singularity to an Ap−1 singularity fibred over CP1 , as discussed in the last section. Indeed, an Ap−1 singularity can be completely resolved by blowing up (p − 1) two-spheres – the metric is the p-centered Gibbons–Hawking metric. There are precisely (p − 1) FI parameters, giving 1 + (p − 1) = p in total. 7. The Complex Cone over F1 As noted above, the toric diagram we have found for Y 2,1 is the same as that for the complex cone over the first del Pezzo surface. We will refer to the latter as F1 and its complex Calabi–Yau cone as CC (F1 ). Here we elaborate on this point. In particular, it follows that we will inherit a metric on F1 from that on Y 2,1 , and we will write this down explicitly. Of course this metric will not be K¨ahler–Einstein. First we will use the toric data we have to deduce the Killing vector field on Y 2,1 corresponding to the complex cone direction. Adapting the metric to this direction, we shall indeed find a smooth metric on F1 . We label the five vertices of the toric diagram, including the blow-up mode corresponding to the interior point, as V1 = [0, 1],

V2 = [1, 0],

V3 = [2, 2],

V4 = [0, 0],

V5 = [1, 1] .

(7.1)

The last vector V5 is the additional blow-up vertex. A possible basis for the two charge vectors is given by Q1 = (1, 1, 0, −1, −1), Q2 = (0, 0, 1, 1, −2) .

(7.2)

We thus obtain a gauged linear sigma model on C5 with U (1)2 gauge group. Let us for the moment drop the last entry in these vectors. This gives a gauged linear sigma model on C4 with weights ˆ 1 = (1, 1, 0, −1), Q ˆ 2 = (0, 0, 1, 1) . Q

(7.3)

80

D. Martelli, J. Sparks

Let us take each quotient in turn. The first quotient yields C × [OCP1 (−1)], since (1, 1, −1) is precisely OCP1 (−1). The former may also be regarded as OCP1 (0) ⊕ OCP1 (−1). The second row then projectivises this C2 = C ⊕ C bundle. This means one quotients each C2 fibre by the Hopf map C2 \ {0} → CP1 . The resulting space is the first Hirzebruch surface F1 = P(OCP1 (0) ⊕ OCP1 (−1)) .

(7.4)

This is also the same thing as CP2 blown up at a point19 . Indeed, CP2 may be obtained by taking O(1) → CP1 and gluing to its boundary, which is topologically S 3 , a ball in C2 . Blowing up the origin in C2 replaces it by a CP1 , which has local geometry OCP1 (−1). Equivalently one can describe this blowing up process as taking a connected sum with CP2 with reversed orientation: CP2 # − CP2 . We now have two copies of CP1 in the resulting space. In fact it is easy to see that these are two sections of F1 –this is precisely analogous to the topological construction of B. Note however that w2 (F1 ) = 0 and thus this is not a spin manifold. Adding back the fifth entry to the charge vectors (7.3) to give (7.2) then describes the canonical bundle over F1 –the charges sum to zero, meaning that the vacuum X (K¨ahler quotient) is topologically Calabi–Yau, c1 (X) = 0. This identifies the canonical bundle, or complex cone, over F1 . Consider now taking a different linear combination of charge vectors, corresponding to a change of basis for the T2 action. In particular, using an SL(2; Z) transformation we may take Q1 = (2, 2, −1, −3, 0), Q2 = (1, 1, 0, −1, −1) .

(7.5)

The first set of weights of course gives the gauged linear sigma model on C4 given by (2, 2, −1, −3) = (p, p, −l, −k), together with a factor of C. We may now effectively gauge away the second U (1). Indeed, this means ∂ 1 ∂ ∂ ∂ ∂ ∂ − , =− + + = ∂θ5 ∂θ4 ∂θ1 ∂θ2 ∂ψ 2 ∂γ

(7.6)

acting on the linear sigma model (2, 2, −1, −3) on C4 , and Y 2,1 , respectively. Here we have used the relations (5.9). Note that ∂/∂θ5 precisely rotates the complex line fibre over F1 . One can check explicitly that this Killing vector field on Y 2,1 is nowhere-vanishing. Indeed, its norm-squared is computed to be ∂ ∂θ

2 = F (y) ≡ q(y) + w(y) f (y) − 1 2 , 2 9 5

(7.7)

which is strictly positive. Here f (y) =

a − 2y + y 2 6(a − y 2 )

(7.8)

19 In the toric language, there is a nice way to understand this. In fact, it’s straightforward to compute the Delzant polytope for CP2 : this is an isosceles rectangular triangle. A toric blow-up is obtained by simply chopping off a vertex to give a rectangular trapezoid.

Toric Geometry, Sasaki–Einstein Manifolds and a New Infinite Class of AdS/CFT Duals

81

is the function appearing in the local one-form A. Of course in this particular case a and take specific values. One finds √ 1 (1 − 1613 ), 2 √ y1 = 18 (1 − 13),

1 = √ , 2 13 −√5 y2 = 18 (7 − 13) .

a=

(7.9)

Let us summarise the situation. We have found that the metric Y 2,1 is an explicit irregular Sasaki–Einstein metric on the horizon of the complex cone CC (F1 ) over F1 , where the Killing vector field (7.6) rotates the complex cone direction. Crucially this is not the Reeb vector, whose generic orbits in fact don’t close. The quotient of the metric (2.4) by the U (1) action generated by (7.6) should be a metric on F1 . We will now explicitly compute this metric and verify that it is indeed a smooth metric on F1 . In order to perform the U (1) quotient of Y 2,1 , it is useful to first rewrite the metric adapted to the Killing vector field ∂/∂θ5 . Thus, let us change coordinates: ψ = θ5 ,

γ = −/2 − θ5 /2 .

(7.10)

It is then straightforward to compute the following expression for the metric : ds 2 =

1−y 1 w(y)q(y) 2 (dθ 2 + sin2 θ dφ 2 ) + dy 2 + (d + cos θ dφ)2 6 w(y)q(y) 36F (y) +F (y) [dθ5 − C]2 ,

(7.11)

where we have defined

1 q(y) C= w(y)(f (y) − 2 ) 2 d + + w(y)f (y)(f (y) − 2 ) cos θ dφ . F (y) 9 (7.12) The quotient by ∂/∂θ5 now simply gives the metric in the first line of (7.11), which again looks like a bundle over a base two-sphere. Let us now analyze regularity of this metric. First, notice that all the functions are positive semi-definite. So, as usual, one has to worry only about the smoothness conditions where the function q(y) vanishes, and then check that the resulting periodicities give a well-defined bundle-metric. Near such a zero yi , the “fibre metric”, i.e. the metric at fixed θ, φ, takes the form ds 2 (fibre) ≈

1 |yi ||y − yi | 2 2 dy 2 + d . 12|yi ||y − yi | 3F (yi )

(7.13)

Now, crucially, the following relations are true for any (p, q): F (y1 ) = (k − 1)2 2 y12 ,

F (y2 ) = (l + 1)2 2 y22 .

(7.14)

Introducing R = 2|y − yi |1/2 we find that for (k, l) = (3, 1)–and only for these values– the metric approaches

1 1 2 2 2 2 ds (fibre) ≈ (7.15) dR + R d 12|yi | 4 near the two zeros. We therefore obtain a smooth metric on R2 in this neighbourhood if and only if has period 4π. Indeed, one can see that this is the induced period for

82

D. Martelli, J. Sparks

from the metric on Y 2,1 by examining the coordinate transformation (7.10): since ψ, γ and θ5 all have period 2π one can calculate the period of from the Jabobian of the coordinate transformation (7.10), which is −1/2. This indeed means that ∼ + 4π and moreover with this period we have that for fixed y, y1 < y < y2 , the resulting space is a squashed S 3 . These are then the generic orbits under the action of the isometry group U (2) on this manifold. We thus obtain an S 2 bundle over S 2 with twist one, which is topologically F1 , just as expected. Let us now label the two sections of F1 at y = y1 , y = y2 as H , E respectively. These are the hyperplane class and exceptional divisor of del Pezzo one, respectively. It is a simple exercise to compute the Chern numbers of the U (1) principle bundle, with coordinate θ5 , over these: dC dC = 3, = 1, (7.16) H 2π E 2π where, as ever, we have to use the cubic (2.7) and, in this particular case, k = 3, l = 1. Equations (7.16) give precisely the Chern numbers required so that the complex cone (or complex line bundle) defined by the U (1) bundle associated to θ5 is indeed Calabi– Yau. To see this, notice that the normal bundles of the two CP1 s corresponding to H, E inside F1 are topologically OCP1 (1) and OCP1 (−1), respectively, as is clear from our discussion of F1 above. Thus c1 (F1 ) restricted to the two cycles gives 1 + 2 = 3 and −1 + 2 = 1, respectively, where 2 = c1 (T S 2 ). The Chern numbers above for −dC precisely cancel these in the total space of the associated complex line bundle, thus giving a Calabi–Yau manifold. As shown at the end of Sect. 2.2, the cones over the U (1) bundles over H and E (which are the submanifolds y = y1 , y = y2 ) are divisors in the Calabi–Yau cone. Equivalently, the complex cones over the submanifolds H and E are divisors. Indeed, we already noted above the normal bundles to these submanifolds inside F1 , which translate into self-intersection numbers H · H = 1, E · E = −1. One can check that the metric on F1 is not Einstein. Thus, in particular it is not diffeomorphic to the Page metric on F1 [41], although it is rather similar in form. 8. New Non-Trivial AdS/CFT Predictions In this final section we discuss features of the gauge theory duals of the Sasaki–Einstein manifolds Y p,q , focusing in particular on Y 2,1 since a candidate dual is already known. In particular we may compare our geometrical results to the a-maximisation calculation20 presented in [14]. We find complete agreement with this field theory calculation, both for the central charge and for the SU (2)F singlet baryons of the theory. Let us first remark that, given a toric Gorenstein canonical singularity, an algorithm for constructing21 a quiver gauge theory that has the singularity as its Higgs branch has been developed in [11, 42] and subsequent works by these authors. This relies on the fact that any such singularity may be obtained by partial resolution of the orbifold C3 /Zp+1 × Zp+1 , and the field theory for the latter is known. In practise the algorithm requires a computer, even for relatively small p. However, the simple analytic expressions found in this paper suggest that all theories can be treated simultaneously. Indeed 20 Note that in [14] the central charge of the dP quiver gauge theory is also calculated, and found to 2 be quadratic irrational. 21 Note that, in earlier work, extending that of [3], the quiver gauge theories associated to some toric singularities were worked out in [43–45] without using these algorithms.

Toric Geometry, Sasaki–Einstein Manifolds and a New Infinite Class of AdS/CFT Duals

1

2

4

3

83

Fig. 3. Quiver diagram associated to the complex cone over dP1

it is tempting to speculate that some members of the family could be related by deformations or connected via RG-flows. In particular, we can anticipate that, at fixed p, the parameter q will govern the matter content and superpotential of an SU (N )2p quiver. Recall also that at fixed p, the central charge a is a monotonic function of q which is bounded between the values corresponding to T 1,1 /Zp and (S 5 /Z2 )/Zp : a(T 1,1 /Zp ) < a(Y p,q ) < a(S 5 /Z2 × Zp ) ,

(8.1)

suggesting that the different q-theories might all be related to the same “parent” orbifold model. However, we will not pursue this direction any further in the present paper. Instead, we focus on Y 2,1 , where the dual quiver theory is already known. This instance already captures many of the essentially new features of these AdS/CFT duals. A quiver gauge theory for dP1 ∼ = F1 was obtained22 in [11] and is presented in Fig. 3. Let us briefly recall the notation of these diagrams. The nodes of the diagram represent different gauge group factors U (N ). Thus the gauge group for the theory is U (N )4 . An arrow from node i to node j represents a bifundamental field in the representation N⊗N, where the first factor denotes the anti-fundamental representation of the i th gauge group, and the second factor denotes the fundamental representation of the j th gauge group. We denote these fields as Xij . Thus the quiver diagram encodes the field content of the theory. One must also specify the superpotential. This is given by [42]: α β α β 3 α β X41 X13 − αβ tr X34 X23 X42 + αβ tr X12 X34 X41 X23 , (8.2) W = αβ tr X34 where αβ ∈ {±1} and α, β ∈ {1, 2} are indices of the non–abelian flavor symmetry group SU (2)F . Note that each term comes from a closed loop in the quiver. This allows one to construct gauge–invariant monomials, which may then appear in the superpotential. One is then particularly interested in the Higgs branch of such a theory. This arises by considering U (N) → U (1)N for each gauge group factor. One effectively considers the case N = 1 so that the gauge theory is an abelian theory–the case N > 1 will simply be given by N copies of the N = 1 case. The fields Xij have various charges under the U (1)4 gauge group. Setting the D-terms of the gauge theory to zero and dividing by the gauge group is, as we have discussed already in a different context in this paper, a K¨ahler 22

In this section we denote the first del Pezzo surface by dP1 .

84

D. Martelli, J. Sparks

quotient construction, and the result is a toric variety (an overall U (1) decouples and is physically the centre of mass U (1) of the D3-branes). However, to get the vacuum of the theory one must also set the F -terms to zero, which means extremising the superpotential: dW =0. This gives a system of relations among the linear sigma model fields, which define hypersurfaces in the toric variety–the intersection of these define the Higgs branch of the theory, which is part of the moduli space of vacua. One can also get to this result by computing all invariant monomials in the fields, and then finding all relations among them, including those relations given by dW = 0. The slightly non-trivial fact is that this is indeed the complex cone over the first del Pezzo surface. We will not review this here, but instead refer the reader to the literature for details (see e.g. [23]). If the quiver gauge theory above is interpreted as living on a D3-brane, then this moduli space should be the geometry seen by the brane. For N > 1 one has N D3-branes in their Higgs phase, which is why one obtains N copies of the above moduli space. Let us now recall the flavour and R-symmetries of the theory. The superpotential above is manifestly invariant under the non-abelian flavour group SU (2)F , for which the α, β indices form a doublet. Crucially, there is also a non-anomalous U (1) × U (1) abelian flavour symmetry which is preserved in the IR. Taking this into account, the a-maximisation calculation applied to this theory [14] then gives the exact R-charges in the IR. For the sake of clarity, these are listed23 in Table 1. Recall that, as proposed in [9], the R-symmetry mixes with the abelian flavour symmetries maximising, among all such admissible R-symmetries, a certain combination of ‘t Hooft anomalies. The value of this combination of anomalies at the critical point is the exact central charge of the theory in the infra-red, and is given by the formula 3 (8.3) 3TrR 3 − TrR . a= 32 Substituting the values for the R-charges from Table 1 into (8.3), and comparing with (1.2) one finds a corresponding volume √ 13 13 + 46 3 π (8.4) 12 · 27 which precisely agrees with the volume of Y 2,1 (2.13) on setting p = 2, q = 1. Table 1. Exact R-charges computed from a-maximisation [14] Xij α X34 3 X34 α X41 α X23

X12 X13 X42

Rexact √ 13) √ −3 + 13 √ 4 (4 − 13) 3 √ 4 (4 − 13) 3 √ 1 3 (−17 + 5 13) √ −3 + 13 √ −3 + 13 1 (−1 + 3

Let us finally consider the baryons of the gauge theory. Recall that baryonic operators B of the gauge theory are dual to D3-branes wrapping supersymmetric cycles in the 23

We thank the authors of [14] for communicating the results of their calculation prior to publication.

Toric Geometry, Sasaki–Einstein Manifolds and a New Infinite Class of AdS/CFT Duals

85

geometry. Their R-charges are related to the volumes of these supersymmetric cycles according to the general formula24 [25]

π 2 · vol() . (8.5) R[B] = · 3 2 vol(Y ) Recall we have shown in Sect. 2 that for each manifold Y p,q there are two supersymmetric 3-cycles, which are topologically Lens spaces 1 = S 3 /Zp+q and 2 = S 3 /Zp−q . We therefore expect that in each case there will be two types of baryonic operators B1 , B2 associated to them. Substituting for the volume (2.13) we can write down the general formula for the R-charges of the corresponding baryons in the Y p,q theory. These are given by the unlikely formulae: 1 2 2 2 2 R[B1 ] = 2 −4p + 2pq + 3q + (2p − q) 4p − 3q , 3q 1 R[B2 ] = 2 −4p 2 − 2pq + 3q 2 + (2p + q) 4p 2 − 3q 2 . (8.6) 3q Note that they are interchanged by changing the sign of q. Setting p = 2, q = 1 the formulae give √ √ R[B2 ] = 13 (−17 + 5 13) . (8.7) R[B1 ] = −3 + 13, These agree precisely with two of the four different R-charges listed in Table 1. Acknowledgements. We would like to thank M. Bertolini, F. Bigazzi, A. Hanany, K. Intriligator, E. Lerman, C. Vafa, D. Waldram, B. Wecht, and S.–T. Yau for discussions and e-mail correspondence. In particular we would like to thank E. Lerman for comments on a draft version of this paper. We are also grateful to the authors of [14] for earlier collaboration on related material, and especially for communicating their a-maximisation calculation. DM would like to thank the 2004 Simons Workshop on Mathematics and Physics, for hospitality at initial stages of this work. Part of this work was carried out whilst both authors were postdoctoral fellows at Imperial College, London. In particular DM was funded by a Marie Curie Individual Fellowship under contract number HPMF-CT-2002-01539, while JFS was supported by an EPSRC mathematics fellowship. At present JFS is supported by NSF grants DMS–0244464, DMS–0074329 and DMS–9803347.

A. The Conifold In this appendix we compute the moment cone, gauged linear sigma model and toric diagram for the conifold, C(T 1,1 ). Of course, many of these results are well-known in the physics literature–we include the discussion only as a simple illustration of the systematic techniques used in this paper, in the context of an example familiar to many physicists. The homogeneous Sasaki–Einstein metric on S 2 × S 3 is usually referred to as T 1,1 . The metric is particularly simple [47]: 1 1 ds 2 = (dθ12 + sin2 θ1 dφ12 + dθ22 +sin2 θ2 dφ22 ) + (dψ + cos θ1 dφ1 +cos θ2 dφ2 )2 . 6 9 (A.1) 24

We suppress the overall factors of N.

86

D. Martelli, J. Sparks

Here θi , φi , i = 1, 2, are usual polar and axial coordinates on two round two-spheres, and ψ is a coordinate on a principle U (1) bundle over S 2 × S 2 . Here ψ has period 4π so that the Chern numbers over the two-spheres are both equal to one25 . In particular, 3∂/∂ψ is the Reeb vector so that this is a regular Sasaki–Einstein manifold–the base K¨ahler–Einstein manifold is just CP1 × CP1 . The symplectic form on the metric cone is ω=

1 2 r (sin θ1 dθ1 ∧ dφ1 + sin θ2 dθ2 ∧ dφ2 ) 6 1 − rdr ∧ (dψ + cos θ1 dφ1 + cos θ2 dφ2 ) . 3

(A.2)

Clearly we have three commuting Hamiltonian U (1)s generated by ∂/∂φi , i = 1, 2, and ∂/∂ψ. As in the main text, one must be careful to ensure that one picks a basis for an effectively acting T3 action when computing the moment map. If one fixes θ1 , φ1 on the first two-sphere, one obtains a copy of S 3 , written as a principle U (1) bundle over the second two-sphere. The effectively acting isometry group on this squashed S 3 is U (2), as discussed in the main text. Defining 2ν = ψ, so that ν has canonical period 2π , one can therefore take the following basis for the T3 action: ∂ + ∂φ1 ∂ + e2 = ∂φ2 ∂ . e3 = ∂ν e1 =

1 ∂ , 2 ∂ν 1 ∂ , 2 ∂ν

(A.3)

The corresponding moment map, homogeneous under rescaling of the cone, is now easily computed to be

1 2 1 1 µ = (A.4) r (cos θ1 + 1), r 2 (cos θ2 + 1), r 2 . 6 6 3 The image of the moment map µ : C(T 1,1 ) → R3 is a convex rational polyhedral cone generated by the four edge vectors: µ(N N ) = 13 (1, 1, 1), µ(N S) = 13 (1, 0, 1), µ(SN ) = 13 (0, 1, 1),

(A.5)

µ(SS) = 13 (0, 0, 1) . That is, the subspaces over which a T2 collapses are precisely the four subspaces N N = {θ1 = 0, θ2 = 0}, N S = {θ1 = 0, θ2 = π }, SN = {θ1 = π, θ2 = 0}, SS = {θ1 = π, θ2 = π }-these are all copies of the fibre circle over the corresponding point on the 25 One may also set ψ to have period 2π yielding T 1,1 /Z which is also a Sasaki–Einstein manifold. 2 In fact, this is the horizon manifold of the complex cone over F0 CP1 × CP1 . Note that one must be careful to ensure that the Killing spinors are well-defined on making such identifications.

Toric Geometry, Sasaki–Einstein Manifolds and a New Infinite Class of AdS/CFT Duals

87

base S 2 × S 2 . The outward pointing primitive normal vectors to the cone are computed to be v1 = [1, 0, −1],

v2 = [0, 1, −1],

v3 = [0, −1, 0],

v4 = [−1, 0, 0] .

(A.6)

Notice that these indeed form a good cone, as defined in the main text. Also notice that the vectors {vi } span Z3 over Z. Lerman’s theorem then states that the base of the metric cone is simply-connected, which is of course correct. Moreover, the fact that there are four facets means that π2 (T 1,1 ) ∼ = Z, again correct. We may now apply the Delzant theorem. The kernel is trivially calculated to be (1, −1, −1, 1). Thus the theorem gives a U (1) gauged linear sigma model on C4 with charges (1, −1, −1, 1)–this is of course well-known to give the conifold. Turning on the FI parameter t > 0, t < 0 gives the two small resolutions of the conifold, related by the flop transition. We now apply the SL(3; Z) transformation   1 1 2  0 −1 −1  (A.7) 0 0 −1 to the torus T3 of symmetries. The normal vectors now read v1 = [−1, 1, 1],

v2 = [−1, 0, 1],

v3 = [−1, 1, 0],

v4 = [−1, 0, 0] .

(A.8)

These all lie in the plane at e1 = −1. Dropping this gives vectors in R2 : V1 = [1, 1],

V2 = [0, 1],

V3 = [1, 0],

V4 = [0, 0] .

(A.9)

The toric diagram may thus be embedded in the orbifold C3 /Z2 × Z2 and is presented below. We may also analyse the topology of the K¨ahler quotient directly, as in the main text. The D-term constraint reads |z1 |2 + |z2 |2 − |z3 |2 − |z4 |2 = t .

(A.10)

Setting t = 0 one obtains a singular space–the conifold. Defining gauge invariant coordinates u = z1 z3 , x = z1 z4 , y = z2 z3 , v = z2 z4 we have precisely one relation uv = xy in C4 , which is thus an equivalent definition of the conifold. At z3 = z4 = 0 we have a copy of CP1 = S 2 , of size t. On a patch in which z2 = 0 we may introduce a gauge invariant complex coordinate z = z1 /z2 . This patch covers a neighbourhood of the south pole at z1 = 0. Similarly the coordinate z = z2 /z1 covers a neighbourhood of the north pole at z2 = 0. Over the intersection of the patches we have the relation z = 1/z, thus making the Riemann sphere. Let us now turn to the remaining coordinates. Consider the subspace in which z2 = 0 and introduce gauge

Fig. 4. Toric diagram of the conifold embedded in the orbifold C3 /Z2 × Z2

88

D. Martelli, J. Sparks

invariant coordinates x2 = z3 z2 , y2 = z4 z2 . Thus, over the open set U2 = {z2 = 0, z3 = z4 = 0} ⊂ CP1 , our subspace looks like a trivial rank two bundle C2 × U2 . Similarly, over U1 = {z1 = 0, z3 = z4 = 0} ⊂ CP1 we also have C2 × U1 , where the fibre is coordinatised by x1 = z3 z1 , y1 = z4 z1 . On the overlap U1 ∩ U2 we have the relation x1 = x2 (z1 /z2 ), y1 = y2 (z1 /z2 ). By definition, this gluing gives the bundle OCP1 (−1) ⊕ OCP1 (−1), which is the resolved conifold. The boundary, or horizon, of this manifold is an S 3 bundle over CP1 = S 2 , since S 3 is the boundary of C2 . There are various ways of seeing the topology of the horizon. One way is to projectivise the original bundle. Recall that to projectivise a rank two complex vector bundle, with transition functions in U (2), means that one replaces each C2 fibre with CP1 , and glues the fibres together across overlaps using the induced transition functions, which lie in U (2)/U (1)D ∼ = SO(3). Here SO(3) acts on the CP1 = S 2 fibre in the usual way. Since the transition functions of OCP1 (−1) ⊕ OCP1 (−1) are diagonal, the projectivisation is just the product CP1 × CP1 . The U (1) factor we projected out has unit winding over the fibre S 2 , since S 3 → S 2 is the Hopf map which has Chern number 1. The winding is also 1 over the base since we began with the sum of two copies of OCP1 (−1). Thus we see explicitly the topology of T 1,1 as the horizon manifold. References 1. Maldacena, J.M.: The large N limit of superconformal field theories and supergravity. Adv. Theor. Math. Phys. 2, 231 (1998) [Int. J. Theor. Phys. 38, 1113 (1999)] 2. Kehagias, A.: New type IIB vacua and their F-theory interpretation. Phys. Lett. B 435, 337 (1998) 3. Klebanov, I.R., Witten, E.: Superconformal field theory on threebranes at a Calabi-Yau singularity. Nucl. Phys. B 536, 199 (1998) 4. Acharya, B.S., Figueroa-O’Farrill, J.M., Hull, C.M., Spence, B.: Branes at conical singularities and holography. Adv. Theor. Math. Phys. 2, 1249 (1999) 5. Morrison, D.R., Plesser, M.R.: Non-spherical horizons. I. Adv. Theor. Math. Phys. 3, 1 (1999) 6. Gauntlett, J.P., Martelli, D., Sparks, J., Waldram, D.: Sasaki-Einstein metrics on S 2 ×S 3 . Adv. Theor. Math. Phys. 8, 711 (2004) 7. Gauntlett, J.P., Martelli, D., Sparks, J., Waldram, D.: Supersymmetric AdS5 solutions of M-theory. Class. Quant. Grav. 21, 4335 (2004) 8. Cheeger, J., Tian, G.: On the cone structure at infinity of Ricci flat manifolds with Euclidean volume growth and quadratic curvature decay. Invent. Math. 118(3), 493–571 (1994) 9. Intriligator, K., Wecht, B.: The exact superconformal R-symmetry maximizes a. Nucl. Phys. B 667, 183 (2003) 10. Gubser, S.S.: Einstein manifolds and conformal field theories. Phys. Rev. D 59 (1999) 025006 11. Feng, B., Hanany, A., He, Y.H.: D-brane gauge theories from toric singularities and toric duality. Nucl. Phys. B 595, 165 (2001) 12. Herzog, C.P., Walcher, J.: Dibaryons from exceptional collections. JHEP 0309, 060 (2003) 13. Herzog, C.P.: Exceptional collections and del Pezzo gauge theories. JHEP 0404, 069 (2004) 14. Bertolini, M., Bigazzi, F., Cotrone, A.: New checks and subtleties for AdS/CFT and a-maximization. JHEP 0412, 024 (2004) 15. Gauntlett, J.P., Martelli, D., Sparks, J., Waldram, D.: A new infinite class of Sasaki-Einstein manifolds. http://arXiv:org/list/hep-th/0403038, 2004 16. Gauntlett, J.P., Martelli, D., Sparks, J. Waldram, D.: Supersymmetric AdS Backgrounds in String and M-theory. http://arXiv:org/list/hep-th/0411194, 2004 17. Chen, W., Lu, H., Pope, C.N., Vazquez-Poritz, J.F.: A Note on Einstein–Sasaki Metrics in D ≥ 7. http://arXiv.org/list/hep-th/0411218, 2004 18. Tian, G.: On K¨ahler–Einstein metrics on certain K¨ahler manifolds with c1 (M) > 0. Invent. Math. 89, 225–246 (1987) 19. Tian, G., Yau, S.T.: On K¨ahler–Einstein metrics on complex surfaces with C1 > 0. Commun. Math. Phys. 112, 175–203 (1987) 20. Lerman, E.: Contact toric manifolds. J. Symplectic Geom. 1(4), 785–828 (2003) 21. Delzant, T.: Hamiltoniens periodiques et images convexes de l’application moment. Bull. Soc. Math. France 116(3), 315–339 (1988)

Toric Geometry, Sasaki–Einstein Manifolds and a New Infinite Class of AdS/CFT Duals

89

22. Witten, E.: Phases of N = 2 theories in two dimensions. Nucl. Phys. B 403, 159 (1993) 23. Beasley, C., Greene, B.R., Lazaroiu, C.I., Plesser, M.R.: D3-branes on partial resolutions of abelian quotient singularities of Calabi-Yau threefolds. Nucl. Phys. B 566, 599 (2000) 24. Witten, E.: Baryons and branes in anti de Sitter space. JHEP 9807, 006 (1998) 25. Berenstein, D., Herzog, C.P., Klebanov, I.R.: Baryon Spectra and AdS/CFT Correspondence. JHEP 0206, 047 (2002) 26. Smale, S.: On the structure of 5-manifolds. Ann. Math. 75, 38–46 (1962) 27. Friedrich, Th., Kath, I.: Einstein manifolds of dimension five with small first eigenvalue of the Dirac operator. J. Differ. Geom. 29, 263–279 (1989) 28. Matsushima, Y.: Sur la structure du groupe d’hom´eomorphismes analytiques d’une certaine vari´et´e kaehl´erienne. Nagoya Math. J. 11, 145–150 (1957) 29. Boyer, C.P., Galicki, K.: New Einstein metrics in dimension five. J. Differ. Geom. 57(3), 443–463 (2001) 30. Boyer, C.P., Galicki, K., Nakamaye, M.: On the Geometry of Sasakian–Einstein 5-Manifolds. Math. Ann. 325(3), 485–524 (2003) 31. Boyer, C.P., Galicki, K., Nakamaye, M.: Sasakian–Einstein structures on 9#(S 2 × S 3 ). Trans. Amer. Math. Soc. 354(8), 2983–2996 (2002) 32. Boyer, C.P., Galicki, K.: New Einstein metrics on 8#(S 2 × S 3 ). Differential Geom. Appl. 19(2), 245–251 (2003) 33. Wang, M.Y., Ziller, W.: Einstein metrics on principal torus bundles. J. Diff. Geom. 31, 215 (1990) 34. Boyer, C.P., Galicki, K.: 3-Sasakian Manifolds. Surveys Diff. Geom. 7, 123–184 (1999) 35. Falcao de Moraes, S., Tomei, C.: Moment maps on symplectic cones. Pacif. J. Math. 181(2), 357–375 (1997) 36. Atiyah, M.F.: Convexity and commuting Hamiltonians. Bull. London Math. Soc. 14, 1–15 (1982) 37. Guillemin, V., Sternberg, S.: Convexity properties of the moment mapping. Invent. Math. 67, 491– 513 (1982) 38. Lerman, E., Tolman, S.: Hamiltonian torus actions on symplectic orbifolds and toric varieties. http://arXiv.org/list/dg-ga/9511008, 1995 39. Boyer, C.P., Galicki, K.: A Note on Toric Contact Geometry. J. Geom. and Phys. 35, 288–298 (2000) 40. Lerman, E.: Homotopy Groups of K-Contact Toric Manifolds. Trans. Amer. Math. Soc. 356(10), 4075–4083 (2004) 41. Page, D.N.: A compact rotating gravitational instanton. Phys. Lett. 79B(3), 235–238 (1978) 42. Feng, B., Franco, S., Hanany, A., He, Y.H.: Symmetries of toric duality. JHEP 0212, 076 (2002) 43. Dall’Agata, G.: N = 2 conformal field theories from M2-branes at conifold singularities. Phys. Lett. B 460, 79 (1999) 44. Fabbri, D., Fre’, P., Gualtieri, L., Reina, C., Tomasiello, A., Zaffaroni, A., Zampa, A.: 3D superconformal theories from Sasakian seven-manifolds: New nontrivial evidences for AdS(4)/CFT(3). Nucl. Phys. B 577, 547 (2000) 45. Ceresole, A., Dall’Agata, G., D’Auria, R., Ferrara, S.: M-theory on the Stiefel manifold and 3d conformal field theories. JHEP 0003, 011 (2000) 46. Intriligator, K., Wecht, B.: Baryon charges in 4D superconformal field theories and their AdS duals. Commun. Math. Phys. 245, 407 (2004) 47. Candelas, P., de la Ossa, X.C.: Comments On Conifolds. Nucl. Phys. B 342, 246 (1990) Communicated by G.W. Gibbons

Commun. Math. Phys. 262, 91–115 (2006) Digital Object Identifier (DOI) 10.1007/s00220-005-1454-y

Communications in

Mathematical Physics

The Threshold Effects for the Two-Particle Hamiltonians on Lattices S. Albeverio1,2,3 , S.N. Lakaev4,6 , K.A. Makarov5 , Z.I. Muminov6 1 2 3 4

Institut f¨ur Angewandte Mathematik, Universit¨at Bonn, Germany. E-mail: [email protected] SFB 611, Bonn, BiBoS, Bielefeld - Bonn, Germany CERFIM, Locarno and USI, Switzerland Samarkand Division of Academy of Sciences of Uzbekistan, Uzbekistan. E-mail: [email protected]. uni-bonn.de 5 Department of Mathematics, University of Missouri, Columbia, MO, USA. E-mail: [email protected] 6 Samarkand State University, Samarkand, Uzbekistan. E-mail: [email protected]

Received: 30 December 2004/ Accepted: 15 June 2005 Published online: 24 November 2005 – © Springer-Verlag 2005

Abstract: For a wide class of two-body energy operators h(k) on the d-dimensional lattice Zd , d ≥ 3, k being the two-particle quasi-momentum, we prove that if the following two assumptions (i) and (ii) are satisfied, then for all nontrivial values k, k = 0, the discrete spectrum of h(k) below its threshold is non-empty. The assumptions are: (i) the two-particle Hamiltonian h(0) corresponding to the zero value of the quasi-momentum has either an eigenvalue or a virtual level at the bottom of its essential spectrum and (ii) the one-particle free Hamiltonians in the coordinate representation generate positivity preserving semi-groups. 1. Introduction The main goal of the present paper is to give a thorough mathematical treatment of the spectral properties for the two-particle lattice Hamiltonians in dimensions d ≥ 3 with emphasis on new threshold phenomena that are not present in the continuous case (see, e.g., [4, 8, 13–15, 17] for relevant discussions and [9, 11, 16, 29] for the general study of the low-lying excitation spectrum for quantum systems on lattices). The kinematics of quantum quasi-particles on lattices, even in the two-particle sector, is rather exotic. For instance, due to the fact that the discrete analogue of the Laplacian or its generalizations (see (2.1) and (4.1)) are not rotationally invariant, the Hamiltonian of a system does not separate into two parts, one relating to the center-of-mass motion and the other one to the internal degrees of freedom. In particular, such a handy characteristic of inertia as mass is not available. Moreover, such a natural local substituter as the effective mass-tensor (of a ground state) depends on the quasi-momentum of the system and, in addition, it is only semi-additive (with respect to the partial order on the set of positive definite matrices). This is the so-called excess mass phenomenon for lattice systems (see, e.g., [15 and 17]): the effective mass of the bound state of an N -particle system is greater than (but, in general, not equal to) the sum of the effective masses of the constituent quasi-particles.

92

S. Albeverio, S.N. Lakaev, K.A. Makarov, Z.I. Muminov

The two-particle problem on lattices, in contrast to the continuous case where the usual split-off of the center of mass can be performed, can be reduced to an effective one-particle problem by using the Gelfand transform instead: the underlying Hilbert space 2 ((Zd )2 ) is decomposed as a direct von Neumann integral associated with the representation of the discrete group Zd by shift operators on the lattice and then, the total two-body Hamiltonian appears to be decomposable as well. In contrast to the continuous case, the corresponding fiber Hamiltonians h(k) associated with the direct decomposition depend parametrically on the internal binding k, the quasi-momentum, which ranges over a cell of the dual lattice. As a consequence, due to the loss of the spherical symmetry of the problem, the spectra of the family h(k) turn out to be rather sensitive to the variation of the quasi-momentum k. We recall that in the case of continuous Schr¨odinger operators in R3 one observes the emission of negative bound states from the continuous spectrum at so-called critical potential strength (see, e.g., [1, 14, 19, 26]). This phenomenon is closely related to the existence of generalized eigenfunctions, which are solutions of the Schr¨odinger equation with zero energy decreasing at infinity, but are not square integrable. These solutions are usually called zero-energy resonance functions and, in this case, the Hamiltonian is called a critical one and the Schr¨odinger operator is said to have a zero-energy resonance (virtual level). The appearance of negative bound states for critical (non-negative) Schr¨odinger operators under infinitesimally small negative perturbations is especially remarkable: it is the presence of zero-energy resonances in at least two of the twoparticle subsystems that leads to the existence of infinitely many bound states for the corresponding three-body system, the Efimov effect (see, e.g., [2, 13, 18, 23–25 , and 27]). It turns out that in the two-body lattice case there exists an extra mechanism for the bound state(s) to emerge from the threshold of the critical Hamiltonians which has nothing to do with additional (effectively negative) perturbations of the potential term. The role of the latter is rather played by the adequate change of the kinetic term which is due to the nontrivial dependence of the fiber Hamiltonians h(k) on the quasi-momentum k and is related to the excess mass phenomenon for lattice systems mentioned above. The main result of the paper is the (variational) proof of existence of the discrete spectrum below the bottom of the essential spectrum of the fiber Hamiltonians h(k) for all non-zero values of the quasi-momentum 0 = k ∈ Td , provided that the Hamiltonian h(0) has either a virtual level (in dimenstions three and four) or a threshold eigenvalue (in all dimensions d ≥ 3) (see Theorem 2). Apart from some technical smoothness assumptions upon the dispersion relation of normal modes εα (p), characterizing the free particles α = 1, 2, and as well as on smoothness assumptions (in the momentum representation) on the two-particle interactions (Hypothesis 1) the only additional assumption made (Hypothesis 2) is that the one-particle free Hamiltonians (in the coordinate representation) generate positivity preserving semi-groups exp(−t hˆ 0α ), t > 0, α = 1, 2. We remark that this property is automatically fulfilled for the standard Laplacian (discrete or continuous). The paper is organized as follows. In Sect. 2 we formulate the main hypotheses on the one-particle lattice systems in all dimensions d ≥ 1 and prove the basic inequality (see Lemma 1 below) for the dispersion relations that are conditionally negative definite. In Sect. 3 we introduce the concept of a virtual level for the lattice one-particle Hamiltonians in dimensions d = 3, 4, and develop the necessary background for our further considerations. In Sect. 4 we describe the two-particle Hamiltonians in both the coordinate and the momentum representation,

The Threshold Effects for the Two-Particle Hamiltonians on Lattices

93

introduce the two-particle quasi-momentum, and decompose the energy operator into the von Neumann direct integral of the fiber Hamiltonians h(k), thus providing the reduction to the effective one-particle case. In Sect. 5 we obtain efficient bounds on the location of the discrete spectrum for the two-particle fiber Hamiltonians in dimensions d ≥ 3 and prove the main result of this paper, Theorem 2, in the case where h(0) has either a threshold eigenvalue (d ≥ 3) or virtual level (d = 3, 4) at the bottom of its essential spectrum. In Appendix A, for readers’ convenience, we give a proof of Proposition 1 which is a “lattice” analogue of a result due to Yafaev [28] in the continuous case. In Appendix B we construct an explicit example of a one-particle discrete Schr¨odinger operator on the three-dimensional lattice Z3 that possesses both a virtual level and threshold eigenvalue at the bottom of its essential spectrum (cf., e.g., [1, 3 , and 12] for related discussions in the case of continuous Schr¨odinger operators). 2. The One-Particle Hamiltonian 2.1. Dispersion relations. The free Hamiltonian hˆ 0 of a quantum particle on the ddimensional lattice Zd , d ≥ 1, is usually associated with the following self-adjoint (bounded) multidimensional Toeplitz-type operator on the Hilbert space 2 (Zd ) (see, e.g., [15]): ˆ + s), ψˆ ∈ 2 (Zd ). ˆ (hˆ 0 ψ)(x) εˆ (s)ψ(x (2.1) = Here the series

s∈Zd

s∈Zd

εˆ (s) is assumed to be absolutely convergent, that is, {ˆε (s)}s∈Zd ∈ 1 (Zd ).

We also assume that the “self-adjointness” property is fulfilled εˆ (s) = εˆ (−s),

s ∈ Zd .

In the physical literature, the symbol of the Toeplitz operator hˆ 0 given by the Fourier series ε(p) = εˆ (s)ei(p,s) , p ∈ Td , (2.2) s∈Zd

being a real valued-function on Td , is called the dispersion relations of normal modes associated with the free particle in question (note that the Fourier coefficients of the funcd tion ε(p) differ from the coefficients εˆ (s) in (2.2) by the factor (2π ) 2 ). The one-particle free Hamiltonian is required to be of the form hˆ 0 = ε(−i∇), where ∇ is the generator of the infinitesimal translations. Under the mild assumption that vˆ ∈ ∞ (Zd ),

94

S. Albeverio, S.N. Lakaev, K.A. Makarov, Z.I. Muminov

ˆ where vˆ = {v(s)} ˆ s∈Zd is a sequence of reals, the one-particle Hamiltonian h, hˆ = hˆ 0 + v, ˆ describing the quantum particle moving in the potential field v, ˆ is a bounded self-adjoint operator on the Hilbert space 2 (Zd ). The one-particle Hamiltonian h in the momentum representation is introduced as ˆ h = F−1 hF, where F stands for the standard Fourier transform F : L2 (Td ) −→ 2 (Zd ), and Td denotes the three-dimensional torus, the cube (−π, π ]d with appropriately identified sides. Throughout the paper the torus Td will always be considered as an abelian group with respect to the addition and multiplication by real numbers regarded as operations on Rd modulo (2π Z)d . 2.2. Hamiltonians generating the positivity preserving semi-groups. The following important subclass of the one-particle systems is of certain interest (see, e.g., [6]). It is introduced by the additional requirement that the dispersion relation ε(p) is a realvalued continuous conditionally negative definite function and hence (i) ε is an even function, (ii) ε(p) has a minimum at p = 0. Recall (see, e.g., [21]) that a complex-valued bounded function ε : Td −→ C is called conditionally negative definite if ε(p) = ε(−p) and n

ε(pi − pj )zi z¯ j ≤ 0

(2.3)

i,j =1 d n for nany n ∈ N, for all p1 , p2 , .., pn ∈ T and all z = (z1 , z2 , ..., zn ) ∈ C satisfying i=1 zi = 0. It is known that in this case the dispersion relation ε(p) admits the (L´evy-Khinchin) representation (see, e.g., [5]) (ei(p,s) − 1)ˆε (s), p ∈ Td , ε(p) = ε(0) + s∈Zd \{0}

which is equivalent to the requirement that the Fourier coefficients εˆ (s) with s = 0 are non-positive, that is,

εˆ (s) ≤ 0,

s = 0,

s∈Zd \{0} εˆ (s) converges absolutely. In turn, this is also equivalent to the that the lattice Hamiltonian hˆ = hˆ 0 + vˆ generates the positivity preserving

and the series

requirement ˆ semi-group e−t h , t > 0, on 2 (Zd ) (see, e.g., [21] Ch. XIII). Following [6] we call the free Hamiltonians hˆ 0 = ε(−i∇) generating the positivity preserving semi-groups the generalized Laplacians. The following example shows that the standard discrete Laplacian is a generalized Laplacian in the sense mentioned above.

The Threshold Effects for the Two-Particle Hamiltonians on Lattices

95

Example 1. For the one-particle free Hamiltonian ˆ ˆ (hˆ 0 ψ)(x) = (−ψ)(x) =

ˆ ˆ + s)], [ψ(x) − ψ(x

x ∈ Zd ,

ψˆ ∈ 2 (Zd ),

|s|=1

the (Fourier) coefficients εˆ (s), s ∈ Zd , from (2.1) are necessarily of the form   2d, s = 0 εˆ (s) = −1, |s| = 1  0, otherwise. Hence, the corresponding dispersion relation ε(p) = 2

d

(1 − cos pi ),

p = (p1 , p2 , ... , pd ) ∈ Td ,

(2.4)

i=1

is a conditionally negative definite function. We need a simple inequality which will play a crucial role in the proof of the main results of the paper, Theorems 1 and 2. Lemma 1. Assume that the dispersion relation ε(p) is a real-valued continuous conditionally negative definite function on Td . Assume, in addition, that ε(0) is the unique minimum of the function ε(p). Then for all q ∈ Td \ {0} the inequality ε(p) + ε(q) >

ε(p + q) + ε(p − q) + ε(0), 2

a.e. p ∈ Td ,

(2.5)

holds. d Proof. Fix a q ∈ Td , q = 0. Then there exists an s0 ∈ Z \ {0} such that εˆ (s0 ) < 0 and cos(q, s0 ) = 1 (otherwise ε(q) = s∈Zd εˆ (s) = ε(0) which contradicts the hypothesis that ε(0) is the unique minimum of the function ε(·) on Td ). Since εˆ (s0 ) < 0 and cos(q, s0 ) = 1, the inequality

ε(p + q) + ε(p − q) F (p, q) ≡ ε(p) + ε(q) − − ε(0) 2 cos(p + q, s) + cos(p − q, s) = −1 εˆ (s) cos(p, s) + cos(q, s) − 2 s∈Zd cos(p + q, s0 ) + cos(p − q, s0 ) −1 ≥ 2ˆε (s0 ) cos(p, s0 ) + cos(q, s0 ) − 2 = 2ˆε (s0 ) (cos(p, s0 ) − 1)(1 − cos(q, s0 )) > 0, (p, s0 ) = 2nπ, n ∈ Z, (2.6) completes the proof.

96

S. Albeverio, S.N. Lakaev, K.A. Makarov, Z.I. Muminov

3. A Virtual Level and Threshold Eigenvalues In order to introduce the concept of a virtual level (threshold resonance) for the (lattice) energy operator h in dimensions three and four (d = 3, 4) we assume the following technical hypotheses that guarantee some smoothness of the dispersion relation ε(p) and the continuity of the Fourier transform d i(p,s) v(p) = (2π)− 2 v(s)e ˆ , d ≥ 3, (3.1) s∈Zd

of the interaction v. ˆ Hypothesis 1. Assume that the dispersion relation ε(p) is a continuous (periodic) realvalued function on Td with a unique (non-degenerate) minimum at the origin such that lim inf |p|→0

ε(p) − ε(0) > 0. |p|2

Assume, in addition, that v(p) is a continuous function on Td such that v(p) = v(−p),

p ∈ Td .

We remark that under Hypothesis 1 the sequence {v(s)} ˆ s∈Zd of the Fourier coefficients of the function v(p) is an element of 2 (Zd ) and then equality (3.1) should be d i(p,s) has the continuunderstood as follows: the 2 (Zd )-function (2π)− 2 s∈Zd v(s)e ˆ ous representative v(p). For λ ≤ ε(0) on the Banach space C(Td ), d ≥ 3, of continuous (periodic) functions on Td we shall consider the integral operator G(λ) with the (Birman-Schwinger) kernel function G(p, q; λ) = (2π)− 2 v(p − q)(ε(q) − λ)−1 , d

p, q ∈ Td .

(3.2)

Lemma 2. Let d ≥ 3. Assume Hypothesis 1. Then for λ ≤ ε(0) the operator G(λ) on C(Td ) given by (3.2) is compact. Proof. Given f ∈ L1 (Td ), for the function g introduced by − d2 g(p) = (2π) v(p − q)f (q)dq, Td

one has the estimates |g(p)| ≤ (2π)− 2 sup |v(p − q)|f L1 (Td ) d

(3.3)

p,q∈Td

and

d |g(p + ) − g(p)| =

(2π)− 2 (v(p + − q) − v(p − q))f (q)dq

Td

≤ (2π)

− d2

sup |v(t + ) − v(t)|f L1 (Td ) .

t∈Td

(3.4)

The Threshold Effects for the Two-Particle Hamiltonians on Lattices

97

Since for λ ≤ ε(0) and d ≥ 3, the function (ε(·) − λ)−1 , is integrable, the multiplication operator by the function (ε(·) − λ)−1 from C(Td ) into L1 (Td ) is continuous. Therefore, from (3.3) and (3.4) it follows that the image of the unit ball under G in C(Td ) consists of functions that are totally bounded and equicontinuous: v is continuous and, therefore, lim sup |v(t + ) − v(t)|f L1 (Td ) = 0.

||→0 t∈Td

An application of the Arzela-Ascoli Theorem then completes the proof.

Remark 1. Let d ≥ 3. Clearly (cf. [28]), the operator h has an eigenvalue λ ≤ ε(0), that is, Ker (h − λI ) = 0, if and only if the compact operator G(λ) on C(Td ) has an eigenvalue −1 and there exists a function ψ ∈ Ker (G(λ) + I ) such that the function f given by f (p) =

ψ(p) ε(p) − λ

a.e.

p ∈ Td ,

(3.5)

belongs to L2 (Td ). In this case f ∈ Ker (h − λI ). Moreover, if λ < ε(0), then dim Ker (h − λI ) = dim Ker (G(λ) + I )

(3.6)

and Ker (h − λI ) = {f | f (·) =

ψ(·) , ψ ∈ Ker (G(λ) + I )}. ε(·) − λ

In the case of a threshold eigenvalue λ = ε(0) equality (3.6) may fail to hold only if d = 3 or d = 4 (in dimensions d ≥ 5 the function f (p) given by (3.5) always belongs to L2 (Td ), cf. Lemma 3 below). In dimensions d = 3 or d = 4 equality (3.6) should be replaced by the inequality dim Ker (h − ε(0)I ) ≤ dim Ker (G(ε(0)) + I ). In order to discuss the threshold phenomena, that is, the case λ = ε(0), following [3 and 7] (see also [12] for a related discussion), under Hypothesis 1 we distinguish five mutually disjoint cases: Case I. −1 is not an eigenvalue of G(ε(0)), that is, 0 = dim Ker (h − λI ) = dim Ker (G(λ) + I ). Case II. −1 is a simple eigenvalue of G(ε(0)) and the associated eigenfunction ψ satisfies the condition ψ(·) ∈ / L2 (Td ), ε(·) − ε(0) that is, 0 = dim Ker (h − λI ) and dim Ker (G(λ) + I ) = 1.

98

S. Albeverio, S.N. Lakaev, K.A. Makarov, Z.I. Muminov

Case III. −1 is an eigenvalue of G(ε(0)) and any of the associated eigenfunctions ψ satisfies the condition ψ(·) ∈ L2 (Td ), ε(·) − ε(0) that is, 1 ≤ dim Ker (h − λI ) = dim Ker (G(λ) + I ). Case IV. −1 is a multiple eigenvalue of G(ε(0)) and at least one (up to a normalization) of the associated eigenfunctions ψ satisfies the condition ψ(·) ∈ / L2 (Td ), ε(·) − ε(0) that is, 2 ≤ dim Ker (G(λ) + I ) ≥ dim Ker (h − λI ) + 1. Case V. −1 is a multiple eigenvalue of G(ε(0)) and dim Ker (h − λI ) + 2 ≤ dim Ker (G(λ) + I ). Given the classification above, we arrive at the following definition of a virtual level in dimensions d = 3 or d = 4 (in dimensions d ≥ 5 Cases II, IV and V do not occur (see Remark 1)). Definition 1. Let d = 3, 4. In Cases II, IV and V the operator h is said to have a virtual level (at the threshold). Remark 2. Our definition of a virtual level is equivalent to the direct analogue of that in the continuous case (see, e.g., [1, 23, 25, 27, 28] and references therein). One can also introduce the concept of a virtual level in dimensions d = 1 and d = 2. However, due to additional threshold singularities of the Birman-Schwinger kernel in the momentum representation (cf. (3.2)) our approach is not directly applicable in low dimensions (d = 1, 2). Lemma 3. Let d ≥ 3. Assume Hypothesis 1 and suppose that ψ ∈ Ker (G(ε(0)) + I ). Then the function f (p) =

ψ(p) , ε(p) − ε(0)

p ∈ Td ,

d

belongs to the weak space Lw2 (Td ). Proof. Recall that f belongs to the weak Lq -space if sup t q mes{p | |f (p)| > t} < ∞. t>0

By Hypothesis 1 there exists a positive constant C such that ε(p) − ε(0) ≥ C|p|2 ,

p ∈ Td ,

The Threshold Effects for the Two-Particle Hamiltonians on Lattices

99

and then, since ψ(p) ∈ C(T3 ), mes{p ∈ Td | |f (p)| > t} = mes{p ∈ Td |

|ψ(p)| > t} ε(p) − ε(0)

≤ mes{p ∈ Td | ψC(Td ) > C|p|2 t} = O(t − 2 ) d

completing the proof.

as

t → ∞,

Corollary 1. If the Hamiltonian h has a virtual level and the corresponding function ψ(·) ψ , ψ ∈ Ker (G(ε(0)) + I ), is such that ε(·)−ε(0) ∈ / L2 (Td ), d = 3, 4, then (under Hypothesis 1) the function f (p) =

ψ(p) , ε(p) − ε(0)

p ∈ Td ,

d = 3, 4

(3.7)

r(d)

belongs to Lw (Td ), with r(d) =

3 2,

2,

d = 3, d = 4.

In particular, the function f given by (3.7) is the eigenfunction of the operator h associated with the eigenvalue ε(0) in the Banach space L1 (Td ), that is, hf = ε(0)f, and hence the following equation d ε(p)f (p) + (2π )− 2 v(p − q)f (q)dq = ε(0)f (p), Td

a.e. p ∈ Td ,

d = 3, 4

holds. Remark 3. A simple computation shows that the Fourier coefficients fˆ(s), s ∈ Zd , d = 3, 4, of the (integrable) function f solve the infinite system of homogeneous equations εˆ (s)fˆ(x + s) + (v(x) ˆ − ε(0))fˆ(x) = 0, x ∈ Zd , s∈Zd

and hence the equation (in the coordinate representation) hˆ fˆ = ε(0)fˆ has a solution fˆ, a threshold resonant state, that does not belong to 2 (Zd ) but vanishes at infinity, lim fˆ(s) = 0

|s|→∞

(by the Riemann-Lebesgue Theorem).

100

S. Albeverio, S.N. Lakaev, K.A. Makarov, Z.I. Muminov

Remark 4. If the dispersion relation ε(p) is known to be an even function, ε(p) = ε(−p), or, which is the same, the Fourier coefficients satisfy the condition εˆ (s) = εˆ (−s) ∈ R, s ∈ Zd , the Birman-Schwinger kernel G(p, q; λ) has the additional property that G(p, q; λ) = G(−p, −q; λ). Hence, if ψ ∈ Ker (G(λ) + I ), λ ≤ ε(0), so does the function ϕ(p) = ψ(−p). Therefore, at least one of the functions ψ ± ϕ is also an eigenfunction of G(λ) associated with the eigenvalue −1, and hence, without loss of generality one may assume that the ˜ operator G(λ) has an eigenfunction ψ˜ such that |ψ(·)| is an even function. To get finer results in dimensions d = 3, 4 (cf. [28]) we need an auxiliary scale of the Banach spaces B(µ), 0 < µ ≤ 1, of H¨older continuous functions on Td obtained by the closure of the space of smooth (periodic) functions f on Td with respect to the norm f µ = sup |f (t)| + ||−µ |f (t + ) − f (t)| . t,∈Td

Note that the spaces B(µ) are naturally embedded one into the other B(ν) ⊂ B(µ) ⊂ C(Td ), 0 < µ ≤ ν ≤ 1. If v ∈ B(κ) with κ > 21 for d = 3, and κ > 0 for d = 4, the following proposition, a variant of the Birman-Schwinger principle, is a convenient tool to decide whether the threshold ε(0) of the essential spectrum of h is an eigenvalue (resp. a virtual level ) for the operator h. Proposition 1. (cf. [28]). Let d = 3, 4. Assume Hypothesis 1. Assume, in addition, that v ∈ B(κ) with 1 , if d = 3 κ> 2 . (3.8) 0, if d = 4 Then the operator h−ε(0)I has a non-trivial kernel if and only if −1 is an eigenvalue of G(ε(0)) and one of the associated eigenfunctions ψ satisfies the condition ψ(0) = 0. In particular, the operator h has a virtual level if and only if −1 is an eigenvalue of G(ε(0)) and one of the associated eigenfunctions ψ satisfies the condition ψ(0) = 0. Proof. See Appendix A.

Remark 5. As it follows from Proposition 1, the eigensubspace of functions ψ associated with the eigenvalue λ = −1 of G(ε(0)) with the additional constraint ψ(0) = 0 is one-dimensional. This proves that Case V does not occur if (3.8) holds. In Case IV we have the coexistence of a (simple) virtual level and a (possibly multiple) threshold eigenvalue (see Appendix B for a concrete example of such a coexistence in Case IV in dimension three).

The Threshold Effects for the Two-Particle Hamiltonians on Lattices

101

Remark 6. It is known that for the continuous Schr¨odinger operators h = − + V (x) in dimension d = 3 with V ∈ L1 (R3 ) ∩ R, R the Rollnik class, Case V does not occur (see, [1] Lemma 1.2.3). It is also worth mentioning that if, in addition, the Schr¨odinger 3 operator h is non-negative, then under the L 2 -weak assumption on the potential V Cases III, and IV do not occur (there is no zero-energy eigenstate) (see, e.g., [10, 22 and 24]). 4. The Two-Particle Hamiltonian Reduction to the One-Particle Case 4.1. The coordinate representation. Throughout this section we assume that d ≥ 1. 0 of the system of two quantum particles α = 1, 2, with The free Hamiltonian H the dispersion relations εα (p), α = 1, 2, respectively, is introduced (as a bounded self-adjoint operator on the Hilbert space 2 ((Zd )2 ) 2 (Zd ) ⊗ 2 (Zd )) by 0 = hˆ 01 ⊗ I + I ⊗ hˆ 02 , H

(4.1)

with hˆ 0α = εα (−i∇),

α = 1, 2,

and I being the identity operator on 2 (Zd ). (in the coordinate representation) of the two-particle system The total Hamiltonian H is a self-adjoint bounded operator on the Hilbert with the real-valued pair interaction V space 2 ((Zd )2 ) of the form , =H 0 + V H where ψ)(x ˆ 1 , x2 ), ˆ 1 , x2 ) = v(x ˆ 1 − x2 )ψ(x (V

ψˆ ∈ 2 ((Zd )2 ),

with {v(s)} ˆ s∈Zd the Fourier coefficients of a continuous function v(p) satisfying Hypothesis 1. 4.2. The momentum representation. The transition to the momentum representation is performed by the standard Fourier transform F2 : L2 ((Td )2 ) −→ 2 ((Zd )2 ), where (Td )m denotes the Cartesian mth power of the three-dimensional cube Td = (−π, π )d : × · · · × Td, (Td )m = Td × Td m times

m ∈ N.

(4.2)

The two-particle Hamiltonian H in the momentum representation is then given by H = H0 + V, where (H 0 f )(k1 , k2 ) = (ε1 (k1 ) + ε2 (k2 ))f (k1 , k2 ),

f ∈ L2 ((Td )2 ),

102

S. Albeverio, S.N. Lakaev, K.A. Makarov, Z.I. Muminov

and V is the operator of partial integration given by − d2 (Vf )(k1 , k2 ) = (2π) v(k1 − k1 )δ(k1 + k2 − k1 − k2 )f (k1 , k2 )dk1 dk2 , (Td )2

f ∈ L2 ((Td )2 ). Here the kernel function is given by the Fourier series v(p) = (2π)−d/2 v(s) ˆ ei(p,s) ,

p ∈ Td ,

s∈Zd

and δ(p) denotes the Dirac delta-function. 4.3. Direct integral decompositions. The quasi-momentum. Denote by Uˆ s2 , s ∈ Zd , the unitary representation of the abelian group Zd by the shift operators on the Hilbert space 2 ((Zd )2 ): ˆ 1 , n2 ) = ψ(n ˆ 1 + s, n2 + s), (Uˆ s2 ψ)(n

ψˆ ∈ 2 ((Zd )2 ),

n1 , n2 , s ∈ Zd .

Via the Fourier transform F2 the unitary representation 2 = Uˆ s2 Uˆ t2 , Uˆ s+t

s, t ∈ Zd ,

induces the representation of the group Zd in the Hilbert space L2 ((Td )2 ) by unitary (multiplication) operators Us2 = F2−1 Uˆ s2 F2 , s ∈ Zd , (Us2 f )(k1 , k2 ) = exp − i(s, k1 + k2 ) f (k1 , k2 ), k1 , k2 ∈ Td , f ∈ L2 ((Td )2 ). (4.3) Given k ∈ Td , we define Fk as follows: Fk = {(k1 , k − k1 ) ∈(Td )2 : k1 ∈ Td , k − k1 ∈ Td }. Introducing the mapping π : (Td )2 → Td ,

π((k1 , k2 )) = k1 ,

we denote by πk , k ∈ Td , the restriction of π to Fk ⊂ (Td )2 , that is, πk = π|Fk .

(4.4)

We remark that Fk , k ∈ Td , is a three-dimensional manifold homeomorphic to Td . The following lemma is evident. Lemma 4. The mapping πk , k ∈ Td , from Fk ⊂ (Td )2 onto Td is bijective, with the inverse mapping given by (πk )−1 (q) = (q, k − q).

The Threshold Effects for the Two-Particle Hamiltonians on Lattices

103

Decomposing the Hilbert space L2 ((Td )2 ) into the direct integral 2 d 2 L ((T ) ) = ⊕L2 (Fk )dk k∈Td

yields the corresponding decomposition of the unitary representation Us2 , s ∈ Zd , into the direct integral 2 Us = ⊕Us (k)dk, k∈Td

with Us (k) = e−i(s,k) IL2 (Fk ) and IL2 (Fk ) being the identity operator on the Hilbert space L2 (Fk ). (in the coordinate representation) obviously commutes with the The Hamiltonian H group of translations, Uˆ s2 , s ∈ Zd , that is, =H Uˆ s2 , Uˆ s2 H

s ∈ Zd .

So does the Hamiltonian H (in the momentum representation) with respect to the group Us2 , s ∈ Zd , given by (4.3). Hence, the operator H can be decomposed into the direct integral ˜ H = ⊕h(k)dk (4.5) k∈Td

associated with the decomposition

L2 ((Td )2 ) =

k∈Td

⊕L2 (Fk )dk.

In the physical literature the parameter k, k ∈ Td , is called the two-particle quasi-momen˜ tum and the corresponding operators h(k), k ∈ Td , are called the fiber operators. ˜ 4.4. The two-particle dispersion relations. The fiber operators h(k), k ∈ Td , from the decomposition (4.5) are unitarily equivalent to the operators h(k), k ∈ Td , of the form h(k) = h0 (k) + v, where (h0 (k)f )(p) = Ek (p)f (p), d v(p − q)f (q)dq, f ∈ L2 (Td ) (vf )(p) = (2π)− 2 Td

and the two-particle dispersion relations Ek (p) = ε1 (p) + ε2 (k − p),

p ∈ Td ,

parametrically depend on the quasi-momentum k, k ∈ Td . The equivalence is given by the unitary operator uk : L2 (Fk ) → L2 (Td ), k ∈ Td , uk g = g ◦ (πk )−1 , with πk defined by (4.4).

104

S. Albeverio, S.N. Lakaev, K.A. Makarov, Z.I. Muminov

5. Spectral Properties of the Fiber Operators h(k) As we have learned from the previous section, the two-particle Hamiltonian H (up to unitary equivalence) can be decomposed into the direct integral ⊕h(k)dk, H k∈Td

where the fiber operators h(k) = h0 (k) + v can be considered as the one-particle Hamiltonians with the two-particle dispersion relations Ek (p) = ε1 (p) + ε2 (k − p),

p ∈ Td ,

(5.1)

with εα (p) the dispersion relations for the particles α = 1, 2. Under Hypothesis 1 the perturbation v of the operator h0 (k), k ∈ Td , is a HilbertSchmidt operator and, therefore, in accordance with the Weyl Theorem the essential spectrum of the operator h(k) fills in the following interval on the real axis: σess (h(k)) = [Emin (k), Emax (k)], where Emin (k) = min Ek (q), q∈Td

Emax (k) = max Ek (q). q∈Td

If the dispersion relations in the one-particle sector are conditionally negative definite, then so is the two-particle dispersion relation E0 (p) corresponding to the zero-value of the quasi-momentum k. Hence, under these assumptions, the Hamiltonian h(0) in the coordinate representation generates the positivity preserving semi-group e−th(0) , t > 0 (which is not necessarily true for the fiber Hamiltonians h(k) with k = 0: the function Ek (p) may not be even, and hence, not conditionally negative definite). Although the two-particle dispersion relations are not necessarily conditionally negative definite for nontrivial values of the quasi-momentum, they still satisfy some useful inequality, Lemma 5 below, analogous to that in Lemma 1 for the one-particle dispersion relations. Hypothesis 2. Assume Hypothesis 1 for both ε1 (p) and ε2 (p). Suppose that the dispersion relations εα (p), α = 1, 2, in the one-particle sectors are conditionally negative definite. We remark that the two-particle dispersion relation E0 (p) satisfies Hypothesis 1 if ε1 (p) and ε2 (p) do. Lemma 5. Assume Hypothesis 2. Then for any (fixed) k, q ∈ Td such that either k = q or q = 0, E0 (p) − E0 (0) + Ek (q) −

Ek (p + q) + Ek (q − p) > 0, 2

a.e. p ∈ Td .

The Threshold Effects for the Two-Particle Hamiltonians on Lattices

105

In particular, if k = 0, and p(k) is a (any) point where the function Ek (·) attains its minimal value, that is, Emin (k) = Ek (p(k)), the following inequality E0 (p) − Emin (0) + Emin (k) −

Ek (p + p(k)) + Ek (p(k) − p) > 0, 2

a.e. p ∈ Td ,

holds. Proof. Since |q|2 + |k − q|2 = 0 the claim is an immediate consequence of Lemma 1 and definition (5.1) of the two-particle dispersion relations: Ek (p + q) + Ek (q − p) − E0 (0) 2 ε1 (p + q) + ε1 (q − p) = ε1 (p) + ε1 (q) − − ε1 (0) 2 ε2 (k − q − p) + ε2 (k − q + p) +ε2 (p) + ε2 (k − q) − − ε2 (0) > 0, 2 a.e. p ∈ Td .

E0 (p) + Ek (q) −

From now and later on we will assume that d ≥ 3. Our first non-perturbative result shows that under Hypothesis 2 the discrete spectrum of the fiber operators h(k) under the variation of the quasi-momentum cannot be absorbed by the threshold. Theorem 1. Let d ≥ 3. Assume Hypothesis 2. Assume, in addition, that the dispersion relations εj (p), j = 1, 2, are twice differentiable functions. Denote by m(k), k ∈ Td , the lower bound of the operator h(k), m(k) = inf Spec (h(k)),

k ∈ Td .

Assume, in addition, that the lower edge m(0) = Emin (0) of the spectrum of the operator h(0) is an eigenvalue. Then Emin (0) − m(0) < Emin (k) − m(k),

k ∈ Td , k = 0,

Proof. Let 0 = f ∈ Ker (h(0) − m(0)I ) and hence d E0 (p)f (p) + (2π)− 2 v(p − q)f (q)dq = m(0)f (p), Td

d ≥ 3.

(5.2)

a.e. p ∈ Td .

By hypothesis the one-particle dispersion relations are conditionally negative definite functions. Then, as it can easily be seen from the definition of the two-body dispersion relation, the function E0 (p) corresponding to the zero-value of the quasi-momentum k is also conditionally negative definite. In particular, E0 (p) is an even function and, hence, by Remark 4, without loss of generality one may assume that the function |f (·)| is even. For k ∈ Td we introduce the trial L2 (Td )-function fk (p) = f (p − p(k)),

106

S. Albeverio, S.N. Lakaev, K.A. Makarov, Z.I. Muminov

where p(k) denotes the minimum point of the function Ek (p), that is, Ek (p(k)) = Emin (k) (if the minimum value of Ek (p) is attained in several points choose p(k) as any one of them arbitrarily). To prove (5.2) it is sufficient to establish the inequality

(k) = ([h(k) − (Emin (k) − Emin (0) + m(0))]fk , fk ) < 0,

k = 0.

(5.3)

One gets ([h(k) − (Emin (k) − Emin (0) + m(0))]fk )(p) = [Ek (p) − (Emin (k) − Emin (0) + m(0))]f (p − p(k)) − d2 +(2π) v(p − q)f (q − p(k))dq

(5.4)

Td

= [Ek (p) − (Emin (k) − Emin (0) + m(0))]f (p − p(k)) d +(2π)− 2 v(p − p(k) − q)f (q)dq Td

= [Ek (p) − Emin (k) − E0 (p − p(k)) + Emin (0)]f (p − p(k)). Using (5.4) one arrives at the representation E0 (p − p(k)) − Emin (0) − Ek (p) + Emin (k) |f (p − p(k))|2 dp,

(k) = − Td

k ∈ Td .

(5.5)

To check the basic inequality (5.3) we proceed as follows. Making the change of variable p → −p + 2p(k) in (5.5) and using the fact that the functions E0 (p) and |f (p)| are even, one obtains the representation E0 (p − p(k)) − Emin (0) − Ek (−p + 2p(k)) + Emin (k)

(k) = − Td

×|f (p − p(k))|2 dp.

(5.6)

Making again the change of variable p → p − p(k) in (5.5) and (5.6) and adding the results obtained we get

(k) = − F(k, p)|f (p)|2 dp, Td

where F(k, p) = E0 (p) − Emin (0) + Emin (k) −

Ek (p + p(k)) + Ek (p(k) − p) . 2

By Lemma 5, for any (fixed) k = 0, one concludes that F(k, p) > 0 for almost every

p ∈ Td , proving the basic inequality (5.3) and the claim follows. Our second non-perturbative result, the main result of the paper, provides sufficient conditions for the discrete spectrum of the whole family of fiber Hamiltonians h(k) with k = 0 to be non-empty.

The Threshold Effects for the Two-Particle Hamiltonians on Lattices

107

Theorem 2. Let d ≥ 3. Assume Hypothesis 2. Assume, in addition, that the dispersion relations εj (p), j = 1, 2, are twice differentiable functions. Suppose that the operator h(0) has either a threshold eigenvalue or a virtual level. Then, for all k ∈ Td \ {0} the discrete spectrum of the fiber Hamiltonian h(k) below the bottom Emin (k) of its essential spectrum is a non-empty set. Proof. The case where either h(0) has the discrete spectrum below the threshold or the bottom of its essential spectrum m(0), m(0) = Emin (0) = E0 (0),

(5.7)

is a (threshold) eigenvalue has been already treated in Theorem 1. Assume that d = 3 or d = 4 and suppose that h(0) has a virtual level at the bottom of its essential spectrum. Therefore, the equation G(E0 (0))ψ = −ψ,

ψ ∈ C(Td ),

has a nontrivial solution ψ ∈ C(Td ). As in the proof of Theorem 1, without loss of generality one may assume that the function |ψ(p)| is even. In particular, the equation − d2 v(p − q)f (q)dq = m(0)f (p), a.e. p ∈ Td , (5.8) E0 (p)f (p) + (2π) Td

has the L1 (Td )-solution (cf. Corollary 1) f (p) =

ψ(p) E0 (p) − E0 (0)

such that the function |f (·)| is even. 2 d Given k ∈ Td , introduce the sequence {fn,k }∞ n=1 of L (T )-functions fn,k (p) =

ψ(p − p(k)) E0 (p − p(k)) − E0 (0) +

1 n

.

By the dominated convergence theorem the sequence fn,k converges in the space L1 (Td ) as n → ∞ to the function fk fk (p) = f (p − p(k)),

p ∈ Td ,

with f (·) an integrable majorant. Under Hypothesis 2 this means that the sequence of functions [h(k) − Emin (k)]fk,n converges in L∞ (Td )-norm to the bounded function d v(p − q)fk (q)dq [Ek (p) − Emin (k)]fk (p) + (2π)− 2 Td

= [Ek (p) − Emin (k) − E0 (p − p(k)) + E0 (0)]f (p − p(k)), where we used (5.7), (5.8) and the representations ([h(k) − Emin (k)]fk,n )(p) = [Ek (p) − Emin (k)]fk,n (p) + (2π)− 2 d

Td

v(p − q)fk,n (q)dq.

108

S. Albeverio, S.N. Lakaev, K.A. Makarov, Z.I. Muminov

In particular, one concludes that the limit

(k) = lim ([h(k) − Emin (k)]fn,k , fn,k ),

k ∈ Td ,

n→∞

exists and is finite and, moreover, E0 (p − p(k)) − E0 (0) − Ek (p) + Emin (k)

(k) = − |ψ(p − p(k))|2 dp, (E0 (p − p(k)) − E0 (0))2

k ∈ Td .

Td

Next, exactly as it has been done in the proof of Theorem 1, one checks the inequality

(k) < 0,

k = 0.

(5.9)

It follows from (5.9) that there exists an n0 ∈ N such that ([h(k) − Emin (k)]fn0 ,k , fn0 ,k ) < 0,

k = 0,

proving the existence of the discrete spectrum of h(k) below its essential spectrum for k = 0. The proof is complete.

Remark 7. Let d = 3. The width w(k) of the essential spectrum band of the Hamiltonians h(k), w(k) = Emax (k) − Emin (k) may vanish for some values of the quasi-momentum k ∈ T3 . Therefore, the fiber Hamiltonians h(k) may have an infinite discrete spectrum for some values of the quasimomentum k even if the discrete spectrum of h(0) is empty. For instance, consider two (identical) particles on the lattice Z3 with the one-particle dispersion relations of the form ε1 (p) = ε2 (p) =

3

(1 − cos pi ),

p ∈ T3 .

(5.10)

i=1

Then if k0 = (π, π, π ) ∈ T3 we have the “strong degeneration" of the two-particle dispersion relation: Ek0 (p) = ε1 (p) + ε2 (k0 − p) = 6 holds for all p = (p1 , p2 , p3 ) ∈ T3 , which means that the essential spectrum of h(k0 ) is a one-point set, namely, ˆ 0 ) = Ek0 (−i∇)+ Spec ess (h(k0 )) = {6}. Therefore, in this case, the fiber Hamiltonian h(k vˆ has infinite discrete spectrum below the bottom of its essential spectrum, provided that 1 3 (i) vˆ = {v(s)} ˆ s∈Z3 ⊂ (Z ), 3 (ii) #{s ∈ Z |v(s) ˆ < 0} = ∞,

say. If, in addition, (iii) the C(T3 )-norm of the (continuous) function 3 i(p,s) v(p) = (2π)− 2 v(s)e ˆ s∈Z3

is small enough,

The Threshold Effects for the Two-Particle Hamiltonians on Lattices

109

ˆ then the discrete spectrum of the fiber Hamiltonian h(0) = E0 (−i∇) + vˆ corresponding to the zero value of the quasi-momentum, is empty (the Birman-Schwinger integral operator G(λ) on C(T3 ) with kernel function given by (cf. (3.2)) 3

G(p, q; λ) = (2π)− 2 v(p − q)(E0 (q) − λ)−1 , p, q ∈ T3 , λ∈ / [Emin (0), Emax (0)], is a contraction whenever vC(T3 ) is small enough. It is also worth mentioning that even a partial degeneracy of the two-particle dispersion relation Ek (·) for some values of the quasi-momentum k = 0 may generate a “rich” infinite discrete spectrum of the Hamiltonian h(k) outside the band [Emin (k), Emax (k)]. Remark 8. The study of the discrete spectrum of the fiber Hamiltonians above the edge of the essential spectrum follows the guidelines described above with obvious modifications, provided that dispersion relations εα (p), α = 1, 2, in the one-particle sector are conditionally positive definite functions A. Proof of Proposition 1 Assume without loss of generality that ε(0) = 0. “Only If Part.” Let f ∈ L2 (Td ), d = 3, 4, be an eigenfunction of the operator h(0) associated with a zero eigenvalue, that is, d v(p − q)f (q)dq, a.e. p ∈ Td (A.1) −ε(p)f (p) = (2π)− 2 Td

The same argument as in the proof of Lemma 2 shows that the equivalence class associated with the function f has a representative f˜ such that the function ψ(p) = ε(p)f˜(p)

(A.2)

is H¨older continuous, ψ ∈ B(κ). Hence the representative f˜ is continuous away from the origin and since from Hypothesis 1 it follows that lim inf p→0 ε(p)|p|−2 > 0, the following asymptotic representation ψ(0) + O(|p|−2+κ ), p → 0, f˜(p) = ε(p) holds. Since f˜ ∈ L2 (T3 ) and (3.8) holds, the H¨older continuous function ψ must vanish at the origin, that is, ψ(0) = 0. Comparing (A.1) and (A.2) one concludes that −1 is an eigenvalue of the operator G(ε(0)) on C(Td ), d = 3, 4, associated with the eigenfunction ψ with ψ(0) = 0. “If Part.” Assume that the operator G(ε(0)) has an eigenfunction ψ associated with the eigenvalue λ = −1, G(ε(0))ψ = −ψ

(A.3)

110

S. Albeverio, S.N. Lakaev, K.A. Makarov, Z.I. Muminov

such that ψ(0) = 0. Following the strategy of the proof of Lemma 2 one gets that ψ ∈ B(κ). Introduce the function f (p) =

ψ(p) , ε(p)

p = 0.

Clearly, an argument as above shows that the following asymptotic representation f (p) = O(|p|−2+κ ),

p → 0,

holds. Since (3.8) holds, one proves that f ∈ L2 (Td ) and then (A.3) means that the operator h(0) has a nontrivial kernel, completing the proof. B. Coexistence of a Threshold Eigenvalue and a Virtual Level The main goal of this Appendix is to show by an explicit example that the Case IV is not empty. Example 2. Let hˆ λ,µ , λ, µ ∈ R, be the discrete Schr¨odinger operator of the form hˆ λ,µ = − + vˆλ,µ , where is the discrete Laplacian from Example 1 and   µ, s = 0 vˆλ,µ (s) = λ2 , |s| = 1  0, otherwise. The Fourier transform of the interaction can be explicitly computed 3 1 v(p) = µ+λ cos pi , 3 (2π) 2 i=1 and, hence, for the Birman-Schwinger kernel one gets the representation 1 µ + λ 3i=1 cos(pi − qi ) G(p, q; 0) = , p, q ∈ T3 , (2π)3 ε(q)

(B.1)

where ε(q) is given by (2.4) and we have used the equality ε(0) = 0. Introduce the notations 1 1 dq cos qi dq a= , c = , i = 1, 2, 3, 3 3 (2π ) T3 ε(q) (2π) T3 ε(q) 1 1 sin2 qi dq cos2 qi dq s= , b = , i = 1, 2, 3, 3 3 (2π ) T3 ε(q) (2π) T3 ε(q) cos qi cos qj dq 1 d= , i, j = 1, 2, 3, i = j. 3 (2π ) T3 ε(q) We remark that since the function ε(q) = ε(q1 , q2 , q3 ) is invariant with respect to the permutations of its arguments q1 , q2 and q3 , the integrals c, s, b, d above do not

The Threshold Effects for the Two-Particle Hamiltonians on Lattices

111

depend on the particular choice of the indices i, j . A simple computation shows that the following relations: 1 , 6 b + 2d = 3c, a−c =

a =b+s

(B.2) (B.3)

1 2 and s = − (b − d) 6 3

(B.4)

hold. Lemma 6. a >

11 51

. In particular, c > 0.

Proof. We start with the representation π π π 4 4 dq dq dq = + π A − cos q π A + cos q A − cos q −4 −4 −π π π 4 4 dq dq + , + π A − sin q π A + sin q −4 −4

(B.5) |A| > 1,

which yields 1 dq dq 1 = 3 3 (2π ) T3 ε(q) 2(2π) T3 3 − cos q1 − cos q2 − cos q3 π 4 1 dq dq dq3 f (q1 , q2 , q3 ), = 1 2 3 2(2π ) T2 − π4

a=

(B.6)

where f (q1 , q2 , q3 ) =

1 1 + 3 − cos q1 − cos q2 − cos q3 3 − cos q1 − cos q2 + cos q3 1 1 + + . 3 − cos q1 − cos q2 + sin q3 3 − cos q1 − cos q2 − sin q3

Note that the function f is well defined on (−π, π ]3 \ {0}. One easily checks that for fixed q1 , q2 , the function f (q1 , q2 , q3 ) as a function of the argument q3 , q3 ∈ [− π4 , π4 ], attains its minimal value at the end points of the interval [− π4 , π4 ] and hence f (q1 , q2 , q3 ) >

2 3 − cos q1 − cos q2 −

√

2 2

+

2 3 − cos q1 − cos q2 +

√

2 2

, (B.7)

π π q3 ∈ (− , ). 4 4 Combining (B.6) and (B.7) proves the inequality 1 1 π √ dq dq + a> 1 2 3 (2π ) 2 T2 3 − 22 − cos q1 − cos q2 3+

√

2 2

1 − cos q1 − cos q2

.

112

S. Albeverio, S.N. Lakaev, K.A. Makarov, Z.I. Muminov

Applying the trick (B.5) two more times (first by getting rid of the variable q2 and then of q1 ) one arrives at the estimate 2 1 π 2 4 2 √ √ a> dq1 + + (2π )3 2 3 − cos q1 T 3 − 2 22 − cos q1 3 + 2 22 − cos q1 3 π 4 11 12 12 4 1 √ + √ + √ + √ = , > 3 (2π ) 2 51 3 − 3 22 3 − 22 3 + 22 3 + 3 22 completing the proof.

Corollary 2. The set

=

1 1 , s b−d

2a \ c

is nonempty. Proof. Assume to the contrary that = ∅, that is, 1 2a 1 = = . s b−d c

(B.8)

Solving (B.3), (B.4) and (B.8) simultaneously in particular yields a= which is impossible due to Lemma 6.

11 5 < , 24 51

Theorem 3. Assume that −λ ∈ and µ=−

1 + 3λc a+

λc 2

,

then the Hamiltonian hλ,µ has both a virtual level and a threshold eigenvalue. Proof. In accordance with Proposition 1 one needs to show that the integral operator G(0) given by (B.1) on the Banach space C(T3 ) has two eigenfunctions, ψ and ϕ associated with an eigenvalue −1: G(0)ψ = −ψ

and G(0)ϕ = −ϕ

such that ψ(0) = 0

and ϕ(0) = 0.

The space of all odd (resp. even) functions Co (T3 ) (resp. Ce (T3 )) is an invariant subspace for the integral operator G(0). The restrictions Go (respectively Ge ) of G(0) on the subspace Co (T3 ) (respectively Co (T3 )) have the kernel functions 3 λ sin pi sin qi , (2π)3 ε(q) i=1 1 µ + λ 3i=1 cos pi cos qi Ge (p, q) = . ε(q) (2π)3

Go (p, q) =

The Threshold Effects for the Two-Particle Hamiltonians on Lattices

113

The matrix of the restriction Go |S of Go onto its three-dimensional invariant subspaces S ⊂ Co (T3 ) spanned by the functions sin p1 , sin p2 , and sin p3 in the basis ei = sin pi , i = 1, 2, 3, is a diagonal matrix of the form   λs 0 0 (B.9) Go |S =  0 λs 0  0 0 λs while the matrix of the restriction Ge |C of Ge onto its four-dimensional invariant subspaces C ⊂ Ce (T3 ) spanned by the functions 1, cos p1 , cos p2 , and cos p3 in the basis fi = cos pi , i = 1, 2, 3, f4 = 1 is given by   λb λd λd λc λd λb λd λc  . (B.10) Ge |C =  λd λd λb λc  µc µc µc µa From (B.10) it follows that if λ, µ and γ satisfy the relations

then



λb λd λd µc

λ(b + 2d + cγ ) = −1,

(B.11)

µ(3c + aγ ) = −γ ,

(B.12)

λd λb λd µc

λd λd λb µc

    λc 1 1 λc   1   1  = . λc   1   1  µa γ γ

Given λ ∈ R, λ = − 2a c , solving Eqs. (B.11) and (B.12) with respect to µ and γ yields 1 − 3, λc 1 + 3λc γ (λ) 1 + 3λc =− µ(λ) = − , = 3c + aγ (λ) 3λc2 − 3λac − a a + λc 2 γ (λ) = −

(B.13) (B.14)

µ = µ(λ) satisfies (B.14) where we used (B.2) and (B.3). Therefore, if λ = − 2a c and 3 the operator G(0) has an eigenfunction ψ(p) = γ (λ) + i=1 cos pi and, moreover,

3

ψ(0) = γ (λ) + cos pi

i=1

p1 =p2 =p3 =0

= 0

as it follows from (B.13). Thus, the Hamiltonian hλ,µ(λ) has a virtual level. Next, from the matrix representation (B.9) for Go |S one gets that if λs = −1, then for any µ ∈ R the operator Go has a three-dimensional eigensubspace spanned by the functions sin pi , i = 1, 2, 3, associated with an eigenvalue −1 of multiplicity three. In particular, G(0)ϕ = −ϕ with ϕ(p) = sin p1 and hence ϕ(0) = 0.

114

S. Albeverio, S.N. Lakaev, K.A. Makarov, Z.I. Muminov

Similarly (cf. (B.10)), if λ(b − d) = −1, then for any µ ∈ R the operator Ge |C has two linearly independent eigenfunctions cos p1 − cos p2 and cos p1 − cos p3 associated with an eigenvalue −1 of multiplicity two. In particular, Gϕ = −ϕ with ϕ(p) = cos p1 − cos p2 , and hence ϕ(0) = 0. 1 Therefore, if λ = − 1s or λ = − b−d , then for any µ ∈ R the operator hλ,µ has an eigenvalue at the bottom of its (absolutely) continuous spectrum. 1 1 Taking −λ ∈ = s , b−d \ 2a (which is nonempty by Corollary 2) and c µ = µ(λ) = − 1+3λc λc one proves the coexistence of a virtual level and a threshold a+

2

eigenvalue for the Hamiltonian hλ,µ(λ) .

Remark 9. We were not able to find out whether the set is a one- or a two-point set and hence we cannot explicitly compute the multiplicity of the zero-energy eigenvalue (more information about numerical values of the integrals a, b, c, and d is needed). However, if contains two elements, then the Hamiltonian hˆ λ,− 1+3λc has a virtual level and a a+ λc 2

threshold eigenvalue of multiplicity two or three depending on the choice of −λ ∈ 1 or λ = − 1s respectively). (λ = − b−d If || = 1, it might happen that the Hamiltonian hˆ λ,− 1+3λc , −λ ∈ , has a virtual a+ λc 2

level and a threshold eigenvalue of multiplicity two, three or even five depending on which of the cases (i) (ii) (iii)

c 2a c 2a c 2a

= s = b − d, = b − d = s, = s = b − d.

takes place respectively. Acknowledgements. K. A. Makarov thanks F. Gesztesy and V. Kostrykin for useful discussions. He is also indebted to the Institute of Applied Mathematics of the University Bonn for its kind hospitality during his stay in the summer 2003. This work was also partially supported by the DFG 436 USB 113/4 Project and the Fundamental Science Foundation of Uzbekistan. S.N. Lakaev and Z.I. Muminov gratefully acknowledge the hospitality of the Institute of Applied Mathematics of the University Bonn. We are indebted to the anonymous referee for a number of constructive comments.

References 1. Albeverio, S., Gesztesy, F., Høegh-Krohn, R.: The low energy expansion in non-relativistic scattering theory. Ann. Inst. H. Poincar´e Sect. A (N.S.) 37, 1–28 (1982) 2. Albeverio, S., Høegh-Krohn, R., Wu, T.T.: A class of exactly solvable three-body quantum mechanical problems and universal low energy behavior. Phys. Lett. A 83, 105–109 (1971) 3. Albeverio, S., Gesztesy, F., Høegh-Krohn, R., Holden, H.: Solvable Models in Quantum Mechanics. New York: Springer-Verlag, 1988; 2nd ed. (with an appendix by P. Exner), Chelsea: AMS, 2005 4. Albeverio, S., Lakaev, S.N., Muminov, Z.I.: Schr¨odinger operators on lattices. The Efimov effect and discrete spectrum asymptotics. Ann. Henri Poincar´e. 5, 743–772 (2004) 5. Berg, C., Christensen, J.P.R., Ressel, P.: Harmonic analysis on semigroups. Theory of positive definite and related functions. Graduate Texts in Mathematics, New York: Springer-Verlag, 1984. 289 pp. 6. Carmona, R., Lacroix, J.: Spectral theory of random Schr¨odinger operators. Probability and its Applications, Boston: Birkh¨auser, 1990

The Threshold Effects for the Two-Particle Hamiltonians on Lattices

115

7. Jensen, A., Kato, T.: Spectral properties of Schr¨odinger operators and time-decay of the wave functions. Duke Math. J. 46, 583–611 (1979) 8. Faria da Veiga, P.A., Ioriatti, L., O’Carroll, M.: Energy-momentum spectrum of some two-particle lattice Schr¨odinger Hamiltonians. Phys. Rev. E (3) 66, 016130, 9 pp. (2002) 9. Graf, G.M., Schenker, D.: 2-magnon scattering in the Heisenberg model. Ann. Inst. H. Poincar´e Phys. Th´eor. 67, 91–107 (1997) 10. Klaus, M., Simon, B.: Coupling constants thresholds in non-relativistic quantum mechanics. I. Short range two body case. Ann. Phys. 130, 251–281 (1980) 11. Kondratiev, Yu. G., Minlos, R.A.: One-particle subspaces in the stochastic XY model. J. Statist. Phys. 87, 613–642 (1997) 12. Kostrykin, V., Schrader, R.: Cluster properties of one particle Scr¨odinger operators. II. Rev. Math. Phys. 10, 627–682 (1998) 13. Lakaev, S.N.: The Efimov effect in a system of three identical quantum particles. Funct. Anal. Appl. 27, 166–175 (1993) 14. Lakaev, S.N.: Discrete spectrum and resonances of the one-dimensional Schr¨odinger operator for small coupling constants. Teoret. Mat. Fiz. 44, 381–386 (1980) 15. Mattis, D.C.: The few-body problem on a lattice. Rev. Mod. Phys. 58, 361–379 (1986) 16. Minlos, R.A., Suhov, Y.M.: On the spectrum of the generator of an infinite system of interacting diffusions. Commun. Math. Phys. 206, 463–489 (1999) 17. Mogilner, A.: Hamiltonians in solid state physics as multi-particle discrete Schr¨odinger operators: Problems and results. Adv. in Sov. Math. 5, 139–194 (1991) 18. Ovchinnikov, Yu. N., Sigal, I. M.: Number of bound states of three-particle systems and Efimov’s effect. Ann. Phys. 123, 274–295 (1989) 19. Rauch, J.: Perturbation theory for eigenvalues and resonances of Schr¨odinger Hamiltonians. J. Funct. Anal. 35, 304–315 (1980) 20. Reed, M., Simon, B.: Methods of modern mathematical physics. III: Scattering theory. New York: Academic Press, 1979 21. Reed, M., Simon, B.: Methods of modern mathematical physics. IV: Analysis of Operators. New York: Academic Press, 1979 22. Simon, B.: Large time behavior of the Lp norm of Schr¨odinger Semigroups. J. Funct. Anal. 40, 66–83 (1981) 23. Sobolev, A. V.: The Efimov effect. Discrete spectrum asymptotics. Commun. Math. Phys. 156, 127– 168 (1993) 24. Tamura, H.: The Efimov effect of three-body Schr¨odinger operators. J. Funct. Anal. 95, 433–459 (1991) 25. Tamura, H.: The Efimov effect of three-body Schr¨odinger operators: Asymptotics for the number of negative eigenvalues. Nagoya Math. J. 130, 55–83(1993) 26. Yafaev, D. R.: Scattering theory: Some old and new problems. Lecture Notes in Mathematics 1735 Berlin: Springer-Verlag, 2000, 169 pp. 27. Yafaev, D. R.: On the theory of the discrete spectrum of the three-particle Schr¨odinger operator. Math. USSR-Sb. 23, 535–559 (1974) 28. Yafaev, D. R.: The virtual level of the Schr¨odinger equation. J. Sov. Math. 11, 501–510 (1979) 29. Zhizhina, E. A.: Two-particle spectrum of the generator for stochastic model of planar rotators at high temperatures. J. Stat. Phys. 91, 343–368 (1998) Communicated by B. Simon

Commun. Math. Phys. 262, 117–135 (2006) Digital Object Identifier (DOI) 10.1007/s00220-005-1385-7

Communications in

Mathematical Physics

PROP Profile of Poisson Geometry S.A. Merkulov Matematiska Institutionen, Stockholms Universitet, 10691 Stockholm, Sweden. E-mail: [email protected] Received: 20 January 2005 / Accepted: 1 February 2005 Published online: 8 July 2005 – © Springer-Verlag 2005

Abstract: It is shown that some classical local geometries are of infinity origin, i.e. their smooth formal germs are (homotopy) representations of cofibrant (di) operads in spaces concentrated in degree zero. In particular, they admit natural infinity generalizations when one considers homotopy representations of the (di) operads in generic differential graded spaces. Poisson geometry provides us with a simplest manifestation of this phenomenon. 0. Introduction The first instances of algebraic and topological strongly homotopy, or infinity, structures have been discovered by Stasheff [St] long ago. Since that time infinities have acquired a prominent role in algebraic topology and homological algebra. We argue in this paper that some classical local geometries are of infinity origin, i.e. their smooth formal germs are (homotopy) representations of cofibrant PROPs P∞ in spaces concentrated in degree zero; in particular, they admit natural infinity generalizations when one considers homotopy representations of P∞ in generic differential graded (dg) spaces. The simplest manifestation of this phenomenon is provided by the Poisson geometry (or even by smooth germs of tensor fields!) and is the main theme of the present paper. Another example is discussed in [Mer2]. The PROPs P∞ are minimal resolutions of PROPs P which are graph spaces built from very few basic elements, genes, subject to simple engineering rules. Thus to a local geometric structure one can associate a kind of a code, genome, which specifies it uniquely and opens a new window of opportunities of attacking differential geometric problems with the powerful tools of homological algebra. Formal germs of geometric structures discussed in this paper are pointed in the sense that they vanish at the distinguished point. This is the usual price one pays for working with (di)operads without “zero terms” (as is often done in the literature). As structural equations behind the particular geometries we study in this paper are homogeneous, this

118

S.A. Merkulov

restriction poses no problem: say, a generic non-pointed Poisson structure, ν, in Rn can be identified with the pointed one, ν, in Rn+1 , being the extra coordinate. We introduce in this paper a dg free dioperad whose generic representations in a graded vector space V can be identified with pointed solutions of the Maurer-Cartan equations in the Lie algebra of polyvector fields on the formal manifold associated with V . The cohomology of this dioperad can not be computed directly. Instead one has to rely on some fine mathematics such as Koszulness [GiKa, G] and distributive laws [Mar1, G]. One of the main results of this paper is a proof of Theorem 3.2 which identifies the cohomology of that dg free dioperad with a surprisingly small dioperad, Lie1 Bi, of Lie 1-bialgebras, which are almost identical to the dioperad, LieBi, of usual Lie bialgebras except that the degree of generating Lie and coLie operations differ by 1 (compare with Gerstenhaber versus Poisson algebras). The dioperad Lie1 Bi is proven to be Koszul. We use the resulting geometric interpretation of Lie1 Bi∞ algebras to give their homotopy classification (see Theorem 3.4.5) which is an extension of Kontsevich’s homotopy classification [Ko1] of L∞ algebras. As a side remark we also discuss graph and geometric interpretations of strongly homotopy Lie bialgebras using Koszulness of the latter which was established in [G]. 1. Geometry ⇒ PROP profile ⇒ Geometry ∞ Let P be an operad, or a dioperad, or even a PROP admitting a minimal dg resolution. Let PAlg be the category of finite dimensional dg P-algebras, and D(PAlg) the associated derived category (which we understand here as the homotopy category of P∞ -algebras, P∞ being the minimal resolution of P). For any locally defined geometric structure Geom (say, Poisson, Riemann, K¨ahler, etc.) it makes sense talking about the category of formal Geom-manifolds. Its objects are formal pointed manifolds (non-canonically isomorphic to (Rn , 0) for some n) together with a germ of formal Geom-structure at the distinguished point. Definition 1.1. The operad/dioperad/PROP P is called a PROP-profile, or genome, of a geometric structure Geom if • the category of formal Geom-manifolds is equivalent to a full subcategory of the derived category D(PAlg) , and • there is no sub-(di)operad of P having the above property. Definition 1.2. If P is a PROP-profile of a geometric structure Geom, then a generic object of D(PAlg) is called a formal Geom∞ -manifold. Presumably, Geom∞ -structure is what one gets from Geom by means of the extended deformation theory. Local geometric structures are often non-trivial and complicated creatures — the general solution of the associated defining system of nonlinear differential equations is not available; it is often a very hard job just to show existence of non-trivial solutions. Nevertheless, if such a structure Geom admits a PROP-profile, P = F ree(E)/I deal 1 , then Geom can be non-ambiguously characterized by its “genetic code”: genes are, by definition, the generators of E, and the engineering rules are, by definition, the generators of I deal. And that code can be surprisingly simple, as Examples 1.3–1.5 and illustrate. 1 Any operad/dioperad/etc. can be represented as a quotient of the free operad/dioperad/etc., F ree(E ) generated by a collection of m -left/n -right modules E = {E (m, n)}m,n≥1 , by an I deal. Often there exists a canonical, “common factors canceled out”, representation like this.

PROP Profile of Poisson Geometry

119 Table 1.

Genome P

generic representation of P∞ in Rn

generic representation of P∞ in a graded vector space V

smooth formal Hertling-Manin structure in Rn [HeMa]

smooth formal Hertling-Manin∞ structure in Vˆ [Mer1]

P is the G-operad

Genes: ◦?? , •??

Engineering rules: ◦? - ◦?? = 0 ◦< ◦= < = •? + •? + •;; = 0 •< CC •? •= ; < C ? = ; •? − ◦? − ◦;; = 0 FF•?? •== ;; ◦<< F P is the dioperad TF

??

Genes: ◦ , •?? Rules: •? + •? + •;; = 0 •< CC •? •= ; < C ? = ; ?? BB ◦ 77 •333 •<< @@@•PP q z DqD• zEz •99 = ◦ + ◦ + } ◦z + ◦z

smooth formal section of ⊗2 TRn (variants: of ∧2 TRn or of 2 TRn ) vanishing at 0

structure, (Vˆ , ð ∈ TVˆ ), of a smooth dg manifold together with a smooth section φ of ⊗2 TVˆ (variants: of ∧2 TVˆ or of 2 TVˆ ) vanishing at 0 and satisfying Lieð φ = 0.

P is the dioperad Lie1 Bi

?? << << FF 33 44 F 33 ◦ ◦? ◦? ◦ − ◦ = 0 ◦ − Rules: Genes: ◦ , •??

•? + •? + •;; = 0 •< CC •? •= ; < C ? = ; ?? ◦ 77 •333 •<< @@@•PP q •BB + ◦ + } ◦zz + DqD◦zzEz = 9 • ◦ 9

smooth formal Poisson structure in Rn vanishing at 0

structure, (V ⊕ V ∗ [1], ð), of a smooth dg manifold together with an odd symplectic form ωodd on V ⊕ V ∗ [1] such that the homological vector field ð is hamiltonian V ∗ [1] and vanishes on 0 ⊕

Notations: For a graded vector space V , Vˆ stands for the formal graded manifold (non-canonically) isomorphic to the formal neighbourhood of 0 in V , and TVˆ stands for the tangent bundle on Vˆ .

1.3. Hertling-Manin’s geometry and the G-operad. A Gerstenhaber algebra is, by definition, a graded vector space V together with two linear maps, ◦ : 2 V −→ V , a ⊗ b −→ a ◦ b

[ • ] : 2 V −→ V [1] a ⊗ b −→ (−1)|a| [a • b]

satisfying the identities, (i) a ◦ (b ◦ c) − (a ◦ b) ◦ c = 0 (associativity); (ii) [[a • b] • c] = [a • [b • c]] + (−1)|b||a|+|b|+|a| [b • [a • c]] (Jacobi identity); (iii) [(a ◦ b) • c] = a ◦ [b • c] + (−1)|b|(|c|+1) [a • c] ◦ b (Leibniz type identity).

120

S.A. Merkulov

The operad whose algebras are Gerstenhaber algebras is often called the G-operad. It has a relatively simple structure, F ree(E)/I deal, with E spanned by two corollas, E = span ◦ = ◦?? , [ • ] = •?? and with engineering rules (i)–(iii). The minimal resolution of the G-operad has been constructed in [GetJo] and is often called a G∞ -operad. The derived category of Gerstenhaber algebras is equivalent to the category whose objects are isomorphism classes of minimal G∞ -structures on graded vector spaces V . Let (M, ∗) be the formal pointed graded manifold whose tangent space at the distinguished point is isomorphic to a vector space V , and let us choose an arbitrary torsion-free affine connection ∇ on M. With this choice a structure of G∞ algebra on a graded vector space V can be suitable described as • a degree 1 smooth vector field ð on M satisfying the integrability condition [ð, ð] = 0 and vanishing at the distinguished point ∗; (if ð has zero at ∗ of second order, then the G∞ -structure is called minimal); • a collection of homogeneous tensors,

µn1 ,... ,nk : TM⊗n1 ⊗TM⊗n2 ⊗ · · ·⊗ TM⊗nk → TM [k+1 − n1 −· · ·− nk ]

ni ,k≥1,ni +k≥2

satisfying an infinite tower of quadratic algebraic and differential equations. The first two floors of this tower read as follows: the data {µn }n≥1 (with µ1 := Lieð ) makes the tangent sheaf TM into a sheaf of C∞ algebras2 satisfying an “integrability” condition, [µ• , µ• ]G∞ = Lieð µ•,• for a certain bi-differential operator [ , ]G∞ whose leading term is just the usual vector field bracket of values of µ• . It is also required that each tensor µ•,... ,• : TM⊗• ⊗ · · · ⊗ TM⊗• → TM vanishes if the input contains at least one pure shuffle product, (v1 ⊗ · · · ⊗ vk )(vk+1 ⊗ · · · ⊗ vn ) :=

(−1)Koszul(σ ) vσ (1) ⊗ . . . ⊗ vσ (n) ,

Shuffles σ of type (k,n)

vi ∈ TM . A change of the connection ∇ alters the tensors µ•1 ,... ,•k , k ≥ 2, but leaves the homotopy class of the G∞ -structure on V invariant. If the vector space V is concentrated in degree 0, i.e. V Rn , then a G∞ -structure on V reduces just to a single tensor field µ2 : TM⊗2 → TM which makes the tangent sheaf into a sheaf of commutative associative algebras, and satisfies the differential equations, [µ2 , µ2 ]G∞ = 0. 2

C∞ stands for the minimal resolution of the operad of commutative associative algebras.

PROP Profile of Poisson Geometry

121

The explicit form for the bracket [ , ]G∞ can be read off from the G∞ operad structural equations rather straightforwardly (see [Mer1] for details), [µ2 , µ2 ]G∞ (X, Y, Z, W ) = [µ2 (X, Y ), µ2 (Z, W )] −µ2 ([µ2 (X, Y ), Z], W ) − µ2 (Z, [µ2 (X, Y ), W ]) −µ2 (X, [Y, µ2 (Z, W )]) − µ2 [X, µ2 (Z, W )], Y ) +µ2 (X, µ2 (Z, [Y, W ])) + µ2 (X, µ2 ([Y, Z], W )) +µ2 ([X, Z], µ2 (Y, W )) + µ2 ([X, W ], µ2 (Y, Z)). The resulting geometric structure is precisely the one discovered earlier by Hertling and Manin [HeMa] in their quest for a weaker notion of Frobenius manifold; they call it an F -manifold structure on V . Hertling-Manin’s geometric structures arise naturally in the theory of singularities [He] and the deformation theory [Mer1].

1.4. Germs of tensor fields. A TF bialgebra is, by definition, a graded vector space V together with two linear maps, ?? 2 2 δ ≡ ◦ : V −→ ⊗ •? V , [ • ] ≡ ? : V −→ V [1] |a| a −→ a1 ⊗ a2 a ⊗ b −→ (−1) [a • b] satisfying the identities, (i) [[a • b] • c] = [a • [b • c]] + (−1)|b||a|+|b|+|a| [b • [a • c]] (Jacobi identity); (ii) δ[a • b] = a1 ⊗ [a2 • b] + [a • b1 ] ⊗ b2 + (−1)|a||b|+|a|+|b| ([b • a1 ] ⊗ a2 + b1 ⊗ [b2 • a]) (Leibniz type identity). 2 2 There are obvious ?? versions of the above notion with δ taking values in ∧ V and V , ◦ realizing either the trivial or sign representations of 2 . i.e. with the gene The dioperad whose algebras are TF bialgebras is denoted by TF. This quadratic dioperad is Koszul so that one can construct its minimal resolution using the results of [G, GiKa, Mar1]. It turns out that the structure of TF∞ -algebra on a graded vector space V is the same as a pair of collections of linear maps, µn : n V → V [1] n≥1 ,

and φn : n V → V ⊗ V n≥1 , satisfying a system of quadratic equations which are best described using a geometric language. Let M be the formal graded manifold associated to V . If {eα , α = 1, 2, . . . } is a homogeneous basis of V , then the associated dual basis t α , |t α | = −|eα |, defines a coordinate system on M. The collection of tensors {µn }n≥1 can be assembled into a germ, ð ∈ TM , of a degree 1 smooth vector field, ð :=

∞ 1 ∂ (−1) t α1 · · · t αn µα1 ,...β,αn β , n! ∂t n=1

122

S.A. Merkulov

where =

n

|eαk |(1 +

k=1

k

|eαi |),

i=1

β

the numbers µα1 ...αn are defined by

µn (eα1 , . . . , eαn ) =

µα1 ...αβn eβ ,

and we assume here and throughout the paper summation over repeated small Greek indices. Another collection of linear maps, {φn }, can be assembled into a smooth germ, φ ∈ ⊗2 TM , of a degree zero contravariant tensor field on M, φ :=

∞ 1 ∂ ∂ (−1) t α1 · · · t αn φα1 ...αβn1 β2 β ⊗ β , n! ∂t ∂t n=1

where k n

= |eβ2 |(|eβ1 | + 1) +

|eαk ||eαi |

k=1 i=1 β β2

and the numbers µα1 ...αn1

are defined by

µn (eα1 , . . . , eαn ) =

1 β2 e µα1 ,...β,α ⊗ eβ2 . n β1

Proposition 1.4.1. The collections of tensors, µn : n V → V [1] n≥1 and φn : n V → V ⊗ V n≥1 , define a structure of TF∞ -algebra on V if and only if the associated smooth vector field ð and the contravariant tensor field φ satisfy the equations, [ð, ð] = 0 and Lieð φ = 0, where [ , ] stands for the usual bracket of vector fields and Lieð for the Lie derivative along ð. If V is finite dimensional and concentrated in degree zero, then a TF∞ -structure in V is just a germ of a smooth rank 2 contravariant tensor on V vanishing at 0.

PROP Profile of Poisson Geometry

123

1.5. Poisson geometry and the dioperad of Lie 1-bialgebras. A Lie 1-bialgebra is, by definition, a graded vector space V together with two linear maps, δ : V −→ ∧2 V a −→ a1 ∧ a2

[ • ] : 2 V −→ V [1] a ⊗ b −→ (−1)|a| [a • b]

,

satisfying the identities, (i) (δ ⊗ Id)δa + τ (δ ⊗ Id)δa + τ 2 (δ ⊗ Id)δa = 0, where τ is the cyclic permutation (123) represented naturally on V ⊗ V ⊗ V (co-Jacobi identity); |b||a|+|b|+|a| [b • [a • c]] (Jacobi identity); (ii) [[a • b] • c] = [a • [b • c]] + (−1) ||a |a 1 2 | a ∧[a •b]+[a •b ]∧b −(−1)|b1 ||b2 | [a • (iii) δ[a •b] = a1 ∧[a2 •b]−(−1) 2 1 1 2 b2 ] ∧ b1 (Leibniz type identity). The dioperad whose algebras are Lie 1-bialgebras is denoted by Lie1 Bi. The superscript 1 in the notation is used to emphasize that the two basic operations ?? δ = ◦ , [ • ] = •?? have homogeneities which differ by 1. Similarly one can introduce the notion of Lie n-bialgebras: coLie algebra structure on V plus Lie algebra structure on V [−n] plus an obvious Leibniz type identity. Homotopy theory of Lie n-bialgebras splits into two stories, one for n even, and one for n odd. The even case (more precisely, the case n = 0) has been studied by Gan [G]. In this paper we study the odd case, more precisely, the case n = 1. The dioperad Lie1 Bi is Koszul. Hence one can use the machinery of [G, GiKa, Mar1] to construct its minimal resolution, the dioperad Lie1 Bi∞ . The structure of a Lie1 Bi∞ algebra on a graded vector space V is a collection of linear maps, µm,n : n V → ∧m V [2 − m] m≥1,n≥1 , satisfying a system of quadratic equations which can be described as follows. Let M be the formal graded manifold associated to V . If {eα , α = 1, 2, . . . } is a homogeneous basis of V , then the associated dual basis t α , |t α | = −|eα |, defines a coordinate system on M. For a fixed m the collection of tensors {µm,n }n≥1 can be assembled into a germ, m ∈ ∧m TM , of a smooth polyvector field (vanishing at 0 ∈ M), m :=

∞ n=1

1 ∂ ∂ (−1) t α1 · · · t αn µα1 ...αβn1 ...βm β ∧ · · · ∧ β , 1 m!n! ∂t ∂t m

where =

n

|eαk |(2 − m +

k=1

k

|eαi |) +

i=1 β ,... ,βm

1 and the numbers µα1 ,... ,α n

n k=1

(|eβk | + 1)

n

|eβi |

i=k+1

are defined by

µm,n (eα1 , . . . , eαn ) =

1 ,... ,βm e µα1 ,...β,α β1 ∧ · · · ∧ eβm . n

124

S.A. Merkulov

Proposition 1.5.1. A collection of tensors, µm,n : n V → ∧m V [2 − m] m≥1,n≥1 , defines a structure of Lie1 Bi∞ -algebra on V if and only if the associated smooth polyvector field, m ∈ ∧• TM , := m≥1

satisfies the equation [ , ] = 0, where [ , ] stands for the Schouten bracket of polyvector fields. In particular, if V is concentrated in degree zero, then the only non-zero summand in is 2 ∈ ∧2 TM . Hence a Lie1 Bi∞ -algebra structure on Rn is nothing but a germ of a smooth Poisson structure on Rn vanishing at 0. 1.6. On the content of the rest. Section 2 is a reminder about PROPs, dioperads and Koszulness [G, GiKa, Mar1]. In Sects. 3 and 4 we prove Koszulness of the dioperads Lie1 Bi and TF, apply the machinery reviewed in Sect. 2 to give explicit graph descriptions of their minimal resolutions, Lie1 Bi∞ and TF∞ , prove Propositions 1.4.1 and 1.5.1 and introduce and study the notion of Lie1 Bi∞ morphisms. Section 5 is a comment on a geometric description of algebras over the dioperad of strongly homotopy Lie bialgebras, and their strongly homotopy maps. 2. PROPs and Dioperads [G] Let Sf be the groupoid of finite sets. It is equivalent to the category whose objects are natural numbers, {m}m≥1 , and morphisms are the permutation groups {m }m≥1 . A PROP P in the category, dgVec, of differential graded (shortly, dg) vector spaces is a functor P : Sf × Sf op → dgVec together with natural transformations, ◦A,B,C : P(A, B) ⊗ P(B, C) −→ P(A, C), ⊗A,B,C,D : P(A, B) ⊗ P(C, D) −→ P(A ⊗ B, C ⊗ D) and the distinguished elements IdA ∈ P(A, A) and sA,B ∈ P(A ⊗ B, B ⊗ A) satisfying a system of axioms [A] which just mimic the obvious properties of the following natural transformation, EV : (m, n) −→ H om(V ⊗n , V ⊗m ), canonically associated with an arbitrary dg space V . The latter fundamental example is called the endomorphism PROP of V . Given a collection of dg (m , n )-bimodules, E = {E(m, n)}m,n≥1 , one can construct the associated free PROP, F ree(E), by decorating vertices of all possible directed graphs with a flow by the elements of E and then taking the colimit over the graph automorphism group. The composition operation ◦ corresponds then to gluing output legs of one graph to the input legs of another graph, and the tensor product ⊗ to the disjoint union of graphs. Even for a small finite dimensional collection E the resulting free PROP can be a monstrous infinite dimensional object. The notion of dioperad was introduced by Gan [G] as a way to avoid that free PROP “divergence”. In the above setup, a free dioperad on E is built on graphs of genus zero, i.e. on trees. More precisely, a dioperad P consists of data:

PROP Profile of Poisson Geometry

125

(i) a collection of dg (m , n ) bimodules, {P(m, n)}m≥1,n≥1 ; (ii) for each m1 , n1 , m2 , n2 ≥ 1, i ∈ {1, 2, . . . , n1 } and j ∈ {1, . . . , n1 } a linear map i ◦j

: P(m1 , n1 ) ⊗ P(m2 , n2 ) −→ P(m1 + m2 − 1, n1 + n2 − 1),

(iii) a morphism e : k → P(1, 1) such that the compositions e⊗I d

1 ◦i

I d⊗e

j ◦1

k ⊗ P(m, n) −→ P(1, 1) ⊗ P(m, n) −→ P(m, n) and P(m, n) ⊗ k −→ P(m, n) ⊗ P(1, 1) −→ P(m, n) are the canonical isomorphisms for all m, n ≥ 1, 1 ≤ i ≤ m and 1 ≤ j ≤ n. These data satisfy associativity and equivariance conditions [G] which can be read off from the example of the endomorphism dioperad EndV with EndV (m, n) = H om(V ⊗n , V ⊗m ), e : 1 → Id ∈ H om(V , V ), and the compositions given by i ◦j

: P(m1 , n1 ) ⊗ P(m2 , n2 ) −→ P(m1 + m2 − 1, n1 + n2 − 1) f ⊗g −→ (Id ⊗· · · ⊗ f ⊗· · · ⊗ Id)σ(Id ⊗· · ·⊗ g ⊗· · ·⊗ Id),

where f (resp. g) is at the j th (resp. i th ) place, and σ is the permutation of the set I = (1, 2, . . . , n1 + m2 − 1) swapping the subintervals, I1 ↔ I2 and I4 ↔ I5 , of the unique order preserving decomposition, I = I1 I2 I3 I4 I5 , of I into the disjoint union of five intervals of lengths |I1 | = i − 1, |I2 | = j − 1, |I3 | = 1, |I4 | = m2 − j and |I5 | = n1 − i. If P is a dioperad, then the collection of (m , n ) bimodules, P op (m, n) := (P(n, m), transposed actions of m and n ) , is naturally a dioperad as well. If P is a dioperad with P(m, n) vanishing for all m, n except for (m = 1, n ≥ 1), then P is called an operad. A morphism of dioperads, F : P → Q, is a collection of equivariant linear maps, F (m, n) : P(m, n) → Q(m, n), preserving all the structures. If P is a dioperad, then a P-algebra is a dg vector space V together with a morphism, F : P → EndV , of dioperads. We shall consider below only dioperads P with P(m, n) being finite dimensional vector spaces (over a field k of characteristic zero) for all m, n. The endomorphism dioperad of the vector space k[−p], p ∈ Z, is denoted by p. ⊗p ⊗p Thus p(m, n) is sgnn ⊗sgnm [p(n−m)], where sgnm stands for the one dimensional sign representation of m . Representations of the dioperad Pp := P ⊗ p in a vector space V are the same as representations of the dioperad P in V [p]. If P is a dioperad, then P := {sgnm ⊗P(m, n)[2−m−n]⊗sgnn } and −1 P :== {sgnm ⊗ P(m, n)[m + n − 2] ⊗ sgnn } are also dioperads.

126

S.A. Merkulov

2.1. Cobar dual. If T is a directed (i.e. provided with a flow which we always assume in our pictures to go from the bottom to the top) tree, we denote by • V ert (T ) the set of all vertices, • edge(T ) the set of internal edges; det(T ) := ∧|edge(T )| spank (edge(T )); • Edge(T ) is the set of all edges, i.e. Edge(T ) := edge(T ) {input legs (leaves)} {output legs (roots)}; Det(T ) := ∧|Edge(T )| spank (Edge(T )); • Out (v) (resp. I n(v)) the set of outgoing (resp. incoming) edges at a vertex v ∈ V ert (V ). An (m, n)-tree is a tree T with n input legs labeled by the set [n] = {1, . . . , n} and m output legs labeled by the set [m] = {1, . . . , m}. A tree T is called trivalent if |Out (v) I n(v)| = 3 for all v ∈ V ert (T ). Let E = {E(m, n)}m,n≥1 be a collection of finite dimensional (m , n ) bimodules with E1,1 = 0. For a pair of finite sets, I, J ∈ Obj ects(Sf ), with |I | = m and |J | = n, one defines E(I, J ) := H omSf ([m], I ) × m E(m, n)) × n H omSf (J, [n]). The free dioperad, F ree(E), generated by E is defined by E(T ), F ree(E)(m, n) := (m,n)−trees T

where E(T ) :=

E(Out (v), I n(v)),

v∈V ert (T )

and the compositions i ◦j are given by grafting the j th root of one tree into the i th leaf of another tree, and then taking the “unordered” tensor product [MSS] over the set of vertices of the resulting tree. Let P = {P(m, n)}m,n≥1 be a collection of graded (m , n ) bimodules. We denote ¯ ¯ by P¯ the collection {P(m, n)}m,n≥1 given by P(m, n) := P(m, n) for m + n ≥ 3 ¯ ¯ and P(1, 1) = 0. The collection of dual vector spaces, P¯ ∗ = {P(m, n)∗ }m,n≥1 , is naturally a collection of (m , n )-bimodules with the transposed actions. We also set ¯ ¯ P ∨ = {P(m, n)∨ := sgnm ⊗ P(m, n)∗ ⊗ sgnn }. Let P be a graded dioperad with zero differential. The cobar dual of P is the dg dioperad DP defined by (i) as a dioperad of graded vector spaces, DP = −1 F ree(P¯ ∗ [−1]) = F ree( −1 P¯ ∗ [−1]); DP −i (m, n) with (ii) as a complex, DP is non-positively graded, DP(m, n) = m+n−3 i=0 the differential given by dualizations of the compositions • ◦• and edge contractions [G, GiKa], d

DP 3−m−n (m, n)→ || d P¯ ∨ (m, n) →

d

DP 4−m−n (m, n) → || d P¯ ∗ ⊗ Det(T )→

|edge(T )|=1

d

d

DP 3−m−n (m, n) →· · ·→ DP 0 (m, n) || || d d P¯ ∗ ⊗ Det(T )→· · ·→ P¯ ∗ ⊗ Det(T )

|edge(T )|=2

where the sums are taken over (m, n)-trees.

trivalent trees T

PROP Profile of Poisson Geometry

127

Remark 2.2. The vector space DP is bigraded: one grading comes from the grading of P as a vector space and another one from trees as in (ii) just above. The differential preserves the first grading and increases by 1 the second one. The Z-grading of DP is always understood to be the associated total grading. In particular, degDP P¯ ∨ (m, n) = degVect (P¯ ∨ (m, n)[m + n − 3]). 2.3. Koszul dioperads. A quadratic dioperad is a dioperad P of the form P=

F ree(E) , I deal < R >

where E = {E(m, n)} is a collection of finite dimensional (m , n )-bimodules with E(m, n) = 0 for (m, n) = (1, 2), (2, 1), and the I deal in F ree(E) is generated by a collection, R, of three sub-bimodules R(1, 2) ⊂ F ree(E)(1, 2), R(2, 1) ⊂ F ree(E)(2, 1) and R(2, 2) ⊂ F ree(E)(2, 2). The quadratic dual dioperad, P ! , is then defined by P! =

F ree(E ∨ ) , I deal < R ⊥ >

where R ⊥ is the collection of the three sub-bimodules R ⊥ (i, j ) ⊂ F ree(E ∨ )(i, j ) which are annihilators of R(i, j ), (i, j ) = (1, 2), (2, 2), (2, 1). Clearly, DP 0 = F ree(E ∨ ) so that there is a natural epimorphism DP 0 −→ P ! . Its kernel is precisely Im d(DP −1 ). Hence H 0 (DP) = P ! . The quadratic operad P is called Koszul if the above morphism is a quasi-isomorphism, i.e. H i (DP) = 0 for all i < 0. In that case the operad DP ! provides us with a minimal resolution of the operad P and is often denoted by P∞ . Algebras over P∞ are often called strong homotopy P-algebras; their most important property is that they can be transferred via quasi-isomorphisms of complexes [Mar2].

2.4. Koszulness criterion. An (m, n)-tree T is called reduced if each vertex has • either an outgoing root or at least two outgoing internal edges, and/or • either an incoming leaf or at least two incoming internal edges. For a collection, E = {Em,n }m,n≥1 , of (m , n )-bimodules define another collection of (m , n )-bimodules as follows, F ree(E)(m, n) := E(T ). reduced (m,n)−trees

Let P be a quadratic dioperad, i.e. P = F ree(E)/I deal for some generators E = {E(1, 2), E(2, 1)} and relations R = {R(1, 3), R(2, 2), R(3, 1)}. With P one can canonically associate two quadratic operads, PL and PR , such that PL =

F ree(E(1, 2)) F ree(E(2, 1)) op , PR = . I deal < R(1, 3) > I deal < R(2, 1) >

128

S.A. Merkulov op

Let us denote by PL PR the collection of (m , n )-bimodules given by   if m = 1, n ≥ 1; PL (1, n) op op PL PR (m, n) := PL (m, 1) if n = 1, m ≥ 1;  0 otherwise. Theorem 2.4.1. [G, Mar1, MV]. A quadratic dioperad P is Koszul if the operads PL and PR are Koszul and op

P(i, j ) = F ree(PL PR )(i, j ) op

for (i, j ) = (1, 3), (2, 2), (3, 1). Moreover, in this case P(m, n)=F ree(PL PR )(m, n) for all m, n ≥ 1. 3. A Minimal Resolution of Lie1 Bi First we present a graph description of the dioperad Lie1 Bi; it will pay off when we discuss Lie1 Bi∞ . By definition (see Sect. 1.5), Lie1 Bi is a quadratic dioperad, Lie1 Bi =

F ree(E) , I deal < R >

where (i) E(2, 1) := sgn2 ⊗ 11 and E(1, 2) := 11 ⊗ 12 [−1], where 1n stands for the one dimensional trivial representation of n ; let δ ∈ E(2, 1) and [ • ] ∈ E(1, 2) be basis vectors; we can represent both as directed3 plane corollas, δ=

1?? 2

◦

1

[ • ] = •??

,

1

1

2

with the following symmetries, 1?? 2

2?? 1 ◦ = − ◦ 1

1

, 1

1

1

? ? •? = •? ; 2

2

1

(ii) the relations R are generated by the following elements, 1<< 2

3<< 1 2<< 3 ◦? 3 ◦? 2 ◦? 1 ◦ + ◦ + ◦ ∈ F ree(E)(3, 1),

•? + •? + •? ∈ F ree(E)(1, 3), = ? = ? = ? • = 3 • = 2 • = 1

1

2

3

1

2

3

1?? 2

2 1 2 1 ◦ 177 •33 277 •33 177 •33 277 •33 3 3 3 3 •9 − ◦ 2 + ◦ 2 − ◦ 1 + ◦ 1 ∈ F ree(E)(2, 2). 9

1 3

2

1

1

2

2

In all our graphs the direction of edges is chosen to go from the bottom to the top.

PROP Profile of Poisson Geometry

129

Proposition 3.1. Lie1 Bi is Koszul. Proof. We have Lie1 BiL = Lie ⊗ {1} and Lie1 BiR = Lie, where Lie stands for the operad of Lie algebras and {m} := {m}(n) := sgn⊗m n [m(n − 1)] n≥1 for the endomorphism operad of k[−m]. As Lie is Koszul [GiKa], both the operads Lie1 BiL and Lie1 BiR are Koszul as well. Next, a straightforward analysis of all calculational schemes in Lie1 Bi represented by directed trivalent (i, j )-trees with i + j = 5 shows that they generate no new relations so that Lie1 Bi(i, j ) = F ree Lie ⊗ {1} Lieop (i, j ) for (i, j ) = (1, 3), (2, 2), (3, 1). Hence by Theorem 2.4.1, the dioperad Lie1 Bi is Koszul. Proposition 1.5.1 is a straightforward corollary of the following Theorem 3.2. The minimal resolution, Lie1 Bi∞ , of the dioperad Lie1 Bi can be described as follows: (i) As a dioperad of graded vector spaces, Lie1 Bi∞ = F ree(E), where the collection, E = {E(m, n)}, of one dimensional (m , n )-modules is given by sgnm ⊗ 1n [m − 2] if m + n ≥ 3; E(m, n) := 0 otherwise. (ii) If we represent a basis element of E(m, n) by the unique (up to a sign) planar (m, n)-corolla, 1KK2<· · · m−1sm KK
n−1 n

with skew-symmetric outgoing legs and symmetric ingoing legs, then the differential d is given on generators by I

1KK2<. . . m−1sm

d

KK
1 2

n−1

2 KKK<<. . . v I1 K< vv KKK<<. . . kKkK<•vH 5 5vH KK
J1

where σ (I1 I2 ) is the sign of the shuffle I1 I2 = (1, . . . , m). Proof. Claim (i) follows from the fact that Lie1 Bi! (m, n) = 1m ⊗ sgnn [1 − n] and Remark 2.2. Claim 2 is a straightforward though tedious graph translation of the initial term, d 1 ! ∗ (Lie1 Bi! )∨ (m, n) −→ (m,n)−trees T (Lie Bi ) ⊗ Det(T ), |edge(T )|=1

of Definition 2.1 of the differential d in DLie1 Bi! .

130

S.A. Merkulov

3.3. A geometric model for Lie1 Bi∞ structures. Let V be a finite dimensional graded vector space. Then the graded formal manifold, M, modeled on the infinitesimal neighbourhood of 0 in the vector space V ⊕ V ∗ [1] has an odd symplectic form ω induced from the natural pairing V ⊗ V ∗ [1] → k[1]. In particular, the graded structure sheaf OM on M has a degree −1 Poisson bracket, { • }, such that {f • g} = (−1)|f ||g|+|f |+|g| {g • f } and the Jacobi identity is satisfied. The odd symplectic manifold (M, ω) has two particular Lagrangian submanifolds, L ⊂ M and L ⊂ M associated with, respectively, the subspaces 0 ⊕ V ∗ [1] ⊂ V ⊕ V ∗ [1] and V ⊕ 0 ⊂ V ⊕ V ∗ [1]. Proposition 3.3.1. A Lie1 Bi∞ algebra structure in a graded vector space V is the same as a degree two smooth function ∈ OM vanishing on L ∪ L and satisfying the equation { • } = 0. Proof. The manifold M is isomorphic to the total space of the shifted cotangent bundle, TM∗ [1], of the manifold M of Proposition 1.5.1. Hence smooth functions on M are the same as smooth polyvector fields on M, and the Poisson bracket { • } on M is the same as the Schouten bracket on M. 3.4. Lie1 Bi∞ morphisms. Let (V , {µm,n }) and (V , {µm,n }) be two Lie1 Bi∞ algebras. Definition 3.4.1. A Lie1 Bi∞ morphism F : V → V is, by definition, a symplectomorphism, F : (M, ω) → (M , ω ) such that F (L) ⊂ L , F (L) ⊂ L and F ∗ = . Thus a Lie1 Bi∞ morphism F : V → V is a pair of collections of linear maps, Fm,n : m V ⊗ ∧n V ∗ → V [−n] m≥1,n≥0 , ∗ F¯m,n : m V ⊗ ∧n V ∗ → V [1 − n] n≥0,m≥1 () satisfying the system equations, F ∗ (ω ) = ω and F ∗ = . In particular, the equation F ∗ (ω ) = ω says that the linear maps, ∗ ∗ F1,0 : (V , µ1,1 ) → (V , µ1,1 ) and F¯0,1 : (V ∗ , µ∗1,1 ) → (V , µ 1,1 ),

are morphisms of complexes, while the equation F ∗ (ω ) = ω says that the composition, ∗ F0,1 ◦ F¯1,0 : V −→ V

is the identity map. Definition 3.4.2. A Lie1 Bi∞ -morphism F : V → V is called a quasi-isomorphism if the morphisms of complexes ∗ ∗ F1,0 : (V , µ1,1 ) → (V , µ1,1 ) and F¯0,1 : (V ∗ , µ∗1,1 ) → (V , µ 1,1 )

induce isomorphisms in cohomology.

PROP Profile of Poisson Geometry

131

Remark 3.4.3. One might get an impression that the notions introduced above make sense only for finite dimensional Lie1 Bi∞ algebras. However, this is no more than an artifact of the geometric intuition we tried to rely on in our definitions. In fact, everything above (and below) make sense for infinite dimensional representations as well. For example, one can replace () by Fm,n : m V → ∧n V ⊗ V [−n] m≥1,n≥0 , F¯m,n : m V ⊗ V [n − 1] → ∧n V n≥0,m≥1 , and reinterpret the equations defining the Lie1 Bi∞ morphism accordingly. For example, with this reinterpretation it is the morphism F1,0 ◦ F¯0,1 which is the identity map. 3.4.4. Contractible and minimal Lie1 Bi∞ -structures. Let V be a graded vector space and (M = V ⊕V ∗ [1], ω) the associated odd symplectic manifold (as in Sect. 3.3). There is a one-to-one correspondence between differentials, d : V → V , and quadratic degree 2 function, quad , on (M, ω), vanishing on L ∪ L and satisfying [ quad • quad ] = 0. If H ∗ (V , d) = 0, the associated data, (M, ω, quad ), is called a contractible Lie1 Bi∞ structure on V . A Lie1 Bi∞ structure, (M, ω, ), on V is called minimal if = 0 mod I 3 , where I is the ideal of the distinguished point, L ∩ L, in M. Put another way, the formal power series in some (and hence any) coordinate system on M begins with cubic terms at least. Theorem 3.4.5. (Homotopy classification of Lie1 Bi∞ -structures, cf. [Ko1, Ko2]). Each Lie1 Bi∞ algebra is isomorphic to the tensor product of a contractible Lie1 Bi∞ algebra and a minimal one. Proof. Let (M, ω, ) be the geometric equivalent of any given Lie1 Bi∞ algebra. To prove the statement we have to construct coordinates, (x a , y a , zα , ψa , φa , ξα ), on M such that a a (i) ω = (dx ∧ dψα + dy ∧ dφα + dzα ∧ dξα ,

a

α

ω2

ω1 ya

a (ii) L is given = zα = 0 while L is given by ψa = φa = ξα = 0, by x = a α (iii) = y ψa +(z , ξα ) for some formal power series (zα , ξα ) which begins a 2 1

with cubic terms at least. For then (M, ω, ) (M1 , ω1 , 1 ) × (M2 , ω2 , 2 ) with the first factor being a contractible Lie1 Bi∞ structure while the second factor is a minimal one. We shall establish existence of the above coordinates by induction. As the first step of the induction procedure we choose arbitrary linear coordinates, {t A }, A ∈ {1, . . . , dim V }, on V and the associated dual coordinates, {χA }, |χA | = −|t A | + 1, on V ∗ [1]. The odd symplectic form is given in these coordinates as ω = A A A dt ∧ dχA , L is given by t = 0 while L is given by χA = 0. Then, = CBA t A χB mod I 3 , A,B

132

S.A. Merkulov

for some constants CBA which are nothing but the coefficients of the differential, d : V → V , associated to the quadratic bit of . As we work over the field of characteristic zero, we can choose a cohomological decomposition of V with respect to this differential, V = H (V , d) ⊕ B ⊕ B[−1], so that d vanishes on H (V , d) and B[−1] and, on the remaining summand B, it is equal to the natural isomorphism B → B[−1]. Let {zα } be some linear coordinates in H (V , d), {y a } linear coordinates on B and {x a }, |x a | = |y a | − 1, the associated (via the natural isomorphism) linear coordinates in B[−1]. Denote by (ξα , ψa , φa ) the coordinates on V ∗ [1] dual to (zα , x a , y a ). In the resulting coordinate system on M the conditions (i)–(ii) are satisfied, while the condition (iii) is satisfied modulo I 3 . Assume by induction that we have constructed a coordinate system on M in which conditions (i)–(iii) are satisfied modI N+1 . Then we have, = y a ψa + ≤N (zα , ξα ) + N+1 (x, y, z, ψ, φ, χ ) mod I N+2 . a polynomials of polynomial of degree N+1 degrees from 3 to N 1

The equation [ • ] = 0 mod I N+2 implies, δ N+1 = 0, where δ is the following differential on OM , δ : OM −→ O M f −→ [ a y a ψa • f ]. Let B ∈ OM be an arbitrary polynomial of degree N + 1 and with |B| = 1. It gives rise to a symplectomorphism, F : M → M, given as exp vB with the vector field vB defined by vB : OM −→ OM f −→ [B • f ]. One has, F ∗ =

y a ψa + ≤N (zα , ξα ) + N+1 + δB

mod I N+2 .

a

Thus N+1 (x, y, z, ψ, φ, χ ) is a δ-cycle defined up to a δ-coboundary. As cohomology of δ in OM is equal to k[[zα , ξα ]], one can always find N+1 such that it is a function of {zα , ξα } only. Corollary 3.4.6. If F : V → V is a Lie1 Bi∞ quasi-isomorphism, then there exists a Lie1 Bi∞ quasi-isomorphism G : V → V such that on the cohomology level [F1,0 ] = ¯ 0,1 ]∗ and [G1,0 ] = [F¯0,1 ]∗ . [G Proof. is exactly the same as the proof of an analogous statement for L∞ algebras in [Ko1].

PROP Profile of Poisson Geometry

133

4. Minimal Resolution of the Operad TF By definition (see Sect. 1.4), TF is a quadratic dioperad TF =

F ree(E) , I deal < R >

where (i) E(2, 1) := k[2 ] ⊗ 11 and E(1, 2) := 11 ⊗ 12 [−1]; we represent two basis vectors of k[2 ] ⊗ 11 by planar corollas 1?? 2

2?? 1 ◦ and ◦ 1

1

and a basis vector of E(1, 2) by the symmetric corolla, 1

1

1

? ? •? = •? ; 2

2

1

(ii) the relations R are generated by the following elements, •? + •? ∈ F ree(E)(1, 3), •? + == ?3 == ?2 == ?1 • • •

1

2

3

1

2

3

1?? 2

2 2 1 1 ◦ 177 •33 177 •33 2 7 • • 7 77 2 3 3 •99 − ◦ 1 − ◦ 2 − 1 ◦ − 2 ◦ ∈ F ree(E)(2, 2).

1

2

2

1

2

1

Proposition 1.4.1 follows immediately from the following Proposition 4.1. The minimal resolution, TF∞ , of the dioperad TF can be described as follows: (i) As a dioperad of graded vector spaces, TF∞ = F ree(E), where the collection, E = {E(m, n)}, of (m , n )-modules is given by   k[2 ] ⊗ 1n if m = 2, n ≥ 2; E(m, n) := 1n [−1] if m = 1, n ≥ 2;  0 otherwise. (ii) If we represent two basis elements of E(2, n) by planar (2, n)-corollas, and the basis element of E(1, n) by planar (1, n) corolla,

1<

2< 1 << 2 << 1 <
1 2

n−1

1 2

n−1

1 2

n−1

134

S.A. Merkulov

with symmetric ingoing legs, then the differential d is given on generators by, 1 1

d

•
1 2

n−1

kk•H55H kkk 55HHH k k = s•
J1 1<

1<

<< 2 <
n−1

2

<<