This page intentionally left blank
The Phases of Quantum Chromodynamics This book discusses the physical phases of qu...
53 downloads
802 Views
2MB Size
Report
This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
Report copyright / DMCA form
This page intentionally left blank
The Phases of Quantum Chromodynamics This book discusses the physical phases of quantum chromodynamics (QCD) in ordinary as well as in extreme environments of high temperatures and high baryon number. A major theme of this book is the idea that, to understand the dynamics of QCD in ordinary circumstances, one needs to master them in extreme environments. QCD is thought to be characterized in ordinary circumstances by quark confinement through the formation of flux tubes and chiral-symmetry breaking. These properties are believed to be lost in environments of extreme conditions and new phases, the quark–gluon plasma and color superconductivity, are thought to exist. The book is aimed at graduate students and researchers entering the fields of lattice-gauge theory, heavy-ion collisions, nuclear theory, and high-energy phenomenology, as well as astrophysicists interested in the phases of nuclear matter and their impact on our current ideas of the interiors of dense stars. It is suitable for use as a textbook on lattice-gauge theory, effective Lagrangians, and field-theoretical modeling for nonperturbative phenomena in QCD. J O H N K O G U T obtained his Ph.D. from Stanford University in 1971. He is a Fellow in the American Physical Society and has held Guggenheim (1987–8) and Sloan Foundation (1976–8) Fellowships. He is co-author of over 200 original papers in elementary-particle, high-energy, and condensed-matter physics, and is the author of one previous book. He is currently Professor of Physics at the University of Illinois, Urbana-Champaign. M I K H A I L S T E P H A N O V obtained his Ph.D. from the University of Oxford in 1994. He was a Postdoctoral Research Associate at the University of Illinois, Urbana-Champaign from 1994 to 1997 and then spent two years at the State University of New York at Stony Brook. He is at present Associate Professor at the University of Illinois at Chicago and RHIC Fellow at the RIKEN-BNL Center. He has published over 50 original papers on particle and nuclear physics, lattice quantumfield theory, matter under extreme conditions, and heavy-ion collisions. His research is supported by a DOE Outstanding Junior Investigator Award, as well as by an Alfred P. Sloan Foundation Fellowship.
CAMBRIDGE MONOGRAPHS ON PARTICLE PHYSICS NUCLEAR PHYSICS AND COSMOLOGY 21 General Editors: T. Ericson, P. V. Landshoff
1. K. Winter (ed.): Neutrino Physics 2. J. F. Donoghue, E. Golowich and B. R. Holstein: Dynamics of the Standard Model 3. E. Leader and E. Predazzi: An Introduction to Gauge Theories and Modern Particle Physics, Volume 1: Electroweak Interactions, the ‘New Particles’ and the Parton Model 4. E. Leader and E. Predazzi: An Introduction to Gauge Theories and Modern Particle Physics, Volume 2: CP-Violation, QCD and Hard Processes 5. C. Grupen: Particle Detectors 6. H. Grosse and A. Martin: Particle Physics and the Schr¨odinger Equation 7. B. Andersson: The Lund Model 8. R. K. Ellis, W. J. Stirling and B. R. Webber: QCD and Collider Physics 9. I. I. Bigi and A. I. Sanda: CP Violation 10. A. V. Manohar andd M. B. Wise: Heavy Quark Physics 11. R. K. Bock, H. Grote, R. Fr¨uhwirth and M. Regler: Data Analysis Techniques for High-Energy Physics, Second edition 12. D. Green: The Physics of Particle Detectors 13. V. N. Gribov and J. Nyiri: Quantum Electrodynamics 14. K. Winter (ed.): Neutrino Physics, Second edition 15. E. Leader: Spin in Particle Physics 16. J. D. Walecka: Electron Scattering for Nuclear and Nucleon Structure 17. S. Narison: QCD as a Theory of Hadrons 18. J. F. Letessier and J. Rafelski: Hadrons and Quark–Gluon Plasma 19. A. Donnachie, H. G. Dosch, P. V. Landshoff and O. Nachtmann: Pomeron Physics and QCD 20. A. Hofmann: The Physics of Synchrotron Radiation 21. J. B. Kogut and M. A. Stephanov: The Phases of Quantum Chromodynamics
The Phases of Quantum Chromodynamics: From Confinement to Extreme Environments
JOHN B. KOGUT University of Illinois, Urbana-Champaign
MIKHAIL A. STEPHANOV University of Illinois, Chicago
Cambridge, New York, Melbourne, Madrid, Cape Town, Singapore, São Paulo Cambridge University Press The Edinburgh Building, Cambridge , United Kingdom Published in the United States of America by Cambridge University Press, New York www.cambridge.org Information on this title: www.cambridge.org/9780521804509 © John B. Kogut & Mikhail A. Stephanov 2004 This book is in copyright. Subject to statutory exception and to the provision of relevant collective licensing agreements, no reproduction of any part may take place without the written permission of Cambridge University Press. First published in print format 2003 - isbn-13 978-0-511-06371-8 eBook (NetLibrary) - isbn-10 0-511-06371-7 eBook (NetLibrary) - isbn-13 978-0-521-80450-9 hardback - isbn-10 0-521-80450-7 hardback
Cambridge University Press has no responsibility for the persistence or accuracy of s for external or third-party internet websites referred to in this book, and does not guarantee that any content on such websites is, or will remain, accurate or appropriate.
Contents
1
Introduction
2 2.1 2.2 2.3 2.4 2.5 2.6 2.7 2.8 2.9
Background in spin systems and critical phenomena Notation and definitions and critical indices Correlation-length scaling and universality classes Properties of the Ising model The Kosterlitz–Thouless model Coulomb gas, duality maps, and the phases of the planar model Asymptotic freedom in two-dimensional spin systems Instantons in two-dimensional spin systems Computer experiments and simulation methods The transfer matrix in field theory and statistical physics
10 10 14 17 19 24 30 36 37 41
3 3.1
Gauge fields on a four-dimensional euclidean lattice Lattice formulation, local gauge invariance, and the continuum action Confinement and the strong-coupling limit Confinement mechanisms in two and four dimensions: vortex and monopole condensation
53
3.2 3.3 4 4.1 4.2 4.3 4.4
1
Fermions and nonperturbative dynamics in QCD Asymptotic freedom and the continuum limit Axial symmetries and the vacuum of QCD Two-dimensional fermionic models of confinement, axial symmetries, and θ vacua Instantons and the scales of QCD
v
53 57 63 74 74 76 77 84
vi
Contents
5 5.1 5.2 5.3 5.4 5.5 5.6 5.7 5.8 5.9 5.10
Lattice fermions and chiral symmetry Free fermions on the lattice in one and two dimensions Fermions and bosons on Euclidean lattices Staggered Euclidean fermions Block derivatives and axial symmetries Staggered fermions and remnants of chiral symmetry Exact chiral symmetry on the lattice Chiral-symmetry breaking on the lattice Simulating dynamical fermions in lattice-gauge theory The microcanonical ensemble and molecular dynamics Langevin and hybrid algorithms
93 93 101 104 107 109 111 117 126 127 132
6 6.1 6.2
The Hamiltonian version of lattice-gauge theory Continuous time and discrete space Quark confinement in Hamiltonian lattice-gauge theory and thin strings Relativistic thin strings, delocalization, and Casimir forces Roughening and the restoration of spatial symmetries
136 136
6.3 6.4 7 7.1 7.2 7.3 7.4 7.5 7.6 7.7 8
Phase transitions in lattice-gauge theory at high temperatures Finite-temperature transitions at strong coupling Simulations at nonzero temperature Pure gauge-field simulations at nonzero temperature Restoration of chiral symmetry and high temperature Hadronic screening lengths Thermal dilepton rates and experimental signatures for the quark–gluon plasma A tour of the three-flavor QCD phase diagram
8.3 8.4 8.5
Physics of QCD at high temperatures and chemical potentials The thermodynamic background Hadron phenomenology and simple models of the transition to the quark–gluon plasma A tour of the T –µ phase diagram The quark–gluon plasma and the energy scales of QCD The extreme environment at a relativistic heavy-ion collider
9 9.1 9.2
Large chemical potentials and color superconductivity Color superconductivity and color–flavor locking Calculating the gap at asymptotically large µ
8.1 8.2
145 146 150 158 158 162 165 170 172 174 176 182 182 202 207 223 226 236 236 239
Contents
vii
9.3 9.4
Lowest excitations of the CFL phase Comments and some further developments
260 278
10
Effective Lagrangians and models of QCD at nonzero chemical potential QCD at finite µ and the sign problem The random-matrix model of QCD Two-color QCD and effective Lagrangians QCD at nonzero isospin chemical potential Pion propagation near and below Tc
280 280 282 295 307 316
10.1 10.2 10.3 10.4 10.5
11 Lattice-gauge theory at nonzero chemical potential 11.1 Propagators and formulating the chemical potential on a Euclidean lattice 11.2 Naive fermions at finite density 11.3 The three-dimensional four-Fermi model at nonzero T and µ 11.4 Four-flavor SU(2) lattice-gauge theory at nonzero µ and T 11.5 High-density QCD and static quarks 11.6 The Glasgow algorithm 11.7 The Fodor–Katz method for high T, low µ 11.8 QCD at complex chemical potential
324
12
Epilogue
355
References
357
Index
363
324 326 330 335 345 347 349 350
1 Introduction
The phases of QCD have presented a challenge to theoretical physics for many years. Originally it was considered folly to think about topics such as confinement, chiral-symmetry breaking, and different states of matter in relativistic field theories. Anything beyond traditional perturbation theory was met with skepticism. However, the development of semi-classical methods in field theory, the lattice-gauge formulation of the subject, and the generalizations of analytic methods of analysis from two-dimensional systems, such as duality and transitions driven by topological excitations, have changed all that. This book is a look at the fundamentals of QCD from this perspective. After introducing lattice-gauge theory, beginning with fundamentals and reaching important recent developments, it will emphasize the application of QCD to the study of matter in extreme environments. Effective Lagrangians, which incorporate the constraints of low-energy dynamics and their symmetry realizations, will also be developed to provide a complementary, and insightful, perspective. Application of perturbative methods will also be presented in the regime of their validity. Why extreme environments? A major theme of this book is the idea that, to understand the dynamics of QCD in ordinary circumstances, one needs to master them in extreme environments. For example, to appreciate confinement, one can heat the system until the thermal fluctuations prevent the formation of the thin flux tubes which give rise to a linearly confining potential. In doing so, the special ingredients in the theory’s dynamics which favor a vacuum pressure that squeezes flux into thin continuous tubes are emphasized. In addition, the essential ingredient of Gauss’ law and exact color gauge symmetry will be seen to lie at the heart of the confining features of ordinary QCD. Another example, which is central to this book, is afforded by chiralsymmetry breaking, the fact that the theory with massless quarks develops 1
2
Introduction
dynamical constituent quark masses consistent with its global chiral symmetries. Again, in ordinary environments quark–anti-quark attractive forces, generated by mechanisms such as perturbative gluon exchanges and nonperturbative instanton forces and even those of flux tubes and confinement, favor the appearance of a condensate of quark–anti-quark pairs in the system’s vacuum. This condensate then gives the vacuum an indefinite chirality, which supports dynamically generated masses for its constituents as well as its colorless physical states. When such a vacuum is heated, the violent thermal fluctuations will melt the condensate and lead to restoration of chiral symmetry. The corresponding thermodynamic state, or thermal vacuum, however, is anything but simple. In this hot environment affecting the theory of massless quarks and gluons, we expect plasma formation and screening of color. The hot theory is qualitatively different from its cold relative: color screening replaces color confinement, massless quarks and gluons replace dynamical symmetry breaking, and a rich mass spectrum of colorless states with a large level spacing is replaced by a spectrum of screened states with a smaller level spacing. Another extreme environment that lies on the experimental horizon is that of cold, dense matter. An example is afforded by the interior of a neutron star, where the baryon number density is far higher than that of ordinary nuclear matter. This environment is not well understood in theories like QCD, for reasons which will be discussed at length within this book. Since the discovery of asymptotic freedom in theories like QCD, it has been realized that matter should become weakly interacting if the Fermi energies of the quarks, controlled by the chemical potential µ, are large in comparison with the confinement scale, which is of the order of a few hundred MeV (or 1 fm−1 ). It has only recently fully been appreciated that even this weakly coupled state is rich in intricate phenomena due to pairing instabilities similar to the phenomenon of superconductivity in metals. Reliable calculations are possible in this regime controlled by the smallness of the gauge coupling of strong interactions. However, this control is very quickly lost once the densities decrease, and weak-coupling approximations are not justified. From the lower-density side, we can reliably predict that, as the chemical potential of the baryon charge approaches the mass of the lightest baryon, MN (i.e., the quark chemical potential approaches MN /3), a transition to a new state of matter occurs, since it becomes energetically favorable for the ground state to contain baryons. We also have empirical evidence leading us to believe that this state is the state of nuclear matter, chunks of which form nuclei of heavy elements, and that astronomically large chunks, held together by gravity, form neutron stars. But how does this matter, which we know is not very dense on the scale of QCD (one baryon per 6 fm3 ) transform into asymptotically free quark matter, as the density, or µ, increases? What happens to the baryons, whose very existence seems doubtful in such dense quark matter? Is there a transition, and, if there is, at what µ and of what type? A hypothetical peek into
Introduction
3
the interior of a neutron star would reveal the answers. Such questions are still in the realm of speculation in QCD research at this time. One of the goals of this book is to provide the tools and background for a new generation of researchers to study this problem. The challenge of QCD in cold, dense environments is a major theme of this book. We believe that there is a new field of science awaiting us there. It is the condensed matter or even the chemistry of QCD. The point is that, as cold QCD is subjected to a chemical potential, there will be the possibility of the creation of new states of extended QCD matter, analogous to the creation of new states of molecular matter in atomic physics. The phases of molecular matter are extraordinarily rich because molecules themselves have intricate threedimensional symmetries and can interact among themselves via cooperative phenomena like those involving induced dipole–dipole forces, which generate energy scales several orders of magnitude smaller than the fundamental atomic energy scale, the Rydberg. These effects conspire to generate phase diagrams of molecules that are very complex and interesting, and determine the character of our environment. In the case of QCD, the quarks and gluons have internal symmetries described by their color and flavor indices. There are small symmetry breakings arising from the bare mass splittings between the quarks. There is a rich physical mass spectrum of colorless states with a level spacing of several hundred MeV. How will such a physical system respond to a chemical potential comparable to its level spacing? Such an environment will encourage the production of new, nontrivially ordered states of fundamental matter. Exotic phases consisting of meson condensates, superfluids, crystals, etc. have been proposed. From this point of view, it would be naive to expect reliable predictions here very soon, since such states of matter and their stability will depend on grasping the dynamics of QCD to within an accuracy of a few MeV. A goal, ambitious at that, for this book, is to encourage a new generation of physicists to tackle this new field of the extremely condensed matter, or chemistry, of QCD, which may very well determine our environment on the cosmic scale. Why is so little known about QCD at nonzero µ and what are the present speculations about new states of matter at vanishing temperature and nonvanishing chemical potential? Lattice-gauge theory has made little progress on this subject because its action for the SU(3) color group in the presence of a nonzero baryon chemical potential becomes complex and not amenable to reliable methods of analysis, such as computer simulations. This problem will be dealt with at length in the text. We will review the random-matrix analysis of the failure of the quenched “approximation” of QCD to model it adequately at nonzero chemical potential. We will also see that models with positive fermion determinants can be analyzed by lattice and effective-Lagrangian methods. In fact the SU(2) color case can be studied in detail and new states of strongly interacting superfluids are predicted.
4
Introduction
QCD of three colors, but with the chemical potential for the isospin charge I3 , is very similar to the SU(2) color case (and in fact also similar to the quenched approximation to QCD – as we shall see using the random-matrix approach). The pions condense into a pion superfluid at an easily predictable critical chemical potential. We will also view the SU(3) theory in an environment where the chemical potential is asymptotically large and there is a huge Fermi sphere of weakly interacting colorful quarks. In this environment, which is amenable to the weak-coupling approach, one can identify a QCD analog of Bardeen–Cooper– Schriefer (BCS) fermion pairing and superconductivity – color superconductivity. This QCD phenomenon is much richer than its condensed-matter (QED) counterpart. It also has much more integrity in it, since in QED the dominant forces between electrons are repulsive and an intricate phonon-mediated interaction is needed in order to provide the necessary attraction. One needs more than just electrons and photons for traditional BCS fermion pairing. In QCD quarks and gluons and their mutual interactions are the only necessary ingredients. The attraction provided by gluons is long-ranged (on the typical scale of 1/µ in such quark matter), thus providing a stronger effect than one could expect from a short-ranged attraction. The phenomenon of color superconductivity is also richer than its QED counterpart due to there being a larger set of global symmetries, the chiral symmetries of QCD. An especially interesting theoretical case is the limit in which all three quarks are light compared with their chemical potential µ. In this regime the BCS instability leads to breaking of global (axial SU(3) and baryon-number U(1)), as well as local (color) symmetries. This phenomenon known as color– flavor locking (CFL) is an interesting example of a mechanism by which chiral symmetry can be broken spontaneously within the domain of weak-coupling perturbation theory. Since the pattern of breaking in CFL is somewhat similar to the vacuum of QCD, the lowest excitations – Goldstone bosons – are also similar, since their existence is dictated by the symmetries of the theory and the Goldstone theorem. What is new in CFL is that the properties of the Goldstone particles are calculable. Imagine calculating the pion decay constant from the first principles in QCD. This turns out to be possible in the CFL phase! The high-temperature and high-chemical-potential environments will emphasize the property of asymptotic freedom of QCD, the fact that it is weakly coupled and perturbative at short distances while being strongly coupled and full of interesting nonperturbative symmetry realizations at large distances. One of the goals of lattice-gauge theory is to attain an understanding, both theoretical and numerical, of the physics on the multiple length scales of QCD: weak-coupling perturbative behavior at short distances, semi-classical nonperturbative behavior at intermediate distances, and strong-coupling confinement and color screening at large distances. We will cover the basics of lattice-gauge theory, with a strong
Introduction
5
emphasis on models in lower dimensions and spin models of greater simplicity, in order to develop the tools, insight, and experience to grapple with these phenomena in realistic four-dimensional settings. Duality, topological excitations and related order–disorder phase transitions, transfer matrices, and Hamiltonian methods will all be studied, with the goal of providing the reader with the skills to attack modern problems in QCD from several different vantage points. A major development in modern field theory has been an understanding of the subtleties of fermion fields and their symmetries. This book spends considerable time on this subject since the lattice has played an important role in many of these topics. The most significant accomplishment of lattice-gauge theory was the demonstration that gauge theories could be formulated nonperturbatively. This means that local color symmetries could be defined even in an ultraviolet-regulated spacetime, and local gauge symmetries would remain exact. The important ingredient in this advance was the realization that the gauge group, rather than the generators, would provide the building blocks of the theory. The lattice action, for example, is constructed using a spacetime lattice of points and links, and gauge fields are represented by group elements on links and Fermi fields by Grassmann variables on sites. The notion of exact local gauge invariance, namely that the local color of an excitation tranforms under local color rotations and that the physics should be expressible in terms of invariants, has exact, geometrical expressions in this framework. This feature lies at the heart of the theory’s ability to discuss confinement outside of particular calculational schemes and to discuss other associated nonperturbative phenomena quantitatively. Within this context one also discusses fermions and their special symmetries, such as flavor transformations and axial flavor and axial singlet transformations. This leads to nonperturbative formulations and calculations of chiral-symmetry breaking and dynamical mass generation. All of this is important to the phenomenology of QCD and its relation to the very successful constituent-quark model. The mechanisms of chiral-symmetry breaking in ordinary strongly interacting physics as well as in extreme environments of high temperature and chemical potentials have been revealed by lattice and semi-classical studies after decades of little progress when the only tool one had was perturbation theory. The interplay of chiral-symmetry breaking, semiclassical excitations, and confinement itself is a major theme in QCD and this book. It is particularly interesting that these two successes, exact gauge invariance and chiral symmetries of fermions, lead to a near catastrophe in the subject. Naively it appears that global fermion symmetries can be formulated exactly on spacetime lattices. At the same time local gauge interactions can also be formulated exactly. This then presents the opportunity to “gauge” the fermion chiral symmetries and make models in which any global chiral symmetry of interest and any local gauge symmetry would also be exact, but such a “success” is too
6
Introduction
much of a good thing. We know from perturbation theory that axial symmetries are broken by the perturbative anomalies of gauge fields. For example, the neutral pion does decay into two photons and this is caused by the existence of the famous triangle graphs coupling a pion to two photons. In more basic terms, this result means that the axial flavor current that describes the neutral pion is no longer conserved in the presence of a quantized electromagnetic field. Classically there is a conservation law but quantum fluctuations destroy it. No one can argue that the neutral pion’s dominant decay mode is into a state of two photons and few physicists would want to say that the standard formulation of axial symmetries and their currents does not describe the properties of light quarks and composite pions. It looks as though lattice-gauge theory can make quantum theories with too much symmetry! Actually this problem is “solved” by another lattice shortcoming. If one considers the Dirac equation on the lattice, one finds surprises. In particular, there are typically too many species predicted from the discrete-lattice version of the Dirac equation in the theory’s continuum limit and these extra species have just the right handedness and number to cancel out the anomaly that leads to the nonconservation of the global chiral current. This is the famous, or infamous, problem of “species doubling” on the lattice. It shows that the continuum limit of the lattice Dirac operator describes more species of fermions with more global symmetries than intended. In the early days of lattice-gauge theory this restriction led to formulations of lattice fermions that eliminated the unwanted extra fermion species by explicitly breaking most of the chiral symmetries. However, the breaking was constructed such that the continuum limit of the model would have the desired fermion species, gauge invariance, and anomalies, so that no contradictions could be encountered. This book will spend considerable space on these topics because they are central to understanding QCD. The two most useful lattice-fermion methods, which compromise chiral symmetries on the lattice but are engineered to have the correct continuum limits, are called “Wilson” and “staggered” fermions. They will be discussed in detail because they are easy to use both analytically and numerically. These formulations of the Dirac equation couple only nearest-neighbor sites on the lattice, and their actions can be “improved” in the sense that next-nearest-neighbor terms can be added to the action, so that the deviations of the lattice action from the continuum action can be systematically reduced. These improvements should prove important in numerical work on the phases of QCD. The unpleasant feature of these lattice-fermion methods, namely that chiral symmetries are compromised on the cutoff system while gauge symmetries are not, is addressed in more sophisticated lattice-fermion actions. In particular, we will introduce the domain-wall fermion method which extends the lattice to five dimensions. In the new dimension there is a domain wall where massless, chiral fermions can exist. This scheme produces a four-dimensional lattice-gauge
Introduction
7
theory with full chiral symmetry, the correct anomaly structure and local gauge invariance. Another, closely related approach to this problem in clashing symmetries changes the definition of lattice chiral transformations to accomodate the nonzero lattice-spacing cutoff. In this scheme, Ginsparg–Wilson fermions, the cutoff system has an exact chiral-like symmetry and all of its important physical consequences but the price one pays here is that the continuous symmetry differs from the continuum chiral symmetry by lattice terms and there is apparent nonlocality in the four-dimensional action. Our interest in these new lattice-fermion developments is restricted to their impact on understanding the nonperturbative, dynamical symmetry-breaking physics of QCD, which is a left–right-symmetric theory. It is hoped that the methods will produce useful nonperturbative formulations of the left–rightasymmetric models, such as the standard model of electroweak interactions, and other chiral gauge theories. As we discuss these issues, we will introduce lattice-numerical methods with an emphasis on algorithms that can solve fermion field theories nonperturbatively. Recent developments in the field at high temperature and low chemical potential and their relevance to relativistic heavy-ion colliders (RHICs) will be emphasized. Since lattice-gauge theory is not covered in graduate-physics curricula, we spend considerable time with the fundamentals of this subject. In fact, the chapters on lattice-gauge theory here could be used as the student’s first introduction to the subject. The perspective is toward QCD in extreme environments, but all the fundamentals such as local gauge invariance, continuum limits and asymptotic freedom, confinement mechanisms, lattice fermions, computer algorithms, and chiral-symmetry breaking are covered here in an elementary and self-contained fashion. The use of model lattice-gauge-theory systems, field theories in two and three dimensions, occurs throughout the book to illustrate and introduce concepts with a minimum of formalism. Several sections of the book within the lattice-gauge-theory mantra give background to the goals of latticegauge theory by working through continuum-model field theories. For example, the importance of confinement, topology, the anomaly and U(1)A problem, and the existence of θ vacua are introduced and illustrated within 1 + 1 quantum electrodynamics. This development emphasizes the need for lattice fermions to capture the chiral (flavor-singlet) and gauge symmetries of QCD correctly – if a failure occurred here, extra unphysical massless modes would populate the theory’s physical spectrum. The importance of topological excitations in field theories is illustrated through the two-dimensional planar spin model, the Abelian Higgs model, and the O(3) model. Much of the material presented here is adopted from the senior author’s elementary reviews of the subject that were written when lattice-gauge theory was in its infancy [1].
8
Introduction
The lattice approach to QCD provides a microscopic, fundamental formulation of the theory of quarks and gluons. The emphasis is on local gauge symmetries, their expression at the microscopic level, and their implications at large distances, as well as global symmetries, such as flavor and chiral symmetries. Eventually lattice QCD will provide concise solutions for the physics on all length scales. At this time, however, the field proceeds more modestly and in this book, for example, we supplement the lattice approach with the study of effective Lagrangians. In this approach one embodies the local and global symmetries of QCD in an effective field theory that states how the low-energy excitations of the solution of QCD must interact on the basis of our knowledge of the microscopic physics and the realizations of its fundamental symmetries. The successes here include an understanding of Goldstone and pseudo-Goldstone physics of pions, kaons, etc. Since light degrees of freedom control the critical behavior of QCD at its second-order phase transitions, this approach has much to say about extreme environments. In particular, at high temperature it predicts the critical behavior of the quark–gluon-plasma transition. Its generalization to nonzero chemical potential is also constrained by symmetries and conservation laws and will be developed in detail. We will see that it is particularly illuminating for model systems, such as SU(2) color models, in which the transition to a diquark condensate occurs at µc = m π /2, within the domain of applicability of chiral-perturbation theory. Unfortunately, it has less to predict about SU(3) QCD in cold but baryon-rich environments. A theoretical framework for QCD at baryon chemical potentials of the order of the mass of the nucleon is lacking, as will be discussed at length in the chapters that follow. One of the challenges here is to find a quantitative, nonperturbative framework to describe diquark condensation leading to breaking of color symmetry and color superconductivity. One of the main themes of the book is to emphasize the use of methods that, starting from the defining principles of QCD as a quantum-field theory, are able to provide systematically controllable results. Historically, only perturbation theory lived up to this claim. Although the small-coupling expansion provides indispensable information, its domain of validity ends before, and often well before, the regime of interest, where nonperturbative phenomena such as confinement, chiral-symmetry breaking, and phase transitions set in. Lattice-field theory is an instrument that allows one to go into these domains with precision controlled by the lattice spacing. Hence, lattice-field theory takes a prominent place in this book. The method of effective Lagrangians is another very powerful tool, which allows one to embody concisely the information about the global symmetries of the theory and of the ground state. A systematically controllable expansion, now organized by powers of the small energies and momenta of the lowest excitations, can then provide nontrivial dynamical information, which in some cases is even exact. Sometimes the synonym model-independent is used to distinguish such systematically controlled, or exact, results from the
Introduction
9
results obtained using phenomenologically or theoretically motivated models. Such models have made and continue to make significant contributions to our qualitative and semiquantitative understanding of nonperturbative phenomena. Unfortunately, or perhaps fortunately, for the coming generation of researchers, at present, there is still a large domain of the QCD phase diagram that is not amenable to first-principles approaches. It is our belief that this will change and we hope that this book will help in that. In writing a book of this sort it is hard to know where to start and where to stop. We assume that the reader has had or is taking an introductory field-theory course, so that path integrals, gauge fields, Feynman diagrams, and the quark model have been seen before. In addition, we use the language and methods of statistical physics, so some preparation in critical phenomena, Landau meanfield theory, and scaling laws would be useful. For example, we will freely move between the languages of field theory and statistical mechanics and we will assume that the reader can easily grasp the formal correspondences between the two subjects. We cover them briefly in our discussions of the transfer matrix for model lattice systems. Throughout the text we refer the reader to textbooks and review articles for additional background. The student might be studying introductory field theory or critical phenomena concurrently with a reading of this more specialized book. The student need not be an expert in these subjects when he or she begins this book, but we hope that they will emerge an expert at the end.
2 Background in spin systems and critical phenomena
2.1 Notation and definitions and critical indices The language and formalisms of lattice-gauge theory are based on those of traditional statistical mechanics. Let’s begin by introducing terminology and ideas in the context of the statistical mechanics of two-dimensional spin systems. This will lay the basis for more elaborate and challenging constructions in fourdimensional lattice-gauge theory. Consider the two-dimensional Ising model on a square lattice whose sites are labeled by a vector of integers n = (n 1 , n 2 )
(2.1)
Place a “spin” variable s(n) at each site and suppose that s can have only two possible values: “up,” s = +1, and “down,” s = −1. Nearest-neighbor spins are defined to interact through an energy S = −J s(n)s(n + i) (2.2) n,i
where i denotes one of the two unit vectors of the square lattice, as shown in Fig. 2.1. We choose the coupling J to be positive, so that aligned spins are favored. In the context of Euclidean field theory, the energy of the system would be called the “action.” This terminology will be clarified as we progress from statistical mechanics, with its partition function, order parameters, and susceptibilities, to Euclidean field theory, with its path integral, condensates, and propagators. The Ising-model variables and energy have two particularly important features. First, the energy has only short-ranged interactions. In fact, the interactions involve only nearest-neighbor spins. The locality of the interactions will play a crucial role in determining the phase diagram and the critical behavior, 10
2.1 Notation
11
.
.
.
.
.
.
.n
.
.
.
.
.
i
Figure 2.1. Labeling the sites and links of a square two-dimensional lattice. properties of the system in the immediate vicinity of its phase transition, of the model. Second, the model has a global symmetry: its energy is unchanged if the entire system is flipped upside down. The symmetry group is just Z2 , the global interchange of “up” and “down”. This system of spins can be placed into an external, uniform magnetic field B. The energy acquires another term, S = −J s(n)s(n + i) − B s(n) (2.3) n
n,i
The external field tends to align all the spins. It distinguishes between the “up” and “down” directions and breaks the global Z2 symmetry of the original model. The statistical properties of the Ising model follow from its partition function, Z= exp(−β S) (2.4) configs
where the Boltzmann factor P = exp(−β S) records the relative probability of a particular configuration of spins and β ≡ 1/(kT ). The sum over configurations in this equation is rather daunting since, for a system of N spins, there are 2 N different configurations. The thermodynamics of the model follows from the configurational sum. For example, the free energy is F = −kT ln Z
(2.5)
The mean magnetization per site, M, can be expressed as an expectation value with respect to the probability weight P = exp(−β S), 1 1 M= s(n) = s(n) exp(−β S)/Z (2.6) N n N n configs which can also be written in terms of the free energy, given above, as M=
1 ∂F N ∂B
(2.7)
12
Background
We will discuss the symmetry-breaking significance of M below when we consider the physics of the Ising model in some detail. Another measure of the response of the spins in the model to an external magnetic field is given by the susceptibility per site, χ=
∂M ∂B
(2.8)
This susceptibility is particularly illuminating for infinitesimal magnetic field B → 0. Using the explicit form for the magnetization in terms of a configurational average, as stated above, we can derive χ=
1 1 2 stot − stot 2 = [(stot − stot )2 ] N kT N kT
(2.9)
where stot = n s(n). This equation shows that the zero-field susceptibility is a measure of the fluctuations in the spins comprising the model. This result is closely related to the “fluctuation-dissipation” theorem. χ can also be written in the form 2 1 χ= (2.10) s(n)s(m) − s(n) N kT n,m n which shows that χ is a measure of the range of the correlations between the spins. For example, if we consider the system at a temperature at which it has no net magnetization, s(0) = 0, then χ=
1 (n) kT n
(2.11)
where (n) is the spin–spin correlation function, (n) = s(n)s(0)
(2.12)
and we used the translational invariance of the system to simplify the sums over the lattice sites. This algebra teaches us a lesson that is of central importance in statistical physics and will play an important role in lattice-gauge theory: susceptibilities can become singular and can diverge in the thermodynamic limit of a macroscopic system, if the system develops sufficiently long-range correlations. Note that we are always considering the Ising model, whose energy is constructed from local interactions of just one lattice spacing. Nonetheless, the dynamics may generate a spin–spin correlation function that falls off so slowly across the lattice that the sum in the equation for χ diverges. This behavior lies at the heart of second-order phase transitions and continuum limits of lattice-gauge theory, so we will return to it several times more.
2.1 Notation
13
Consider the phenomenology of phase transitions, in particular, the behavior of the system for temperatures at and near a second-order phase transition. The magnetization M is a function of T and B. It is interesting to let B tend to zero for a fixed temperature T . If M remains nonzero, the system is said to experience “spontaneous magnetization.” This occurs only for T below a critical value Tc , the Curie temperature in the Ising model. The existence of a nonzero magnetization indicates that the solution of the system does not share the global up–down symmetry of its energy expression. One says that the global Z2 symmetry is “spontaneously broken” and that M serves as an “order parameter” to label the phases and symmetries of the system. M is nonzero for T < Tc , but it vanishes continuously at a second-order phase transition, T → Tc . In the Ising model, as in many physical systems, the order parameter vanishes as a power in the limit T → Tc , M ∼ (Tc − T )βmag
(2.13)
where βmag is the magnetization critical exponent, which has been calculated for many solvable models, including the Ising model in two dimensions, and has been measured in experiments and in computer simulations. It takes the value 18 in the two-dimensional Ising model, and is 12 in mean-field theory, a simple but extraordinarily useful approximation that will be discussed extensively below. Another quantity that is particularly important and informative in the context of critical phenomena is the spin–spin correlation function (n). At sufficiently high temperatures, we expect thermal fluctuations to dominate the tendency of distant spins to align and correlate. Very general considerations lead to the expectation that (n) should fall off exponentially with the distance between the spins for any T > Tc , (n) ∼ exp(−|n|/ξ(T )),
|n| 1
(2.14)
where ξ(T ) is the system’s “correlation length.” ξ gives a measure of the sizes of patches of correlated spins in the system. For high T , ξ(T ) will be comparable to a lattice spacing. As T approaches Tc , ξ(T ) should grow large and eventually diverge when T = Tc . In many models the singularity is power-behaved and we write ξ(T ) ∼ (T − Tc )−ν
(2.15)
where ν is another critical index of fundamental significance in the theory of phase transitions. In the two-dimensional Ising model ν = 1, whereas in meanfield theory ν = 12 . Since the correlation length diverges at the critical point Tc , the correlation function (n) is no longer an exponential there. Instead, it falls slowly, as a power of the distance n between the spins, (n) ∼ |n|d−2+η ,
T = Tc
(2.16)
14
Background
where d is the dimensionality of the lattice and η is another critical index, which is 14 in the two-dimensional Ising model. At a temperature T below the critical point where the system develops a spontaneous magnetization, we must define the correlation fuction as (n) = s(n)s(0) − s(0)2
(2.17)
in order that it fall to zero as |n| grows large. With this definition (n) measures real correlations between individual spins and it falls off exponentially in |n| for all T < Tc . Finally, there is the specific heat, C = −T
∂2 F ∂T 2
(2.18)
which is also expected to be singular at the critical point. In fact, if the singular piece of the specific heat is power-behaved, we introduce the critical index α and write C ∼ (T − Tc )α
(2.19)
It is amusing that α = 0 in the two-dimensional Ising model and the actual critical singularity in the specific heat is a logarithm. The final critical index we will discuss is another one specific to T = Tc and is called δ. It measures the response of the system at the critical point to an infinitesimal external magnetic field, M ∼ B 1/δ ,
T = Tc
(2.20)
In the two-dimensional Ising model δ = 15, so the response of the magnetization to the external field is very abrupt. There is much more to be said about basic critical behavior of classical spin models, and we will introduce more ideas as we turn to applications below. However, the lesson we should take from this discussion is that, at the critical point, the correlation length diverges and this causes large fluctuations and singular behavior in various thermodynamic functions. This occurs even though the basic coupling in the system is short-ranged. 2.2 Correlation-length scaling and universality classes One of the most striking features of the experimental study of physical systems at and/or near their critical points is that apparently different physical systems have identical critical singularities [2]. Various ferromagnets have the same critical indices as various liquid crystals, etc. Systematic experiments concerning
2.2 Scaling and universality classes
15
a wide range of materials have led to the idea that physical systems can be arranged into “universality classes” and that within each class each substance has a common critical behavior. These classes are sensitive to just a few features of each physical system. In fact, the classes are labeled by the following parameters. 1. The dimension of the physical system. 2. The symmetry group of the local variables in the system’s energy. 3. The symmetry of the coupling between the local variables. 4. The range of couplings in the energy. For example, the two- and three-dimensional Ising models have different critical behaviors. In addition, the Z2 Ising model has different critical indices than the three-state Pott model. Models with sufficiently long-range coupling in their energy expressions belong to the mean-field universality class whereas shortrange-coupled models made from the same degrees of freedom formulated on identical lattices do not. One of the triumphs of K. G. Wilson’s renormalizationgroup approach to critical phenomena was that it provided a natural framework for describing different universality classes through the notion of fixed points of renormalization-group transformations and scaling dimensions of operators [2]. At the descriptive, phenomenological level, one traces the singular behavior of diverse physical systems down to the divergent characters of their correlation lengths. The “correlation-length-scaling hypothesis” claims that the divergence of this single length is responsible for the singular behavior of each of the system’s thermodynamic functions. This is a crucial hypothesis because it pinpoints the source of the phase transition in a single, singular quantity. Insofar as scaling laws and singular behavior of thermodynamic functions are concerned, the hypothesis states that only the single correlation length matters. In particular, the lattice spacing of the model, although it has the same dimensions as ξ , drops out of the picture. The long-range correlations between the microscopic degrees of freedom are the essential ingredients in understanding how the system can engage in a continuous phase transition. The droplet picture of continuous phase transitions captures the essence of this phenomenon. It is a visualization of a scale-invariant field theory. As the critical temperature is approached, T → Tc , the correlation length ξ(T ) becomes very large and long-range correlations begin to appear in the configurations. The droplet picture states that regions of “all up” and regions of “all down” spins, in the case of the Z2 Ising model, begin dominating the configurations. The sizes of such “droplets” range from zero to ξ(T ) itself. It is important to appreciate that no particular size scale dominates: there are considerable numbers of droplets
16
Background
Figure 2.2. Droplets on all length scales at a second-order phase transition. on all possible length scales, as shown in Fig. 2.2. This is the idea of “scale invariance” in the context of field theory or statistical mechanics. It is the physical phenomenon that underlies “critical opalescence,” the fact that a liquid–gas system near a second-order phase transition appears “milky” when it is irradiated with ordinary light because droplets scatter the light on all possible length scales. Percolation phase transitions also constitute a precise realization of these ideas and show how singular thermodynamic functions with power-law scaling behaviors result. In the context of field theory, one is developing a physical picture that underlies conformal field theory with anomalous dimensions. The reader is referred to the huge statistical mechanics literature for model building and examples of critical phenomena based on these ideas [3]. The scaling relations that follow from the correlation-length-scaling hypothesis are very important and very general. The idea is that, since there is but one relevant length in the problem, ξ(T ), which controls the long-range correlations and the singular behavior of thermodynamic functions, various of the singularities are related by dimensional-analysis arguments. This also implies that there are precise relations among the various critical indices characterizing the phase transition. These are called “hyperscaling relations” and read βmag = 12 ν(d − 2 + η) γ = ν(2 − η) α = 2 − νd δ = (d + 2 − η)/(d − 2 + η)
(2.21) (2.22) (2.23) (2.24)
One can check that these relations are true for the two-dimensional Ising model we have reviewed, and they have been checked for a wealth of other two- and three-dimensional statistical models and materials. It is interesting that mean-field-theory critical exponents do not satisfy these scaling results except in four dimensions. Since mean-field theory replaces fluctuating quantites with their supposed averages, this failure of an otherwise extremely important and useful model should come as no surprise.
2.3 Properties of the Ising model
17
2.3 Properties of the Ising model The two-dimensional Ising model played a most significant role in the development of the modern theory of critical phenomena. Its exact solution established the existence of a critical point with power-law singularities that were not given by Landau mean-field predictions. This result begged for understanding, both theoretical and intuitive, and it inspired several generations of physicists to make major contributions to the field. Many of these contributions, such as high-temperature and low-temperature expansions, duality, correlation-length scaling, finite-size scaling, the operator-product expansion, the renormalization group, and the expansion, had applications much broader than the specific Ising model and led to fundamental advances. We wrote down the action for the model above. To study its thermodynamics, we consider its partition function,
Z (K ) = ··· e K i j σi σ j (2.25) σ N =±1
σ1 =±1
where K = J/(kT ), the notation i j indicates a sum over nearest-neighbor sites i and j, and N is the total number of lattice sites in the square array. Instead of doing sophisticated constructions and calculations, let’s investigate the possibility that this system displays spontaneous magnetization at low temperatures, σi = 0, by “following our noses” and expanding Z (K ) beginning with this “guess” for the ground state. In other words, we begin with all spins aligned, as would be the case at vanishing temperature, and expand Z (K ) in the number of flipped spins. Choose the T = 0 ground state to be all spins “up.” When one spin is flipped, four bonds are broken, since each spin has four nearest neighbors on a symmetric, two-dimensional lattice. If two spins are flipped, then six bonds are broken if those spins are nearest neighbors and eight bonds are broken otherwise. The first several terms in this “low-temperature expansion” are visualized in the Fig. 2.3 and the corresponding expression for Z (K ) is Z (K )e−2N K = 1 + N e−8K + 2N e−12K + 12 N (N − 5)e−16K + · · ·
(2.26)
where the factors of N record the number of ways the graph can occur on .
.
.
.
.
.
.
.
.
.
.
.
.
.
.x
.
.
. x
. x
.
.
.x
.
.
.x
.
.
.
.
.
.
.
.
.
.
.
.
.
.
(a)
(b)
(c)
Figure 2.3. The first few terms in the low-temperature expansion of the twodimensional Ising model. The crosses label flipped spins.
18
Background
Figure 2.4. The first few terms in the high-temperature expansion of the twodimensional Ising model. the lattice. This expression is not very illuminating, so let’s turn to the hightemperature properties of the model. In that case K is very small and we can write each bond term in the partition function in the convenient form exp(K σ ) = cosh K + σ sinh K
(2.27)
where σ is a generic Z2 variable (σ = ±1). Applying this relation to the original expression for the partition function gives ··· (cosh K + σi σ j sinh K ) Z (K ) = σ1 =±1 i j
σ N =±1
= (cosh K )2N
σ N =±1
···
(1 + σi σ j tanh K )
(2.28)
σ1 =±1 i j
This form of the partition function is now a suitable basis from which to develop an expansion for Z (K )(cosh K )−2N in the small variable tanh K . To identify nonzero contributions to this expansion, we use the trivial identities σ = 0, 1=2 (2.29) σ =±1
σ =±1
Therefore, as we expand (1 + σi σ j tanhK ), only those terms without a free spin variable will contribute. The product i j indicates that we place “bonds,” σi σ j tanh K , on the links of the lattice and only if the bonds form closed paths will the term contribute. In addition, no link can be covered more than once. The several terms in the expansion are shown in Fig. 2.4 and the expression for Z (K ) is Z (K )(cosh K )−2N 2−N = 1 + N (tanh K )4 + 2N (tanh K )6 + 12 N (N − 5)(tanh K )8 + · · ·
(2.30)
This expression looks as uninformative as the low-temperature expansion. In fact, they look identical, if we identify tanh K = exp(−2K ∗ )
(2.31)
2.4 The Kosterlitz–Thouless model
19
and compare the two series term by term, Z (K ∗ ) Z (K ) ∗ N = 2K N (e ) 2 (cosh2 K ) N
(2.32)
The last two expressions can be written in more symmetric fashion, as a short exercise in hyperbolic functions shows: sinh(2K ) sinh(2K ∗ ) = 1
(2.33)
Z (K ) Z (K ∗ ) = N /2 sinh (2K ∗ ) sinh N /2 (2K )
(2.34)
and
In other words, we have discovered that the low-temperature properties of the model map onto its high-temperature properties: if we understand the magnetized phase, we understand the disordered phase equally well. We have discovered a “duality map” relating the two phases of the model. One says that the Ising model is self-dual and, if we assume that there is but one phase transition in the model, then it must occur at the self-dual point, that temperature at which K = K ∗ , or √ or e2K c = 2 + 1 (2.35) sinh2 (2K c ) = 1 This argument is correct and we have found the critical point exactly. It is interesting that this self-dual property of the model was known long before it was solved. In fact, the duality map can be derived elegantly without the use of the expansions used here, and more detail about the mapping between correlation functions in the low- and high-temperature phases can be extracted. Duality maps are extremely important in lattice-gauge theory and field theory in general, and we will see more of them later. 2.4 The Kosterlitz–Thouless model Up to this point we have used the Ising model as our means to illustrate notions of statistical mechanics that will play an important role in lattice-gauge theory. However, continuous variables will become more important, so we turn to the simplest case of them now. We replace the Z2 Ising variables on the twodimensional square lattice with two-dimensional, planar vectors of unit length, s1 (n) = cos θ (n),
s2 (n) = sin θ (n)
(2.36)
where θ(n) is an angular variable that resides on each site n. The energy of the entire system becomes s(n) · s(n + µ) = −J cos(θ (n) − θ (n + µ)) (2.37) S = −J n,µ
n,µ
20
Background
where µ labels the two directions between nearest-neighbor sites. As was the case for the Ising model, when J is chosen positive the system is ferromagnetic. We might expect this model to have an order–disorder transition at some sufficiently low T , but, although there is a phase transition in this model, it is particularly subtle and relevant to various lattice-gauge theories in higher dimensions. In fact, the planar model experiences a phase transition without a local order parameter, as we shall see shortly. This model has a global continuous rotation symmetry: all the spins can be rotated through a common angle α, and the energy remains unchanged. The analogous global symmetry was broken at low temperatures in the Ising model. Will there be a similar ordering of spins in this model? To gain an idea about the possible phases of the planar model, consider it first at high temperatures T . We will discuss low temperatures in due course. Let’s use the language of two-dimensional field theory, as a foretaste of later discussions of lattice-gauge theory, and call the expression S in the previous equation the “action” of the Euclidean field theory. We write it in a slightly more convenient fashion, 1 i(θ(n)−θ(m)) S = −J e cos(θ(n) − θ (m)) = − J + h.c. (2.38) 2 n,m n,m where n, m is a sum over nearest-neighbor sites of the two-dimensional lattice. At high temperatures we expect that the system will be disordered and the spin–spin correlation function will be short-ranged. This is easy to verify. Consider the correlation function iθ(0) −iθ(n) = e e dθ (m) ei(θ(0)−θ(n)) m
J × exp − cos(θ (n) − θ (m)) Z kT n,m
(2.39)
The exponential of the action S can be expanded in powers of J/(kT ) for sufficiently high temperatures, which will allow us to obtain a useful upper bound on the correlation function. We use the elementary integrals 2π 2π dθ (m) = 2π, dθ (m) eiθ(m) = 0 (2.40) 0
0
So, the leading contribution to the correlation function will occur in the expansion of the exponential when a string of e−i(θ(k)−θ(l)) factors extends from site 0 to site n, because otherwise there will be an uncancelled phase eiθ( j) on a site, which, when it is integrated over, will average to zero. If the sites 0 and n are on an axis, the first nonzero contribution appears at order [(J/(kT )]|n| , and the
2.4 The Kosterlitz–Thouless model correlation function behaves as i(θ(0)−θ(n)) e ∼ [J/(kT )]|n| = exp[−|n| ln(kT /J )]
21
(2.41)
which falls to zero exponentially as |n| grows. In the language of scaling and the correlation length, we see that ξ(T ) ∼ 1/ ln(kT /J ). Now we turn to the low-temperature behavior of the correlation function. In this limit θ(n) is expected to vary slowly over the lattice, so the argument of the cosine in the action should be very small. Then the cosine can be expanded and only the quadratic term need be accounted for, 1 S≈ J (µ θ (n))2 (2.42) 2 n,µ where µ denotes a discrete difference, µ θ (n) ≡ θ (n + µ) − θ (n). We note further that the action in this approximation resembles that of a free, massless scalar field. The only subtlety is that θ is an angular variable, ranging from 0 to 2π. However, if it were pinned to zero at some site on the lattice, we would expect its values everywhere on a finite lattice to be near zero, rendering its periodic nature irrelevant. Certainly we must be careful in the thermodynamic infinite-volume limit, however. If the action becomes a quadratic form and θ remains small over the entire lattice, then the model is integrable. The correlation function becomes i(θ(0)−θ(n)) J e Z ∼ dθ (m) ei(θ(0)−θ(n)) exp − (µ θ (n))2 2kT m n,µ (2.43) which can be evaluated explicitly because it involves only Gaussian integrals. If (n) denotes the massless scalar propagator on the finite system, then i(θ(0)−θ(n)) kT e ∼ exp (n) (2.44) J where the propagator behaves as 1 ln |n| (2.45) 2π on the finite L × L, but large, L n 1, system. A physical interpretation follows when we substitute back into the expression for the correlation function, kT /(2π J ) i(θ(0)−θ(n)) 1 ∼ (2.46) e |n| (n) n1 ∼ −
We see that the correlation fuction always falls to zero, even for infinitesimal temperature T . This means that the two-dimensional planar model never magnetizes. Apparently, the fluctuations in two dimensions are considerable enough
22
Background
at any nonzero T to average any particular spin to zero. This follows formally from this expression because the expectation value of the product of two spins should approach the product of their expectation values as |n| grows large, i(θ(0)−θ(n)) i(θ(0) −iθ(n) e ∼ e e (2.47) Since the correlation function falls to zero in this limit, we must have iθ(0) e =0
(2.48)
precisely. This result, the absence of spontaneous magnetization at low temperatures, is in accord with general results in this field that state that continuous global symmetries cannot break down in two-dimensional systems with just local, short-range interactions [4]. In addition, the result indicates that the model is critical for any low temperature because the spin–spin correlation function is power-behaved, with a critical index η = kT /(2π J ) that varies continuously for low T . In other words, in contrast to the cases discussed above, this model has a line of critical points rather than an isolated Tc . We learn that, for any T that is sufficiently low, the correlation length ξ is divergent. If we put our low- and high-T analyses together, we conclude that the model must have at least one phase transition at an intermediate temperature between a low-temperature region where the spin–spin correlation function is a power and a high-temperature region where the spin–spin correlation function is an exponential. The correlation length ξ(T ) is infinite in the low-temperature phase and finite otherwise. This behavior is distinct from the Ising model, for which the correlation length is finite both in the magnetized phase and in the unmagnetized phase and diverges only at Tc . The planar model has a phase transition, but no global order parameter because s(0) = 0 in both phases. What is the nature of the planar model’s phase transition? An intriguing and very influential answer was suggested by Kosterlitz and Thouless [5], who pointed out that the periodic nature of the angular variables θ (n) allowed for singular spin configurations – vortices – which could appear in the system at sufficiently high temperatures and disorder it, changing the correlation function from a power to an exponential. They began their considerations with the Gaussian, low-temperature approximation to the action S, but they supplemented it with the periodicity of the angular variables θ (n), ∇ 2 θ = 0,
0 < θ < 2π
(2.49)
The solutions to this periodic Gaussian system can be labeled by their winding number, · ds = 2πq, ∇θ q = 0, ±1, ±2, . . . (2.50) In the spin-wave analysis we presented above at low temperature, we ignored the periodicity of the θ (n) variables because they should vary very slowly across
2.4 The Kosterlitz–Thouless model
23
x
Figure 2.5. A vortex in the two-dimensional planar model. the lattice. We were effectively dealing with just the q = 0 topological sector of the theory. A spin configuration with q = 1 is shown in the Fig. 2.5. It can be depicted as a “porcupine” or a “vortex.” Clearly the configuration is singular, has a “center” about which the spins vary rapidly on the scale of the lattice spacing, and disorders the system over arbitrarily large distances. Kosterlitz and Thouless presented a famous plausibility argument suggesting that vortex condensation drives the phase transition between the low- and high-temperature phases at a nonzero Tc . The argument begins by evaluating the action for the vortex in Fig. 2.5, using the Gaussian approximation S ≈ π J ln(L/a)
(2.51)
where L is the linear dimension of the lattice, and a is the short-distance cutoff, the lattice spacing. Since S diverges whenever L/a → ∞, one might think that vortices are irrelevant in the continuum system. However, to estimate their importance to the thermodynamics, we must consider their contribution to the free energy, F = S − T E, where E is the entropy. We have used field-theory notation and have written S for “action” where one would have written “energy” in statistical mechanics. We see a competition between energy and entropy in the free-energy expression that decides whether the vortex is physically significant. To make this argument quantitative, we must estimate the entropy of a free vortex. This is a counting problem because the entropy is the logarithm of the multiplicity of the vortex configuration. Since the vortex is specified by its location, as is clear in Fig. 2.5, its multiplicity is just the number of sites of the lattice, entropy = k ln(L/a)2 . Therefore, F = (π J − 2kT ) ln(L/a)
(2.52)
24
Background
which changes sign when the temperature is raised to Tc = π J/(2k)
(2.53)
Therefore, for all temperatures greater than Tc , vortex condensation is favored and the ground state of the system should have a macroscopic occupation of them. Since vortices disorder the spins over macroscopic distances as well, we have a mechanism for describing the system’s phase transition. This simple exercise brings up as many questions as it answers. It ignores all interactions and doesn’t explain whether vortex condensation is a catastrophic instability or a phase transition to a new ground state. It is, in fact, a phase transition and it is instructive to study the model more and see how this works out. This will prove to be an inspiring exercise because the issue of topological causes for disordering a spin system in two dimensions will prove to be closely related to discovering the mechanisms of confinement and chiral-symmetry breaking in four dimensional lattice gauge theory. 2.5 Coulomb gas, duality maps, and the phases of the planar model Now let’s take the ideas of the last section and make them quantitative. The motivation for this comes from lattice-gauge theory: many of the concepts that we can illustrate straightforwardly in two-dimensional spin models, like the planar model, generalize to four-dimensional lattice-gauge theory. The condensation of topological configurations in two-dimensional spin systems will find analogs in the condensation of various topological excitations in four-dimensional gauge theories. The analogs of disordered spin configurations will become quark confinement and chiral-symmetry breaking. To carry through this program, we introduce a slightly simpler, for the purposes of analysis, formulation of the two-dimensional planar model, namely the periodic Gaussian model. To motivate the use of this model, consider a single link in the expression for the partition function of the planar model, e−β[1−cos(θ(n)−θ(m))] =
∞
eil(θ(n)−θ(m)) Il (β)
(2.54)
l=−∞
where we have written out the Fourier series of the periodic-link contribution to Z and Il (β) is the Bessel function of imaginary argument. Now specialize to low temperatures (large β) at which Il (β) is well approximated by a Gaussian, e−β[1−cos(θ(n)−θ(m))] ≈
∞
eil(θ(n)−θ(m)) e−l
2 /(2β)
(2.55)
l=−∞
The right-hand side of this expression has the same “essential” ingredients as the original planar model: it has local, ferromagnetic couplings between spins
2.5 Coulomb gas, duality maps, and phases
25
that preserve the periodic nature of the action. It has the great advantage that it is a simple Gaussian and can be analyzed elegantly. The model was introduced by Villain and will be analyzed now for all temperatures β. Since it has the same essential features as the original planar model, we expect its critical behavior to be the same. However, because the nasty nonlinearities of the cosine interaction have been simplified, we will be able to rewrite the partition function into two factors: one describing the spin waves and another describing the vortices. The partition function for the vortices will be given by the twodimensional Coulomb gas and will quantify the descriptive analysis of Kosterlitz and Thouless reviewed above. The partition function of the periodic Gaussian model now reads, introducing discrete link variables n µ (r ), ∞ 2 Z= dθ (r ) ein µ µ θ e−n µ (r )/(2β) (2.56) r
r,µ n µ (r )=−∞
where µ is the discrete-difference operator, µ f (n) = f (n + µ) − f (n). We can do the integrals over θ (r ) because θ enters only a phase factor. Each such integral generates a constraint, µ n µ (r ) = 0, so the partition function becomes Z=
∞
r,µ n µ (r )=−∞
δµ n µ (r ),0 e−n µ (r )/(2β) 2
(2.57)
It is easy to solve the constraint and obtain a simpler form for Z . The constraint states in discrete form that n µ (r ) is a divergence-free integer-valued vector field. This implies that it is the discrete curl of an integer-valued scalar field, n µ (r ) = µν ν n(r ) Using the result n 2µ (r ) = (µ n(r ))2 , the partition function becomes ∞ 1 2 exp − (µ n(r )) Z= 2β r,µ n µ (r )=−∞
(2.58)
(2.59)
It is interesting to step back for a moment and see what all this trickery has accomplished. We started with the planar model at temperature T and mapped it onto a new model, our last expression for Z , at temperature T −1 . This is an example of a duality transformation. The last expression for Z is called the “interface-roughening” model, and is of considerable fame in the statistical mechanics of crystal growth. Think of a crystal of steps, each of height n(r ), growing on a table top. At small values of β, the energetics of the system favors a flat crystal, all the n(r ) at a common value, with the finite temperature fluctuations inducing lumpy waves on it. However, as β decreases a point at which the localization of the surface is lost is reached and it becomes “rough,” with n(r )
26
Background
diverging logarithmically as the linear dimensions L of the system are taken large, and the step–step correlation function behaving as n(r )n(0) ∼ ln |r |. We will see shortly how to establish these results. One of the useful points about a duality mapping is that the high-temperature region of one model is mapped onto the low-temperature region of another. The relevant degrees of freedom are also mapped onto one another, so one can learn which variables expose the physics in the clearest way. In the case developed here, we also learn that two apparently different models, with totally different physical settings, one a conventional spin model and the other a crystal-growth model, have the same critical behavior. We are not done with our maps in this case. It would be convenient to obtain a representation of the planar model in which the integer-valued field n(r ) were replaced by an ordinary scalar field φ(r ). The trick to do this is the Poisson summation formula, ∞ ∞ ∞ g(n) = dφ g(φ)e2πimφ (2.60) n=−∞
m=−∞
−∞
where g is an arbitrary but sensible function. The identity is rather trivial: to establish it, just do the sum over m before the integral over φ. Applying it to the case at hand, ∞ ∞ 1 2 Z= dφ(r ) exp − (µ φ(r )) + 2π i m(r )φ(r ) 2β r,µ −∞ r r m(r )=−∞ (2.61) The symbols in this expression have simple interpretations in terms of the original model: the field φ(r ) is the spin wave and the m(r ) are the vortices. The expression for Z states that the vortices act as sources of the spin waves, which are ordinary massless fields. The identification of the m(r ) with vortex variables becomes clear if we integrate out the spin waves. The integration over φ in the expression for Z is a simple Gaussian, so we obtain ∞ 2 exp −2π β m(r )G(r − r )m(r ) (2.62) Z = Z SW m(r )=−∞
r,r
where Z SW is the spin-wave contribution produced by the Gaussian integrations and G(r − r ) is the lattice propagator for a massless field. This expression for the partition function is surprisingly simple, physical, and important. It shows that the spin waves are independent of the vortices in the periodic Gaussian model because Z has factored into two terms, a spin-wave piece Z SW that has no phase-transition physics in it, and the vortex piece displayed above. The vortex piece shows that vortices interact through the “potential”
2.5 Coulomb gas, duality maps, and phases
27
G(r −r ). The vortices are pointlike excitations with possible charges m ranging from −∞ to +∞. To extract the physics from this expression we must understand G(r − r ) better. The propagator G(r − r ) appeared in the calculation when we did the Gaussian integral and had to invert the discrete, lattice Laplacian on a twodimensional square array, 2 G(r ) = δr,0
(2.63)
where 2 = 2x + 2y and 2x is the simplest discrete form of the second derivative, 2x G(r ) = G(r + x) + G(r − x) − 2G(r )
(2.64)
We solve for the propagator G by passing to momentum space, as in the corresponding calculation in continuum field theory, π dk x π dk y eik · r G(r ) = (2.65) −π 2π −π 2π 4 − 2 cos k x − 2 cos k y We will need some of the properties of G(r ). First, it is easy to check that, at large |r | 1, G(r ) is well approximated by the continuum massless propagator, G(r ) ≈ −
1 1 ln(r/a) − , 2π 4
|r | 1
(2.66)
The behavior of G(r ) for small r can also be determined from the integral above, G(0) ≈
1 ln(L/a) 2π
(2.67)
where L is the linear dimension of the lattice. G(0) is important because it contributes to the partition function through the r = r term in the sum, so it represents the “self-interaction mass” of each vortex. Its logarithmic dependence on the infrared cutoff L and the ultraviolet cutoff a was anticipated earlier in our review of the Kosterlitz–Thouless ideas. We also see that the r = r terms in the double sum of Z indicate that the vortices interact through spin-wave exchange, which generates the two-dimensional Coulomb potential G(r − r ). Since the possible charges m are positive or negative, we have a gas with opposite charges attracting and same-sign charges repelling: a Coulomb gas in which the charges are quantized and vary from −∞ to +∞. It is instructive to write G(r ) as two terms, one of which is infrared-finite and the other of which is not, G(r ) = [G(r ) − G(0)] + G(0) ≡ G (r ) + G(0)
(2.68)
28
Background The sum over the vortices in the partition function can then be simplified, 2 ∞ Z = Z SW exp−2π 2 βG(0) m(r ) m(r )=−∞
r
× exp −2π β 2
m(r )G (r − r )m(r )
(2.69)
r,r
The
first exponential eliminates all vortex configurations that are not neutral, ) = 0, in the thermodynamic limit L → ∞. Only configurations for r m(r
which r m(r ) = 0 contribute to Z . Therefore, we can write Z = Z SW exp −2π 2 β m(r )G (r − r )m(r ) (2.70) r,r
m(r )
where the prime on the sum means “neutral configurations of vortices only.” It is also useful to use the large-r expression for the propagator G, since it is actually quite accurate even for r near the ultraviolet cutoff, π 2β Z = Z SW exp m(r )m(r ) + πβ m(r ) ln(|r − r |/a)m(r ) 2 m(r ) r,r r,r (2.71) where the prime on the double sum over sites means that the r = r term is omitted. Using the neutrality condition, we can simplify the first double sum in the expression for Z , 0=
r,r
Now,
m(r )m(r ) =
r,r
m(r )m(r ) +
m(r )2
(2.72)
r
π 2β 2 exp − m (r ) + πβ m(r ) ln(|r − r |/a)m(r ) Z = Z SW 2 r m(r ) r,r
(2.73) The first term in the exponential records the chemical potential of each vortex and the second gives the logarithmic interaction between different ones. This Coulomb-gas representation of the planar model gives a precise underpinning to the ideas of Kosterlitz and Thouless about the model’s phases and phase transition. The only possible configurations of vortices must be neutral. If β is very large, then the chemical-potential term suppresses the vortex contribution to the physics, leaving just the spin waves. In fact, the strong logarithmic
2.5 Coulomb gas, duality maps, and phases
29
potential between vortices suggests that any vortex–anti-vortex pairs that populate the ground state are tightly bound together into neutral molecules. However, as β ≡ J/(kT ) is reduced and becomes of the order of unity, one would expect that the probability of finding vorticies would not be suppressed and they should contribute to the properties of the ground state. Since they experience long-range interactions, however, they cannot be treated as free excitations. In effect, as the temperature T grows, the neutral molecules of vortex–anti-vortex pairs unbind and the vortices form a plasma that screens external and internal charges and dynamically generates a finite screening length. A renormalization-group analysis is needed in order to derive this fact, which also uncovers the character of the phase transition: it is an infinite-order transition with the correlation length growing as T approaches Tc from above as ξ(T ) ∼ exp[b 1/(T /Tc − 1)] (2.74) which is the famous essential singularity of the subtle planar-model transition [6]. This result has been obtained by various methods in addition to using the traditional langauge of the Coulomb gas of screening lengths and plasmas. A field-theoretical approach follows from the expression for the partition function in terms of φ and m, ∞ ∞ 1 2 Z= dφ(r ) exp − (µ φ(r )) + 2π i m(r )φ(r ) 2β r,µ −∞ r r m(r )=−∞ (2.75) We have already realized that the coupling of the vortices to the spin waves will generate a mass, effectively a chemical potential, for the vortices. To accomodate this effect we generalize the partition function to one with an additional variable y, ∞ dφ(r ) Z (β, y) = −∞ r
×
∞ m(r )=−∞
1 (µ φ(r ))2 + ln y m 2 (r ) 2β r,µ r m(r )φ(r ) (2.76) + 2πi
exp −
r
If y is very small so that the vortices are heavy, then only the terms in the sum with m(r ) = 0, ±1 should be numerically significant, and the partition function
30
Background
can be simplified,
eln y m
2 (r )+2πim(r )φ(r )
≈ 1 + eln y (e2πiφ(r ) + h.c.)
m(r )=0,±1
≈ 1 + 2y cos(2π φ(r )) ≈ e2y cos(2πφ(r ))
(2.77)
The partition function becomes ∞ 1 2 dφ(r ) exp − (µ φ(r )) + 2y cos(2π φ(r )) Z (β, y) ≈ 2β r,µ −∞ r r (2.78) Finally, to restore conventional field-theoretical notation, we rescale φ → √ φ/ β, so ∞ Z (β, y) ≈ dφ(r ) −∞ r
1 2 × exp − (µ φ(r )) + 2y cos(2π βφ(r )) 2 r,µ r
(2.79)
which we recognize as the quantum sine-Gordon equation. The periodicity of the original planar model shows up in this expression in the cos interaction term. A renormalization-group analysis of this model in the β–y plane produces the phase diagram and scaling laws of the model in an elegant fashion, with all the universal features the same as those obtained from the Coulomb-gas representation. We have dwelt on this model in considerable detail because it illustrates many physical and technical issues that arise in four-dimensional lattice-gauge theory. These include topological excitations as the source for disorder and a phase transition, duality maps to relate the high- and low-temperature features of the model in terms of weakly fluctuating variables, and correlation-length scaling. 2.6 Asymptotic freedom in two-dimensional spin systems Another class of two-dimensional spin systems that will teach us what to expect of four-dimensional gauge theories consists of the nonlinear sigma models. We consider a square, two-dimensional lattice again with a unit vector spin at each site, coupled to its nearest neighbors through an inner product, 2 S=− s(n) · s(m), |s | = 1 (2.80) g n,m
2.6 Asymptotic freedom
31
The simplest case we will concentrate on is the O(3) model, in which we can visualize each spin as having its end on a site of the square lattice and its head on a unit sphere. We easily check that this action is the simplest lattice version of the continuum O(3) “sigma” model, 1 2 Z= ds (n) exp − 2 d r ∂µ s · ∂µ s (2.81) 2g n We will ultimately be interested in understanding this model’s phase diagram as a function of the coupling g. Is there a region where there are long-range correlations, and another where there are only short-range correlations? What are the disordering mechanisms that control the behavior of the spin–spin correlation function at long distances? We take a renormalization-group view to answer these questions. We are interested in extracting the long-distance physics from this model. Equivalently, we want the low-energy, low-momentum behavior. The theory has fluctuations and excitations on all length scales from the ultraviolet to the infrared cutoff. The long-distance/low-energy character of the model is not apparent from the shortdistance, fundamental formulation of the model given by the partition function. The partition function gives us no clue about the theory’s possible phases, the possibility of dynamical symmetry breaking, and generation of a correlation length. We need to integrate out the high-energy fluctuations of the theory, to find the relevant low-energy degrees of freedom and their interactions. For example, in the case of quantum chromodynamics, we would like to start with the theory of quarks and gluons and end up with an effective Lagrangian whose degrees of freedom are the known mesons and baryons, coupled together by the residual nuclear force. Physics is certainly far from this goal, but this program, the renormalization group, has had its share of successes in simpler models [2]. In the context of the O(3) nonlinear sigma model in two dimensions, let’s integrate out high-frequency modes and find the effective degrees of freedom and interactions for which perturbation theory is adequate. This is an extremely modest calculation, on the scale of our discussion above, but we will find an interesting result, asymptotic freedom, along the way. We will see that the curvature of the sphere of the O(3) spins causes the interactions between them to be stronger at low energies than at high energies. This is the same trend as that found in QCD, which is responsible for its parton description at short distances and its confining features at long distances. QCD will be discussed below. Here we wish to see that the O(3) spin system in two dimensions is asymptotically free. This means, in our statistical-mechanics language, that the theory’s only phase transition is at zero coupling, for which it becomes disordered. The theory is more strongly coupled at long distances than at short, as our perturbative asymptotic-freedom calculation will show, and it will dynamically generate a correlation length that will allow its spin–spin correlation function to be
32
Background
exponentially small at large distances. Actually, since our calculation will be perturbative, we won’t be able to discuss very large scales in a controlled fashion, because the effective coupling will be too large there. Since asymptotic freedom is a consequence of the geometry of this model, we will parametrize the spin variable s judiciously. s has low- and high-frequency fluctuations and we choose a parametrization in which two components, s1 and s2 , are both slowly varying and large, O(1), while the third component, s3 , is √ rapidly varying and small, O( g). We will verify in the course of the calculation that such an arrangement is possible if we are content to consider only very small couplings g. Since each spin has unit length, we can use the parametrization 2 s1 = 1 − s3 sin θ, s2 = 1 − s32 cos θ (2.82) Then the Lagrangian becomes 1 1 (∂µ s3 )2 2 2 2 L= 1 − s3 (∂ν θ ) + (2.83) (∂µ s) = 2g 2g 1 − s32 √ If we anticipate that, for small g, s3 ∼ O( g), we can approximate L by L=
1 (∂ν s3 )2 + 1 − s32 (∂ν θ )2 + s32 (∂µ s3 )2 + · · · 2g
(2.84)
Finally, it is convenient to rescale s3 to eliminate the overall factor of g, h(r ) ≡ √ s3 (r )/ g, so 1 1 2 2 2 2 2 (∂ν h) + L= (2.85) − h (∂ν θ ) + gh (∂µ h) + · · · 2 g This expression illustrates the good features of this parametrization of the sphere. As h fluctuates it affects the contribution of the θ field to the action. However, fluctuations in θ do not react back onto h. Now consider the theory with a momentum-space cutoff, | p| < . Introduce a high-momentum slice, < | p| < , and a low-momentum slice, 0 < | p| < , and organize the partition function accordingly, 1 (∂ν θ )2 d2 x Z = Dθ( p) exp − 2g 0< p< 1 1 1 2 2 2 2 2 2 2 2 Dh( p)e− 2 (∂ν h) d x e 2 h (∂ν θ) d x− 2 gh (∂ν h) d x + ··· (2.86) × < p<
For small g the second term in the last exponential can be dropped. Then the integral over h( p) is a Gaussian and can be computed exactly,
2.6 Asymptotic freedom
1
< p<
Dh( p)e− 2
(∂ν h)2 d2 x
1
e2
h 2 (∂ν θ)2 d2 x
33 1
= Ne2
h 2 (∂ν θ)2 d2 x
d2 p 1 2 2 2 d x (∂ν θ ) d x = N exp 2 2 2 (2π ) p 1 2 2 ln( /) (∂ν θ ) d x = N exp 4π
(2.87)
(2.88)
where the overall integration constant N will not be important. On putting this result back into the partition function, we see that the θ term in the effective action is 1 1 1 L = (2.89) − ln( /) (∂ν θ )2 + · · · 2 g 2π On comparing this expression with the θ term in the original action, [1/(2g)](∂ν θ)2 , we find that integrating out high-frequency fluctuations has caused coupling-constant renormalization, 1 1 1 = − ln( /) g g 2π
(2.90)
Although this parametrization of the model does not leave its O(3) symmetry manifest, it is good enough for this leading-order calculation. In fact, it is better to convert this result into a traditional differential equation for the running coupling constant. Let the high-momentum slice be infinitesimal, = + δ, so that 1 1 1 1 ln(/ ) = −ln(1 + δ/) ≈ −δ/ and − dg = d = − g g g g2 (2.91) Therefore, 1 dg = − g2 (2.92) d 2π Or, if we write this in terms of a real-space cutoff, like the lattice spacing a,
dg 1 2 (2.93) = g da 2π These equations state that the theory is asymptotically free. We see that the effective coupling is a decreasing function of the momentum cutoff. Alternatively, in terms of lattice models, we learn that a smaller coupling must be used on a finer lattice to obtain the same low-energy, long-wavelength physics. The long-distance features of the theory would be described by a strongly coupled theory, if we can believe the indications of the weak-coupling calculation done a
34
Background
here. The calculation suggests that the O(3) model should be disordered and have a finite correlation length ξ . Let’s develop this idea more thoroughly. Suppose that an asymptotically free theory has a finite correlation length ξ . In the language of high-energy physics, we would speak of the “mass gap” m, which is inversely related to the length ξ , m = 1/ξ . If the system is actually disordered and correlation-length scaling applies to it, then its correlation functions should be determined by ξ and behave as ∼e−|r −r |/ξ = e−m|r −r | . Since m characterizes the low-energy, longdistance features of the theory, it should be unaffected when we integrate out high-frequency modes in the lattice theory. In other words, the “bare” lattice theory with a momentum cutoff and coupling g should give rise to the same m as the lattice theory with momentum cutoff and coupling g , as long as g is related to g as stated above. It is more convenient to speak of the real-space cutoff a instead of . So, we will consider the couplet (a, g) instead of (, g). By dimensional analysis m can be written as 1 F(g) (2.94) a However, m is a physical quantity that does not depend on the cutoff scheme, so m=
d m=0 (2.95) da which implies that a and g must be related. We have already seen that the relation is given by asymptotic freedom in the case of the O(3) sigma model, and now let’s see that this determines the functional dependence of F(g). Substituting the dimensional-analysis statement for m into the last equation implies that 1 dg F(g) + F (g) =0 (2.96) 2 a da but we have determined how g must depend on the ultraviolet cutoff a. Let’s use standard and general notation for that result, the Callan–Symanzik relation, −
dg = β(g) (2.97) da where β(g) is the Callan–Symanzik function (not to be confused with 1/(kT ) or the critical index for the magnetization, or the many other uses for the symbol β!). Now the differential equation for F becomes a
d F(g) = F(g)/β(g) dg which can be integrated,
F(g) ∼ exp
dg β(g )
(2.98)
(2.99)
2.6 Asymptotic freedom
35
If we choose g 2 1, we can evaluate this integral for the O(3) model, F(g) ∼ exp(−2π /g)
(2.100)
There are several things to note about these curious results. First, the fixed points of the renormalization-group flows g(a) are of central importance. We see that the field theory can be “renormalized” at the fixed point because here F diverges and ξ = 1/m can be a macroscopic, physical length that is independent of the short-distance cutoff a. When we formulate other, more-challenging lattice-field theories we are particularly interested in understanding how their parameters depend on the lattice spacing and we are particularly interested in finding fixed points of the renormalization group because it is only at these points that the long-distance physics “decouples” from the cutoff and it is only at these points that a continuum limit can exist. Second, we need the renormalization-group trajectory g(a) in the vicinity of the fixed point in order to determine the theory’s scaling laws, to show how ξ depends on the coupling, and to understand its physical length scales. Recall that, in the case of the planar model, the scaling law was ξ(T ) ∼ exp[b 1/(T /Tc − 1)] (2.101) and the fixed point of the renormalization group was at a nonzero value Tc = 0. In the case of the planar model, the fixed point separated an ordered phase from a disordered phase and we needed to know the behavior of the model in the immediate vicinity of the transition in order to infer its long-distance physics. For the O(3) model, the phase transition occurs at zero coupling, a special feature of asymptotic freedom, and the model has only a disordered phase. We haven’t really proven this point, of course, since we have done only weak-coupling analysis, but results from numerical studies back up this point of view. In fact, in the case of the O(3) model, numerical searches for other fixed points of the renormalization group have revealed none. Third, the dependence of the correlation length on the lattice coupling g is nonanalytic. The correlation length cannot be obtained by perturbing about weak, vanishing coupling g = 0. There is an instability at g = 0 that dynamically generates a finite correlation length. The weak-coupling calculation suggests that, even if the ultraviolet physics is charaterized by weak interactions, the infrared physics is strongly coupled. This point of view underlies modern thoughts on quark confinement in non-Abelian gauge theories. Those models are also asympototically free, so they are well approximated by free fields at short distances and the parton model works quite well, but at large distances they are strongly coupled, Coulomb’s law is replaced by flux-tube formation and confinement results. There will be much more discussion on these points in the chapters that follow, needless to say!
36
Background
Before turning to other topics, it is interesting to understand why the O(3) model was asymptotically free. This was a crucial result because it led to the scaling law for the correlation length and is an important part of the result that the model has just one phase, a disordered one with a dynamically generated correlation length. When we computed the effective action above by integrating out the high-frequency, small-amplitude fluctuations, the local curvature of the sphere produced an effective spin variable with |s | < 1. To restore the condition that the spin have unit length required a rescaling, s → s /|s |, which was absorbed into a change of the coupling constant, g → g /|s |2 . Therefore, the effective coupling increases as a result of integrating out the high-frequency fluctuations which feel the local curvature of the non-Abelian-group manifold. When we turn to non-Abelian gauge theories, we will again find asymptotic freedom and it will be traced to the non-Abelian self-interactions among the vector gauge fields. Other interactions, such as fermion–gauge couplings, will show the more-familiar screening characteristics, known from quantum electrodynamics, and will oppose asympototic freedom. Recall that, in quantum electrodynamics, fermions with positive charges screen those with negative charges, and the effective coupling at large distances is reduced from one’s classicalphysics expectations. In fact, if there were “too many” colored quarks in QCD, screening would beat out asympototic freedom and, instead of the theory being weakly coupled at short distances and strongly coupled at long distances, it would behave in the opposite fashion, like quantum electrodynamics and Yukawa models. Asymptotic freedom is paramount in understanding Nature. 2.7 Instantons in two-dimensional spin systems Without topological excitations the planar model reduces to a free, infraredsensitive scalar field. The topological excitations, vortices, provide a disordering mechanism that drives a phase transition to a high-temperature disordered phase in which the model has a finite correlation length. The phase transition is particularly interesting because it is driven by vortex condensation, has an essential singularity rather than the power-law singularity of the more-familiar Ising model, and has no global order parameter at any temperature. We will see that the planar model has much in common with phenomena in four-dimensional gauge theories. The O(3) spin model is never well approximated by a free field. Asympototic freedom implies that it is strongly coupled and disordered for all couplings, if, as suspected, there are no additional phase transitions besides that at g = 0. Does this model have topological disordering mechanisms as well? Indeed it does. The model has instantons, named by G. ’t Hooft and discovered by A. M.
2.8 Computer experiments
37
Figure 2.6. The instanton in the two-dimensional O(3) model. Polyakov in this case [7]. These spin configurations, which constitute local minima of the action, carry a conserved topological charge, and disorder the spins over all distance scales from zero to infinity. The instantons can be viewed as vacuum tunneling events in Minkowski space, or as stationary field configurations in the two-dimensional Euclidean formulation of the model. Take the L × L lattice and identify the points on the boundary to compactify the base space into a sphere. Then an instanton represents a mapping of that sphere onto the sphere of the O(3) target space. We know from elementary topology that such maps are labeled by their winding number, which ranges over the positive and negative integers. The winding number is given by 1 Q= s · (∂1 s × ∂2 s) (2.102) 4π A field configuration with Q = 1 and action S = 2π/g is drawn in Fig. 2.6 and is given by the expression 2ρx1 2ρx2 x 2 − ρ 2 (2.103) s = , , x 2 + ρ2 x 2 + ρ2 x 2 + ρ2 As shown in Fig. 2.6, at large distances all spins point “up,” in the center the spin points “down,” and, at intermediate distances x ∼ ρ, the spins are horizontal, pointing toward the center. The parameter ρ sets the size scale of a particular instanton and, in a partition-function formulation of the model, one would encounter an integral over all scale sizes. Clearly the instantons of the O(3) model tend to disorder the spins. Since asymptotic freedom does the same, the instantons here do not bring in any qualitatively new physics. However, we shall see that instantons are very significant in QCD and appear to play a central role in its mass spectrum. 2.8 Computer experiments and simulation methods Another tool for understanding simple statistical-mechanics models is provided by computer simulation. In addition to estimating order parameters, correlation
38
Background
functions, and other thermodynamic quantities, this approach provides a theoretical laboratory in which one can investigate how the model “works.” In other words, we might generate “typical” spin configurations contributing to a partition function and see whether they have special properties, such as topological excitations, and, if they do, then estimate some of their characteristics. To carry out such a program we need an efficient method of generating typical spin configurations. One very general method that applies to systems made of classical variables is use of the Metropolis algorithm. The idea is to replace the computation of the expectation value of a quantity A that depends on all the spins of the system, ! −S A = dθ e A dθ e−S (2.104) which involves a very high dimensional integral, by an average over field configurations θ(r ), ! A(θ(r )) 1 (2.105) A = i
i
in which the configurations θ (r ) are distributed according to the Boltzmann law e−S . This approach to calculating expectation values is called “importance sampling” because it replaces the full multi-dimensional integral by a sample of configurations that are highly likely. We will illustrate importance sampling by discussing the Metroplis algorithm for generating an ensemble of configurations that are distributed according to the Boltzmann weighting. Suppose that we have a trial configuration on a computer. Then the Metropolis algorithm gives a procedure for generating another configuration by replacing one θ (r ) variable at a time. The new configuration replaces the old one if the rules of the algorithm are satisfied. After all the sites have been sampled, one has “swept through” the lattice once. The aim of the algorithm is to generate, after many sweeps, perhaps, a new, independent field configuration θ (r ), which is a new member of the Boltzmann distribution. The value A(θ (r )) can be calculated for this configuration and a contribution to the average, A, recorded. Then the Metropolis algorithm is applied many more times to θ (r ) and another configuration, a new, statistically independent member of the Boltzmann distribution, θ (r ), is generated. Then A(θ (r )) is computed and recorded. This procedure is repeated again and again until A has been estimated to some sufficient, desired accuracy. The Metropolis algorithm consists of the following procedure. We have a configuration in an Ising system, for example. This configuration may but need not be an important member of the Boltzmann distribution for the parameter set of interest. In any case, take the spin variable at a particular site n and flip it. Compute the change in the action, S. If it is less than zero, the new configuration
2.8 Computer experiments
39
is accepted and the old configuration is replaced. If S is positive, the new configuration is accepted with the conditional probability exp(−S). This step can be done by picking a random number x between zero and unity and comparing it with exp(−S): if exp(−S) > x, the configuration is accepted, otherwise it is not. This Metropolis procedure is performed for a particular site and then another site is picked and the procedure continues until all the spins have been considered. This procedure is guaranteed to bring the system into thermal equilibrium and to produce a Boltzmann ensemble. The procedure is “local,” that is, to update a spin at a particular site, one need know only those spins which are directly coupled to it. In the case of the nearest-neighbor coupled Ising model, this is just a group of four nearest neighbors. The procedure is very straightforward and easy to implement. One can even update several spins at a time, as long as they are outside each others’ immediate spheres of influence. Of course, since the procedure is local, it may take many sweeps to generate a statistically independent member of the Boltzmann ensemble. Correlations between configurations in the Markov process must be monitored carefully in order to estimate the statistical errors in the procedure’s estimate for A. Let’s review the argument that states that the Metropolis algorithm works, in the sense that the probability of finding a configuration θ (r ) after N sweeps is PN (θ (r )) N →∞ ∼ e−S(θ(r ))
(2.106)
To do this, consider the relationship between the N th and the (N + 1)th sweeps. Let W (θ → θ ) be the probability of the transition θ → θ . Then the procedure generates a new distribution, PN +1 (θ), given by W (θ → θ )PN (θ ) + 1 − W (θ → θ ) PN (θ ) (2.107) PN +1 (θ ) = θ
θ
[PN (θ )W (θ → θ ) − PN (θ )W (θ → θ )] = PN (θ) +
(2.108)
θ
A stationary distribution will satisfy PNS (θ )W (θ → θ ) = PNS (θ )W (θ → θ )
(2.109)
which is the statement of detailed balance. An important property of the procedure is that it always makes configurations that are closer to satisfying detailed balance, so we are guaranteed that thermalization will occur eventually. To see this, suppose that W (θb → θa ) PN (θa ) > PN (θb ) W (θa → θb )
(2.110)
then, by substituting into the evolution equation for PN , we find PN +1 (θa ) < PN (θa ) and PN +1 (θb ) > PN (θb ).
40
Background Finally, the Metropolis algorithm chooses the transition to be W (θ → θ ) = 1,
if S(θ ) > S(θ )
−(S(θ )−S(θ))
W (θ → θ ) = e
,
if S(θ ) < S(θ )
(2.111) (2.112)
This implies that W (θb → θa ) = e−(S(θa )−S(θb )) W (θa → θb )
(2.113)
and substituting back into the detailed-balance equation gives PNS (θ ) = P eq (θ ) ∼ e−S(θ)
(2.114)
as desired. Note that we never need to calculate the entire action S(θ ) in order to implement the algorithm – all we need are the nearest neighbors of the updated variable. Locality in the action guarantees that the computer time needed for a single sweep of the lattice grows in proportion to its size. The final ingredient in the Metropolis scheme is the specification of the transition, θ → θ . The exact procedure depends on the character of the variables in the configuration, namely whether they are Ising spins, unbounded scalar fields, phase angles, vectors, or matrices. The essential ingredient is that one must choose a scheme that will cover the space of the local variable uniformly. For example, in the case of a model with SU(3) matrices at each site, one might set up a table of random SU(3) matrices and their Hermitian adjoints. Then, in the Metropolis algorithm one might specify a site, change its variable by multiplication by a randomly chosen matrix from the table and proceed with the rest of the algorithm as presented above. This procedure works because one can prove that repeated application of matrices from the table to any SU(3) matrix brings it arbitrarily close to any other member of the group. In fact, one could even bias the table of matrices toward the identity, so that the changes in the configuration are small at each step, thus improving the probability that the change in the matrix at each site is “accepted,” and thus improving the efficiency of the algorithm for dealing with an ordered phase. Classical statistical-mechanics models have been studied in great detail using the Metropolis algorithm and other, similar, algorithms. Many practical problems can arise in these studies. 1. There are considerable correlations between successive configurations in these local procedures, so the lattice must be swept through many times before an independent measurement of the observable A can be made. This problem is particularly severe near a critical point, where the correlation length becomes large. Typically, the number of sweeps needed to generate a new configuration under these conditions diverges as a power of
2.9 The transfer matrix
41
T − Tc . Statistical analyses of the set of configurations generated must be carried out in order to deal with this problem. In the vicinity of a first-order transition there are often metastable states that also slow the Metropolis algorithm’s approach to an equilibrium configuration. Tunneling between metastable states is often seen in computer simulations. Nonlocal, global changes in the configuration can be substituted for the local changes discussed above in order to deal with these problems, but such algorithms must be carefully crafted for the particular model being studied, in order to keep the “acceptance probability” high enough to make the procedure successful. “Cluster algorithms” are of this class and have proven to be very worthwhile for particular, simple models [8]. 2. The Monte Carlo procedure produces statistical estimates √for observables and the uncertainties in the averages fall to zero as 1/ N , where N is the number of independent configurations in one’s equilibrated ensemble, according to the central limit theorem. This character of the procedure must be faced after problem 1 has been dealt with. This point emphasizes how computer-intensive accurate Monte Carlo studies must be. 3. Since the simulations are done on systems of finite size, one must monitor finite-size effects effects carefully, because they can interfere with one’s real interest, the thermodynamic limit of the model. This limitation is particularly severe near a critical point, where the correlation length diverges. Since the critical points are especially important, simulations of a range of lattices are usually needed in order to extract universal features of the model such as critical indices and amplitude ratios. Finite-size scaling or renormalization-group methods may be needed in order to extract critical features of the model reliably. Again, vast computer resources are usually needed in quantitative studies. One of the real challenges in lattice-gauge theory is the simulation of fermion systems. Since Fermi fields do not commute, they represent another hurdle for computer simulations. As we shall discuss later, lattice-gauge theory has excellent, general algorithms for simulating fermions in many, but, alas, not all, environments. 2.9 The transfer matrix in field theory and statistical physics The purpose of this section is to discuss alternative formulations of lattice-gauge theory and statistical mechanics of spin systems that might help in their solutions. It should be clear to the reader that the partition-function formulation of four-dimensional statistical systems is analogous to the Euclidean path-integral formulation of field theories. We have seen already, and will see again as gauge
42
Background
theories are introduced, that the temperature T of the statistical system corresponds to the coupling g of the field-theory problem. High temperature corresponds to strong coupling and low temperature corresponds to weak coupling, all other things being fixed. There are other equally popular and effective formulations of field theories besides the Euclidean path integral. The Hamiltonian formulation, in which one deals with time evolution, a Schr¨odinger-like equation, operator equations of motion and matrix elements, etc., is very important. How is this formulation related to those we have already seen? If we begin with the Ising model in two dimensions, then it is traditional to formulate it on a symmetric lattice. In the light of universality, other cutoff procedures are equally valid if one’s real interest is in the critical behavior of the model. In lattice-gauge theory we shall be interested in the continuum limits of cutoff systems where the correlation length grows large relative to the system’s lattice spacing, so we must concern ourselves with the systems’ critical points and scaling laws. Since the scaling laws are universal, they should be the same for any lattice formulation of a given system as long as we use the same degrees of freedom and local couplings in each case. This gives us tremendous freedom. The lattice could be very, very anisotropic and the critical behavior, or, equivalently, the continuum limit in the language of field theory, would be unchanged. In fact, the Hamiltonian version of a four-dimensional Euclidean lattice-gauge theory uses a spatial lattice as a continuum cutoff, but treats ‘time’ as a continuous variable, so that a Schr¨odinger equation and a system of equal time-commutation relations can be set up. The goal of this section is to illustrate how this works in simple cases. Since the physics of interest is independent of the lattice, the ultraviolet cutoff, there is enormous freedom here that we should learn to use to our advantage.
2.9.1 The simple harmonic oscillator A great deal of the mathematics and physics we wish to illustrate comes up in the context of nonrelativistic quantum mechanics, so we start here. We want to find the relation between the Feynman path integral and the Schr¨odinger equation for a one-dimensional-potential model problem. Consider the Lagrangian for a one-dimensional simple harmonic oscillator, L = 12 (x˙ 2 − ω2 x 2 )
(2.115)
In the path-integral formulation of quantum mechanics, one considers the probability amplitude that the particle is initially at the spacetime point (xa , ta ) and ends up at the spacetime point (xb , tb ). The amplitude for this transition is
2.9 The transfer matrix postulated to be
i Z= exp – Sm h paths
43
(2.116)
where the sum is over all world lines between the initial and final points and Sm is the Minkowski-space action for a particular path, tb Sm = L dt (2.117) ta
Therefore, each path contributes to the transition amplitude through a weight exp[(i/ h) L dt]. There are two challenges in this formulation of quantum mechanics. First, we must clarify what the “sum over all paths” means in quantitative terms. Next, we need to pinpoint the important paths in the sum and deal with the constructive and destructive interference expected among the complex contributions of individual paths, exp[(i/ h) L dt]. To begin, introduce a temporal lattice and call the spacing between time slices , ti+1 − ti =
(2.118)
Next, formulate the problem in imaginary time, so that the oscillating weights for individual paths become damped, real numbers: t = −iτ
(2.119)
Now the Minkowski-space action becomes, continued to imaginary time τ , # " 2 dx 1 i 2 2 (2.120) + ω x dτ –h Sm = − 2–h dτ To make contact with conventional statistical physics with its positive partition function, with positive weights, define the Euclidean action, # " 2 dx 1 2 2 S= (2.121) + ω x dτ 2 dτ For discrete time slices, the action is approximated by " # xi+1 − xi 2 2 2 S= + ω xi 2 i and the path integral becomes Z=
∞
−∞
i
1 dxi exp − – S h
(2.122)
(2.123)
44
Background
It is interesting to interpret these formulas before proceeding to the transfer matrix. The path integral and the discrete action define a statistical mechanics problem in one dimension. The lattice sites are labeled with an index i and on each site there is a variable xi that can vary from −∞ to +∞. The action couples nearest-neighbor to the sum sites and each configuration {xi } contributes – over configurations, dxi , through a positive weight, exp[(−1/ h)S]. We can interpret this weight as a Boltzmann factor if we identify –h and temperature T . This is a useful correspondence because, in statistical physics, temperature is a measure of fluctuations through the heat bath and in quantum mechanics –h is a measure of fluctuations through the uncertainty relation. In quantum mechanics the theoretical limit –h → 0 picks out the unique classical-physics path, with fluctuations suppressed to zero. In statistical physics the T = 0 case produces a unique, completely “frozen” configuration. Now let’s return to the theme of this discussion, the development of the transfer matrix. This will replace the one-dimensional classical statistical-physics problem with a zero-dimensional quantum problem by viewing the sums over configurations on the temporal axis labeled by sites i as quantum evolution from one state at i to the next at i + 1. The nearest-neighbor coupling is important here because it allows us to write the partition function as Z= [dxi T (xi+1 , xi )] (2.124) i
where
1 1 1 1 2 (2.125) + ω2 xi2 (xi+1 − xi )2 + ω2 xi+1 T (xi+1 , xi ) = exp − – 2h 2 2
Since T (xi+1 , xi ) has the same form for any two neighboring sites, we can write it as a matrix element of an operator, the transfer matrix. Set up a quantum basis at each site. Let there be position and momentum operators xˆ and pˆ with the commutator [ p, ˆ x] ˆ = −i–h. Eigenstates of xˆ satisfy x|x ˆ = x|x
(2.126)
x |x = δ(x − x)
(2.127)
are normalized,
and are complete,
1=
dx |xx|
(2.128)
Now we can construct the operator Tˆ with the matrix elements x |Tˆ |x = T (x , x)
(2.129)
2.9 The transfer matrix
45
where T (x , x) is given by Eq. (2.125). Identify Tˆ as the time-evolution operator of quantum mechanics (imaginary time) evaluated over the interval . Now Z can be written in the simpler form, using completeness, Z = dxi xi+1 |Tˆ |xi =
i
xb |Tˆ |xn−1 dxn−1 xn−1 |Tˆ |xn−2 · · · dx1 x1 |Tˆ |xa
= xb |Tˆ n |xa
(2.130)
where n − 1 is the number of τ slices between the τa and τb initial and final conditions. If we impose periodic boundary conditions and sum over all possible initial conditions, we have Z = tr Tˆ n
(2.131)
Finally, we need an explicit expression for the transfer matrix. Recalling the matrix element expressing the fact that the Fourier transform of a Gaussian is another Gaussian, − 1 pˆ 2 1 2 2 (2.132) x |e |x ∼ exp − – 2 (x − x) 2 h we can derive the exact result, 1 1 1 2 2 2 2 2 ˆ T ∼ exp − – 2 ω xˆ exp − – pˆ exp − – 2 ω xˆ (2.133) 2h 4h 4h Since xˆ and pˆ do not commute, the operator ordering here is important. This expression for the evolution operator is probably not familiar. However, if we take the “time-continuum” limit and expand the exponentials, the operator ordering is unimportant to lowest order in , → 0, and we easily identify the Hamiltonian of the harmonic oscillator, Tˆ → 1 − – Hˆ (2.134) →0 h where Hˆ = 12 ( pˆ 2 + ω2 xˆ 2 )
(2.135)
Furthermore, the Hamiltonian Hˆ and the path integral produce the Schr¨odinger equation. The Feynman interpretation of the path integral Z means that, if the wavefunction of the particle is ψ(x, τ ), then, at a later time, ψ(x , τ ) = Z (x , τ ; x, τ )ψ(x, τ ) dx (2.136)
46
Background
Choosing τ − τ = to be infinitesimal, the path integral simplifies to ˆ ˆ Z = x |T |x = x | exp − – H |x h (2.137) ≈ x |1 − – Hˆ |x = δ(x − x) − – H δ(x − x) h h Then Eq. (2.136) becomes (2.138) ψ(x , τ ) = ψ(x, τ ) − – H ψ(x , τ ) h or, –h ∂ ψ(x, τ ) = −H ψ(x, τ ) (2.139) ∂τ which is the Euclidean version of Schr¨odinger’s equation. H in this equation is just the x-space realization of the Hamiltonian. 2.9.2 The transfer matrix for the Ising model These ideas generalize to field theories in higher dimensions. One can establish correspondences between mass gaps and correlation lengths, free energies and vacuum energies, and correlation functions and composite propagators in statistical physics and field theory [9]. Here we will work through a specific example because it will bring up issues of immediate relevance to QCD in extreme environments. Consider the two-dimensional Ising model again, but let the couplings in the two directions be independent, S=− [βτ σ (n + τˆ )σ (n) + βσ (n + x)σ ˆ (n)] (2.140) n
where we have chosen one direction τˆ as the temporal direction and the other xˆ as the spatial direction. The variables σ (n) are classical Ising variables, with the two possible values of ±1. We are interested in constructing this model’s transfer matrix, taking the τ continuum limit and finding the Hamiltonian and then understanding its phases from this perspective. This will give us an interesting and illuminating series of insights into the model with very little effort. Since Hamiltonian lattice-gauge theory is an important formulation of QCD, especially in its physical content, this exercise will provide good background for later developments. To begin, write the action in the form that makes the transfer matrix easier to identify, S= {βτ [σ (n + τˆ ) − σ (n)]2 − βσ (n + x)σ ˆ (n)} (2.141) n
2.9 The transfer matrix
47
which differs from our original expression by an unimportant constant. Next consider two neighboring spatial rows and label the spins in one row σ (m) and those in the next row s(m). The action can now be written as a sum over these rows, S= L(n 0 + 1, n 0 ) (2.142) n0
where 1 1 [s(m) − σ (m)]2 − β [σ (m + 1)σ (m) + s(m + 1)s(m)] L = βτ 2 2 m m (2.143) In setting up the transfer matrix note that, if there are M sites on each spatial row, then there are 2 M spin configurations. Therefore, the transfer matrix will be a 2 M × 2 M matrix. Let’s consider some of its elements, with an eye toward the τ -continuum limit. For a diagonal element, σ (m) = s(m) and Eq. (2.143) reduces to, σ (m + 1)σ (m) (2.144) L(0 flips) = −β m
and, if there are n spin flips between the rows, 1 L(n flips) = 2nβτ − β [σ (m + 1)σ (m) + s(m + 1)s(m)] (2.145) 2 m Next we must determine how the parameters βτ and β must behave so that the transfer matrix has the form Tˆ ≈ 1 − τ Hˆ
(2.146)
in the τ -continuum limit. To do this, consider several matrix elements, σ (m + 1)σ (m) ≈ 1 − τ Hˆ 0 flips Tˆ (0 flips) = exp β m
# 1 Tˆ (1 flip) = exp(−2βτ ) exp β σ (m + 1)σ (m) + s(m + 1)s(m) 2 m ≈ τ Hˆ 1 flip
"
(2.147)
and we read off these equations that β ∼ τ and exp(−2βτ ) ∼ τ . This means that β and exp(−2βτ ) are proportional and we are free to define the temporal
48
Background
variable τ as τ = exp(−2βτ )
(2.148)
and introduce a coupling λ so that β = λτ
(2.149)
We learn from these details that, in order to define a smooth τ -continuum theory, the couplings must be adjusted so that the temporal coupling grows large while the spatial coupling becomes weak. If we rewrite Eq. (2.147) using the variable τ , we can identify the quantum Hamiltonian version of the transfer matrix, Tˆ (0 flips) ≈ 1 + τ λ σ (m + 1)σ (m) ≈ 1 − τ Hˆ 0 flips m
Tˆ (1 flip) ≈ τ ≈ Hˆ 1 flip
(2.150)
with multiple spin flips vanishing in the τ -continuum limit. This simplification of the model will prove to be very useful and insightful. To write Hˆ in operator form, we need to set up a quantum basis of states at each site and we need the operators that implement transitions between those states. We represent spin “up” and spin “down” at site m as column vectors, 1 0 |up = , |down = (2.151) 0 1 Then we place a Pauli matrix σˆ 3 on each site to measure whether a spin is up or down and we place a σˆ 1 on each site to flip that spin, 1 0 0 1 , σ1 = (2.152) σ3 = 0 −1 1 0 This notation has been chosen so that the Hˆ expression is particularly elegant, Hˆ = − σˆ 1 (m) − λ σˆ 3 (m + 1)σˆ 3 (m) (2.153) m
m
It cannot be stressed too much that we are interested in just the universal features of the Ising model in this case, and the universal features of latticefield theories in general. So, the precise relations between the parameter λ and the couplings of the anisotropic Ising model, βτ and β, are not primary. The qualitative character of λ is important, however. It is clear from its derivation that 1/λ is a measure of temperature. The τ -continuum Hamiltonian formulation of the Ising model is “hot” when λ is small and “cold” when λ is large. More
2.9 The transfer matrix
49
detail on the connection between the isotropic and the anisotropic Ising models can be found in the literature [1]. 2.9.3 Self-duality and kink condensation through the eyes of the transfer matrix It is interesting and relevant to the search for new types of condensates in QCD in extreme environments to discuss the physics of the Ising model from the perspective of the τ -continuum limit. This search is aided by finding an operator interpretation of the self-duality property of the Ising model. In this discussion we will be using operators everywhere, but we will drop the “hats” from the symbols for notational simplicity. To begin, construct the system dual to Hˆ given above. Sites of the original lattice will be associated with links of the dual lattice and vice versa. We can think of the dual lattice as another one-dimensional array of sites that is shifted by half a spatial lattice length relative to the original lattice. Define operators µ1 and µ3 on the dual lattice, µ1 (n) = σ3 (n + 1)σ3 (n), µ3 (n) = σ1 (m) (2.154) m
So, µ1 (n) tells us whether spins on adjacent sites are aligned, and µ3 (n) flips all the spins to the left of n. These dual operators have several crucial properties. • They satisfy the same Pauli spin algebra as σ1 and σ3 . • The Hamiltonian retains its form when it is written in terms of the dual variables, H (σ ; λ) = λH (µ; λ−1 ). To check the first point, recall that the Pauli matrices σ1 and σ3 anti-commute on the same site and commute otherwise, and the square of any Pauli matrix is the identity. To check that µ1 (n)µ3 (n) = −µ3 (n)µ1 (n)
(2.155)
we note that, when it is written out in terms of the original Pauli matrices, only one factor of σ1 and one factor of σ3 are on the same site, and their anticommutation produces the desired result. The fact that µ1 and µ3 commute on different sites follows from the fact that an even number of σ1 ’s and σ3 ’s is interchanged when µ1 and µ3 are interchanged. To establish the second point, we note that σ1 (m) = µ3 (m + 1)µ3 (m) so the Hamiltonian can be written
(2.156)
50
Background H =−
µ3 (n)µ3 (n + 1) − λ
n
=λ −
µ1 (n)
n
µ1 (n) − λ−1
n
µ3 (n)µ3 (n + 1)
(2.157) (2.158)
n
which is the desired expression of the self-duality of the model. Note that this result implies that each eigenvalue of the Hamiltonian satisfies E(λ) = λE(λ−1 ), which relates high- and low-temperature properties of the model. In particular, apply it to the theory’s mass gap, which vanishes at any critical point the theory might have. If we suppose that the theory has just one critical point, we learn that it must lie at λc = 1
(2.159)
Explicit calculations on the model confirm this result. In particular, perturbationtheory expansions of the mass gap M(λ) in the coupling λ are easy to carry out to high orders [9] and produce the result M(λ) = 2|1 − λ|
(2.160)
exactly. In other words, all the perturbation-theory graphs vanish beyond second order, and an exact result emerges. The result determines the correlation-length critical index ν exactly, ν = 1. This success illustrates the power and use of universality. The mass gap determines the propagator in the model which corresponds to the correlation function in the isotropic formulation of the model. By dimensional analysis, the mass gap corresponds to the reciprocal of the correlation length of the isotropic model, so the linear vanishing of the mass gap in the vicinity of the critical point translates into the linear divergence of the correlation length as the critical point is approached [9]. In many field-theory applications, it is advantageous to use a Hamiltonian rather than a Euclidean path integral. Further analysis of the quantum Hamiltonian gives the other critical indices of the traditional Ising model. In fact, it is elementary to solve the quantum Hamiltonian version of the Ising model explicitly [9]. We end this discussion of the Ising model with some observations about order–disorder transitions, kink condensation, and the magnetization. The magnetization M = σ3 serves as the global order parameter for the Ising model. It is nonzero at low temperature, vanishes continuously as the temperature passes through the critical point with a critical index of 18 , and remains identically zero for all higher temperatures. The magnetization indicates that the “up–down” global symmetry of the system’s Hamiltonian is spontaneously broken in the low-temperature phase. The self-duality of H allows us to view the magnetization and order in the low-temperature phase from a different, interesting perspective. Place the system
2.9 The transfer matrix
51
into an external magnetic field h and describe it with the perturbed Hamiltonian 1 H +h σ3 (n) λ n
(2.161)
Apply the duality transformation to this expression and find 1 σ3 (n) = H (µ; λ−1 ) + h µ1 (m) (2.162) H +h λ n n m
n
m
where |0λ is the vacuum of H (σ ; λ) and |0λ−1 is the vacuum of H (µ; λ−1 ). Sinceσ1 and σ3 are equivalent to µ1 and µ3 , this equation tells us that the operator m
m
It follows from the self-duality of the model that the computation of the kink’s mass in powers of λ−1 is identical to that of the flipped spin states’ mass in powers of λ. Therefore, the mass gap formula, 2|1 − λ| applies to the kink in the low-temperature phase. Since the high-temperature phase of the the model is a “kink condensate,” it provides an illuminating interpretation of the short-range character of the the spin–spin correlation function in this phase. Since the disorder operator flips spins over an infinite region of space, a condensate of kinks randomizes the spin variables and leads to short-range correlations in this variable. The kinks have a topological significance. Applying a kink operator to the lowtemperature ground state produces a state that interpolates between the two
52
Background
degenerate ground states of the low-temperature phase. Kinks alter the boundary conditions of the system. In fact, by inspecting the extremities of the spatial lattice one can determine whether there is an even or odd number of kinks in the system. The concepts of topological disorder and operator condensation are very important in field theory, statistical physics, and applications to QCD in extreme environments.
3 Gauge fields on a four-dimensional euclidean lattice
3.1 Lattice formulation, local gauge invariance, and the continuum action In its standard formulation, lattice-gauge theory replaces the continuum spacetime of Euclidean quantum-field theory with a four-dimensional lattice of spacetime points and links. If the lattice is hypercubic, then its lattice spacing a provides an ultraviolet cutoff. Although such a spacetime cutoff is not convenient in most analytic calculations, it has several extraordinary advantages. First, it allows strong-coupling calculations both through expansions and by simulation methods. These approaches provide a framework for formulating and developing physical pictures of confinement, chiral-symmetry breaking, and other nonperturbative phenomena that are so challenging in ordinary continuum formulations of quantum-field Theory. The various fields of the lattice version of quantum chromodynamics exist on the sites and links of the regular lattice. In the standard formulation, the fermion fields reside on the sites and the gauge fields reside on the links [10]. The fermion fields, matter fields in general, carry the color quantum number of the gauge group SU(3). It is convenient, therefore, to imagine that there is a color frame of reference at each site so that the fermion field with color index α, ψα , can be visualized. (This geometrical point of view dates back to the original Yang–Mills paper which introduced non-Abelian gauge fields into high-energy physics when they generalized the notion of isospin symmetry from the familiar global symmetry of nuclear physics to a local symmetry of high-energy field theory [11].) The gauge fields are then the connections in this network of color coordinate systems and with them one can implement the notion of local colorgauge invariance in a precise and elegant fashion. Let’s illustrate the lattice approach with a construction of the pure Yang–Mills field and action. To keep the algebra simple we will specialize to SU(2) fields, but the generalization to SU(3) and, in fact, SU(N ) will be evident. 53
54
Gauge fields
Uµ(n) = exp(iBµ(n)) n+µ
n
Figure 3.1. The link variable on a four-dimensional Euclidean lattice. Consider a four-dimensional hypercubic Euclidean lattice with a spacing a. On each link place an SU(2) matrix, as shown in Fig. 3.1, Uµ (n) = exp(iBµ (n))
(3.1)
where Bµ (n) = 12 agτi Aiµ (n) = 12 ag τ · Aµ (n) and τ consists of the three Pauli matrices. Each link carries a direction, (n, µ) with µ = 1, 2, 3, or 4 and connects site n with site n + µ. If we denote a link in the backward direction, we associate with it Uµ−1 (n), U−µ (n + µ) = Uµ−1 (n)
(3.2)
We note that the [Uµ (n)]i j are SU(2) rotation matrices. Following Yang and Mills again, we imagine a color frame of reference at each site of the lattice approximation to continuum spacetime. Uµ (n) is defined to be the local rotation that takes a color frame of reference at site n to that at site n + µ. We want to write down a system of rules of dynamics, an action in particular, which will be invariant with respect to the orientations of these color frames. The symmetry should be a local one – if a color frame is rotated at one site, then the action and “physics” in general should be unaffected. The SU(2) color symmetry will become a local, gauge symmetry. A rotation in color space can be done at each site with an SU(2) rotation matrix, 1 G(χ (n)) = exp −i τ · χ(n) (3.3) 2 When G acts on the sites of the lattice, the link variables (“connections”) should transform as Uµ (n) → G(n)Uµ (n)G −1 (n + µ)
(3.4)
because they relate the color frames at sites n and n + µ. Note the n and n + µ in this equation, and the local character of the symmetry group. Writing out the transformation law with indices gives 1 −i 1 τ · χ(n+µ) [Uµ (n)]i j → kl e−i 2 τ · χ(n) e 2 [Uµ (n)]kl (3.5) ik jl
3.1 Lattice formulation
55
Our challenge is to construct a local action, made out of gauge variables Uµ (n), for which this local symmetry transformation is an exact symmetry. A little thought and experimentation convinces us that the action can be built only out of the products of U matrices taken around closed paths. These quantities are locally gauge-invariant because the color indices are all contracted into local group singlets. On a four-dimensional Euclidean lattice, the simplest and shortest closed paths are elementary squares (“plaquettes”). So a candidate action reads S=−
1 2g 2
n,µ,ν
tr Uµ (n)Uν (n + µ)U−µ (n + µ + ν)U−ν (n + ν) + h.c. (3.6)
We shall check that the prefactor, −1/(2g 2 ), is needed in order to establish conventional notation. We need to check the following attributes of this action. 1. Its classical continuum limit ( a → 0 ) reproduces ordinary Yang–Mills theory. 2. Its strong-coupling limit confines quarks. Once these exercises have been done, we shall have a candidate formulation of gauge-invariant dynamics that can be used to study the self-interactions of non-Abelian gauge fields for all cutoffs a and all couplings g. There are many other hurdles that the lattice action must jump over before it becomes of real physical interest, but these two constitute a necessary and nontrivial beginning in those more-demanding discussions. To take the classical continuum limit, we presume that the gauge fields are slowly varying on the length scale of the cutoff. Then we can Taylor expand the fields Bµ (n) in the action, Bν (n + µ) = Bν (n) + a ∂µ Bν (n) + O(a 2 ) B−µ (n + µ + ν) = −Bµ (n + ν) = −[Bµ (n) + a ∂ν Bµ (n)] + O(a 2 ) B−ν (n + ν) = −Bν (n)
(3.7) (3.8) (3.9)
Now the expression for the action can be approximated, UUUU ≈ eiBµ ei(Bν +a ∂µ Bν ) e−i(Bµ +a ∂ν Bµ ) eiBν
(3.10)
Finally, we use the operator identity 1
ex e y = ex+y+ 2 [x,y]+···
(3.11)
56
Gauge fields and, after some easy algebra, UUUU ≈ eia(∂µ Bν −∂ν Bµ )−[Bν ,Bµ ]
(3.12)
However, using the definition Bµ (n) = 12 agτi Aiµ (n) ≡ ag Aµ (n)
(3.13)
we can identify the conventional Yang–Mills field strength Fµν = (∂µ Aν − ∂ν Aµ ) − ig[Aν , Aµ ] in the action, UUUU ≈ eia
2gF µν
(3.14)
where the corrections in the exponent are of higher order in a 2 and will not contribute to the classical continum limit. In particular, we consider only smooth, classical fields that have long wavelengths on the scale of the lattice cutoff a, a 2 g Fµν 1
(3.15)
so then we can simplify the action, tr UUUU ≈ tr eia
2gF µν
2 = tr 1 + ia 2 g Fµν − 12 a 4 g 2 Fµν + ···
2 ≈ tr 1 − 12 a 4 g 2 tr Fµν + ···
(3.16) (3.17)
where tr Fµν = 0 because the trace of a generator vanishes. Finally, we should 2 write tr Fµν in terms of the vector potential Aiµ , 2 tr Fµν = tr 12 τi ∂µ Aiν − ∂ν Aiµ − gkli Akµ Alν × 12 τi ∂µ Aiν − ∂ν Aiµ − gk l i Akµ Alν
(3.18)
where we used [τi , τ j ] = 2iki j τk . Finally, using tr τi τ j = 2δi j , we have 2 tr Fµν =
1 2
∂µ Akν − ∂ν Akµ − gki j Aiµ Aνj
2
(3.19)
which is the square of the gauge-covariant field-strength tensor. Now the action can be written using continuum notation, 2 1 1 S= 2 ∂ Ak − ∂ν Akµ − gki j Aiµ Aνj + O(a 2 ) (3.20) 2 µ ν 2g
Confinement and strong coupling
57
where we used the familiar replacement, n,µν
→
d4 x a 4 µν
(3.21)
which is adequate for smooth fields that vary little on the scale of the lattice spacing a, and a factor of 2 has appeared from the Hermitian conjugate. Now, collecting everything, and naively taking the lattice spacing to zero, i 2 1 d4 x Fµν (3.22) S= 4 which we identify as the usual Euclidean action of the classical Yang–Mills approach. This result has several important properties. 1. The final result is Euclidean O(4) invariant. This is crucial for physical applications. The result is familiar and expected: the analysis was done only for fields that vary negligibly on the scale of the lattice spacing a, for which differences between the cubic and full rotational symmetry are significant. Therefore, classically one expects that “full rotational symmetry will be a property of the continuum limit of the lattice physics.” If one pursued the O(a 2 ) corrections to the lattice action, one would indeed find dependences on the difference between four-dimensional cubic and O(4) symmetry. 2. The final result has the standard gauge invariance of traditional Yang– Mills theory and is proportional to the square of the gauge-covariant i field-strength tensor, Fµν . The local implementation of gauge invariance through elements of the gauge group in the lattice construction guaranteed that the classical continuum limit had the local invariance described by the algebra of the gauge group. There is a great deal more to be said about the continuum limit of the lattice theory in the quantum case. We will return to this subject after we have discussed further properties of the cutoff lattice theory. The major issues center on the construction of operators, products of fields, that can be written down and could contribute to the the action and be compatible with the spacetime and other symmetries the theory possesses. 3.2 Confinement and the strong-coupling limit One of the most interesting and important features of strong-coupling latticegauge theory is the property of confinement. We will learn that the pure
58
Gauge fields
gauge-field dynamics are highly nonlinear and the force between a widely separated quark and anti-quark rises linearly with the distance between them. The non-Abelian version of Gauss’ law implies that one unit of flux emanates from the quark and is absorbed by the anti-quark. Unlike in weak-coupling quantum electrodynamics, the flux does not spread out in the typical dipole spatial pattern, but rather forms a flux tube that is long and thin. How do we see this behavior in Euclidean lattice-gauge theory? ¯ pair at spaceWe begin with a Gedankenexperiment. Imagine creating a QQ time point xµ = 0 and adiabatically separating them to a relative distance R. We ¯ pair back hold this configuration for a time T → ∞. Finally, we bring the QQ together and let them annihilate. The Euclidean amplitude for this process is the ¯ matrix element of the Hamiltonian evolution operator e−H T in the state |QQ ¯ pair are a distance R apart. Note that we are in which the members of the QQ ignoring the contribution to the amplitude coming from separating and rejoining ¯ pair because we are interested in the leading behavior the members of the QQ of the amplitude as T → ∞. The answer can be expressed as a path integral, ¯ −H T |QQ ¯ = QQ|e
D Aaµ
−S+ig
Aaµ Jµa d4 x
e
D Aaµ e−S
(3.23)
where we have written the functional integral without concerns about gauge fixing because they do not effect this generic argument and because the lattice form of this construction is manifestly gauge-invariant and finite, as we will discuss below. The current Jµa should describe the closed worldline C of the ¯ pair. Then our expression for the amplitude creation and annihilation of the QQ simplifies to ¯ QQ|e
−H T
¯ = |QQ
D Aaµ
−S+ig
e
C
Aaµ 12 λa dxµ
D Aaµ e−S
(3.24)
¯ represents a static QQ ¯ pair whose memNow, use the fact that the state |QQ ¯ = ¯ −H T |QQ bers are a distance R apart, so H is purely potential and QQ|e −V (R)T e , which produces the heavy quark potential V (R). Taking the logarithm of our expression gives V (R) = − lim
T →∞
1 ln trP eig T
C
Aaµ 12 λa dxµ
/tr 1
(3.25)
where P (“path-ordered”) guarantees that the operator ordering along the closed path is maintained.
Confinement and strong coupling
59
Our last expression contains the famous “loop-correlation” function which is so basic to this subject, a 1 a Z (J ) = trP eig C Aµ 2 λ dxµ (3.26) which serves as an order parameter of pure gauge-field dynamics, as we shall see shortly. Let’s calculate the loop-correlation function in the strongly coupled lattice version of pure gauge-field dynamics. Clearly such a calculation gives us the heavy-quark potential and provides important insights into strong-coupling gauge-field dynamics. The loop-correlation function is the path-ordered product of the exponential phase factors constructed out of the gauge-field vector potential. These exponential factors indicate the locally gauge-invariant coupling of the colored heavy quark with the colored gauge fields. Similar factors but without the non-Abelian color matrix should be familiar to the reader from quantum electrodynamics – such phase factors are needed in order to make point-split fermion bilinears gauge-invariant and were introduced by J. Schwinger in the earliest days of quantum-field theory. Let’s briefly review this construction since it is so central to lattice gauge theory, confinement, and the axial anomaly. Schwinger began his discussion with the current operator in quantum electrodynamics, ¯ Jµ (x) = ψ(x)γ µ ψ(x)
(3.27)
However, when quantum-field-theory calculations are done using this bilinear form, singularities that must be interpreted and handled with care can appear. To isolate these singularities, Schwinger introduced the point-split version of the quantum electromagnetic current operator, ¯ + )γµ ψ(x) Jµ (x, ) = ψ(x
(3.28)
where is a four vector that must vanish at the end of a physical calculation. The point-split character of this operator causes new problems in a gauge theory like quantum electrodynamics. Recall that, under a local gauge transformation in which the vector potential transforms as Aµ (x) → Aµ (x) + ∂µ (x), a fermion field of charge e transforms through a phase ψ(x) → eie(x) ψ(x)
(3.29)
Clearly our point-split form for the current Jµ (x) is not gauge-invariant. However, Schwinger realized that it could be modified with the addition of a gaugefield phase, ¯ + )γµ eie A(x) · ψ(x) Jµ (x, ) = ψ(x (3.30)
60
Gauge fields
which makes the operator locally gauge-invariant, as desired. It was with this operator that Schwinger originally discussed the axial anomaly of quantum electrodynamics. Schwinger’s discussion has a great deal in common with our earlier discussion of local gauge invariance in lattice-gauge theory. In the non-Abelian version of his discussion, the fermion fields carry a color index ψa (x) and, when a local rotation in color space occurs at x, the Fermi field rotates accordingly, ψa (x) →
b
eig(x) · τ
ab
ψb (x)
(3.31)
We imagine a color frame of reference at each spacetime point, and local gauge transformations as implementing rotations on these coordinate systems. The relative orientation of these coordinate systems is provided by the vector gauge fields and two color coordinate systems separated by a four vector are 1 related by the rotation eig 2 τa Aa (x) · . The construction of a gauge-invariant color bilinear fermion current is now clear, generalizing Schwinger’s work from quantum electrodynamics. In all of this we see again how exponentials of the color vector potential become the building blocks of the gauge-invariant dynamics. Now we can return to the loop-correlation function and write down its lattice realization. The lattice version of the loop-correlation function is clearly
Z (C) =
Uµ (n)
(3.32)
C
where C is some closed path of links on the four-dimensional spacetime lattice. Now we can ask for the behavior of the expectation value of Z (C) in the strongly coupled lattice theory. We need to calculate −S Z (C) = [dUµ (n)] Uµ (n)e [dUµ (n)]e−S n,µ
C
(3.33)
n,µ
We want to confirm that the strongly coupled theory confines quarks through a linear potential and that the gauge field forms a flux tube between the quark and its anti-quark partner. To evaluate the expression for Z (C) we need to evaluate the sum over all the gauge-field configurations, i.e. the integrals n,µ [dUµ (n)]. We could parametrize the group with generalized angles, but all we will need are two properties of these group integrals, called Haar measures in real-analysis texts.
Confinement and strong coupling The first is that these integrals are finite, [dU ] = 1
61
(3.34)
where we have dropped the index µ for convenience. The second is the group property, [dU ] f (U ) = [dU ] f (UoU ) (3.35) where Uo is an arbitrary element of the group and f is a sensible function. Two integrals that we need are [dU ]Ui j = 0 (3.36) which states that the average over the group of a group element vanishes – there is no group-singlet contribution in Ui j . Next we need the integral over the product of two link variables, 1 † [dU ]Ui j Ukl = δil δ jk (3.37) 2 which one can easily check by using the group invariance of the Haar measure and the unitary character of each link variable (this determines the factor of 12 ). Finally, we are interested at this moment only in the strong-coupling case, β = g −2 1, so we can expand each factor of e−S = P e−β tr UUUU ≈ 1 − β tr UUUU in the matrix element of the loop-correlation function CUµ (n). Take a rectangular loop C for clarity and imagine how we could obtain a nonzero contribution to Z (C). If a term in the expansion of Z (C) in powers of β has a single U factor on a particular link, its contribution to the matrix element will be zero. It is clear that factors of β tr UUUU from the expan sion of the action must include each link of the loop operator CUµ (n) so that each link on the curve C has a U and a U † that can average to a nonzero value. Considerations such as these lead us to the conclusion that the first nonzero contribution to Z (C) must have a factor of β tr UUUU for each plaquette making up the interior of the rectangle C. Some algebra and counting then gives the result 1 NC Uµ (n) = = exp(− ln g 2 · Area) (3.38) 2 g C
62
Gauge fields
where NC is the number of plaquettes making up the rectangle bounded by the contour C. Since the contour is a rectangle of base R and height T , we can read off the heavy-quark potential here, √ V (R) = σ |R| (3.39) and identify the string tension in strong coupling, √ σ = ln g 2 + · · ·
(3.40)
This little calculation also illustrates that the excess energy density in the gauge-field configuration is in a flux tube between the quark and the anti-quark. A time slice of the RT contour shows that the extra gauge-field flux and energy are on the line between the sources and sinks of color flux. This calculation can be extended to a power series in the coupling β and one can argue, on the basis of counting diagrams, that the radius of convergence of this strong-coupling calculation is nonzero. This means that there is a finite extent in the variable 1/g 2 where confinement, namely a linearly rising heavyquark potential, is a true property of the theory. Of course, this tells us little about the continuum limit of the theory. There will be more about that challenging topic in the chapters ahead. This famous calculation produces the “area” law for the loop-correlation function. We can label the phase of the large-g 2 region of the lattice theory as “confining.” In this case the loop-correlation function falls off rapidly with the dimensions of the loop C and the phase is referred to as “disordered.” What are other possible, physical behaviors for the expectation value of the loop-correlation function? In a free-field theory in which all the U matrices are unity, we have the “perimeter law” Uµ (n) = exp(−2mT ) (3.41) C
and we recognize the heavy-quark potential V (R) = 2m, which is just the sum of the masses of the quark and anti-quark. Another possibility is given by the single-photon exchange expected in weak-coupling electrodynamics. In this case a perturbation-theory calculation gives Uµ (n) = exp[−αT /(4π R)] (3.42) C
where α = e2 /(4π ) and we have Coulomb’s law V (R) = −α/(4π R). Naturally, this possibility is called the “Coulomb phase.”
Confinement mechanisms
63
3.3 Confinement mechanisms in two and four dimensions: vortex and monopole condensation There are several lattice models and continuum field theories in which confinement can be illustrated in great detail. We will consider one such model because it is particularly simple, and yet brings up many important issues. We shall also be able to build on our work with the planar model to address confinement and screening. Although most researchers believe that QCD confines quarks, the mechanism for this “miracle” is a matter of debate. Is it due to monopole loops hidden in non-Abelian configurations, vortex sheets, etc? The model we will consider is the electrodynamics of the planar model and the action reads S = 2κ 2
cos(µ θ (r ) + Bµ (r )) +
r,µ
1 cos(µ Bν (r ) − ν Bµ (r )) 2g 2 a 2 r,µ,ν (3.43)
Bµ (r ) = ag Aµ (r ), and Aµ (r ) is the lattice version of the vector potential of QED, with its usual normalization conventions. Note that both terms in the action are periodic. The electrodynamics terms are locally gauge-invariant, Bµ (r ) → Bµ (r ) + µ (r ),
θ(r ) → θ (r ) − (r )
(3.44)
The idea behind the model is that there are matter fields on the sites carrying charge g and coupled to the gauge fields Bµ (r ) which reside on links. The matter fields are represented by planar model phases. It will be particularly interesting to see the new, modified role of the vortices of the planar model in this setting and see that their topological structure plays a central role in the confining features of this model. To study confinement, we set up the Wilson loop-correlation function, $ % µ exp ie Aµ dx = C
r
dθ (r ) r ,µ dAµ (r ) exp(−S + ie C Aµ dx µ ) r dθ (r ) r ,µ dAµ (r ) exp(−S) (3.45)
Note that we are labeling the “internal,” dynamical charge g and the “external,” impurity charge e. We will see that the distinction will prove useful. The contour C in the loop-correlation function will be chosen to be a rectangle with temporal extent T much greater than its spatial extent R. As discussed earlier, we view the contour as the worldline of a heavy quark and the energy required to separate a heavy quark from a heavy anti-quark a distance |R| away is obtained from the
64
Gauge fields
loop-correlation function through the relation $ % 1 µ ln exp ie Aµ dx E(R) = − lim T →∞ T C
(3.46)
We shall see that E(R) will have qualitatively different behavior depending on the possible values of the ratio of the “external” and the “internal” charges, e/g. Next we must write a lattice form of the loop integral, C Aµ dx µ . Introduce the vector field Jµ (r ) and let it equal +1 if the link r → r + µ is on the contour C, −1 if the link r + µ → r is on the the contour C, and zero otherwise. Then we can write the loop integral, using Bµ = ag Aµ , as
e Bµ Jµ g C
Aµ dx µ →
e C
(3.47)
Finally, if we define Z (J ) =
r
e dθ (r ) dAµ (r ) exp −S + i Bµ Jµ g C r ,µ
(3.48)
then the loop-correlation function is the ratio Z (J )/Z (0). In order to make analytic progress, we borrow from the tricks we assembled for the planar model. In particular, we make Villain-model replacements for both terms in the action, e−2κ exp −
2 [1−cos(
µ θ−Bµ )]
ein µ (µ θ−Bµ ) e−n µ /(4κ 2
2)
n µ =−∞
1 [1 − cos(µ Bν − ν Bµ )] 2g 2 a 2
∞
→
∞
→
eilµν (µ Bν −ν Bµ )
lµν =−∞
× e−g
2 a 2 l 2 /4 µν
(3.49)
Now, following the manipulations introduced for the planar model, the integrals over θ(r ) and Bµ (r ) can be done and they generate constraints among the integer-valued auxiliary fields. The partition function Z (J ) reads δν lµν (r )+(e/g)Jµ (r )+n µ (r ),0 r,µ,ν lµν (r ) r ,µ
r n ν (r ) r
δµ n µ (r ),0 e−n µ (r 2
)/(4κ 2 )
e−g
2 a 2 l 2 (r )/4 µν
(3.50)
It is best to solve the constraints explicitly and reduce the number of auxiliary,
Confinement mechanisms
65
integer-valued fields to a minimum. Since n µ and Jµ are divergence-free, they can be written as the curls of other integer-valued fields, n µ = µν ∇ν n,
Jµ = µν ∇ν J
(3.51)
Finally, the constraint ∇ν lµν +
e Jµ + n µ = 0 g
(3.52)
can be solved, lµν = m µ (m · ∇)−1
e ν e µ J + n ν − m ν (m · ∇)−1 J + nµ (3.53) g g
where (m · ∇)−1 indicates a discrete line integral along the direction m. On choosing m to be the 2 direction, (m · ∇)−1 f (r1 , r2 ) =
r2
f (r1 , m )
(3.54)
m =−∞
These auxiliary sums will soon disappear from the physics, but these and their generalizations to higher dimensions are important technical details in latticegauge theory. Finally, we write the exponentials in our last expression for the partition function in terms of n and J ,
n µ n µ = (∇µ n) , 2
lµν lµν
e =2 J +n g
2 (3.55)
so Z (J ) becomes 2 2 2 e a 1 g Z (J ) = n(r ) + J (r ) exp − 2 (µ n(r ))2 − 4κ 2 g r r n(r )=−∞ ∞
(3.56) which we recognize as a generalization of the interface-roughening model we discussed earlier in the context of the planar model. We see that, if g 2 a 2 is appreciable, then the sum over n(r ) is quickly convergent and this is a useful form for the partition function. We are more interested in small g 2 a 2 , near the continuum limit, a → 0, so we borrow the Poisson summation formula from our earlier discussion of the planar model in order to obtain a useful expression for small
66
Gauge fields
g 2 a 2 . Recall the identity ∞
g(n) =
n=−∞
∞ m=−∞
∞
dφ g(φ)e2πimφ
(3.57)
−∞
and apply this identity to Z (J ) in order to write it in terms of an ordinary scalar field φ(r ) and vortices m(r ), Z (J ) =
∞
−∞ r
dφ(r )
∞ m(r )=−∞
" exp −
1 g2a2 2 ( φ) − µ 4κ 2 r 2 #
2 e φ + J + 2π i m(r )φ(r ) × g r r
(3.58)
√ where we recognize that φ is a massive scalar field with mass m s = 2κg. Before continuing with our investigation of quark confinement in this model, we should interpret our findings at this point. The construction of the model started with the planar model, which was then coupled in a gauge-invariant fashion to an Abelian gauge field. The planar model had two phases, a lowtemperature spin-wave phase in which the spin–spin correlation function is power-behaved and a high-temperature vortex-condensate phase in which the spin–spin correlation function falls exponentially over the distance between the spins. We learn from this expression for the partition function that, when the gauge field is turned on, the spin wave becomes a massive scalar field. This weak-coupling behavior is analogous to the Higgs mechanism of fourdimensional electroweak models. In those cases one has a Goldstone boson, which, when it is coupled to an otherwise-massless gauge boson, acquires a mass when a global part of the full gauge symmetry is spontaneously broken. This analogy is not perfect, of course, since infrared singularities forbid the existence of Goldstone bosons in two dimensions, but we can state with full accuracy that, for large κ, the planar model has a power-behaved spin–spin correlation function, while for large κ and small electrodynamic coupling ga, the √ theory’s correlation functions would be short-ranged, with m s = 2κg setting the scale in the exponentials. The periodicity of the original action would be irrelevant in this phase of the full model, the string tension would vanish, and all interactions would be short-ranged, characterized by the screening (correlation) length ξ = 1/m s . By contrast, for small κ and nonzero ga, we shall see that the vortices in the lattice model are relevant and cause confinement through the formation of flux tubes. To back up these claims, we take the last expression for the partition function, integrate out the scalar field φ, and obtain, up to an unimportant overall scale
Confinement mechanisms
67
factor, Z (J ) =
∞
exp −4π 2 κ 2
m(r )G(r − r ; m s a)m(r )
r,r
m(r )=−∞
e m(r )G(r − r ; m s a)J (r ) − 4κ 2 π i (g 2 a 2 ) g r,r e2 J (r )G(r − r ; m s a)J (r ) g 2 r,r 1 2 2 e2 2 − g a 2 J (r ) 2 g r
+ 4κ 2 (g 2 a 2 )2
(3.59)
where G(r ; m s a) is the lattice propagator for a massive scalar field, G(r ; m s a) =
π
−π
dk x 2π
π −π
dk y eik · r 2π 4 − 2 cos k x − 2 cos k y + m 2s a 2
(3.60)
To expose the physics here, consider the case J = 0, the partition function itself, ∞ Z (J = 0) = exp −4π 2 κ 2 m(r )G(r − r ; m s a)m(r ) (3.61) r,r
m(r )=−∞
which expresses the partition function in terms of integer-valued fields m(r ) interacting through a short-ranged potential G(r ; m s a). In contrast to the case in the planar model, in which the bare interaction between the vortices was longranged and singular in the infrared, here the mass m s renders the bare interaction short-ranged and well-posed. The term r = r in the sum produces the vortex self-energy, G(0; m s a) ∼ −
1 ln(m s a) 2π
(3.62)
so, if the vortices are dilute so that their short-ranged interactions can be neglected, the partition function becomes simply √ Z (J = 0) ≈ exp 2π κ 2 ln( 2κga) m 2 (r ) (3.63) m(r )
r
Now return to the loop-correlation function Z (J ) and estimate it under these conditions for which the vortex interactions can be ignored. Note that the function J (r ) is +1 inside the loop and zero otherwise. This behavior guarantees
68
Gauge fields
that Jµ = µν ν J is +1 on the left-hand vertical part of the contour C and is −1 on the right-hand vertical part, representing the charge flow of the heavy quark–anti-quark pair. Aside from an overall factor of g 2 a 2 (e2 /g 2 ), the last two terms in the expression for Z (J ) become (m s a)2 G(r − r ; m s a) − RT (3.64) r,r inside C
Let’s show that this expression vanishes to leading order in RT . Consider both R and T large compared with (m s a)−1 . Consider r inside and not near the contour. Then the sum over r can be done because the extremities in the sum can be ignored. Using
G(r ; m s a) =
r
we see that (m s a)
2
r inside C
1 (m s a)2
(3.65)
G(r ; m s a) − RT = (m s a)2
r
1 RT − RT = 0 (m s a)2 (3.66)
to leading order in RT . Similar approximations give r,r
m(r )G(r − r ; m s a)J (r ) =
1 m 2s
m(r )
(3.67)
r inside C
So, the final expression for Z (J ) becomes √ e 2 2 Z (J ) ≈ exp 2π κ ln( 2κga) m (r ) − 2π i m(r ) g r inside C r m(r ) (3.68) We are particularly interested in a parameter set for which the self-energy of each vortex is large, so that the sum over vortices can be truncated after m(r ) = 0, ±1. Then, if r is inside the contour C, e 2 2 exp 2π κ ln(m s a)m (r ) − 2π i m(r ) g m(r ) e 2 ≈ 1 + 2 exp[2π κ ln(m s a)] cos 2π g e 2 (3.69) ≈ exp 2e2πκ ln(m s a) cos 2π g
Confinement mechanisms and we have for the loop-correlation function # " Z (J ) e 2 −1 cos 2π ≈ exp 2(m s a)2πκ Z (0) g r inside C So
' & Z (J ) e 2πκ 2 RT 1 − cos 2π ≈ exp −2(m s a) Z (0) g
from which we read off E(R) =
2 2m 2s (m s a)2πκ −2
e R 1 − cos 2π g
69
(3.70)
(3.71)
(3.72)
where we have restored physical units (i.e. R could be in GeV−1 ). This is the major result of this discussion. On choosing parameters so that the dilute-vortex condition holds, we see that the vortices disorder the system and lead to confinement through a linear potential. The dependence of the coefficient of the confining potential is significant: it is a periodic function of e/g, the ratio of the external and the internal charges. If we take the external charge to match the internal, dynamical charge, we see that the |R| piece of the heavyquark potential vanishes. We interpret this as screening: as the heavy quark and anti-quark are adiabatically separated on the contour C, it is energetically favorable for a dynamical pair to cancel out the charges of the “impurities” locally, screening the long-range confining electric flux entirely. If, however, we choose the dynamical charge to be twice the charge of the impurity, g/e = 2, then the screening cannot occur and there is a confining |R| potential. This is significant because of the absence of the sort of screening that accompanies a plasma (Higgs) system. There, the charge is itself indefinite because of unbounded vacuum fluctuations and any charge impurity would be screened dynamically: an infinitesimal charge, or a large external charge, would all be screened to zero and a long-range confining potential would never occur. In a truly confining system, one expects screening of a different sort. The confinement charge remains an exact, quantized quantum number and the vacuum respects the associated symmetry. We will not address the continuum limit of this model. We are content to demonstrate its confining and nonconfining features as a function of κ for fixed lattice spacing a. There are discussions in the literature about the character of its continuum limit. It is natural to ask whether any of these particular methods of analysis cast light on four-dimensional theories. It is interesting that the disordering mechanisms of the planar model and the electrodynamics of the planar model apply
70
Gauge fields
to the U(1) gauge theory in four dimensions. Since the U(1) gauge group provides the dynamical basis for QED, our exercise might have more significance than planned. In addition, there are speculative discussions of the confinement mechanism in non-Abelian gauge theories that focus on “maximal Abelian subgroups” and the analysis of confinement here is much like the ideas of confinement in the U(1) theory we will now consider briefly. The Abelian U(1) lattice-gauge theory in four dimensions follows the construction of the non-Abelian theories above, but it places phases on each link instead of a non-Abelian group element. The action reads S=β
(1 − cos θµν (r ))
(3.73)
r,µ,ν
where θµ (r ) is an angular variable on the link r → r + µ, and θµν (r ) ≡ ∇µ θν − ∇ν θµ is the discrete form of the field strength. In order to discuss confinement and of the model we consider ( the phases µ the loop-correlation function, exp(ie C Aµ dx ). The analysis of this quantity uses the same tricks as those we presented for the planar model and its electrodynamics above and we shall summarize the resulting formulas below. However, before looking at those technicalities, it is interesting to discuss the physics issues surrounding U(1) gauge theory, which is often referred to as PQED (“periodic” QED). Since the theory contains the photon as its “spin wave,” we would expect the theory to have a weakly coupled phase that would reproduce the ordinary electromagnetic field which could be coupled to electrons. This sector of the theory should provide a lattice-regulated version of textbook QED. Results of various studies indicate that this is the case and that the periodicity of the U(1) gauge group is indeed irrelevant with weak coupling. Certainly our work on the planar model prepares us for this result. However, with strong coupling, under which gauge-field configurations should explore the U(1) group thoroughly, the periodic nature of the group should become relevant. This is indeed the case and it shows up by the appearance of “monopole loops” in the partition function. The strong-coupling phase of PQED is a version of “quantum electromagneto dynamics” (QEMD), in which there are conserved monopole currents and invisible Dirac strings. The theory could be coupled to electrons and, at least when the lattice spacing is nonzero, it would confine those electrons through a linear potential. The electric and magnetic charges satisfy a Dirac quantization condition that might guarantee that the theory would be interacting and sensible in a continuum limit for which electrons would be confined in a monopole condensate. Since the theory has a separate weak-coupling phase in which monopoles are irrelevant and electrons propagate freely with interactions with the photons well described by perturbation theory, there must a phase
Confinement mechanisms
71
mi (r)
Figure 3.2. A closed monopole loop in four-dimensional compact QED. transition in the full PQED theory at some intermediate gauge coupling. If this phase transition were of second order, it might have the properties described above and be very intriguing. Unfortunately, only parts of this scenario have been established. In particular, when the gauge coupling is large and the lattice cutoff is held fixed, the theory has the properties stated above. This can be shown by analyzing the loopcorrelation function by the same tricks as those that were introduced for the planar model. The four-dimensional character of the model and the periodic nature of its interactions produce four-dimensional versions of the constraints and auxiliary fields which lead to an expression for the partition function, ∞ 1 Z (J ) = exp − Jµ (r )V (r − r )Jµ (r ) δ∇µ m µ (r ),0 2β r,r m µ (r )=−∞ r × exp −2π 2 β m µ (r )V (r − r )m µ (r ) r,r
+ 2πi
m µ (r )µνλκ n ν V (r − r ) ∇λ (n · ∇)−1 Jκ (r )
(3.74)
r,r
where V (r ) is the four-dimensional massless propagator. Let’s interpret the various pieces in this formula. Since ∇µ m µ (r ) = 0, the vector field m µ (r ) can be visualized as traveling on closed loops as shown in Fig. 3.2. These are the worldlines of magnetic monopoles. This identification shows up in two aspects in the partition function. First we see that the heavy electron, described by the vector field Jµ (r ), couples to the massless propagator V (r ) of the photon field, with the strength gE2 = 1/(2β), and the magnetic loops interact with the strength
72
Gauge fields
2 gM = 2π 2 β. Therefore, gE gM = π , the familiar reciprocal relation of Dirac. It is a duality relation: if the electric coupling is weak, the magnetic one is strong and vice versa. Also, m µ (r ) interacts with ∇κ F˜µκ , the divergence of the dual of the electromagnetic-field-strength tensor. So m µ (r ) is a source of ∇κ F˜µκ , which we recognize as the magnetic current (∇κ Fµκ is the ordinary electromagnetic current). The qualitative description of the phases of the model parallels our considerations for the two-dimensional periodic models we have presented. At weak coupling, large β, the monopole loops are suppressed by their large activation energy, 2π 2 V (0). They should then be irrelevant for the large-scale behavior of the theory, which should reduce to free, nonperiodic QED, without confinement. However, for strong coupling, small β, the monopole loops are not suppressed; they can grow to become macroscopic in size and there should be a finite probablity of finding a monopole in any 3 cube. These monopoles disorder the system, suppressing Z (J ) and leading to the area law and confinement through a linear potential. One would call this an “electric meissner effect.” One can make a crude estimate for the critical coupling in PQED using an energy–entropy argument. Consider a loop of L steps. Its activation energy is 2π 2 βV (0)L. The number of loops of length L passing through a given point on the lattice behaves roughly as 7 L . The free energy, action minus entropy times temperature, of such a loop is
F = 2π 2 V (0)L − T ln(7 L ) = (2π 2 V (0) − T ln 7)L
(3.75)
which becomes negative at Tc =
2π 2 V (0) ln 7
(3.76)
and a condensate of monopole loops occurs for all couplings T > Tc = gc2 ≈ 1.57, using the numerical result V (0) ≈ 0.155. In summary, according to these estimates, PQED has a monopole condensate and confinement for all g 2 > 1.57. This result compares favorably with Monte Carlo results even though it has neglected the 1/R potentials in the partition function and has overcounted loops, etc. These features appear to be subdominant effects. Computer simulations indicate that the strong-coupling phase is a plasma of condensed monopole loops that generates a finite correlation (screening) length ξ . One would like to know the nature of the critical point and whether it can be used to define a renormalized, interacting field theory of QEMD. The answer to this question is not known. One would also like to know whether this confinement mechanism is related to that governing QCD. As mentioned above, there are speculations, based on gauge fixing and projecting to Abelian subgroups,
Confinement mechanisms
73
that this is true. However, there are other proposals for topologically driven confinement mechanisms in QCD as well. At this time it is not known whether a simple, single mechanism controls confinement in QCD, let alone its physical and mathematical nature. One of the goals of studying QCD in extreme environments is the elucidation of its underlying dynamics, namely the sources of confinement on the one hand and chiral-symmetry breaking (dynamical mass generation for the hadronic mass spectrum) on the other. We return to non-Abelian gauge theories that enjoy asymptotic freedom.
4 Fermions and nonperturbative dynamics in QCD
4.1 Asymptotic freedom and the continuum limit The lattice formulation of pure SU(N ) gauge fields can be analyzed at strong coupling to show confinement, as we have discussed above. It can also be analyzed at weak coupling, even though a spacetime lattice provides an awkward framework for standard perturbation theory. Asymptotic freedom [12] can be derived in various ways. A relatively simple derivation uses the loop-correlation function and the heavy-quark potential as we have discussed here for strong coupling. The heavy-quark potential can also be calculated in ordinary perturbation theory, by an expansion in powers of the gauge coupling g 2 . To lowest order, the static quark and anti-quark exchange a colored gluon and there results an attractive force given by Coulomb’s law, 4g 2 /3 (4.1) 4π R Next, quantum fluctuations dress this result and modify its R dependence with the logarithms of perturbation theory. Fermion vacuum-polarization effects modify the single-gluon propagator and cause partial screening of the color charge, reducing the potential from the classical result. On the other hand, color interactions between the gluons lead to partial anti-screening of the color charges of the quarks. The final, gauge-invariant result for the heavy-quark potential is 4/3 22g 4 2 V (R) = − g + ln(R/a) (4.2) 4π R 16π 4 V (R) = −
and we see a logarithmic correction to Coulomb’s law that makes the attraction between the quark and the anti-quark stronger than would be predicted by a classical calculation. The interesting effect is in the logarithm which involves the 74
4.1 Asymptotic freedom
75
ratio of the physical distance R and the ultraviolet cutoff a, the lattice spacing. It is instructive and conventional to write this result in terms of a running coupling, 22 4 g (a) ln(R/a) (4.3) 16π 2 The left-hand side of this equation should be a physical quantity that is independent of the cutoff a. This means that g 2 (a) must be tuned to the cutoff a in just the correct manner so that g 2 (R) and the heavy-quark potential do not vary with a. This requires that the lattice coupling g 2 (a) must vanish logarithmically in the continuum limit a → 0. Alternatively, one can view the R dependence of g 2 (R) and consider the Callan–Symanzik function g 2 (R) = g 2 (a) +
∂ 2 11 3 g (R) + · · · (4.4) g (R) = − ∂R 16π 2 The minus sign on the right-hand side implies that the running coupling grows as the distance between the quarks increases and their attraction is more than one would have estimated classically. Underlying this discussion is the full modern theory of the renormalization group. This includes the study of how the ultraviolet-sensitive parameters of a field theory must vary with the cutoff procedure such that physical phenomena remain independent of the regularization method. We cannot introduce this subject here, but rather refer the reader to basic texts on the subject [2]. We will be using many concepts of renormalization as we develop the physics of dense matter. To take the continuum limit of the lattice theory we must understand lattice perturbation theory and respect asymptotic freedom, the fact that non-Abelian SU(3) gauge theory with a limited number of quarks in the fundamental representation of the gauge group becomes weakly coupled at short distance at a logarithmic rate. For Nf flavors of quarks, the asymptotic running of the coupling constant reads β(R) = −R
dg(R) = β0 g 3 + β1 g 5 + · · · d ln R where
2 1 β0 = 11 − Nf , 16π 2 3
β1 =
1 16π 2
2
(4.5)
38 102 − Nf 3
(4.6)
and the coefficient β1 is obtained in a two-loop perturbation-theory calculation. We see that the continuum limit lies far from the strong-coupling, coarse-lattice limit in which we have demonstrated confinement and will later demonstrate chiral-symmetry breaking, the Goldstone mechanism, etc. How then can we be assured that the continuum limit of the lattice theory has these physical properties? Using the full apparatus of the renormalization group, we must show that
76
Fermions and nonperturbative dynamics
the confining, chiral-symmetry-breaking features of the theory are analytically connected to the short-distance asymptotic-freedom features of the running coupling constant. Numerical evidence indicating the veracity of this result is accumulating. We shall be assuming throughout this text that all will work out on this front.
4.2 Axial symmetries and the vacuum of QCD The basic features of QCD consist in its phases and symmetries and their modes of realization. The most fundamental aspects of QCD include its asymptotic freedom, which teaches us that high-resolution experiments such as deep inelastic scattering can detect the pointlike charged constituents of the theory, namely spin- 12 quarks in the fundamental representation of the color gauge group. Its presumed infrared-slavery property posits that the theory is strongly coupled at large distances and that it confines quarks via a linear potential that is generated by electric-flux-tube formation. Confinement is a property of the QCD vacuum, a relativistic electric Meissner effect. In addition, the fundamental spin1/2 quarks have small bare masses, but the strong dynamics of QCD breaks the approximate chiral symmetry of these fermions and generates considerable constituent-quark masses dynamically. Dynamical chiral-symmetry breaking has the byproduct of producing an SU(3)flavor multiplet of almost-Goldstone bosons, with the three pions the clearest of the eight mesons filling out the octet, which plays a particularly important role in the low-energy features of hadronic physics. In fact, many of the properties of low-energy hadronic physics are simply consequences of the Goldstone nature of the theory’s pions. The QCD Lagrangian reads, in a standard but abbreviated notation, 1 α µν 3 α ¯ ¯ L = d r ψ(i ∇ / + g /Aα λC )ψ − Fµν Fα + ψ Mψ (4.7) 4 α where λαC are the eight 3 × 3 color matrices, Fµν is the field strength constructed from the non-Abelian potential Aα , and M is the quark-mass matrix. In the case that M vanishes, the eight flavor-bearing, chiral currents, µ ¯ µ γ 5 Fi ψ = ψγ j5,i
(4.8)
are conserved. In this expression the Fi are the eight SU(3) flavor matrices and the flavor, color, and spin indices on the fermion fields have been suppressed. If the QCD interactions generate a quark mass dynamically, then these global symmetries are spontaneously broken. Eight massless Goldstone bosons must then appear in the theory’s spectrum and their properties are constrained by various low-energy theorems, as will be discussed throughout this book.
4.3 Two-dimensional fermionic models
77
One can also check that the ninth axial-vector current associated with flavorsinglet chiral rotations, ¯ µγ 5ψ j5µ = ψγ
(4.9)
is also conserved formally and should be associated with a ninth, flavor-singlet Goldstone boson. In fact, this “prediction” is a disaster. There is no such light state in the spectrum of QCD . . . the candidate η is much too massive and does not have the expected properties of an “approximate” Goldstone particle. In the early days of current algebra, before the development of QCD, it was realized both in model field theories such as the linear sigma model of the baryons and in QED that the flavor-singlet current’s conservation is broken by quantum fluctuations, the famous triangle anomaly. It was hoped that this fact, the need to regulate the underlying field theory, would save the day and explain the heaviness of the would-be Goldstone particle, the η . These hopes were never realized and the puzzle of the flavor-singlet current remained in the field for many years. In fact, other low-energy theorems associated with j5µ , such as the suppression of the low-energy process η → 3π, were derived and were also at odds with experiment. 4.3 Two-dimensional fermionic models of confinement, axial symmetries, and θ vacua Dynamical symmetry breaking is a major theme of this book and we will dissect, model and analyze it repeatedly. However, here we want to discuss the axial flavor-singlet symmetry because, although it is a perfect classical symmetry of massless quarks, it is broken by quantum effects, is sensitive to the strong-coupling, topological excitations in QCD, and also plays an important role in the properties of the theory’s vacuum. We will illustrate the issues at play by considering two-dimensional models, the massless and massive Schwinger models, and will discuss the role of instantons in QCD in setting the scale of this symmetry breaking. The reason for looking at solvable models is their clarity in the face of some subtle issues involving boundary conditions, long-range forces, cluster properties of matrix elements, and gauge invariance . . . all those topics which bedevil the field and are often swept “under the rug.” Some progress in unraveling the puzzle of the flavor-singlet axial symmetry was made by considering the Schwinger model, garden-variety QED in 1 + 1 dimensions [13]. In this case there is only the flavor-singlet symmetry if there is just one electron. Therefore, the formal chiral flavor-singlet symmetry would predict a massless Goldstone boson, which would be an infrared catastrophe in 1 + 1 dimensions. In fact, the Schwinger model √ is solvable and consists of just a free massive neutral boson of mass m = e/ π , where e is the electric charge. The reader should recall that QED in 1+1 dimensions is a super-renormalizable
78
Fermions and nonperturbative dynamics
Figure 4.1. A vacuum-polarization loop in the two-dimensional Schwinger model. theory and its coupling carries the dimensions of mass. The fact that it is superrenormalizable means that interactions can be ignored for distances small compared with 1/e, the reciprocal of the theory’s charge. So, the theory is simple and does not need an elborate renormalization scheme to define it, and we should be able to understand how it evades the disaster of the extra symmetry. If one considers the Feynman diagrams of the model, one finds that the masslessness of the fermions and conservation of current conspire to render most of them zero. The one-loop correction, shown in Fig. 4.1, to the photon propagator is nonzero; however, it is easy to calculate and is physically relevant. As in four-dimensional QED, write the exact photon propagator in the form dictated by vector-current conservation and local gauge invariance, Dµν (k 2 ) =
gµν − kµ kν /k 2 k 2 (1 − "(k 2 ))
(4.10)
The one-loop contribution to the propagator is computed following the methods familiar from four-dimensional calculations in which gauge invariance is used to simplify its kinematic structure. The resulting expression is finite and yields e2 /π (4.11) k2 Note that the form of this answer is dictated by dimensional analysis. The contribution is of second order in e, the charge, so "(k 2 ) is proportional to e2 , which carries the dimensions mass2 . However, "(k 2 ) is dimensionless, so a factor of k 2 must appear in its denominator. The factor of π results from the Feynman rules. On substituting back into the expression for the photon propagator, we find "(k 2 ) =
gµν − kµ kν /k 2 (4.12) k 2 − e2 /π √ and the photon has picked up a mass, m = e/ π , which is Schwinger’s famous result that showed that gauge invariance alone does not guarantee a massless photon. We recognize that the model displays a very simple example of the dynamical Higgs mechanism. Now that the photon is massive, the potential between charged static impurites is short-ranged, of order 2 exp(−m|x|), where Dµν (k 2 ) =
4.3 Two-dimensional fermionic models
79
is the charge of the impurity and is arbitrary. We learn that an arbitrary charge is screened to zero in this model. This is a characteristic of the Higgs mechanism; the vacuum is a relativistic plasma. It is not true confinement as we have discussed above because there is no remnant of a long-range potential for any . We will see, however, that, if the fermion had a nonzero bare mass, confinement rather than the Higgs mechanism would return and there would be a linear potential between impurities separated by a distance |x| if did not match a multiple of e. The one-loop graph exhausts the dynamics of √ the massless Schwinger model. Further analysis shows that the result m = e/ π is exact, and the spectrum of the model consists of a neutral, massive boson. Before we turn to a discussion of the vacuum of the Schwinger model, let’s consider the conservation laws in the theory in the light of its simple solution. Conservation of current gives ∂µ j µ = 0,
¯ µψ j µ = ψγ
(4.13)
However, a conserved vector field can be written as the curl of a scalar, √ (4.14) j µ = µν ∂ν φ/ π In 1 + 1 dimensions, the γ matrices satisfy γ µ γ5 = µν γν
(4.15)
¯ µ γ5 ψ, is simply related to the vector current, so the axial-vector current, jµ5 = ψγ j5µ = µν jν
(4.16)
√ j5µ = ∂ µ φ/ π
(4.17)
√ ∂µ j5µ = φ/ π
(4.18)
This means that jµ5 is related to φ,
and its divergence is
However, if we return to Maxwell’s equation, ej ν = ∂µ F µν
(4.19)
and realize that, in 1 + 1 dimensions, where there is no “space” for magnetic effects, the field strength can be written as Fµν = −µν E, where E is the electric field, and we can identify the auxiliary field φ as √ π E (4.20) φ= e
80
Fermions and nonperturbative dynamics
However, our brief analysis of the graphs of the Schwinger √ model showed us that the electromagnetic field develops a mass of m = e/ π due to vacuum polarization, so E satisfies the massive Klein–Gordon equation, ( − e2 /π )E = 0
(4.21)
and our divergence equation for the axial-vector current becomes e ∂µ j5µ = E (4.22) π which can be written in the explicitly covariant form e ∂µ j5µ = µν F µν (4.23) 2π So, the fermion loop graph breaks the conservation of the flavor-singlet axial current. Recognize this as the (1 + 1)-dimensional version of the triangle anomaly – in 1 + 1 dimensions the lack of conservation of the axial current comes from vacuum polarization and the divergence of the axial current is linear rather than quadratic in the field-strength tensor [14]. We learn that quantum fluctuations, which are not accounted for in classical field-theory discussions of conservation laws, save the day and remove the unwanted symmetry, and its unwanted consequences, from the theory. The theory has no axial-vector symmetry and, therefore, no massless (Goldstone or even light) mesonic state. The φ is the Schwinger model’s analog of QCD’s η state. The absence of the flavor-singlet axial symmetry changes the vacuum structure of the theory profoundly. The chiral charge is the zero-frequency mode of the momentum canonically conjugate to φ, 1 1 0 † Q5 = j5 dx = ψ γ5 ψ dx = √ ∂0 φ dx = √ "φ dx (4.24) π π Canonical quantization for φ then implies
√ [φ(x), Q 5 ] = i/ π
(4.25)
so the axial charge generates translations in the field φ. If α is a constant, eiα
√ π Q5
φe−iα
√ π Q5
=φ+α
(4.26)
Since the chiral group is compact and exp(inπ Q 5 ) leaves a Fermi field unchanged, up to an unphysical sign, for any integer n, the equivalent boson wavefunctions of the model should be unchanged under the translation √ φ →φ+n π (4.27) So, the eigenvalue spectrum of Q 5 consists of the even integers, when one is acting on admissable Bose states. φ is effectively an angle variable and Q 5 is its canonically conjugate “angular momentum.”
4.3 Two-dimensional fermionic models
81
Since the chiral charge changes the φ field’s boundary values √ at spatial infinity, one should write φ as the sum of a massive (m = e/ π ) field φˆ and a constant field θ [15]. The chiral transformation acts only on θ , √
√
ˆ −iα π Q 5 = φˆ eiα π Q 5 φe √ √ eiα π Q 5 θ e−iα π Q 5 = θ + α
(4.28) (4.29)
√ and the chiral charge Q 5 is the momentum conjugate to θ , Q 5 = (1/ π )"θ . The full space of states of the model, | , can be spanned by product states: one ˆ and a second factor |n}, factor |#) describing the state of the massive field φ, where n is an integer, which is an eigenfunction of 12 Q 5 . √ We can make raising ± and lowering operators for the states |n}, a = exp(±2i π θ ), which change the chirality by two units, a ± |n} = |n ± 1}
(4.30)
We can also make the eigenfunctions of the raising and lowering operators a ± . They are coherent states, |θ0 } =
1 π 1/4
e−2i
√
πnθ0
|n}
(4.31)
n
Note that these states |θ0 } have a simple physical interpretation: since φ is proportional to the electric √ field E, the state |θ0 } has a background, constant electric field of value eθ0 / π . In the Schwinger model with massless electrons, it costs no energy to screen any external electric field to zero, so only θ0 = 0 is a physical possibility. This is a consequence of chiral symmetry. θ0 does not enter the theory’s energetics. In any case, what should the vacuum state be? The perturbative vacuum is |0 = |0)|n = 0}. This is not a good candidate for the exact vacuum of the theory because there might be physical processes that change the chirality of the system and lift the system from |n = 0}. These processes cause mixing between the different |n}, so any particular |n} is not a stationary state. The “θ vacua” are eigenstates of these processes, so a candidate for the exact vacuum would be |θ0 = |0)|θ0 } [16]. The complete screening in the model singles out θ0 = 0. These considerations become more thought-provoking if we consider the massive Schwinger model, i.e. let the electron have a nonzero bare mass m 0 so we have an additional source of explicit chiral-symmetry breaking. In the massless Schwinger model just discussed the background electric field could always be removed through a chiral rotation, but that will not be true in the massive Schwinger model and we will see that the energetics and confinement mechanism will be closer to the electrodynamics of the planar model
82
Fermions and nonperturbative dynamics
than to the massless Schwinger model. The θ vacua of the massive Schwinger model will teach us some subtleties we will see again in four-dimensional QCD [17]. The massive Schwinger model is conveniently analyzed using the equivalentboson method (“bosonization”) introduced for the massless model. The scalar field φ is used again and a thorough analysis of Feynman diagrams allows us to establish the equivalences between the model formulated in terms of fermions and one formulated just in terms of φ, √ j µ = µν ∂ν φ/ π √ j5µ = ∂ µ φ/ π √ ¯ ± γ5 )ψ = ce : exp(±2i π φ) : ψ(1 ¯ µ ∂µ ψ = 1 [(∂0 φ)2 − (∂x φ)2 ] ψγ (4.32) 2
1
where c is a√known constant, and the normal ordering is done with respect to ¯ ± γ5 )ψ to the exthe mass e/ π. The equivalence of the chiral bilinears ψ(1 ponentials of the scalar field φ can be demonstrated by considering some massinsertion graphs in the massless Schwinger model in which φ is a free field. The contractions of the exponentials can be done by applying Wick’s theorem and the equivalence can be established directly in real x space.√For short distances compared with the scale of the Schwinger model mass e/ π , the boson propagator depends on x logarithmically. Exponentials of the propagator then produce the power-law behavior of fermion propagators and the equivalence is established after some careful algebra and counting of graphs. If we apply this set of equivalences to the Lagrangian written in the Coulomb gauge and use Maxwell’s equations to eliminate the electric field in favor of the long-range confining Coulomb potential of 1 + 1 dimensions, then √ mce dx [(∂0 φ) − (∂x φ) ] + dx : cos(2 π φ) : 2 2 e − (4.33) dx dx (∂x φ)|x − x |(∂x φ) 4π We know from our discussion of the massless Schwinger model that we must exercise care in dealing with the last term of L: a careless application of integration by parts to render the term local will fail. The correct procedure is again to write φ in terms of a piece φˆ that falls off quickly at spatial ∞ plus a constant piece, the θ term, φ = φˆ + θ (4.34) 1 L = 2
2
2
1 The value of the dimensionless constant c = π −3/2 eγ (see e.g. [18]), where γ = 0.577 . . . is
the Euler constant, is not important for our discussion.
4.3 Two-dimensional fermionic models
83
Now, 1 L= 2
√ mce e2 2 ˆ dx (∂0 φ) − (∂x φ) − φ + dx : cos[2 π (φˆ + θ )] : π 2 (4.35)
2
2
We learn that, as long as the fermion mass m is different from zero, the model is sensitive to θ because of the long-range confining forces. This is different from the massless Schwinger model, as we saw above, and suggests that this model confines rather than screens. To check this point, let’s calculate the energy between heavy, static arbitrary charges in the model. We need to place equal and opposite external charges a distance |R| apart. In the equivalent-boson language, this is done using 0 jext = √ ∂x φext π where φext is chosen to be nonzero between the external charges, √ 1 1 0 = π $ |R| + x $ |R| − x φext 2 2
(4.36)
(4.37)
0 , we see that there are On taking the derivatives of the step function $ to form jext 1 pointlike charges ± at ± 2 |R|. In this environment the potential-energy term in the Lagrangian (4.33) reads ) * ) 0 0 * e2 dx dx j 0 (x) + jext (x) |x − x | j 0 (x ) + jext (x ) (4.38) − 2 e e
On introducing boson fields as before, and doing the integration by parts, we obtain *2 1 e2 ) dx (∂0 φ)2 − (∂x φ)2 − φˆ + φext L = 2 π e √ mce dx : cos[2 π (φˆ + θ )] : + (4.39) 2 Finally, shift (/e)φext into the cosine term, e2 1 dx (∂0 φ )2 − (∂x φ )2 − (φˆ )2 L = 2 π + *, ) √ mce dx : cos 2 π φˆ + θ − φext : + 2 e
(4.40)
and, in the region between the external charges − 12 |R| < x < 12 |R|, L has the
84
Fermions and nonperturbative dynamics
extra energy, proportional to |R|, V (R) =
+ √ ) √ mce √ *,. cos(2 π θ ) − cos 2 π θ − π |R| 2 e
(4.41)
where we have set the fluctuating, dynamical field φ to zero. We learn from this expression that there is a linear confining energy, an unscreened flux tube, for any external charge that is not a multiple of the internal charge e. However, if is a multiple of e, then the long-range confining force is eliminated. The strength of the confining force is proportional to m. This result is interpreted in the same way as in the electrodynamics of the planar model. The nonzero value of m saves the model from experiencing the Higgs mechanism. The impossibility of screening for a general means that the theory is sensitive to the value of θ for the vacuum. The long-range forces in the model and its singlet chiral properties are intimately related. θ becomes a physically meaningful, measurable parameter in the solution of the model. Only for θ = 0 will the theory’s solution have the discrete symmetries that we expect of the strong interactions. How much of this physics generalizes to four dimensions? This important, difficult question will be discussed below. The instanton solutions of QCD play an important role.
4.4 Instantons and the scales of QCD We have discussed models of order–disorder and confinement. When fermions were introduced, axial-singlet symmetries and the vacuum acquired some unexpected, subtle features. Low-dimensional models allowed us to discuss flux-tube formation and symmetry-breaking phenomena that are hard to deal with analytically in four dimensions. Before we turn to a systematic look at lattice-gauge theory, which will require a long detour from the physics of QCD, let’s discuss some of the theoretical and phenomenological features of QCD that lattice-gauge theory addresses in a fundamental, unbiased fashion. In particular let’s discuss the instantons of QCD, because they are believed to play an important role in the vacuum structure, chiral-symmetry breaking, and new phases of QCD at high temperature and chemical potentials which are the ultimate goal of this branch of research. The reader should be aware that the role of instantons in QCD is not settled. However, a great deal of instanton physics will be addressed and their ultimate importance in QCD will be answered in the context of lattice-gauge theory. One of the finest attributes of lattice-gauge theory is that it promises to teach us how QCD works by providing a testbed for theoretical ideas.
4.4 Instantons and scales of QCD
85
To discuss the instantons of QCD, we begin with the theory’s partition function after integrating out the fermions, Nf 1 −S a a Z = DAµ e det(D / + m f ), S = 2 d4 x Fµν Fµν (4.42) 4g f where the determinant of the Dirac operator γµ (∂µ − iAµ ) accounts for the anticommuting fermions. When we come to lattice-gauge theory we will also integrate out the fermions in our first step of analyzing the system’s partition function because a classical computer cannot handle nonclassical variables, noncommuting fermions, in any other way. This expression for the partition function is not easy to deal with because the fermion determinant hides nonlocal interactions induced by exchanges of light fermions. Nonetheless, in four dimensions it will prove to be a useful starting point in many, but certainly not all, cases. In a semi-classical analysis of Z , we seek configurations that minimize the classical action S. We reviewed this exercise above in the case of the O(3) model and plotted its instantons. These solutions to its classical equations of motion tend to disorder its spin configurations and play a role in calculations of its correlation function. In the case of QCD, instantons contribute to the disorder in the system, and contribute to chiral-symmetry breaking, the resolution of the U(1) problem, and the inter-quark forces within hadronic bound states. We will touch on these points as we develop the subject [19]. An instructive way to write the pure gauge action for these purposes is 1 1 a 4 a ˜a a 2 ˜ S = 2 d x ±Fµν Fµν + Fµν ∓ Fµν (4.43) 4g 2 a a where F˜µν = µνρσ Fρσ is the dual field-strength tensor. Since the second term in this expression for S is always positive, the action is a minimum as long as a a the field is self-dual or anti-self-dual, Fµν = ± F˜µν . The action of such a field configuration is then the first term, which is 1 8π 2 a ˜a d4 x Fµν Q= (4.44) Fµν S = 2 |Q|, g 32π 2
Field-theory textbooks show that Q is a topological invariant with integral eigenvalues for finite-action field configurations and can be written as an integral over the divergence of a conserved four vector. The instanton field configurations with Q = 1 are Aaµ (x) =
2ηaµν xν , x 2 + ρ2
a Fµν
2
=
192ρ 4 (x 2 + ρ 2 )4
(4.45)
where the symbol ηaµν was introduced by ’t Hooft [20] and reduces to the completely anti-symmetric symbol aµν as µ and ν range from 1 to 3, is δaµ for
86
Fermions and nonperturbative dynamics
ν = 4, and is −δaν for µ = 4. The parameter ρ characterizes the size of the instanton. The instanton’s topological charge is not localized for this particular gauge-dependent configuration Aaµ (x). One can choose a different gauge in which the topological charge is at a point singularity, 2ηaµν xν ρ 2 (4.46) (x 2 + ρ 2 )x 2 In order to specify the instanton completely, one must give its position xµ , its size ρ, and its color orientation. In addition to instantons, there are also antiinstantons, whose topological charges range over the negative integers. Instantons can be interpreted as tunneling events between vacua whose topological winding numbers differ by a unit. A pure gauge configuration can be written as Ak = iU ( x ) ∂k U † ( x ), in the temporal gauge A0 = 0. In order to enumerate the classical vacua we must classify the gauge transformations U . U provides a mapping from 3-space onto the gauge group SU(N ). Consider gauge transformations U that approach the identity as x → ∞. Then, in the case of the gauge group SU(2), U provides a mapping of R 3 onto R 3 and the equivalence classes of these mappings are well understood. They are labeled by the Pontryagin number, which counts how many times the group manifold is covered and is given by 1 nP = d3 x i jk tr[(U † ∂i U )(U † ∂ j U )(U † ∂k U )] (4.47) 24π 2 This expression can also be written in terms of the gauge potentials, in which case it is called the Chern–Simons characteristic, 1 1 abc a b c 3 a a n CS = (4.48) d x i jk Ai ∂ j Ak + f Ai A j Ak 16π 2 3 Aaµ (x) =
The important feature about these quantities is that their values are unaffected by small, perturbative fluctuations. Their values can be changed only by transformations that can unwrap their mappings. Instantons are precisely this sort of configuration. The topological charge, which was introduced above, can be written as a surface integral, 1 4 a ˜a 4 Q= d x Fµν Fµν = d x ∂µ K µ = dσµ K µ (4.49) 32π 2 where the current K µ carries the Chern–Simons index, 1 1 abc a b c a a Kµ = (4.50) µαβγ Aα ∂β Aγ + f Aα Aβ Aγ 16π 2 3 So, we learn that the topological quantum number n is the difference between the Chern–Simons characteristic at t → ∞ and that at t → −∞, Q = n = d3 x K 0 |+∞ (4.51) −∞ = n CS (t = ∞) − n CS (t = −∞)
4.4 Instantons and scales of QCD
87
This displays the tunneling feature of the instanton. It provides the mechanism for mixing between the different topologically distinct vacua. We learn that the true vacuum in the theory must be a θ vacuum, as introduced in our discussion of the Schwinger model. Recall that the θ vacua are coherent states, |θ = n einθ |n. The rate of tunneling between different components |n of the θ vacuum behaves as 2 2Nc 8π 8π 2 d4 z dρ exp − 2 (4.52) g2 g (ρ) ρ5 where g 2 (ρ) is the running coupling constant evaluated at the size scale of the instanton. This factor suppresses small instantons. The regime of large ρ cannot be analyzed easily because the coupling grows large in that case and our perturbative formulas become unreliable. The realm of large ρ is not understood. Perhaps study of the lattice will eventually clarify this part of the configuration space. Our understanding of instantons will remain incomplete until the large ones are under control, either numerically or, better, analytically. The role of instantons in establishing the physical vacuum when there are light fermions in the theory is similar to the phenomena we studied in the (1 + 1)dimensional Schwinger model. The axial anomaly in 3 + 1 dimensions originates from the famous triangle diagram coupling the divergence of the flavorsinglet axial current to two gluons, ∂µ jµ5 =
Nf a ˜ a F F 16π 2 µν µν
(4.53)
The right-hand side of this equation is proportional to the topological charge density Q/V , so in the background field of an instanton conservation of axial charge will be violated by 2Nf units. This breaking occurs through localized fermion states that are zero modes of the Dirac operator, iD / ψ0 (x) = 0, in the background field of an instanton [20]. The zero mode is left-handed, γ5 ψ0 = −ψ0 . During a tunneling event the instanton pushes a left-handed fermion state from negative to positive energy, changing the topological charge of the theory’s vacuum, |n → |n +1, and changing its axial charge as well. The Dirac sea acts as an infinite source of axial charge and the instanton’s left-handed zero mode taps into this source and gently breaks the global conservation law. The θ vacua are, therefore, states of indefinite axial charge, not dissimilar to the physics of the Schwinger model. The lightest flavor-singlet pseudo-scalar meson, the η , is not a candidate for being a light Goldstone boson because the axial current is not conserved, and the vacuum does not support long-wavelength, low-energy fluctuations in this quantum number. The (3 + 1)-dimensional situation is similar, again, to the Schwinger model in which the long-range Coulomb potential gave the φ a large mass because of the anomaly. Here, however, it is the nonperturbative interactions induced by instantons that play the crucial role.
88
Fermions and nonperturbative dynamics
The importance of the fermion zero modes is hard to over-emphasize. The nonconservation of the axial charge is given by Q 5 = Q 5 (t = +∞) − Q 5 (t = −∞) = d4 x ∂µ jµ5 (4.54) In terms of the fermion propagator, Q 5 = d4 x Nf ∂µ tr(S(x, x)γµ γ5 )
(4.55)
However, the fermion propagator can be written as an eigenfunction expansion. / ψλ = λψλ , then Let ψλ be an eigenfunction of the Dirac operator, D S(x, y) =
ψλ (x)ψ † (y) λ
λ
which gives
(4.56)
Q 5 = Nf
λ
d4 x tr
ψλ (x)ψ † (x) λ
λ
λ
2λγ5
(4.57)
However, for every nonzero λ, γ5 ψλ = ψ−λ , so the λ = 0 terms in this expression give zero upon integration. Only the normalized zero modes contribute to Q 5 , and we have Q 5 = 2Nf (n L − n R )
(4.58)
where n L (n R ) is the number of left (right)-handed zero modes. However, this is exactly how instantons effect the axial charge: every instanton contributes one unit to the topological charge and has a left-handed zero mode, while every anti-instanton has topological charge minus one and a right-handed zero mode. The fermion zero modes are essential in order that the instantons have physical effects in QCD with light quarks. Note that quarks contribute determinants to the QCD partition function, Nf
det(D / +mf)
(4.59)
f
Since nonzero eigenvalues occur in pairs, det(D / + m f ) = mν
(λ2 + m 2 )
(4.60)
λ>0
where ν is the number of zero modes. So we see again that individual instanton tunneling events cannot occur in the limit of massless fermions. We know
4.4 Instantons and scales of QCD
89
the “escape” from this dilemma: during a tunneling event the axial charge of the vacuum must change by two units, so instantons must be accompanied by fermions. If one calculates the tunneling amplitude in the presence of external quark sources, the zero modes in the denominators of the quark propagators cancel out against zero modes in the determinant and leave a nonzero result, in appropriate cases. Below we shall review how instantons cause chiral-symmetry breaking, giving the light quarks substantial dynamical masses. Then, in a self-consistent, but mean-field, approach to instanton and light-quark dynamics, we find a solution in which the instantons have a nonzero density and the quarks have a nonzero dynamical mass. Each nontrivial effect sustains the other. Instantons are believed to have other physical effects in light-quark hadronic physics. In particular, although their role in confinement is unknown because they are not understood in the strong-coupling, long-distance regime, they appear to play an important role in the energetics of the theory at the intermediate distances which are important for much of hadronic spectroscopy. As stated above, it is believed that they are responsible for the spontaneous breaking of chiral symmetry and the associated phenomena of constituent-quark masses and the existence of the Goldstone pions. This occurs when one considers the quark– quark interactions induced by instantons. In fact, in the case of two flavors, four Fermi terms are induced into the low-energy effective Hamiltonian and the breaking of chiral symmetry and the appearance of Goldstone bosons occurs as in the Nambu–Jona Lasinio model. Each tunneling event is accompanied by a change in the chirality of each of the Nf fermion species through the zero modes surrounding each instanton. This induces a 2Nf -fermion interaction. For momenta small compared with the reciprocal of the size of the instanton, ρ −1 , this interaction can be written in a local form. After the instantons’ color orientations and positions have been integrated over, the effective quark interaction becomes
L
Nf =2
=
"
3 4 2 32 + dρ n 0 (ρ) π ρ 32 3 f # a a a a × (u¯ R λ u L d¯R λ dL − u¯ R σµν λ u L d¯R σµν λ dL ) (4.61)
4 mρ − π 2 ρ 3 q¯ f,R q f,L 3
When this interaction is studied in a mean-field approximation, one finds that it induces chiral-symmetry breaking, a nonzero chiral condensate that co-exists with a nonzero instanton density. The effective quark mass is given by m ∗f = m f − 23 π 2 ρ 2 q¯f qf
(4.62)
90
Fermions and nonperturbative dynamics
and the chiral condensate is estimated through the gap equation, M(k) d4 k ¯qq = −4Nc 4 2 (2π ) M (k) + k 2
(4.63)
where M(0) is the constituent-quark mass whose k dependence is determined by the shape of the fermion zero modes. From the known values for the chiral condensate and the constituent-quark mass, ¯qq ≈ −(255)3 MeV3 and M(0) ≈ 320 MeV, we obtain the estimates for the size and density of instantons, ρ≈
1 fm, 3
N /V = 1 fm−4
(4.64)
These quantities play an important role in instanton phenomenology, which is studied in this mean-field scenario. Since this work is a mix of theory and experiment, it may change considerably as more ambitious instanton programs are undertaken or as more is learned about instantons in lattice-gauge-theory simulations, as we will discuss below. Hadronic phenomenology, namely results from spectroscopy and correlation functions, supports these values of the basic features of the instanton ensemble. Unfortunately, the issues of large instantons and confinement are not resolved, so there could be flaws hidden in this ambitious approach. However, accepting these caveats, a qualitative picture of the QCD vacuum emerges from these studies [19]. 1. The instanton size appears to be a well-defined concept. The size is considerably smaller than the typical inter-instanton separation R, and the instanton ensemble is fairly dilute, ρ/R ∼ 13 . The fraction of spacetime where there are strong fields is only a few percent. a 2. The fields inside an instanton are very strong, Fµν 2QCD . The typical 2 2 instanton action is large, S0 = 8π /g (ρ) ∼ 10–15. The use of the semiclassical approach to the instanton vacuum appears to be self-consistent.
3. The interactions between instantons are small compared with S0 , but are not negligible, |δSint | ∼ 2–3 S0 . The dilute-gas picture for the instanton ensemble is not good, and the ensemble is better thought of as a “liquid.” Tests of these ideas through lattice simulations are possible and should be decisive. Such studies are crucial because the instanton correlations in the instanton ensemble are not tractable analytically and probably must be determined numerically. Comparison with hadronic phenomenology, light-quark spectroscopy as well as light-heavy spectroscopy, is also informative and insightful. For example, within this picture, the strongly attractive interaction in
4.4 Instantons and scales of QCD
91
the pseudo-scalar quark–anti-quark channel needed to explain the Goldstone nature of the pion also implies that there is an attractive interaction in the scalar quark–quark channel. Therefore, the instanton picture of the vacuum appears to support the old diquark model of the baryons. It can easily accomodate the splitting between the nucleon and the delta in this fashion. Unfortunately, decisive evidence for diquarks in nucleons is not apparent in lattice simulations and in purely phenomenological works. Since the instanton calculations cannot accomodate confinement, which disfavors the colored-diquark idea, it may be missing some important energetics and be misleading in some cases. Time, aided by lattice-gauge theory, will tell. Instantons are believed to play an important role in determining the behavior of QCD in extreme environments, high temperature and/or high baryon density. These issues will be discussed at length later in this book, but a few words here are appropriate just to set the stage. How does the instanton ensemble change as QCD is heated through the transition to the quark–gluon plasma? Since chiral symmetry is restored at the transition, the influence of the instantons on the quarks must be diminished dramatically. This could occur in many different ways: the density of instantons could fall dramatically and not be sufficient to drive chiral-symmetry breaking, or the correlations between instantons could become relatively more significant, diminishing their long-range effects. In fact, results from lattice studies indicate that the instanton density does not change dramatically at Tc , but there is some indication that instantons and anti-instantons pair up above Tc and form “molecules.” This is analogous to the Coulomb-gas phase transition in the planar model, whereby, below the transition, the vortices are paired up into neutral molecules and the Kosterlitz–Thouless transition occurs when these molecules ionize. The temperature of QCD would correspond to the reciprocal of the temperature in the planar model. Such ideas are being developed by workers active in the field. It is believed that the role of instantons at large chemical potentials is even more central. As the baryon chemical potential increases one expects a transition to a state of matter in which baryon matter condenses in the ground state. Ordinary nuclear matter with a density of ≈0.17 fm−3 is of this sort. However, if one continues to increase the chemical potential, the nucleons in the ground state should begin to overlap and it is expected that abrupt transitions to new states of matter can occur. Intermediate values for the chemical potential might produce a host of new phases in which the quark mobility is qualitatively enhanced over that in confining QCD. Only at very, very large chemical potentials do we know what to expect. There the Fermi surface of the quarks should be so high that asymptotic freedom should apply and admit a weak-coupling analysis. However, such a system should be unstable with respect to a BCS instability and Cooper pairs, scalar diquarks in the 3¯ representation, should form and make
92
Fermions and nonperturbative dynamics
a color superconductor. This scenario was pointed out many years ago [21, 22] and has been considered by astrophysicists because the interiors of neutron stars are expected to be dense enough to be quark superconductors. However, for no good reason, this effect (the size of the pairing gap) was assumed to be tiny. This phenomenon has been rediscovered and reexamined recently [23, 24], and an attempt to estimate the size of the gap at moderate densities was made. In fact, as we reviewed briefly above, instantons have particularly large attractive forces in the 3¯ scalar-diquark channel. The strength of this attraction is such that it can generate BCS gaps of the order of 100 MeV [24] at moderate densities just a few times that of nuclear matter. The study of QCD at chemical potentials and temperatures relevant to the real world is a Holy Grail of lattice-gauge theory. As we learn more about latticegauge theory and effective Lagrangians we will focus on this application, develop some useful methods, discuss problems and limitations, and, it is hoped, point out directions for progress.
5 Lattice fermions and chiral symmetry
5.1 Free fermions on the lattice in one and two dimensions Lattice fermions, their symmetries, and their continuum properties are issues central to lattice studies of dense and hot matter. It is a subtle subject because there are fundamental restrictions on the number of fermion species, their handedness, and gauge invariance. To see what the challenges are, we will illustrate lattice forms of the Dirac equation in various settings. First consider the free Klein–Gordon equation and free Dirac equation on a spatial lattice with continuum time variable. In a Hamiltonian lattice-gauge theory, one would have a three-dimensional lattice and a continuum of time. Excitations in the system could hop from site to site given the rules of the Hamiltonian, the discrete version of the spatial derivatives in the energy, etc. Although Hamiltonian lattice-gauge theory is an important subject, our emphasis here continues to be on the Euclidean version of the theory. However, a short look at Hamiltonian methods is very elementary and enlightening. The details of the problems of lattice versions of the Dirac equation are different depending on whether time is treated as a continuum variable or a discrete one. Euclidean lattice fermions will be discussed in detail below after our introduction. Let there be a free boson field φ(x, t) in 1+1 dimensions, namely one discrete spatial axis and one continuum temporal axis. The continuum Klein–Gordon equation reads 2ψ − m2φ φ¨ = ∇
(5.1)
which implies the energy–momentum relation for plane waves, E 2 = p 2 + m 2
(5.2)
We want to know how well the solutions to the lattice form of these equations approximate the continuum situation. Our first task is to place the Laplacian on 93
94
Lattice fermions and chiral symmetry
the spatial lattice. The standard discretization reads 2 φ → φ(n + 1) + φ(n − 1) − 2φ(n) = (d + + d − − 2)φ(n) (5.3) a2 ∇ where d ± φ(n) = φ(n ± 1) are shift operators. The lattice Klein–Gordon equation now reads a 2 φ¨ = (d + + d − − 2)φ(n) − a 2 m 2 φ
(5.4)
This discrete equation has plane-wave solutions, φ = exp(ikna + iEt)
(5.5)
Here the wave vector k takes values within the Brillouin zone, −π < ka < π, which exhausts all the independent solutions. The lattice energy–momentum relation follows on substituting the plane wave back into the discrete equation of motion, −E 2 φ =
eika + e−ika − 2 φ − m2φ a2
(5.6)
so E 2 = m 2 − 2[cos(ka) − 1]/a 2
(5.7)
For small ka 1, the continuum energy–momentum relation is reproduced, E 2 = m 2 + k 2 + O(k 4 a 2 )
(5.8)
and relativistic propagation is restored for wavelengths that are long on the scale of the cutoff. The energies of the states near the edges of the Brillouin zone behave as O(1/a) and are classically irrelevant as a → 0 in the continuum limit where we are interested only in fixed, finite energy–momentum states. In conclusion, there are no surprises with the lattice form of the Klein–Gordon equation. This result will be in sharp contrast to that for the Dirac equation. Consider the (1 + 1)-dimensional free Dirac equation. For the most part we shall set the Dirac mass m o to 0, and concentrate on chiral symmetry on the lattice, which will prove to be a challenging subject. The two-component spinor and the 2 × 2 Dirac matrices will be written ψ1 1 0 0 1 , γo = ψ= , α = γ5 = (5.9) ψ2 0 −1 1 0 The continuum Dirac equation reads iψ˙ = −iα ∂z ψ = −iγ5 ∂z ψ
(5.10)
The equation is simplified if we consider chiral eigenstates, γ5 χ± = ±χ±
(5.11)
5.1 Free fermions
95
Consider plane-wave solutions of the Dirac equation, ψ± = e−ikz+iEt χ±
(5.12)
Substituting into the Dirac equation gives the energy–momentum relation, E = ±k,
−∞ < k < ∞
(5.13)
We identify massless left- and right-moving fermions and anti-fermions. Now, to begin understanding the subtleties of the lattice Dirac equation, place ψ1 (5.14) ψ= ψ2 on a spatial lattice and replace ∂z by a simple discrete difference, ψ˙ = −
i γ5 [ψ(n + 1) − ψ(n − 1)] 2a
(5.15)
Again, look for plane-wave solutions with definite chirality, ψ± = e−i(−kna+Et) χ± ,
γ5 χ± = ±χ±
(5.16)
Substituting back into the equation of motion gives the energy–momentum relation −E = ±
i ika (e − e−ika ) 2a
(5.17)
which gives E = ± sin(ka)/a
(5.18)
There is something seriously “wrong” with this result! Note that, for small ka 1, it reduces to E ≈ ±k + O(k 3 a 2 )
(5.19)
as expected. However, for ka close to the edge of the Brillouin zone, ka = π − k a, k a 1, we have additional low-energy excitations, E ≈ ∓k + O(k 3 a 2 )
(5.20)
The full energy–momentum dispersion relation is shown in Fig. 5.1. We identify two two-component Dirac particles in the continuum limit. In addition, the total chiral charge of the resulting multiplet is zero. This catastrophe is even better seen by considering the field ψ+ on the lattice. It describes the branch of the dispersion relation labeled γ5 = +1 in Fig. 5.1. There is a right-mover (k ≈ 0) and a left-mover (k ≈ π ). Since ordinary chirality
96
Lattice fermions and chiral symmetry E
1
chirality + 1
k
Figure 5.1. The energy–momentum relation for the naive Dirac equation on a one-dimensional spatial lattice. is just velocity for massless particles in 1+1 dimensions, the low-energy content of the lattice field ψ+ is a pair of fermions whose chiralities add up to zero. This is the phenomenon of “species doubling” which is well known in condensedmatter theory. H. B. Nielsen and M. Ninomiya [25] pointed out that this result is much more general than this specific exercise. They presented a series of “no-go” theorems concerning exact chiral symmetry on the lattice and species doubling. The idea is that species doubling occurs because of the continuity of the excitation branches in the Brillouin zone, independently of the particular lattice approximations as long as the lattice formulation of the theory has exact chiral symmetry associated with a compact, continuous group and the hopping and interaction terms in the lattice action are sufficiently local to insure continuity of the energy–momentum relations for the lattice fermions. For example, one of their theorems considers the action S=
d4 x [−iψ¯˙ L ψL + ψ¯ L σ · (−i ∂ − A(x))ψ L]
(5.21)
Recognize this as a simple theory of a left-handed neutrino coupled to a gauge field. It could be a piece of the standard-model action, for example. Nielsen and Ninomiya’s theorem states that, when this theory is put onto the lattice with exact, continuous chiral symmetry and locality in its interactions and hopping (kinetic) terms, then the low-energy excitations of the theory will include an equal number of left- and right-handed Weyl fermions. The continuum limit of the lattice theory will be left–right symmetric, contrary to plan.
5.1 Free fermions
97
There are several methods to deal with the limitations placed upon us by these observations. The two most useful and straightforward sacrifice chiral symmetry to different degrees to avoid the full consequences of species doubling. 1. Thin the degrees of freedom. 2. Lift the energy at the edges of the Brillouin zone. Let’s discuss each approach in turn by illustration in 1 + 1 dimensions. Approach 1: “staggered fermions” Begin by placing a single-component Fermi field φ(n) on each site. The equal-time anti-comutation relations of φ(n) read [φ † (n), φ(m)]+ = δnm
[φ(n), φ(m)]+ = 0,
(5.22)
and let its dynamics be described with the Hamiltonian H =−
i [φ † (n)φ(n + 1) − φ † (n + 1)φ(n)] 2a
(5.23)
The equation of motion for the Fermi field reads, i ∂φ/∂t = [H, φ], ˙ φ(n) =−
1 [φ(n + 1) − φ(n − 1)] 2a
(5.24)
This doesn’t resemble the two-component Dirac equation we expect in 1 + 1 dimensions until we identify the upper and lower components of the spinor. To do this, consider the spatial lattice to consist of two, interleaved, “even” and “odd” lattices, ψ1 (n) = φ(n), ψ2 (n) = φ(n),
n even n odd
(5.25)
Now the equation of motion can be written in two-component form, 1 [ψ2 (n + 1) − ψ2 (n − 1)] 2a 1 ψ˙ 2 (n) = − [ψ1 (n + 1) − ψ1 (n − 1)] 2a
ψ˙ 1 (n) = −
which gives a discretization of the continuum Dirac equation, 0 1 ∂ψ ψ˙ = −γ5 ∂z ψ = − 1 0 z
(5.26)
(5.27)
which reads, in component form, ψ˙ 1 = −∂z ψ2
and
ψ˙ 2 = −∂z ψ1
(5.28)
98
Lattice fermions and chiral symmetry
We learn from this exercise that the staggered-fermion method avoids the species-doubling problem by doubling the size of the unit cube in real space and effectively halving the size of the Brillouin zone. We must clarify the symmetries of the lattice Hamiltonian and equation of motion. This is necessary because, when we add interactions into the lattice model, the continuum limit its solutions will have only those symmetries built into the system. For example, if there is no remnant of chiral symmetry in the discrete equation of motion, then we cannot expect massless fermions to appear in the continuum propagators of the theory. In fact, the mass of the fermion will be of the order of the reciprocal of the lattice spacing and the fermion will be unphysical. Consider some bilinears of the lattice system: † † † ψ ψ dz = (ψ1 ψ1 + ψ2 ψ2 ) dz → n φ † (n)φ(n) † † ¯ ψψ dz = (ψ1 ψ1 − ψ2 ψ2 ) dz → n (−1)n φ † (n)φ(n) † † ¯ ψγ5 ψ dz = (ψ1 ψ2 − ψ2 ψ1 ) dz
→ n (−1)n [φ † (n)φ(n + 1) − φ † (n + 1)φ(n)] † † ψ † γ5 ψ dz = (ψ1 ψ2 + ψ2 ψ1 ) dz →
n [φ
†
(n)φ(n + 1) + φ † (n + 1)φ(n)]
(5.29)
Now we can classify the symmetries of the lattice Hamiltonian, H =−
i [φ † (n)φ(n + 1) − φ † (n + 1)φ(n)] 2a
1. Translation of the spatial lattice by an even number of sites. The generator of ordinary translations in the continuum theory is the momentum operator, † † † pz = −i ψ ∂z ψ dz = −i (ψ1 ∂z ψ1 + ψ2 ∂z ψ2 ) dz (5.30) which, of course, does not mix the components, ψ1 and ψ2 . So, its lattice version should not mix the two sublattices, pz →
n [φ
†
(n + 2)φ(n) + φ † (n)φ(n + 2)]
(5.31)
is the corresponding lattice generator. Translations by two lattice spacings correspond to continuum translations. The lattice Hamiltonian is, of course, unaffected by such operations.
5.1 Free fermions
99
2. Tranlations by an odd number of sites. Here we have a surprise. Looking at the bilinears above, we see that translations by one site are generated by the chiral charge, ψ † γ5 ψ dz. Clearly the lattice Hamiltonian is exactly preserved by this operation, regardless of the physical dimensions of the lattice spacing. This lattice-fermion method preserves discrete γ5 symmetry for any lattice spacing a. Since γ5 simply interchanges the components of ψ, ψ ψ2 (5.32) γ5 1 = ψ2 ψ1 these are just chiral rotations through π/2 radians, exp(iγ5 π/2) = iγ5 . So, translation through an even number of sites is a pure translation whereas translation through an odd number of sites is a pure discrete chiral transformation. This curious mixing of spacetime and chiral symmetries occurs because of the extended nature of the Brillouin zone. The staggered fermion method in (3 + 1)-dimensional Hamiltonian systems (discrete spatial but continuum time lattices) and in four-dimensional Euclidean systems (symmetric four-dimensional lattices) has similar but even more intricate symmetry considerations because flavor symmetries (multiple species) will also play a role there. Note that the standard mass term, m o n (−1)n φ † (n)φ(n), breaks the discrete chiral symmetry, but not the translational symmetry, of the staggered-fermion method. It is particularly important that the Hamiltonian with m o = 0 has an exact, discrete chiral symmetry no matter what the lattice spacing. This means that interactions that also preserve chiral symmetry on the lattice cannot generate mass terms in the theory’s effective Hamiltonian. In other words, the symmetry is “protected.” This is the reason why staggered fermions are so important in applications to QCD. Now consider another lattice-fermion method that has poorer chiral properties but more convenient spinor properties. Approach 2: Wilson fermions (“brute force”) Place a two-component Dirac spinor at every site of the spatial lattice, but add terms to H so that the energy–momentum dispersion relation does not have secondary minima at ka = ±π , i † ψ (n)α[ψ(n + 1) − ψ(n − 1)] 2a n B ¯ ψ(n)[2ψ(n) − ψ(n + 1) − ψ(n − 1)] + 2a n m ¯ ψ(n)ψ(n) + g2 2a n
H =−
(5.33)
100
Lattice fermions and chiral symmetry
Note that the second term in H is a “boson” kinetic energy multiplied by the lattice spacing a. This term is designed to lift the energy–momentum dispersion relation at the edge of the Brillouin zone, just like a bosonic term, for nonzero lattice spacings, but should become irrelevant in the continuum limit, a → 0, 1 dz 1 ¯ 2 ∇ 2ψ ¯ a ψa ψ(n)[2ψ(n) − ψ(n + 1) − ψ(n − 1)] → a n a a = a ψ¯ ∇ 2 ψ dz → 0,
as
a → 0 (5.34)
for sufficiently smooth fields. Omitting the last term in H for the moment, the equation of motion for ψ is i γ5 [ψ(n + 1) − ψ(n − 1)] 2a B + γo [2ψ(n) − ψ(n + 1) − ψ(n − 1)] 2a
˙ ψ(n) =−
(5.35)
For a plane-wave Ansatz, ψ = ei(−kna+Et) χ, the equation of motion becomes i B γ5 (e−ika − eika )χ + γo (2 − eika − e−ika )χ 2a 2a sin(ka) sin2 (ka/2) χ + 2Bγo χ −Eχ = −γ5 a a
−Eχ = −
(5.36)
So 4 sin2 (ka) 2 sin (ka/2) E = + 4B a2 a2 2
(5.37)
Letting ka → 0, we find that E 2 ≈ k 2 + B 2 k 4 a 2 + · · ·, which is the desired result. In addition, near the edge of the Brillouin zone, ka ≈ ±π , E 2 ≈ 4B 2 /a 2
(5.38)
and we see that the bosonic term in the Hamiltonian has opened a gap in the fermion’s spectrum which diverges in the naive continuum limit. However, we have paid a heavy cost for this success. 1. H explicitly breaks both continuous and discrete γ5 symmetry. There is no remnant of chiral symmetry left. 2. Since there is no symmetry prohibiting mass counterterms in the theory when additional interactions are put into H, delicate fine tuning of parameters will have to be done in the lattice model in order to have finite-mass fermions in the propagators of the continuum model.
5.2 Fermions and bosons
101
5.2 Fermions and bosons on Euclidean lattices Now let’s turn to the propagation of free bosons and fermions in two and four Euclidean dimensions. Again we shall find that bosons are easy to deal with on the lattice, but fermions are not. There will be species doubling again, but the details will be lattice-dependent and the remnants of chiral symmetry different from the Hamiltonian case. Begin with the boson propagator " in four-dimensional Euclidean space, described by the Klein–Gordon equation, (m 2 − x ) "(x − x ) = δ 4 (x − x )
(5.39)
Now place this equation on a four dimensional Euclidean lattice, xµ = n µ a,
n µ = 0, ±1, ±2, . . . ,
µ = 1, 2, 3, 4
(5.40)
Replace the derivatives with finite differences, x =
∂2 1 + → (dµ + dµ− − 2) 2 2 ∂ x a µ µ µ
(5.41)
where dµ± is a shift operator carrying out shifts by one lattice spacing in the direction ±µ. It is convenient to define a dimensionless boson propagator, "n = a 2 "(n). Then the Klein–Gordon equation for the propagator becomes a −2 m 2 "n = ("n+µ + "n−µ − 2 "n )/a 4 = δn0 /a 4 (5.42) µ
or, in dimensionless form, (8 + m 2 a 2 )"n −
("n+µ + "n−µ ) = δn0
(5.43)
µ
Transforming to momentum space allows us to solve for the free propagator, "( p) =
1−K
1 , i pµ a + e−i pµ a ) µ (e
K ≡
1 8 + m 2a2
(5.44)
which can be written "( p) =
1 − 2K
1
µ
cos( pµ a)
(5.45)
The Brillouin zone covers the range −π/a ≤ pµ ≤ π/a for each µ = 1, 2, 3, or 4. If we continue back to Minkowski space, p0 → iE, we can identify the particle spectrum with the poles of "(E, p), "(E, p) =
1
1 − 2K cosh(Ea) − 2K i cos( pi a)
(5.46)
102
Lattice fermions and chiral symmetry
The only poles in "(E, p) occur near the origin of the Brillouin zone, Ea 1, pi a 1. Under these conditions we have "(E, p) ≈
1 1 − 8K − K (E 2 a 2 − p2 a 2 )
(5.47)
which has a pole at E 2 = p2 + (1 − 8K )/(K a 2 )
(5.48)
We recognize this result as the relativistic energy–momentum relation for a particle of mass m 2 = (1−8K )/(K a 2 ). So we could obtain sensible propagation in the continuum limit if we take the limit K → 18 from below, 1 − 8K > 0, and take a → 0 so that m 2 is held fixed in physical units. Now consider the free fermion propagator in four dimensional Euclidean space. The Dirac equation with unit source reads (γ µ ∂µ − m)G(x − x) = δ 4 (x − x),
[γ µ , γ ν ]+ = 2δ µν
The continuum action is 1 4 ¯ µ ∂µ − m)ψ d4 x ψ(γ A= Ld x = 2 so a “naive” lattice action would read 1 4 ¯ ¯ − [ψ(n)γ A= µ ψ(n + aµ ) − ψ(n + aµ )γµ ψ(n)]a 2a n,µ 4 ¯ −m ψ(n)ψ(n)a
(5.49)
(5.50)
(5.51)
n
We can expect from our previous discussion of the lattice Dirac equation that this lattice action will describe not one species, but 16 = 24 , because of the fourdimensionality of the Brillouin zone. Actually this lattice action is very useful in formal developments of the subject, so we will use it many times below. We will also elucidate its relation to other lattice fermion forms that have fewer low-energy species. To reduce the number of species in the low-energy spectroscopy of the lattice fermion action, we can follow the first strategy used with Hamiltonian lattice
¯ actions and add a boson-like term to this expression, r/(2a) n,µ [ψ(n)ψ(n + 4 ¯ + aµ )ψ(n) − 2ψ(n)ψ(n)]a ¯ aµ ) + ψ(n , so the action becomes A=
1 ¯ + aµ )(r + γµ )ψ(n)]a 4 ¯ [ψ(n)(r − γµ )ψ(n + aµ ) + ψ(n 2a n,ν 8r 4 ¯ + m− (5.52) ψ(n)ψ(n)a a n
5.2 Fermions and bosons
103
It√ is more convenient to introduce dimensionless fields, ψ(n) (1/ a 3 )ψ(n), so the action becomes ¯ ¯ + µ)(r + γµ )ψ(n) A= ψ(n)(r − γµ )ψ(n + µ) + ψ(n n,ν
+ (ma − 8r )
¯ ψ(n)ψ(n)
→
(5.53)
n
The fermion propagator then satisfies [(r − γµ )dµ− + (r + γµ )dµ+ + (ma − 8r )]G(n) = δn0
(5.54)
Passing to momentum space, we obtain the propagator G( p) =
1−K
K i pµ + (r − γ )e−i pµ ] µ µ [(r + γµ )e
(5.55)
where the parameter K is −1/(ma − 8r ). The propagator simplifies for some special choices of its parameters. 1. m = 0, r = 0. Then,
1 µ γµ sin pµ G( p) =
=
2 µ γµ sin pµ µ sin pµ
(5.56)
Generalizing our earlier discussion, we see that G( p) describes 24 lowenergy fermion excitations because it has zeros at the origin in momentum space as well as zeros at the edges of the four dimensional Brillouin zone. This fermion method is called “naive fermions.” 2. m = 0, r = 0. Then, G( p) =
2
1
µ γµ sin pµ − 2r µ cos pµ + 8r
(5.57)
For nonvanishing r , we see that the fermion propagator does not have additional zeros at the edges of the Brillouin zone. It is also clear from the action, however, that, even though m = 0, the lattice system has neither continuous nor discrete chiral symmetry: the fact that r = 0 removes the unwanted species at the cost of removing all remnants of chiral symmetry. This “catastrophe” is a consequence of the general principles underlying the Nielsen–Ninomiya theorems. In applications the choice r = 1 is quite handy for this class of fermion methods. In this case the propagator equation becomes [(1 − γµ )dµ− + (1 + γµ )dµ+ − 8]G(n) = δn0
(5.58)
104
Lattice fermions and chiral symmetry
It is easy to see that the operators Pµ± = 12 (1 ± γµ ) are projection operators, (Pµ± )2 = Pµ± and Pµ+ Pµ− = Pµ− Pµ+ = 0. In addition, with r = 1 and letting K → 18 , we obtain the correct continuum form of the lattice propagator, G ≈ 1/ p, for low-momentum modes. Clearly this fermion method has some useful properties, but the absence of chiral symmetry and the protection that lattice chiral symmetry provides to unwanted, unbounded bare-mass generation limits its usefulness. 5.3 Staggered Euclidean fermions Now consider the Euclidean version of staggered fermions. This fermion method is particularly important in applications because it retains a continuous remnant of chiral symmetry and has a precisely massless Goldstone boson, a pion, whenever chiral symmetry is spontaneously broken by the lattice theory’s dynamics. Although the Hamiltonian version of staggered fermions had only a discrete piece of chiral symmetry when the lattice spacing a was nonzero, the Euclidean version has more chiral symmetry insuring the existence of a massless Goldstone boson. Since these aspects of the theory dictate many of its low-energy features, the theoretical and practical importance of these features have proved very important in applications ranging from spectroscopy to finite-temperature transitions and low-energy theorems. We will illustrate Euclidean staggered fermions in two dimensions so everything will be explicit and simple. The generalizations to higher dimensions can then be carried through more formally. We shall see that one of the drawbacks of the staggered fermion methodology is the identification of ordinary flavor symmetries and species (the method is challenging . . . “staggering,” as its critics have complained). To begin, we will choose a standard representation of the Dirac matrices in two dimensions. The Pauli matrices suffice, 0 1 0 −i 1 0 γ0 = = σ1 , γ1 = = σ2 , γ5 = = σ3 (5.59) 1 0 i 0 0 −1 We will begin with the “naive” fermion action and then “thin” its degrees of freedom and construct the “staggered” action which will describe a single relativistic iso-doublet. The naive action for massless fermions is, in dimensionless notation, ¯ S= ψ(i)γ i = (n 0 , n 1 ) (5.60) µ [ψ(i + µ) − ψ(i − µ)], i,µ
To thin the degrees of freedom, we “spin diagonalize” this expression by introducing a Fermi field χ, ψ = γ0|n 0 | γ1|n 1 | χ
(5.61)
5.3 Staggered Euclidean fermions
2
1'
1
2'
2
1'
2
1'
1
2'
1
2'
105
Figure 5.2. The basic Brillouin zone for the two-dimensional staggered-fermion method. and, independently, ψ¯ = χ¯ γ1|n 1 | γ0|n 0 |
(5.62)
Writing the action in terms of χ, we have S=
χ¯ (i)(−1)φ(i,µ) [χ (i + µ) − χ (i − µ)]
(5.63)
i,µ
where the phases are (−1)φ(i,0) = γ1|n 1 | γ0|n 0 | γ0 γ0|n 0 +1| γ1|n 1 | = +1 (−1)
φ(i,1)
=
γ1|n 1 | γ0|n 0 | γ1 γ0|n 0 | γ1|n 1 +1|
= (−1)
(5.64) |n 0 |
(5.65)
So the matrix structure has collapsed to simple signs . . . S is “spin-diagonal,” and one need keep only a single complex field χ (i) at each site. We can expect from our earlier analysis of the Hamiltonian lattice (discrete space, continuum time), that this lattice system will describe two spin- 12 fermion species. The basic Brillouin zone will consist of four lattice sites that are conveniently labeled as “even” and “odd,” as shown in Fig. 5.2. Consider the four sites of each block shown in Fig. 5.2 and label them with γ matrices as suggested by the defining equation for χ above.
106
Lattice fermions and chiral symmetry
1. The origin, site 1 (bottom-left-hand corner of the block): 1 a ψ1 = χ (1), ψ¯ 1a = 1 0 χ¯ (1) 0 2. γ0 , site 2 (upper-left-hand corner of the block): 1 0 a χ(2) = χ (2), ψ¯ 2a = 0 1 χ¯ (2) ψ2 = γ0 0 1 3. γ1 , site 2 (bottom-right-hand corner of the block): 1 0 b χ(2 ) = i ψ2 = γ1 χ (2 ), ψ¯ 2b = −i 0 1 χ¯ (2 ) 0 1
(5.66)
(5.67)
(5.68)
4. γ0 γ1 , site 1 (top-right-hand corner of the block): 1 1 b χ (1 ) = i ψ1 = γ0 γ1 χ (1 ), ψ¯ 1b = −i 1 0 χ¯ (1 ) (5.69) 0 0 where the labels a and b will specify the two species of an iso-doublet. Now, we can write the action in terms of the ψ fields, ¯ (2)(∇0 χ(1) − ∇1 χ (1 )) S = χ(1)(∇ ¯ 0 χ (2) + ∇1 χ (2 )) + χ
+ χ(1 ¯ )(∇0 χ (2 ) − ∇1 χ (2)) + χ(2 ¯ )(∇0 χ (1 ) − ∇1 χ (1)) + other blocks = ψ¯ 1a ∇0 ψ2a − i ∇1 ψ2b + ψ¯ 2a ∇0 ψ1a + i ∇1 ψ1b + ψ¯ 1b ∇0 ψ2b − i ∇1 ψ2a + ψ¯ 2b ∇0 ψ1b + i ∇1 ψ1a + other blocks (5.70) where χ¯ (1)(∇0 χ(2)) means χ¯ (1)(dµ+ − dµ− )χ(2) with µ = 0, etc. The site labeling in the equation follows the conventions used in Fig. 5.2. We can diagonalize the action in the flavor indices and identify up and down quark fields by making appropriate linear combinations. Let /√ /√ u i = ψia + ψib 2, d˜i = ψia − ψib 2 (5.71) so that
√ ψia = (u i + d˜i )/ 2,
√ ψib = (u i − d˜i )/ 2
(5.72)
On substituting into the action, we find S = u¯ 1 (∇0 u 2 − i ∇1 u 2 ) + u¯ 2 (∇0 u 1 + i ∇1 u 1 ) + d¯˜ (∇ d˜ + i ∇ d˜ ) + d¯˜ (∇ d˜ − i ∇ d˜ ) 1
0 2
1 2
2
0 1
1 2
(5.73)
5.4 Block derivatives and axial symmetries
107
and, on collecting terms, S = u¯ Du + d¯˜D˜ d˜
(5.74)
where the “Dirac” operators are D = γ0 ∇0 + γ1 ∇1 ,
D˜ = γ0 ∇0 − γ1 ∇1
(5.75)
Finally, we can perform a linear transformation on d˜to obtain a standard Dirac operator, 0 1 ˜ d (5.76) d = iγ5 γ1 d = iσ3 σ2 d = σ1 d = 1 0 Now the second term in the action can be written ¯ 1 Dσ ¯ 0 ∇0 − γ0 γ1 ∇1 γ0 )d = d¯Dd ˜ 1 d = d(γ d¯˜D˜ d˜ = dσ
(5.77)
So the action becomes S = u¯ Du + d¯Dd
(5.78)
and we identify the physical content of the lattice theory: it consists of two fermions, an iso-doublet.
5.4 Block derivatives and axial symmetries Since the fermion degrees of freedom of the iso-doublet are spread over four lattice sites, each comprising a “block,” it is necessary to distinguish between couplings between blocks, which have a clear continuum meaning, and those within blocks. With this in mind, we introduce “block” derivatives and rewrite the lattice action in a fashion that displays the continuum symmetries and their breakings by finite lattice effects more explicitly. To begin, go back to the lattice action written in terms of the ψ fields, S = ψ¯ 1a ∇0 ψ2a − i ∇1 ψ2b + ψ¯ 2a ∇0 ψ1a + i ∇1 ψ1b + ψ¯ 1b ∇0 ψ2b − i ∇1 ψ2a + ψ¯ 2b ∇0 ψ1b + i ∇1 ψ1a + other blocks (5.79) Let’s examine the discrete differences here in more detail. Consider three sites 1, 2, and 3 in one lattice direction and the definitions of first and second derivatives,
108
Lattice fermions and chiral symmetry
1
2
1
x1
2 x2
1
2 x3
Figure 5.3. Coupled sites in one direction on a two-dimensional staggeredfermion lattice. ∇ f (2) ≡ 12 ( f (3) − f (1)),
∇ 2 f (2) ≡ f (3) + f (1) − 2 f (2) (5.80)
Then various finite differences involving the three sites can be written in these terms as f (3) − f (2) = 12 ( f (3) − f (1)) + 12 ( f (3) + f (1) − 2 f (2)) = ∇ f + 12 ∇ 2 f (5.81) and f (2) − f (1) = 12 ( f (3) − f (1)) + 12 ( f (3) + f (1) − 2 f (2)) = ∇ f − 12 ∇ 2 f (5.82) Now introduce the “block” derivative which couples corresponding fermion fields between adjacent blocks and does not contain any mixing. The action as it is written at present contains terms that couple degrees of freedom entirely inside a block as well as terms connecting adjacent blocks. For example, the action written above contains differences like ψ2a (x2 ) − ψ2a (x1 ), in the notation of Fig. 5.3. However, using the algebra above, ψ2a (x2 ) − ψ2a (x1 ) = 12 ψ2a (x3 ) − ψ2a (x1 ) − 12 ψ2a (x3 ) + ψ2a (x1 ) − 2ψ2a (x2 ) ˆ 2a − 1 ∇ˆ 2 ψ2a = ∇ψ 2
(5.83)
Using block derivatives, the variables ψia and ψib , i = 1, 2 can be thought to live at the centers of each block. The variables are now organized to account for the multiple flavors which appear in the continuum limit. Introducing the block derivatives into the expression for the action written in terms of ψ’s gives S = ψ¯ 1a ∇ˆ 0 ψ2a − 12 ∇ˆ 02 ψ2a − i ∇ˆ 1 ψ2b − 12 ∇ˆ 12 ψ2b + ψ¯ 2a ∇ˆ 0 ψ1a + 12 ∇ˆ 12 ψ1a + i ∇ˆ 1 ψ1b − 12 ∇ˆ 12 ψ1b − ψ¯ 1b ∇ˆ 0 ψ2b + 12 ∇ˆ 02 ψ2b − i ∇ˆ 1 ψ2a + 12 ∇ˆ 12 ψ2a + ψ¯ 2b ∇ˆ 0 ψ1b − 12 ∇ˆ 12 ψ1b + i ∇ˆ 1 ψ1a + 12 ∇ˆ 12 ψ1a (5.84)
5.5 Staggered fermions and chiral symmetry
109
We can collect the terms linear in ∇ and identify the u and d fermions to write 1 ˆ2 a i ˆ2 b i ˆ2 b ¯ ˆ a a 1 ˆ2 a ˆ ˜ ˜ ˜ ¯ ¯ S = u¯ Du + d D d + ψ1 − ∇0 ψ2 + ∇1 ψ2 + ψ2 ∇0 ψ1 − ∇1 ψ1 2 2 2 2 1 ˆ2 b i ˆ2 a 1 i ∇0 ψ2 − ∇1 ψ2 + ψ¯ 2b − ∇ˆ 02 ψ1b + ∇ˆ 12 ψ1a (5.85) + ψ¯ 1b 2 2 2 2 ˆ using the identities discussed The awkward term d¯˜Dˆ˜ d˜ can be written d¯Dd earlier. The last four terms can also be simplified by rewriting them in terms of u and d and by introducing flavor-isospin notation, u f = (5.86) d and the flavor matrices, T0 =
0 −1 1 0
,
T1 =
0 −i −i 0
(5.87)
After some algebra, we have the final result 1 S = f¯ Dˆ f + f¯γ5 Tµ ∇ˆ µ2 f (5.88) 2 This form of the action has the good features discussed above. The first term is free of flavor mixing and displays the two species of the continuum limit. The second term is an irrelevant boson-kinetic-energy term with the fermion remnants, the matrices γ5 Tµ . We shall see below that the matrix character of this term insures the existence of a continuous remnant of chiral symmetry at all lattice spacings while it breaks explicitly the axial-flavor-neutral symmetries. 5.5 Staggered fermions and remnants of chiral symmetry One of the best features of Euclidean staggered fermions is that they possess a continuous remnant of chiral symmetry even on the strongly cutoff lattice. This allows one to study some aspects of spontaneous symmetry breaking and lightpion physics even on a coarse lattice. Just as in our discussion of the Hamiltonian, we will be discussing even and odd, alternating, lattice sites. To begin, reconsider the action written in terms of the χ variables, and imagine a global U(1) × U (1) invariance group such that the first U(1) acts on even sites and the second U(1) acts on odd sites, χ (n) → Ue χ (n),
n even;
χ (n) → Uo χ (n),
n odd (5.89)
χ(n) ¯ → χ¯ (n)Uo+ ,
n even;
χ¯ (n) → χ¯ (n)Ue+ ,
n odd (5.90)
and
110
Lattice fermions and chiral symmetry
Looking back at our expression for the action written in terms of χ , we see that this is clearly a symmetry because only even (odd) sites are coupled to odd (even) sites. It is instructive to write this symmetry in terms of the conventional Fermi fields, ψ1 → Uo ψ1 ,
u1 u2
ψ 2 → U e ψ2
(5.91)
ψ1a + ψ1b = ψ2a + ψ2b 1 1 u1 u1 Uo u 1 = (Uo + Ue ) + (Uo − Ue ) → Ue u 2 u2 −u 2 2 2 (5.92)
Similarly,
d˜1 d˜2
1 1 d˜ d˜1 → (Uo + Ue ) ˜ + (Uo − Ue )γ5 ˜1 d2 d2 2 2
(5.93)
However, this expression can be written in terms of d by recalling that d˜ = σ1 d and γ5 = σ3 , so d1 d1 d1 d˜1 + U− γ5 σ1 (5.94) = σ1 = U+ σ1 ˜ d2 d2 d2 d2 where we have introduced U± = (Uo ± Ue )/2. Now, d = σ1U+ σ1 d + σ1U− γ5 σ1 d
(5.95)
Finally, since σ1 γ5 σ1 = −γ5 , we can write d = U + d − U − γ5 d
(5.96)
which is identical to the transformation equation for u except for the last sign. Using the isospin notation, u f = d we have f = U+ f + U− γ5 T3 f where T3 is the third component of isospin, 1 0 T3 = 0 −1
(5.97)
5.6 Exact chiral symmetry
111
This transformation law is a symmetry of the action. We learn that the original Ue × Uo symmetry operation contains a vector piece, 1D × 1F , fermion-number conservation, and an axial piece, γ5 × T3 , one component of axial isospin. The second piece forbids a mass term in S. If interactions preserve this symmetry, as they will in our applications, then the mass counterterms cannot be generated by those interactions. This feature means that fine tuning of the fermion action will not be necessary in order to keep the fermion excitations light even in the presence of such interactions. In addition, if the symmetry breaks dynamically (spontaneously), as it will in applications, then, in more than two dimensions, a massless pion will appear in the theory’s spectrum in accordance with Goldstone’s theorem. Note that the irrelevant operator in the block-derivative expression for the action breaks the full axial-isospin symmetries to a smaller set of continuous symmetries. It breaks the axial-flavor-neutral symmetries explicitly. In addition to the continuous Uo × Ue symmetries, there are discrete symmetries of interest in S. For example, if we translate the system by an odd number of sites in either direction and transform phases appropriately, the action will be unchanged. Therefore, these symmetries are the “square roots of translations” and can be identified with γ5 , Ti , and their products. We refer the interested reader to the literature for more details. 5.6 Exact chiral symmetry on the lattice Both the original Wilson method and the staggered-fermion method suffer from having too little symmetry on a lattice with a fixed spacing a. However, their continuum limits should have the correct number of species with the correct chiral and flavor symmetries. The approach to the continuum limit can be sped up considerably by improving both actions with next-to-nearest-neighbor couplings whose strengths are engineered to suppress O(a 2 ) symmetry-breaking effects. With these improvements physical, accurate results from simulations can be expected on modestly sized lattices. In addition, there are more inventive lattice-fermion approaches that possess more symmetries even on the cutoff lattice, have the correct axial anomaly structure, and may yield nonperturbative formulations of chiral gauge theories such as the standard model of electroweak interactions. Our emphasis here remains on vector theories like QCD, so we will not discuss left–right-asymmetric models, but we shall see that the desire to formulate such models on the lattice in a consistent, nonperturbative fashion is having very significant influence on QCD research. 5.6.1 Domain-wall fermions Domain-wall methods to decribe light fermions have been known to workers in statistical physics for over 60 years [26], but were rediscovered in lattice theory
112
Lattice fermions and chiral symmetry
in the 1990s [27]. Inspiration in high-energy physics came from efforts by physicists interested in grand unification to understand global anomalies within chiral gauge theories for which the renormalizability of the field theory requires that the gauge currents remain anomaly-free. For example, the standard model of electroweak interactions has an anomalous global baryon current, but has anomaly-free local chiral gauge currents. Engineering such a feat on the lattice would be highly nontrivial in the light of the species-doubling phenomena we have already discussed for Wilson and staggered fermions. Here we will be less ambitious and use the domain-wall method to formulate QCD, a vector gauge theory with no left–right asymmetries, with exact chiral and flavor symmetries even at strong coupling on a lattice with a large cutoff length a. Such a formulation has enormous advantages over traditional Wilson and staggered approaches. Having the Goldstone bosons of broken chiral symmetries on strongly coupled lattices for analytic as well as numerical purposes is a great help. One might worry that this is too much of a good thing in the sense that having massless modes on a finite lattice will lead to huge artificial finite-size effects. In fact, when m π L ≈ 1 (L is the linear extent of the lattice), finite-size effects are considerable, but chiral perturbation theory has precise predictions for them, which is another great help. To begin, let’s outline the strategy used here. In condensed-matter physics one finds models in which there are interfaces between different phases of matter and on the interface conduction electrons can propagate as massless modes. We begin with four-dimensional Euclidean lattice-gauge theory and we append to it a fifth dimension in which a fermion propagates with a spatially dependent mass. Let the mass term in the fifth dimension have the spatial dependence of a kink. We shall see that, in this case, the free Dirac equation will have a massless chiral zero mode bound to the four-dimensional domain wall. There will also be the doublers with opposite chirality on the wall, as we have seen in our discussion of naive fermions above. However, the doublers can be removed by the Wilson term. At this point we have the degrees of freedom of a four-dimensional chiral gauge theory as long as the massive fermion modes which are not bound to the defect decouple and provided that the gauge fields just propagate along the defect in the four dimensions of the original lattice-gauge-theory formulation. On the lattice we must specify the extent of the fifth-dimensional world or give it periodic boundary conditions. For open boundary conditions, there will be an interface at one end of the fifth-dimensional world and an anti-interface at the other end with L s lattice spacings between them. This means that we can arrange for one handedness of fermion on the interface and the opposite handedness on the anti-interface L s lattice spaces away in the fifth direction. These components then comprise a single Dirac fermion in which the left- and right-handed components do not mix in the limit L s → ∞. Therefore, the fermion system has exact chiral symmetry on the lattice! This is perfect for QCD applications.
5.6 Exact chiral symmetry
113
Extensions of the domain-wall idea to chiral rather than vector gauge theories are controversial and will not be discussed here. To illustrate how the domain-wall method works, consider the free fivedimensional Dirac equation coupled to a domain wall [28]. Denote the five coordinates z µ = ( x , s), where x is the coordinate of our familiar four-space. Taking the fifth dimension to have infinite, open extent, we make a domain wall by considering a spatially dependent mass term with a kink, m(s) → ±m ± s→±∞
(5.98)
where m ± > 0. The Dirac operator is then + γ5 ∂s + m(s) D5 = /∂ + m(s) = γ · ∇
(5.99)
If we choose m(s) to vanish at s = 0, we expect massless modes bound to the 0 and would describe massless defect. These zero modes satisfy D5 #0 = γ · ∇# propagation along the defect. In detail, we choose the zero modes to have the form, #0± = ei p · x φ± (s)u ±
(5.100)
where the u ± are constant four-component chiral spinors, γ5 u ± = ±u ± , and the functions φ± (s) satisfy the one-dimensional differential equation [±∂s + m(s)]φ± (s) = 0.
(5.101)
The solutions to this equation are
s φ± (s) = exp ∓ m(s ) ds
(5.102)
0
Note that only one of these solutions, φ+ (s), is normalizable and admissable. We learn that there is a single, positive-chirality fermion bound to the defect. This continuum analysis is easily transplanted to the lattice. If we discretize the Dirac operator in the manner of Wilson and use a five-dimensional Wilson term, we can arrange the system to have the same low-energy degrees of freedom as our simple continuum analysis presented above. Again, only a single positivechirality mode will persist. Of course, on a finite lattice, we would choose periodic or open boundary conditions in the fifth dimension. For example, take a lattice of length L s is the s direction and choose periodic boundary conditions. Then, given a domain wall at s = 0, there must be an anti-domain wall at s = ±L s /2. For every zero mode at s = 0, there is a zero mode of opposite chirality at s = ±L s . The vector character of the five-dimensional theory is explicit. The zero modes at s = 0 and s = ±L s have exponentially small overlap, so for finite L s there is an exponentially small breaking of chirality. This breaking is completely decoupled from the restoration of the spacetime symmetries
114
Lattice fermions and chiral symmetry
of the four-dimensional lattice system because one could choose different lattice spacings in the s direction and in the four physical directions. The breaking of chiral symmetry arises only through the finiteness of the fifth dimension. When gauge fields and the full structure of lattice-gauge theory are incorporated into this framework, we continue to use the same plaquette action in the four-dimensional world, but we do not let the gauge fields extend into the fifth dimension. Now the physical content of the four-dimensional theory brings up some issues challenging our understanding of the Nielsen–Ninomiya theorems. In particular, at low energies it appears that the domain-wall formulation has produced an effective four-dimensional theory of fermion zero modes coupled to gauge fields, which in general must be expected to be anomalous and cannot be simultaneously locally gauge-invariant and chirally invariant. The resolution to this puzzle was, in fact, provided long before the application of these ideas to lattice-gauge theory. In the context of grand unified theories, it was understood that heavy fermions that live away from the domain walls do not fully decouple from the four-dimensional dynamics. They provide the anomalous currents which flow into the fifth dimension and account for the “apparent” nonconservation of the flavor-singlet chiral current in our four-dimensional world [28]. So, the extra dimension is the “loophole” in the Nielsen–Ninomiya theorem that the domain-wall fermions have exploited. Let’s record the domain-wall fermion action as it is used in practical calculations. Some explicit chiral-symmetry breaking is inserted into the model in the form of a conventional bare-quark mass m. The chiral limit of the system is found by first taking L s → ∞ and then taking m → 0. The fermion action reads ¯ / x,y δs,s + D Sf = − ψ(D / s,s δx,y )ψ (5.103) x,y,s,s
where † δx−µ,y − (M − 4)δx,y ] (5.104) D / x,y = 12 [(1 + γµ )Ux,µ δx+µ,y + (1 − γµ )U y,µ
and D / s,s
PR δ1,s − m PL δ L s −1,s − δ0,s = PR δs+1,s + PL δs−1,s − δs,s −m PR δ0,s + PL δ L s −2,s − δ L s −1,s
s=0 0 < s < L s − 1 (5.105) s = Ls − 1
where PR,L = (1 ± γ5 )/2 are the chiral projection operators. The effectiveness of this action in carrying out the original plan of the domain-wall philosophy is discussed in the literature [29]. Domain-wall fermions have been used for simulations of QCD at nonzero temperature. They could also be used at nonzero chemical potential, following the rules we will introduce in the context of naive and staggered fermions in the
5.6 Exact chiral symmetry
115
next chapter. Since domain-wall fermions are much more computer intensive than staggered fermions, simply because the algorithm runs in five dimensions rather than four and the fifth dimension cannot be taken to be small, their use in applications is still in its infancy. However, for applications in which chiral and flavor symmetries are paramount, we expect that the method will have a bright future. 5.6.2 The Ginsparg–Wilson relation Long ago, in the early days of lattice-gauge theory, Ginsparg and Wilson, who were engaged in a renormalization-group analysis, suggested that lattice Dirac operators D should be constructed to satisfy the relation [30], γ5 D + Dγ5 = a Dγ5 D
(5.106)
because it guarantees that the fermion propagator anti-commutes with γ5 at nonzero distances on the lattice with spacing a, thereby respecting an aspect of chiral symmetry. In fact this relation has a more fundamental and useful interpretation [31]. Instead of considering the usual chiral rotation ψ → eiγ5 ψ,
¯ iγ5 ψ¯ → ψe
(5.107)
consider a version adapted to a lattice with a given spacing a, 1
ψ → eiγ5 (1− 2 a D) ψ,
¯ i(1− 12 a D)γ5 ψ¯ → ψe
(5.108)
If we take to be infinitesimal, then one can easily check that Eq. (5.106) follows. Equation (5.108) can also be generalized to flavor-non singlet chiral transformations by including a flavor matrix into the transformation law in the usual way. This “success” looks too good to be true. It appears that we now have a gaugeinvariant regularization scheme that has no chiral anomalies. We know from the Nielsen–Ninomiya lattice arguments that this is not possible, assuming that the lattice Dirac operator has reasonable properties that we discussed above. So, can we find a reasonable expression for D that satisfies Eq. (5.106)? A particularly simple candidate is [32] DN =
1 [1 − A(A† A)−1/2 ], a
A = 1 − a DW
(5.109)
where DW is the Wilson form of the lattice Dirac operator, having an irrelevant boson-like piece to suppress fermion doublers at the edges of the Brillouin zones, as discussed and recorded above. We can check that Eq. (5.109) satisfies Eq. (5.106) by direct sustitution. However, DN looks pathological because it involves the reciprocal of the square root of an operator. However, DN is not
116
Lattice fermions and chiral symmetry
nonlocal. If we transform to momentum space, we can see this explicitly. Use the abbreviations pˆ = (2/a) sin(apµ /2) and p˜ = (1/a) sin(apµ ) and then calculate the Fourier transform, −1/2 1 1 ˜ p) = 1 − 1 − a 2 pˆ 2 − iaγµ p˜ µ 1 + a 4 pˆ 2 pˆ 2 (5.110) a D( 2 2 µ<ν µ ν This expression has the right properties. It is analytic and periodic in pµ within the Brillouin zone, it reduces to iγµ pµ for low momenta, it is invertable for all nonzero momenta, and in x space it falls off exponentially at a rate proportional to 1/a. Since it does not anti-commute with γ5 , it does not have the usual chiral symmetry appropriate to continuum field theory and the Nielsen–Ninomiya nogo theorem does not apply to it. Nonetheless, since the symmetry it does have is a continuous chiral transformation relevant to the lattice, we need to understand the fate of the anomaly [31]. Consider calculating an expectation value of an operator O involving fermion fields, ¯ O F = dψ(x) dψ(x) O e−SF (5.111) x
Now apply the lattice chiral symmetry Eq. (5.108) for infinitesimal , and find that the variation in the expectation value is not zero because the measure in the fermion integral is not invariant, δ O F = −a tr{γ5 D} O F
(5.112)
where the trace goes over all fermion fields. In the case of free fermions, the trace vanishes and the symmetry is exact, as expected. In addition, for flavornonsinglet chiral rotations, we obtain zero because of the presence of the group generator in the trace. However, in other interacting theories −a tr{γ5 D} is the chiral anomaly and it will not vanish in general. A formal, but brief, derivation of the anomaly equation can be given in this context [31]. More explicit derivations also exist. Begin by writing the left-hand side of Eq. (5.106) for the operator z − D in the case that D satisfies Eq. (5.106), a(z − D)γ5 (z − D) = z(2 − az)γ5 − (1 − az)[(z − D)γ5 + γ5 (z − D)] (5.113) Since we want an expression for the anomaly, multiply this equation by (z − D)−1 , taking z outside the spectrum of D, and trace it, −a tr{γ5 D} = z(2 − az) tr{γ5 (z − D)−1 }
(5.114)
5.7 Chiral-symmetry breaking
117
Finally, divide throughout by z(2 − az) and integrate over an infinitesimal circle that encloses only the spectral value 0 of D, −a tr{γ5 D} = 2 tr{γ5 P0 } = 2Nf × index(D) where
(5.115)
dz (z − D)−1 (5.116) 2π i We identify Eq. (5.115) as a formal statement of the axial anomaly, expressed in terms of zero modes of the Dirac operator, which is particularly appropriate for topological analyses, and was touched on above in our discussion of instantons. This very curious fermion method is being studied theoretically and numerically and it looks promising on all fronts. The action associated with DN involves direct couplings between arbitrarily separated sites and is not local. The importance of these direct couplings will depend on the gauge fields involved, so little can be said of a general, generic nature. However, results of some studies suggest that it will be possible to effect the inversions of the lattice Dirac operator to implement simulation studies based on this fermion method. Several more orders of magnitude of computer time will be needed compared with the use of staggered fermions. The method could be applied to finite-temperature studies of QCD and QCD with a chemical potential, if all these preliminaries work out. The usefulness of having another fermion method with good chiral properties cannot be underestimated in this field. Looking beyond vectorlike theories such as QCD, there is the hope that the Ginsparg–Wilson-fermion method will allow precise and practical formulations of the standard model of electroweak interactions as well as other chiral gauge theories. Then their nonperturbative physical content could be analyzed. In the case of the standard model, such an advance would allow simulations of the early universe in which electroweak symmetry breaking was restored. Perhaps early-universe studies of the baryon asymmetry in the universe would yield to simulations. Perhaps. P0 =
5.7 Chiral-symmetry breaking on the lattice Lattice-gauge theory provides a good framework for studying the dynamical breaking of chiral symmetry and soft-pion phenomenology. Condensation phe¯ nomena and the calculation of order parameters such as ψψ are practical jobs for computer simulations of QCD and much has been learned about this subject from numerical simulations over the years. Perhaps the simplest argument for the spontaneous breaking of chiral symmetry was abstracted by A. Casher from models in 1 + 1 dimensions [33]. The argument motivates the belief that vector theories that confine quarks necessarily
118
Lattice fermions and chiral symmetry
spontaneously break chiral symmetry. Begin with a theory whose Lagrangian is chirally symmetric and whose quarks have vanishing bare masses. Suppose also that the confining force is spin-independent and that bound q¯q pairs exist in an s-wave state. View these states semi-classically as in the quark model. Consider a turning point in the worldline of the bound quark. The quark must turn around, but its spin does not because the confining force does not cause spin flips. Therefore, its chirality (chirality equals helicity for massless quarks) must change sign. So chirality is not a good quantum number and we must have dynamically induced chiral-symmetry breaking! What is the physical mechanism behind this miracle? When the quark turns around, it must pick up a unit of chirality from the vacuum. Therefore, the vacuum must have indefinite chirality, as a condensate of q¯ q pairs would. Chirality must be spontaneously broken in order to support the bound states assumed. In fact, the formation of bound states, rather than confinement, is central to Casher’s argument. In QCD it appears that chiral-symmetry breaking occurs at intermediate length and coupling scales on which the instanton liquid describes the dynamics. Confinement and its accompanying linear potential work on larger length scales. The Casher physical picture of chiral-symmetry breaking is rather different than that presented by the model which began the field of soft pions and chiralsymmetry breaking, the Nambu–Jona Lasinio model [34]. These models were inspired by BCS superconductivity in which short-range forces between electrons cause the formation of an ee condensate giving rise to superconductivity in various low-temperature metals. The original BCS models of superconductivity were essentially (1 + 1)-dimensional Gross–Neveu (four-Fermi) models which were asympototically free and led to superconductivity even in the weakcoupling limit. The point was that arbitrarily weak short-range couplings between electrons at the top of a filled, sharp Fermi surface are always sufficient to destabilize the free-electron state and lead to a lower-energy state in which the electrons are paired up (Cooper pairs) into charge-two loosely bound states that can condense. The four-dimensional four-Fermi models considered by Nambu and Jona Lasinio to model chiral-symmetry breaking required a critical coupling: only above the critical coupling, g 2 > gc2 , would chiral symmetry be broken. Unlike the BCS model, there is no filled Fermi sea in the quantum vacuum, just a filled Dirac sea, so there is no inherent instability for a weak interaction to play upon and cause condensation. Instead, the short-range four-Fermi attractive potential must be increased in strength before quark–anti-quark pairs will condense into the vacuum and spontaneously break chiral symmetry. Unfortunately, the four-Fermi model in four dimensions is unrenormalizable, so the Nambu–Jona Lasinio model suffers from cutoff dependence and various other pathologies. However, there is a great deal of truth in their enormously important work.
5.7 Chiral-symmetry breaking
119
There is an important rigorous relationship between the spectrum of the quark propagator and the chiral condensate that was inspired by Casher’s qualitative argument. Consider ρ(λ) dλ, the mean number of eigenvalues of the fermion propagator contained in the interval dλ, per unit volume. The theorem of Banks and Casher relates this density of zero modes to the chiral condensate [35], 0|qq|0 ¯ = −πρ(0)
(5.117)
To derive this result, consider the fermion Green function S F in a background gauge field F, S F (x, y) = q(x)q(y) ¯ F =
u n (x)u †n (y) n
m − iλn
(5.118)
where u n (x) and λn are eigenfunctions and eigenvalues of the Euclidean Dirac operator, D / u n (x) = λn u n (x)
(5.119)
Note that, except for zero modes, the eigenfunctions occur in pairs, u n and γ5 u n , with opposite eigenvalues. So, setting x equal to y and integrating over x, we have, 1 1 2m d4 x 0|q(x)q(x)|0 ¯ (5.120) F = − 2 V V λn >0 m + λ2n Averaging over all gauge-field configurations and taking the thermodynamic limit V → ∞, ∞ ρ(λ) 0|qq|0 ¯ = −2m dλ 2 (5.121) m + λ2 0 This is a formal expression since the integral is not well behaved at large λ. However, we are interested just in the possibility of spontaneous symmetry breaking, which is controlled by the low-λ behavior of ρ(λ). In fact, if ρ(λ) behaves as λα , then the right-hand side of this equation is propotional to m α . Therefore, a nonzero result arises if and only if ρ(λ) tends to a nonzero limit for λ → 0. On completing the integration over λ in this case, we obtain the Banks–Casher result. Since the fermion propagator is central to lattice-gauge-theory algorithms and the density of eigenvalues is sensitive to topological excitations in QCD, this result has played an important role in diverse studies of chiral-symmetry breaking. There are many related ideas about the mechanism of chiral-symmetry breaking in the modern field-theory literature. For example, in gauge theories of grand unified theories (GUTs), one develops a rough picture of spontaneous
120
Lattice fermions and chiral symmetry
symmetry-breaking patterns by modeling the theory’s complex dynamics with simple, single-gauge-boson exchange between quarks and anti-quarks. On the basis of some experience with dynamical calculations using gap and Schwinger– Dyson equations, one supposes that a condensate forms and the relevant symmetry breaks dynamically if the gauge-boson coupling exceeds a critical value of the order of unity, gc2 /(4π ) ≈ O(1). In these models with many quark and boson species, one considers all the possible condensates and symmetry-breaking patterns and one considers single-boson exchange and hypothesizes that there will be condensation and symmetry breaking in that channel where single-boson exchange yields the strongest force [36]. Quarks in high representations of the gauge group are, therefore, predicted to condense preferentially. Models addressing the hierarchy problem of GUTs have been proposed on this basis. Since the dynamics of these theories are so rich, other seemingly totally different mechanisms have also been proposed. For example, instantons can lead to condensation and spontaneous symmetry breaking and these effects must, if possible, be accounted for. In the context of QCD in baryon-rich environments, the role of instantons appears to be especially important at moderate densities. Lattice-gauge theory can add controlled, accurate calculations to all of these scenarios. Large-scale simulations of QCD have given good evidence of chiralsymmetry breaking in fairly realistic settings and these calculations are becoming more sophisticated with time. Soft-pion dynamics can also be simulated, so the low-energy theorems that accompany spontaneous symmetry breaking can be simulated, checked, and understood as well. These calculations must make contact with the continuum limit of the lattice model in order for them to have real value and, for the present generation of simulations, this requires large lattices and weak gauge coupling, according to asymptotic freedom. It is interesting and instructive to see chiral-symmetry breaking in a simple and short lattice calculation [37]. This can be done for strong coupling using expansion methods, as we will illustrate now. These calculations offer a realization of Casher’s argument: the lattice model of QCD confines quarks at strong coupling and it also breaks chiral symmetry. Consider the lattice action for a “naive”-fermion formulation of QCD, S =
1 2 +
¯
r,ν ψ(r )γµ n µ U (r, n µ )ψ(r
1 2g 2
plaq
tr UUUU + h.c.
+ nµ) + m
¯
r ψ(r )ψ(r )
(5.122)
where the sum over the unit vector n µ includes all the sites r + n µ neighboring r . We will use this naive-fermion action (species “doubling” would produce 16 continuum quarks) because the notation is very simple and clear, and it will be adequate to illustrate several points.
5.7 Chiral-symmetry breaking
121
Now suppose that the lattice coupling is large, g 2 1. Then the tr UUUU term can be dropped. The U matrices lose their real dynamics and become random matrices whose functional integrals are easy. We will discuss this aspect of the calculation below after we have seen that the model breaks chiral symmetry dynamically. We have added a small fermion-mass term as a symmetry-breaking field to the action so that we can search for spontaneous symmetry breaking in the usual fashion – by calculating the condensate and other quantities at nonzero m and then letting m → 0 and seeing whether the condensate survives. We organize the action into two terms, S = S0 + Sint
(5.123)
¯
+ nµ) (5.124)
where S0 = m
¯
r ψ(r )ψ(r ),
Sint =
1 2
r,ν ψ(r )γµ n µ U (r, n µ )ψ(r
The hopping term, Sint , will have to be treated to all orders if we are to obtain useful results. The calculation will resemble a self-consistent mean-field theory expressed in graphical terms. In order to evaluate the graphs, we will need two basic contractions. The first deals with the fermion piece of the action, b 1 ψβ (r )ψ¯ αa (r ) = δαβ δ ab m
(5.125)
where α and β are color indices ranging from 1 to N for SU(N ) color, and a and b are the spinors’ Dirac indices ranging from 1 to 2d/2 (d is the dimensionality of spacetime). The expectation value in the equation is taken with respect to S0 . When Sint is treated as a perturbation, i.e. e S0 +Sint is expanded in powers of Sint , local gauge invariance implies that only closed fermion loops contribute. On reading off Sint , we see that each link contributes a factor − 12 γµ n µU (r, n µ ) to the amplitude. We will meet the contraction 1 † (5.126) [dU ]Ui j Ukl = δil δ jk N in these strong-coupling expansions. The final graphical rule we need is the prescription, from Fermi statistics, that each closed fermion loop is accompanied by a factor of −1. Now we can begin constructing graphs. The zeroth-order contribution for ¯ ψ(0)ψ(0) is N C/m, where we have summed over the indices on the Fermi fields and have defined C = 2d/2 for clarity.
122
Lattice fermions and chiral symmetry
0 Figure 5.4. Dominant graphs contributing to chiral-symmetry breaking on the four-dimensional lattice for strong coupling, large N , and large spacetime dimensionality. To second order in Sint , the fermion can hop to a nearest neighbor and back, so it contributes (N C/m)[2d/(4m 2 )], where the factor 2d counts the number of nearest neigbors. Note that the fermion path encloses zero area and the integral † [dU ]Ui j Ukl = (1/N )δil δ jk has been used. To proceed to higher orders we must take another limit to control the topologies of the graphs which dominate the calculation. In the limit of a large number of colors, N → ∞, we will argue that only graphs with many zero-area “petals,” as shown in Fig. 5.4, dominate. A graph with n petals contributes ¯ 0 n d n NC N C d ψψ An = = m 2m 2 m 2m N C
(5.127)
where we have identified the zeroth-order estimate for the chiral condensate, ¯ 0 , on the right-hand side of this expression. ψψ At this point we could just select this set of graphs, let n → ∞, and compute ¯ ψ(0)ψ(0). However, a more interesting, dynamical, and self-consistent calculation can be done. Recognize that the end of each “petal” of each graph is the source of an additional “flower.” When this is done, the perturbative expansion at N → ∞ becomes a fermion mean-field theory. With this improvement the graphs An become, simply on replacing the zeroth¯ 0 by its “exact” value determined by self-consistency, order ψψ n ¯ N C d ψψ An = m 2m N C
(5.128)
5.7 Chiral-symmetry breaking
123
¯ Summing all these graphs for ψψ gives n ∞ ¯ d ψψ CN ¯ ψψ = m n=0 2m 2 N C CN (5.129) d ¯ ψψ m− 2N C which expresses the self-consistency of this mean-field √approach. In the limit ¯ m → 0, we find a nonsingular result, ψψ = −iC N 2/d. Finally, mapping back to Minkowski space, ψ → ψM , we have 3 2 ¯ (5.130) = −C N ψ¯M ψM = −iψψ d which gives spontaneous chiral-symmetry breaking. Let’s end with a few qualitative comments about this calculation. First, note ¯ the mean-field character of the calculation: the order parameter ψψ is deter¯ mined at one site in terms of ψψ at its nearest-neighbor sites self-consistently. Note that the “petals” in An do not overlap, so our counting factors are good only for large dimensions d, another approximation underlying most mean-field calculations. One should consider this calculation as the leading nontrivial term in a 1/d expansion. Another technicality: vacuum fluctuations – disconnected graphs – did not appear in the calculation. One can check, however, that such graphs cancel out between the numerator and denominator in the formal expres¯ sion for ψψ at large d. The calculation respects the spirit of Casher’s intuitive argument. In particular, the random character of the U matrices at strong coupling was essential here and it was also the ingredient in showing confinement, the area law for Wilson loops. Also, at strong coupling quarks will be bound into s-wave bound states. Another illuminating calculation is the verification of the Goldstone theorem here. We want to check that the spontaneous breakdown of chiral symmetry is accompanied by a massless pion and that the PCAC relation holds. It is interesting that we can obtain these results easily on the lattice at strong coupling. Although the lattice badly violates spacetime symmetries, the staggered-fermion method has enough elements of chiral symmetry to get some qualitative physical effects correct. Of course, these results must be obtained in the theory in the weak-coupling, continuum limit in order to make contact with real physics. That is being done through extensive computer simulations at present, and quantitative agreement between the lattice results and hadronic data is emerging. We need to calculate the pion propagator, =
¯ )γ5 ψ(r ))(ψ(r ¯ )γ5 ψ(r )) D(r − r ) = (ψ(r
(5.131)
124
Lattice fermions and chiral symmetry
Figure 5.5. Graphs contributing to the pion propagator at strong coupling. in the same fermion mean-field scheme as that in which we calculated the order ¯ parameter ψψ. The graphs that dominate are shown in Fig. 5.5 and they can be summed by the same methods as those used to obtain the order parameter ¯ ψψ, yielding D(k) ∼
k→0
4N C 4d ¯ k2 + mψψ NC
(5.132)
We can let the explicit chiral-symmetry-breaking effect m vanish, and see that the pion becomes massless. For m small but nonzero, we can also read off a strong-coupling version of the current-algebra PCAC relation, f π2 m 2π = mψ¯ M ψM
(5.133)
NC 2 m = mψ¯ M ψM 4d π
(5.134)
in the form
Because of these results we can expect a smooth approach to continuumlimit soft-pion physics starting from lattice calculations. In addition, we can ¯ easily experiment in lattice calculations with ψψ and the pion propagator in high-temperature and high-chemical-potential environments to discover new phenomena. We have left one detail undone. We need to check that the graphs shown in Fig. 5.5 actually dominate the calculation when N → ∞. The argument
5.7 Chiral-symmetry breaking
125
0
(a)
(b)
Figure 5.6. Dominant and subdominant graphs at large N .
here is essentially the same as the more familiar argument in the continuum N → ∞ limit of Yang–Mills theory. For example, consider Fig. 5.6(b) with the subdominant graph having a hole in the quark worldline and compare it with Fig. 5.6(a) without the hole. The dominant graph without the hole behaves as N , which simply counts the number of quark colors circulating in the closed infinitely narrow fermion loop. The graph with the hole behaves as O(1) in N . This can be checked explicitly using the U -matrix integral discussed above. Alternatively, one can note that both branches of the internal loop in Fig. 5.6(b) with the hole must be local color singlets separately so there is an additional constraint in this graph, which eliminates one power of N . One more observation here concerns the quenched limit of the theory. This is the limit in which the number of fermion species Nf is taken to zero so that the dynamics becomes that of just the gauge fields. It is clear in the “flower” graphs of the mean-field calculation that the fermion effects consist of a single continuous worldline. It is also clear that the graphs with loops, such as that shown in Fig. 5.6(b), are subdominant in the quenched Nf → 0 limit. So this strong-coupling calculation suggests that the quenched limit of the theory, which is relatively easy to study by simulation methods since there are no quark loops in the dynamics, should be at least a good qualitative guide to the physics of spontaneous chiral-symmetry breaking. This has certainly proven to be the case, even though the quenched limit has some pathologies that ultimately limit its usefulness. In particular, the quenched limit ignores the anomaly and suffers from too much symmetry, a light η , and attendant specious infrared singularities.
126
Lattice fermions and chiral symmetry 5.8 Simulating dynamical fermions in lattice-gauge theory
The basic four-dimensional Euclidean action for the lattice-gauge-theory version of QCD reads S= / (U ) + m]i j ψ j + S0 (U ) (5.135) ψ¯ i [D ij
where the ψi are noncommuting Grassmann fields on site i that represent the fermions, Ai j = [D / (U ) + m]i j is the gauge-covariant Dirac operator and S0 (U ) is the pure gauge-field action, the trace of the product of U matrices around a plaquette in the simplest actions. The precise form of the gauge-covariant discrete-difference operator D / depends on the lattice-fermion method employed. We will be emphasizing and illustrating the use of staggered fermions here because of their good chiral-symmetry properties, their practical advantages in simulations, and their wide use in thermal and chemical-potential studies of lattice QCD. So in this case ψi will be a one-component Grassmann field and the first term of the action reads 4 1 † ¯ ηµ [Uµ (n)ψ(n + µ) − Un−µ (n)ψ(n − µ)] + m ψ(n) ψ(n) 2 µ n (5.136) where the ηµ are the phase factors guaranteeing that ψ satisfies the fourdimensional Dirac equation and carries spin 12 . The reader will recall the appearance of these phases ηµ in our explicit two-dimensional discussion of staggered fermions. The four-dimensional generalization can be found in the literature. The Uµ (n) in the formula are the gauge-field SU(3) rotation matrices that reside on the links of the lattice between sites n and n + µ. A lattice-gauge-theory simulation of QCD would calculate expectation values of gauge- and fermion-field observables in the partition function, Z= dψi dψ¯ j dUµ (n) exp(−S) (5.137) i
j
n,µ
Since the ψ are anti-commuting numbers, a direct simulation of this equation through a Metropolis or heat-bath algorithm familiar from classical statistical mechanics is not practical on a classical computer that can manipulate only ordinary binary numbers. Luckily, the Grassmann variables can be integrated out since they appear only in a quadratic form, Z = dUµ (n) det[D / (U ) + m] exp(−S0 (U )) n,µ
=
n,µ
dUµ (n) exp[−S0 (U ) + tr ln(D / + m)]
(5.138)
5.9 The microcanonical ensemble
127
where we see the infamous fermion determinant. It is not clear that integrating out the fermions has done us much good, since the final expression is a nonlocal effective interaction among all the Uµ (n) matrices. At least one can check that the fermion matrix D / + m has a positive-semi-definite determinant, so the measure does constitute a conventional probabilistic measure. The fermion determinant contains the physics of all closed fermion loops. In particular, it contains the Fermi-statistic rule that associates a factor of minus 1 with each closed fermion loop. Various approaches to evaluating the partition function by importance sampling have been suggested. We will consider the microcanonical, Langevin, and hybrid approaches here because they have been particularly productive in QCD simulations. Many results have been obtained for QCD spectroscopy at zero temperature, as well as the thermodynamics at nonzero temperature. Unfortunately, as we will discuss in greater detail below, the fermion determinant is not real at nonzero chemical potential for the gauge group SU(3). Therefore, most simulation studies have been performed for the gauge group SU(2) or the isospin chemical potential for the gauge group SU(3) and, as we will discuss, the most interesting physics has not been studied by lattice methods. It is to be hoped that improvements will be made, new algorithms will be invented, and this situation will improve. Presumably these developments will grow out of the known successful but limited methods which will be introduced now. 5.9 The microcanonical ensemble and molecular dynamics We begin with the molecular-dynamics approach to problems in equilibrium statistical mechanics because this framework will produce a means to simulate the fermionic partition function above. To begin, we review the idea of this approach. Suppose that we had a sample of material and wanted to measure its bulk thermodynamic properties to high precision. We would prepare a sample of the material and make many sequential measurements of the properties of interest to us. We need many measurements in order to obtain averages with small statistical-error bars because the system is experiencing fluctuations due to its internal dynamics and, perhaps, its external environment. Our averaging procedure consists of taking a time average over the system’s intrinsic dynamics. Under many circumstances we will discuss below, this time average will be identical to a configurational average of the same properties. The configurational average is an idea due to the inventors of statistical mechanics, Maxwell and others, whereby one imagines preparing a very large ensemble of identical copies of the physical system and taking averages by concatenating measurements of each particular member of the ensemble. For the same given external conditions, the time average of the properties of one system can be shown to be identical to the ensemble-average. In most theoretical discussions, one focuses on the
128
Lattice fermions and chiral symmetry
ensemble-average approach because of its mathematical clarity, but in most experimental situations, one measures the time averages of the observables and relies on the molecular dynamics of the system to sweep the one experimental system through the available phase space sufficiently that the idealized ensemble average and the actual time average are identical. Let’s illustrate the molecular-dynamics approach with a simple problem. Consider a boson field φ defined on a lattice with sites labeled i. The theory has an action S(φ) that determines its partition function and equilibrium statisticalmechanical properties. The system, however, does not have a natural dynamics to describe its approach to equilibrium. We are free to invent dynamics as long as the resulting scheme produces the correct equilibrium statistical mechanics. Two schemes that have been studied and analyzed extensively are the moleculardynamics approach to the microcanonical ensemble and the Langevin equation for the canonical ensemble. In the molecular-dynamics approach, we associate the action S(φ) with a “potential” V (φ) ≡ β −1 S(φ) and construct a fictitious “Hamiltonian,” H =T +V =
1 i
2
pi2 + V (φ)
(5.139)
where i labels the lattice sites and pi will be interpreted to be the momenta conjugate to the field φi . However, using this Hamiltonian we could consider the canonical ensemble and the classical statistical mechanics based on the phase space i d pi dφi and the Boltzmann factor exp(−β H ). Since the pi integrals are trivial, this formulation reduces to the original path-integral formulation of the boson-field theory, Z = i dφi exp S(φ). To construct a molecular-dynamics approach to the same problem, identify pi with the momenta conjugate to φi by introducing a “time,” τ , into the problem, pi = dφi /dτ . In the context of computer simulations, τ is essentially computer time. Now the ensemble given by the phase-space measure i d pi dφi and the Boltzmann factor exp(−β H ) defines the usual canonical ensemble of classical statistical mechanics. There is still nothing new or of any advantage here until one passes to the microcanonical ensemble. In this approach the energy of the physical system is fixed, H = E, and the measure in phase space is i d pi dφi δ(H − E). Observables in the system are functions of pi and φi and have expectation values 1 F = d pi dφi δ(H − E)F( pi , φi ) (5.140) Z i If F is just a function of the field φi , then standard arguments show that F calculated in the microcanonical ensemble is identical to F calculated in the canonical ensemble in the thermodynamical limit of large volumes, V → ∞.
5.9 The microcanonical ensemble
129
However F can also be calculated from the evolution with “time” of the classical system. This is the molecular-dynamics approach to the problem. Let (φi (τ ), pi (τ )) label the phase-space point of the physical system. Then a “time” average of F would be 1 T F( pi (τ ), φi (τ )) dτ (5.141) F = lim T →∞ T 0 This “time” average reproduces the expectation value in the microcanonical ensemble stated above if the ergodic hypothesis works for this physical system. Roughly speaking, one must assume that the molecular dynamics of the system carries the phase-space point (φi (τ ), pi (τ )) uniformly over the entire energy shell H = E. One says that the degrees of freedom “mix” under their dynamical evolution. Exceptions to the ergodic hypothesis are afforded by free fields, for which there is, of course, no mixing at all, and weakly coupled systems, for which the mixing is inadequate or incomplete. In computer simulations of molecular-dynamical systems, one must carefully check that adequate mixing occurs. Since molecular-dynamics simulations are done at fixed “energy” rather than fixed “temperature,” we must read off the system’s “temperature” from the results. (In most of our applications, “temperature” is actually “coupling.”) The precise correspondence follows from the equipartition theorem for the kinetic energy, E, K E = 12 β −1 N ∗
(5.142)
where N ∗ is the number of independent, physically excited degrees of freedom in the system. The molecular-dynamics approach to the boson system represents a clear alternative to standard Metropolis and heat-bath methods for simulation of the canonical ensemble. Those approaches move the phase-space point of the system over the entirety (it is hoped) of the phase space at given temperature in small, local jumps. In the molecular-dynamics approach, “time” averages of observables are accumulated over long computer runs. The moleculardynamics approach simulates the microcanonical ensemble at given “energy” and it is (1) fully deterministic, (2) involves ordinary, coupled differential equations, and (3) generalizes to a practical, local algorithm for noncommuting fermions. The molecular-dynamics approach to simulating QCD, whose action was written above, consists of inventing a scheme of “time”-dependent SU(3) gaugefield matrices, and pseudo-fermion fields, so that the “time” averages of these variables reproduce the ensemble averages of observables made of gauge fields and fermions in the original QCD lattice action.
130
Lattice fermions and chiral symmetry
The molecular-dynamics scheme is as follows. Let the gluon link variables Uµ (n) have a “time” dependence τ and let there be boson fields (“pseudofermions”) whose evolution with “time” will generate the fermion determinant. One Lagrangian that will accomplish this reads † † 1 ˙† L = −S0 (U ) + Uµ (n) Pˆ U˙ µ (n) + φi φi φ˙ i [A† A]i j φ˙ j − ω2 2 n,µ ij i (5.143) / (U ) + m]i j is the lattice Dirac operator introduced earlier and where Ai j = [D Pˆ is a projection operator diag(1, 1, 0) which picks out independent variables in the SU(3) matrix U˙ µ . L consists of kinetic-energy terms both for the gauge field and for the “pseudo-fermion” field as well as potential terms for both. Each kinetic-energy term has special features. In the gauge-field piece we must include the projector Pˆ to account for the fact that one cannot treat each element in the 3×3 matrix U as independent because of the constraints imposed by its SU(3) character. One has a great deal of freedom in the parametrization of these matrices as well as freedom in how the constraints are treated. In the pseudo-fermion piece of the kinetic energy, we have placed the positive operator A† A in the role one might have expected a “mass-squared” term. We shall see that the appearance of A† A here will generate the fermion determinant in the numerator of the partition function. Finally, note that L is local because the Dirac operator couples only nearest-neighbor sites. It is useful to construct this system’s Hamiltonian in order to discuss its molecular dynamics and thermodynamics. We have momenta canonically conjugate to the gauge and pseudo-fermion degrees of freedom, pµ = U˙ µ (n),
Pi = [φ˙ † A† A]i
The Hamiltonian then reads † 1 2 † † −1 pi + Pi (A A)i j P j + S0 (U ) + ω2 φi φi H= 2 i i i
(5.144)
(5.145)
and the Hamiltonian equations of motion are d † ˙ = −ω2 φ P˙ † = (A (U )A(U )φ) dτ ∂ ∂ p˙ = U¨ = − † S0 (U ) + φ˙ † (A† (U )A(U ))φ˙ ∂U ∂U †
(5.146) (5.147)
These equations are meant to give the reader an idea of the moleculardynamics equations for this system. In order to implement them numerically, a convenient parametrization for the U matrices must be chosen and one must
5.9 The microcanonical ensemble
131
be careful that the special unitary character of the matrices is preserved by the evolution with “time”. These matters of detail are nicely covered in the literature, but the point to be stressed here is that the Hamilton equations constitute a tractable (well, tractable for a computer) set of ordinary differential equations for which there are well-studied algorithms (such as second-order Runge–Kutta, or conjugate gradient, algorithms) that can solve them with controlled errors. The fermions do introduce an added complication that makes this system of coupled equations more difficult to simulate than a standard molecular-dynamics system that consists of classical, commuting variables. In particular, we see that, to advance the system of equations forward in “time” by one step, one must solve a linear set of equations for φ˙ of the form A† (U )A(U )φ˙ = source. Since A is a sparse matrix, this can be done efficiently by standard algorithms, such as the conjugate-gradient method. The number of operations in such an algorithm grows proportionally to the number of degrees of freedom on the finite lattice, so the computer time needed for the full simulation scales with the size of the system in a conventional fashion and it is practical, although demanding, to simulate large lattices of QCD. Computer timing and issues of the best algorithm etc. all change with hardware and software issues and the interested reader should consult the most up-to-date research articles to appreciate the state of the art. Our last task is to check that the Lagrangian L invented above actually gives the original path integral. The canonical ensemble based on the Hamiltonian above reads Z = DU D p Dφ Dφ † DP DP † exp(−H/T ) (5.148) All the integrals except U enter H quadratically, so the integrals can be done, Z ∝ DU det2 A(U ) exp(−S0 (U )/T ) (5.149) which is the required answer, except for the second, rather than the first, power of the fermion determinant. This can be cured by realizing that A† (U )A(U ) does not couple nearest-neighbor φ fields in the staggered-fermion method. One can think of the four-dimensional lattice of sites as two interleaving lattices of “even” and “odd” sites, and the field φi can be set to zero on one of the sublattices and the dynamics will leave it zero forever. When this is done, det2 A is replaced by det A, and we have our final, acceptable scheme. In retrospect we can appreciate the character of the tricks that allowed a purely bosonic system of equations to simulate a fermionic field theory, complete with the exclusion principle and all of the subtle correlations that are implied. The pseudo-fermion kinetic-energy term in L is 12 mv 2 with m ∝ A† (U )A(U ). When the Hamiltonian is constructed, we have p 2 /(2m) and
132
Lattice fermions and chiral symmetry
the resulting (A† (U )A(U ))−1 here is responsible for the positive power of det(A† (U )A(U )) = det2 A(U ) in the partition function. A crucial feature of this scheme is how it avoids the full, nonlocal character of the fermion determinant. The full fermion determinant is built up in the molecular-dynamics algorithm wherein, for each “time” step, the linear system A† (U )A(U )φ˙ = source ˙ This step is a local operation since A† (U )A(U ) couples only is solved for φ. nearby degrees of freedom. Only by iterating this procedure over many “time” steps does one solve the molecular-dynamics equations and build up the full, nonlocal fermion determinant. Ths last ingredient in the algorithm is the calculation of the coupling constant β. This is done through the equipartition theorem, which states that each term in the system’s kinetic energy has an average value of 12 kT , 1 −1 ∗ 1 ˙† † β N = T = φ˙ A Aφ˙ + (5.150) U (n) Pˆ U˙ µ (n) 2 2 n,µ µ The numerical value of N ∗ depends on the particular parametrization of the U matrices. 5.10 Langevin and hybrid algorithms Molecular-dynamics methods have several drawbacks. The method simulates the microcanonical ensemble of given “energy” and it works only if the ergodic hypothesis applies. It is very inconvenient to have to read off the coupling constant of a simulation after it is complete rather than before. However, this is just an annoyance in comparison with the possibility that the dynamics might not sample all of available phase space and produce equilibrium expectation values. In computer simulations with a finite set of weakly interacting degrees of freedom, violations of the ergodic hypothesis are virtually guaranteed. Luckily the molecular-dynamics algorithm can be improved by learning some lessons from the Langevin equation, a stochastic differential equation that simulates the canonical ensemble. The molecular-dynamics method and the Langevin equation can be merged into a “hybrid” algorithm that will simulate lattice QCD with any number of quarks. First, let’s introduce the Langevin equation, which is famous from Brownian motion, in a simple context of a single Bose variable q. We want to calculate an expectation value, 1 F(q) = dq F(q) exp(−S(q)) (5.151) Z In the microcanonical/molecular-dynamics approach, we have seen that this system would be given by the dynamics, q¨ = −∂ S(q)/∂q
(5.152)
5.10 Langevin and hybrid orbitals and expectation values would be replaced by “time” averages, 1 T F(q) = lim F(q(τ )) dτ T →∞ T 0
133
(5.153)
By contrast, in the Langevin approach the system is subject to external, explicit white noise, q˙ = −∂ S(q)/∂q + η(τ ) η(τ )η(τ ) = 2δ(τ − τ )
(5.154)
and average quantities are calculated using “time” averaging as above. The stochastic differential equation of Langevin causes q(τ ) to execute a forced random walk such that it covers phase space with the Boltzmann weight exp(−S(q)) dq. The molecular-dynamics equations and the Langevin equations look rather contrary, but the fact that there is an intimate relation between them is best shown by replacing each differential equation by a discrete difference equation, as a computer programmer would do. The Langevin equation becomes, denoting χn as noise, qn+1 = qn + χn − 12 2 S (qn ) χn χn = δnn τn+1 − τn = 12 2
(5.155)
whereas the molecular-dynamics equations read qn+1 = 2qn − qn−1 − 2 S (qn ) τn+1 − τn =
(5.156)
which can be written more suggestively as qn+1 = qn + 12 (qn+1 − qn ) − 12 2 S (qn )
(5.157)
So we see that “noise” in the Langevin scheme corresponds to “velocity” in the molecular-dynamics scheme. The “time” differences correspond as τn+1 − τn = 12 2 in the Langevin scheme and τn+1 −τn = in the molecular-dynamics scheme. These two schemes have the following features: 1. The Langevin equation has explicit noise, so it is ergodic by construction. Unfortunately, it samples phase space very slowly in many cases because q(τ ) executes √ a forced random walk, which, if the noise dominates, fills space at a rate ∝ N , where N is the number of “time” steps. 2. Molecular dynamics uses the classical equations of motion,
134
Lattice fermions and chiral symmetry
so it is as efficient as possible at probing the important regions of phase space locally. Also, its “time” step is large, so it moves along its trajectories rapidly. However, it need not be ergodic. For example, for long “times” the algorithm may become trapped in small regions of phase space and be unable to explore the entire energy shell. By comparison, the Langevin method could experience less of a problem in a similar situation because its explicit noise term “kicks” it through phase space. These considerations, qualitative as they are, suggested to early workers in the field of lattice simulations of fermionic problems that a “hybrid” method combining the best features of each algorithm might be worthwhile [38]. In the hybrid scheme, one states that a time step will be either Langevin (with probability p) or molecular dynamic (with probability 1 − p): qn+1 = qn + vn − 12 2 S (qn )
(5.158)
vn = (qn+1 − qn−1 )/(2), with probability 1 − p with probability p vn = χn ,
(5.159) (5.160)
τn+1 = τn +
(5.161)
with
One can optimize this algorithm by choosing the best probability per unit “time” to “refresh” it, i.e. replace the velocity with a random number. Experience has shown that p should be set to approximately twice the frequency of the “slowest mode” in the system. The idea here is that the fields in the system can be characterized by their variations on the four-dimensional spacetime lattice. Modes, with the slowest variations limit the rate at which the system can change and move through phase space. If one “drives” the system with noise at just this frequency, then one is encouraging the slowest modes, which hold back the system, to change efficiently. This is the best one can do within this framework. When these ideas are implemented in lattice-gauge theory, one begins with the Lagrangian, † 1 ˙† L = Uµ (n) Pˆ U˙ µ (n) + φ˙ i [A† A]i j φ˙ j 2 n,µ ij † − ω2 φi φi − β (tr UUUU + h.c.) (5.162) i
Notice that the “velocitites” U˙ and φ˙ appear only in quadratic terms, so, at a chosen “time” step in the evolution of the molecular-dynamics equations, one can replace the U˙ fields, say, by a completely new field configuration chosen from the Boltzmann distribution, exp(− 12 U˙ µ† (n) Pˆ U˙ µ (n)). A similar procedure
5.10 Langevin and hybrid orbitals
135
can be followed for φ˙ but a row in the inverse of the matrix A† A will be required in order to follow standard formulas and one must choose φ˙ in the Boltzmann † distribution exp(−φ˙ i [A† A]i j φ˙ j ). Finally, since even the pseudo-fermion field φ appears only as a quadratic form in L, it can be “refreshed” according to the Boltzmann distribution exp(−ω2 φ † φ). There are variations on this early hybrid algorithm that are very useful. In one case, the pseudo-fermion field is replace by noise. We can see that this is possible because φ enters L just as a quadratic form. The advantage of this algorithm is its apparently greater efficiency as well as the possibility of tuning and treating the number of flavors as a continuous parameter [38]. This is important because in QCD applications we want to simulate two almost massless “up” and “down” quarks, a heavier “strange” quark, etc. This noisy fermion algorithm can effectively treat the power of the fermion determinant in the path integral as a free parameter. This algorithm plays an important role in the chemical-potential simulations, as we will discuss later. Another very important variation is the so-called hybrid Monte Carlo algorithm [39]. Instead of just arbitrarily “refreshing” the variables in the moleculardynamics evolution from “time” to “time,” this algorithm executes a Monte Carlo step with a “guidance” Hamiltonian that has an accept–reject criterion. A consequence of this procedure is that the algorithm now becomes exact, like a traditional Monte Carlo algorithm. This means that the “time”-step errors found on discretizing the molecular-dynamics differential equations are completely removed. Of course, large “time” steps will not work well here because the probability of rejection of the Monte Carlo step will approach unity in that case and the algorithm will not push the system through phase space efficiently. In addition, this algorithm cannot take fractional powers of the fermion determinant because it cannot use noisy pseudo-fermion fields. However, if this algorithm can simulate a model of interest, its lack of “time”-step errors affords it clear advantages.
6 The Hamiltonian version of lattice-gauge theory
6.1 Continuous time and discrete space Another form of lattice-gauge theory that is very intuitively appealing is the theory’s Hamiltonian form. In many problems it is very instructive to make energy estimates, look at wavefunctions, and estimate the masses, sizes, and shapes of bound states etc. that a quantum-mechanical formulation with a Hamiltonian gives. Sometimes these simple things are hard to extract from a path-integral formulation. Mechanisms of confinement, flux-tube dynamics, and chiral-symmetry breaking are also amenable to Hamiltonian analysis. Deconfinement at finite T and transitions as functions of chemical potential will also be discussed in this formulation below. Since there is much more pioneering physics to be done in QCD in extreme environments, it is important to have many approaches available. In the Hamiltonian formulation one uses a spatial lattice and leaves the time variable continuous, as we have illustrated in several 1 + 1 examples above. The Hamiltonian is the generator of time evolution and acts on quantized states. The Hamiltonian could be obtained methodically from the path-integral formulation by calculating the system’s transfer matrix and then taking the time continuum limit [9]. It is more instructive, however, to construct the Hamiltonian from scratch. In fact, we can think about just two spatial points, write the Hamiltonian for that system, and then consider a full three-dimensional spatial lattice. As in the Euclidean formulation, the key to the construction is the requirement that local color-gauge invariance be an exact symmetry for any lattice spacing a. Let there be two sites with the link between them. The fermion field will live on the sites i = 1, 2 and it will be denoted ψα (i), where α is a color index. We will leave the color index implicit in many cases where its use is clear from the context. A local color transformation is implemented through an SUc (3) rotation 136
6.1 Continuous time and discrete space
137
matrix V (i) in the fundamental representation ψ(i) → V (i)ψ(i)
(6.1)
Therefore, we can think of independent color frames of reference at each site i. Each term in the Hamiltonian H must be invariant with respect to independent rotations of each frame. Clearly a single-site mass term satisfies this criterion trivially, m ψ † (i)ψ(i) (6.2) i
However, the construction of a discrete form of the kinetic energy is not so simple because the “obvious” hopping Hamiltonian, Hq = i[ψ † (2)ψ(1) − ψ † (2)ψ(1)]
(tentative)
(6.3)
is not locally gauge-invariant. Following Yang and Mills, we introduce an SUc (3) rotation matrix that relates the color frames at sites 1 and 2. In the fundamental representation it reads α λ (6.4) U3 = exp i Bα 2 where Bα (α = 1, 2, 3, . . . , 8) is an angular variable. The geometrical meaning of U3 determines its color-transformation law, U → V (1)U V (2)−1
(6.5)
Using U we can write the hopping Hamiltonian with local color symmetry, Hq = i[ψ † (2)U ψ(1) − ψ † (2)U −1 ψ(1)]
(6.6)
The U matrices are essential ingredients here, so let’s study them in isolation first. These degrees of freedom relate adjacent color frames, so they occupy the links between frames. The color reference frames at each link can be rotated in color space and, to describe the invariance of the theory with respect to local color-gauge symmetry, we need the generators E α (i), α = 1, 2, 3, . . . , 8, for these rotations at each site i. They are determined by their commutation relations, [E α (i), E β ( j)] = i f αβγ E γ ( j)δi j
(6.7)
where f αβγ are the structure constants of SU(3). Later we will relate the generators E α (i) to non-Abelian electric flux, which will play an important role in
138
The Hamiltonian version
the dynamics of pure gauge fields. However, our emphasis here is on geometrical features of gauge fields. In particular, the transformation law of U under rotations V of the color reference frames gives us the commutation relations of E α (i) with U . Consider an infinitesimal local color rotation written to first order in the generators, i α ( j) E α ( j) (6.8) 2 where α ( j) is an infinitesimal color vector of generalized angles. Substituting this expansion into the transformation relation for U gives V ( j) ≈ 1 +
λα λα [E α (2), U3 ] = −U3 U3 , (6.9) 2 2 Since E α (i) is a color vector and since the U operator relates adjacent color frames of reference, we can show that E α (1) and E α (2) are simply related, [E α (1), U3 ] =
E(2) = −U8 E(1)
(6.10)
where U8 means that U has been expressed in the eight-dimensional adjoint representation of SU(3). This result follows from the relations for commutation of E α (i) with U3 via a standard exercise in Clebsch–Gordon coeffients, making an 8 from the product of a 3¯ and a 3. The important point about this result is that it means that color electric flux maintains its magnitude across links, E(2)2 = E(1)2
(6.11)
Now consider the space of states of the gauge field. There is an SU(3) group at each site, so we must label the states by their irreducible representations. In SU(3) each multiplet is labeled by two invariants, a quadratic and a cubic Casimir operator, C2 and C3 . In addition, within each multiplet we need two additional quantum numbers to label a state uniquely and we will do that with a two-dimensional vector m. (All this should be familiar from ordinary particle physics, in which we assign particles to a representation of SUflavor (3), perhaps an octet or decuplet, and then label the particular particle by its third component of isospin I3 and its hypercharge Y .) A state in our world of two sites would be written 2
|m(i); C2 (i), C3 (i)
(6.12)
i=1
However, the quantum numbers on one site are not independent of those on the other. In particular, since the quadratic Casimir operator is just E 2 and we noted that E(1)2 = E(2)2 , it must be that C2 (1) = C2 (2)
(6.13)
6.1 Continuous time and discrete space
139
In addition, C3 is a second invariant constructed from the group generators, so C3 (1) = C3 (2)
(6.14)
Now we have enough preliminaries to construct the space of states explicitly. We begin with the ground state |0 which should be annihilated by the generators, E(1)|0 = E(2)|0 = 0
(6.15)
To make states in higher multiplets, we apply matrix representations of the ¯ 3) representation of SU(3) × SU(3), U matrix. In particular, to fill out the (3, consider (U3 )lk |0
(6.16)
To check this claim, measure the square of the color electric flux in this state, using the transformation laws given above, (6.17) E 2 (1)U3 |0 = 12 λα 12 λα U3 |0 = C2U3 |0 where C2 is the value of the quadratic Casimir operator in the fundamental 3 representation, C2 = 43 . Before developing more aspects of the lattice theory, let’s predict some of its continuum correspondences. We did this exercise in detail for the Euclidean four-dimensional version of the theory, so the results should not be surprising. First, the generators of local color rotations are related to the Yang–Mills fieldstrength tensor, a 2 0k Eα ( F ( r ) · nˆ = r )n k g α
(6.18)
where nˆ is a unit vector pointing between lattice sites, Fα0k is the Yang–Mills field strength, and g is the coupling constant. The rotation matrix is related to the vector potential Aα ( r ), λα (6.19) r , n) ˆ = exp iag Aα ( r ) · nˆ U3 ( 2 if we choose the class of temporal gauges, A0α ( r ) = 0. It is interesting that constructions of this sort pre-date lattice-gauge theory and even Yang–Mills theory by many years. As discussed in the context of pathintegrals above, when J. Schwinger was first formulating QED he considered point-split fermion currents like
¯ r + )γµ γ5 e(ig A(r ) · ) ψ( lim ψ( r)
→0
(6.20)
140
The Hamiltonian version
By considering local gauge transformations, Schwinger showed that this point-split fermion bilinear is gauge-invariant even for nonzero . This proved essential in Schwinger’s discussions of the axial anomaly, the lack of conservation of the axial-vector current of QED due to quantum fluctuations. In terms closer to our present discussion, we can check that Schwinger’s construction satisfies Gauss’ law and respects the fact that a positively charged fermion is the source of one unit of electric flux and that a negatively charged fermion is the sink of that flux. Consider the state
|s = eig A · |0
(6.21)
˙ in the temporal gauge (A = 0), the exponential exp(ig A · ) is Since E = A 0 We compute a shift operator for E. = g s| E|s
(6.22)
The close analogy of these results to our equations above involving the colorgroup generator E( j) and the U matrix will prove important. Now let’s put quark degrees of freedom onto the two-site world. The quarks reside in the fundamental representation of the color group, so they transform under color rotations as ψ( j) → V ( j)ψ( j)
(6.23)
For an infinitesimal rotation, we again write i V ( j) ≈ 1 + α G fα + · · · 2
(6.24)
where G fα is the fermionic piece of the generator. The infinitesimal form of the rotation equation reads
δψ = G fα , ψ =
i α λα ψ 2
(6.25)
which indicates, after some standard textbook algebra in second quantization, that the fermion piece of the generator at site i is G f (i) = ψ † (i)
λα ψ(i) 2
(6.26)
This means that the total generator at the site i is G α = ψ † (i)
λα ψ(i) + E α (i) 2
(6.27)
6.1 Continuous time and discrete space
141
Local color-gauge invariance can now be stated in a precise, convenient form. A physical state should be invariant with respect to local color rotations and therefore must be annihilated by G α , G α |0 = 0
(6.28)
It is instructive to make some gauge-invariant states and interpret them physically. The vacuum is locally invariant with respect to color rotations and is annihilated by ψ(i)− , ψ † (i)− , and E α (i). The state of a quark and an anti-quark at site i is gauge-invariant if the color indices are contracted into a singlet, † ψ j (i)ψ j (i)|0 ≡ ψ † (i)ψ(i)|0 (6.29) j
However, if we place a quark on one site and an anti-quark on a neighboring site, then gauge invariance requires the presence of a U3 operator between them, † † ψ j (1)Ui(3) (6.30) j ψ j (2)|0 ≡ ψ (1)U3 ψ(2)|0 j
where the U operator enforces Gauss’ law and places one unit of electric flux between the fermion and the anti-fermion. Note that the state also has all of its color indices locally contracted into color singlets, so it is clearly invariant under local color rotations. Our last task in the context of the two-site model is the construction of a gauge-invariant Hamiltonian that will generalize to a full three-dimensional, infinite spatial lattice. We have discussed fermion-mass and hopping terms. We will need to elevate the gauge field to the status of a dynamical variable. Since E α (i) is the gerator of color-frame rotations and since its square, the quadratic Casimir operator for the gauge
variables, is locally gauge-invariant, a sensible candidate term should involve iα E α (i)E α (i). We must pick the appropriate prefactor here so that the Hamiltonian reduces to the known result when the lattice spacing is taken to zero for smooth fields. The result is H=
i g2 2 E (i) + [ψ † (1)U ψ(2) − ψ † (2)U † ψ(1)] + m ψ † (i)ψ(i) 2a i a i (6.31)
This is as far as we can go with two sites. H is clearly missing magnetic effects in the gauge field. Recall from our discussion of the Euclidean version of the theory that these effects come from plaquettes, the product of U matrices around the smallest loops on the lattice. To get the full H we turn to an infinite, three-dimensional lattice. Label sites with a triplet of integers, r = (n 1 , n 2 , n 3 ) and label directed links with r and a unit vector pointing toward one of the six
142
The Hamiltonian version
nearest-neighbor sites. The fermion fields will reside on the sites, ψ( r ). The electric-flux variable, E α ( r ) · n, ˆ will also reside on sites but will point along the various links. The gauge fields U ( r , n) ˆ will live on links. Most of the algebra we did on our two-site world generalizes to the three-dimensional spatial lattice in the obvious fashion. The generator of color rotations now has a single-site fermion term and six contributions from the electric-flux variables on all the links attached to that site. The condition that physical states be locally colorgauge-invariant reads as before, G α |0 = 0
(6.32)
with r ) = ψ † ( r) G α (
λα E α ( r ) · nˆ ψ( r) + 2 nˆ
(6.33)
It is instructive to consider some physical states made by applying gaugeinvariant operators to the vacuum. First consider pure gauge-field examples. We can multiply strings of U matrices together, but, in order to make only gaugeinvariant operators, color indices must be contracted locally. In other words, let be a closed path of links and label the links with increasing integers. Then an operator of the form U () = tr U (1)U (2)U (3) . . . U (n)
(6.34)
is locally gauge-invariant and makes a closed string of electric flux when it operates on the vacuum. The shortest closed path on the lattice consists of the square of four links and, as in the Euclidean version of lattice-gauge theory, these quantities will be used to make the “magnetic” terms in the Hamiltonian. The energy of a closed, nonintersecting loop of n links is proportional to the product of the electric flux on each of the links with n, the number of links. Since the electric flux is quantized according to the possible values of the quadratic Casimir operator of SU(3), [E( r ) · n)] ˆ 2 = 0,
4 10 16 , 3, , ,... 3 3 3
(6.35)
the energy per length of such electric flux tubes is quantized. Next consider states that contain fermions in addition to gauge-field excitations. A meson state might be ¯ τ ψ(i)|0 (6.36) ψ¯ k (i)γ τ ψk (i)|0 ≡ ψ(i)γ k
where the color indices k are summed locally into color singlets. The γ and τ matrices are short for an appropriate Dirac matrix and a flavor matrix that acts on
6.1 Continuous time and discrete space
143
the flavor indices of the fermion field operator that we have left implicit. Baryon states can also be made using fermion operators on a single site, † † klm ψk (i)ψl (i)ψm† (i)|0 (6.37) klm
where the symbol constructs a color singlet from three indices. Excited meson and baryon states can be made using operators in which fermion fields are at different sites and strings of U are inserted to satisfy Gauss’ law. Now we should write down the Hamiltonian for the three-dimensional lattice system. It reads H=
1 g2 [ E α ( r ) · n] ˆ 2− 2 (tr UUUU + h.c.) 2a r,nˆ g a plaquettes 1 [(−1) y χ † ( r )U ( r , n z )χ ( r + n z ) + (−1)z χ † ( r )U ( r , n x )χ ( r + nx ) + 2a r r )U ( r , n y )χ ( r + n y ) + h.c.] + (−1)x χ † (
(6.38)
where we have not included a bare-fermion-mass term. The elaborate fermionhopping term contains the phases of Kogut–Susskind (staggered) fermions in three dimensions [40]. We recorded the Euclidean version of these phases above. With these phases the Dirac equation for a massless iso-doublet is retrieved in the continuum limit. The relative strength of the electric and magnetic gauge terms in the first line of the equation also guarantees that the appropriate, conventional continuum Hamiltonian of QCD will be obtained. The verification of these points follows our earlier demonstration for the Euclidean formulation and that algebra will not be repeated here. The important point, however, is that, for strong coupling on a coarse lattice, the terms are clearly ranked in order of importance in a strong-coupling expansion in terms of the variable x = 1/g 2 . We write H in dimensionless form, W =
2a H = We + x Wq + 2x 2 Wm g2
where We =
[ Eα ( r ) · n] ˆ 2
(6.39)
(6.40)
r,nˆ
Wq =
[(−1) y χ † ( r )U ( r , n z )χ ( r + nz ) + · · ·
(6.41)
r
Wm = −
plaquettes
(tr UUUU + h.c.)
(6.42)
144
The Hamiltonian version
This Hamiltonian has been used for QCD spectroscopy calculations as well as formal developments in the field, such as large-N expansions and mean-field approximations. Let’s gain some familiarity with this Hamiltonian. Suppose that a, the spatial lattice spacing, is fixed and the coupling constant g is large. Suppose also that the dynamical Fermi fields χ are absent. In other words, we are working in the “quenched” approximation in which all the dynamics resides in the gauge fields which provide an environment in which “external” fermions can propagate. Then the static term, We , dominates the energetics. If we place a static source of one unit of electric flux at a site r, then one unit of electric flux must emanate from it, which means that a U ( r , n) ˆ must occupy a link radiating from site r. However, this operator has an open color index associated with the nearest-neighbor site r + n, ˆ so it must be connected to another U ( r + n, ˆ m), ˆ etc. 2 Each U operator costs an energy [g /(2a)]C2 (3) and there is an infinite number of links in the string attached to the color source. More simply, the fixed source gives rise to a flux tube of infinite extent and infinite energy (linearly divergent). A simple generalization from this exercise is that the only finite-energy states in the theory will be color singlets. To illustrate the importance of the magnetic term in the Hamiltonian, consider a state with a source of color in the 3 representation at one site and a sink of color in the 3¯ representation at another site. A gauge-invariant operator making such a state would have a string of U operators extending from the source to the sink. In other words, a straight thin flux tube holds the source and sink together by a linearly confining potential. Now treat the effects of the magnetic piece of the Hamiltonian in ordinary perturbation theory in the expansion parameter x = 1/g 2 . According to Schr¨odinger perturbation theory, the perturbation will act on the unperturbed state to produce a correction. In particular, the magnetic term of the gauge-field action can be applied to the unperturbed string state with one of the links in Wm hitting the same link on which there is a U from the unperturbed state. It is clear that the result is a “fluctuating string.” In addition, the quarkhopping term can act as a perturbation and, if it acts on a link on which there is a U from the original state, the hopping term can annihilate that bit of string and place a quark and an anti-quark down so that the original string is broken. This effect can also be interpreted as “screening” because the perturbation has now produced extended color-singlet “meson” states that do not experience a long-range confining potential. This calculation illustrates a general effect: when dynamical quarks are included in calculations, they screen the long-range linear potential, leaving just color-singlet states and shorter-range interactions. Systematic calculations of light- and heavy-quark spectroscopy, the heavyquark potential, electric-flux distributions, etc. have been done using these methods carried to much higher order in a strong-coupling expansion. Some illustrations will be included below.
6.2 Quark confinement
145
6.2 Quark confinement in Hamiltonian lattice-gauge theory and thin strings One of the goals of lattice-gauge theory is the development of a better understanding of quark confinement. We have considered it briefly above and will return to it several times again in the context of extreme environments. Although computer simulations of SU(3) and simpler lattice-gauge theories provide good evidence for flux-tube formation and confinement, a simple continuum understanding of the phenomenon is lacking. As we have discussed, quark confinement is easy to understand on a coarse, strongly coupled lattice. Of course, such a structure breaks our cherished spacetime symmetries, so the significance of such a result is suspect. Nonetheless, we will discuss spacetime symmetries and their restoration in the weak-coupling limit of lattice-gauge theory, and gain more insight into flux-tube dynamics. ¯ created Begin with the on-axis heavy-quark potential. We place a Q and a Q, † by operators ψ ( n ) and ψ( n + R), at the sites n and R with a string of U matrices between them, 4 5 † n) U ψ( n + R) (6.43) ψ ( path
but is otherwise arbitrary. where the “path” extends from n to n + R, It is easy to calculate the energy of such states for strong coupling, g 2 1. When g 2 1, the leading term in H is Ho =
g2 2 El 2a l
(6.44)
¯ pair, must minimize Ho , so each The vacuum, into which we will place the QQ of its links must be a color-singlet state, Elα |0 = 0
(6.45)
¯ So the QQ state of minimal energy should have the fewest excited links in its path U term. Each excited link will have the energy, in SU(2) gauge theory, E 2U |0 = 12 τ α · 12 τ α U |0 = 34 U |0
(6.46)
which one proves from the SU(2) commutation rule, [E α , U ] = 12 τ α U
(6.47)
¯ lie along an axis with R/a links between them, the energy of So, if the Q and Q their state is g2 3 R √ · · = σR V (R) = (6.48) 2a 4 a
146
The Hamiltonian version
which gives us the strong-coupling limit of the string tension in this Hamiltonian formulation of lattice-gauge theory, √
σ =
3 g2 8 a2
(6.49)
This formula illustrates the renormalization program of lattice-gauge theory in a very primitive fashion. The string tension is a physical quantity that is independent of the cutoff a. However, this formula for the string tension will be independent of a only if g ∼ a: if the string tension were computed on two different lattices with different spacings but the same H , then the coupling on the coarser lattice would have to be larger and proportional to a, in order that σ would be independent of the cutoff. This characterizes confinement without screening: the coupling grows without bound at large distances in order to guarantee that only color-singlet states have finite, bounded energies. This strong coupling behavior, g ∼ a, would have to match with asymptotic freedom, g ∼ 1/ ln a, on a fine lattice so that the physical, continuum theory resides in a single phase. The continuum limit of the Hamiltonian lattice theory would then experience a single phase: confinement at large distances and asymptotic freedom and a parton description at short distances.
6.3 Relativistic thin strings, delocalization, and Casimir forces How does confinement emerge from asymptotic freedom at short distances? How does the flux law change over length scales, so that at small length scales the flux spreads out into the familiar three-dimensional pattern, while at large distances it is squeezed into a thin flux tube? Is a thin tube really a possible configuration of flux? To gain some insight into these longstanding questions, it is instructive to consider a very simple model: a thin structureless string in continuum, relativistic spacetime. Pin down the ends of the string at positions x = − 12 R and x = 12 R. What are the qualitative characteristics of the spatial distribution the string can assume? Will it resemble a “straight flux tube” that could be responsible for confinement through a linearly rising heavy-quark potential? We will see that, as R grows large, translation invariance implies that the string wanders off the axis between its pinned ends and that its wandering transverse to the axis grows without bound at the rate ln R. Let the string be described by a two-component vector field, which gives its location off the axis between its ends, ξ (t, z) for z in the range, − 12 R ≤ z ≤ 12 R. The string is pinned at its ends, as shown in Fig. 6.1, ξ (t, − 12 R) = ξ (t, 12 R) = 0. We want an effective action to describe the low-frequency modes of this string. Make the following assumptions about it.
6.3 Relativistic thin strings
−R/2
ξ(t, z)
147
+R/2
Figure 6.1. The fluctuating thin string with pinned ends.
1. Seff must be local, so it can be written as an integral over a density L, which can depend on ξ (t, z) and a finite number of its derivatives at (t, z), Seff = dz dt L. 2. L itself should be invariant with respect to the following spacetime symmetries: (a) Poincar´e transformations in the (t, z) plane, and (b) O(2) rotations and translations of the string ξ . Now we can write down the terms that can enter the effective Lagranian L. We are interested only in terms that can contribute to its low-energy behavior. Property 2(b) precludes a “mass” term that would be proportional to ξ 2 . So L must be made up of derivatives, ∂µ ξ and ∂µ ∂ν ξ etc., where the derivatives ∂µ refer to t and z, Seff = dz dt [∂µ ξ · ∂ µ ξ + b ∂µ ∂ µ ξ · ∂ν ∂ ν ξ + c(∂µ ξ · ∂ µ ξ )2 + · · ·] (6.50) where the integral extends over the range − 12 R ≤ z ≤ 12 R and all t. If we are interested only in the low-momentum properties of the string, on the scale of an ultraviolet cutoff a, then only the first term in the expression for Seff is significant. The other terms in the expression are irrelevant, i.e. the parameters b and c have dimensions of length to various positive powers, which will become suppression factors by powers of the ultraviolet cutoff in applications. In condensed-matter physics this is the character of a “spin-wave” expansion. So, for studying phenomena with characteristic wavelengths λ a, Seff is well approximated by the action of two massless fields in two dimensions, Seff = dz dt ∂µ ξ · ∂ µ ξ (6.51) We know that massless modes in two dimensions have severe infrared problems that will make some features of the string sensitive to its length R. For example, the string cannot be straight, because this would mean that ξ (z, t) = 0 with small fluctuations. In fact, using Seff it is easy to calculate the variance in
148
The Hamiltonian version
ξ (z, t) for − 12 R ≤ z ≤ 12 R and find that it diverges as the string becomes longer and longer, 2 dk 2 ξ (z, t) ∼ ∼ ln(R/a) (6.52) k2 where the integral over wavelengths is cut off by R in the infrared and a in the ultraviolet. For a physical fluctuating string, its thickness would replace the ultraviolet cutoff a. Since the transverse fluctuations in the position of the string diverge as R grows large, the string is “delocalized.” This calculation is an example of the Mermin–Wagner theorem that we discussed above in the context of two-dimensional spin systems. If ξ (z, t) = 0 with small fluctuations, then translation symmetry would be broken. However, the theorem states that continuous global symmetries cannot break down spontaneously in two dimensions in a theory with only local couplings. We learn from this that, although the flux tube can maintain a thin intrinsic thickness, it must “snake” out into the transverse plane in an unbounded fashion, such that it eventually loses memory of its boundary conditions at ± 12 R. A second physical effect of these massless modes, capillary waves in the language of interfaces in condensed-matter physics, is the existence of a universal 1/R potential in the string channel of the theory [41], V (R) =
√
σR+c−
(d − 2)π/24 + ··· R
(6.53)
The 1/R potential here is a universal one-dimensional Casimir effect and will be derived below. It is a Casimir effect because it records the energy in the transverse fluctuations in the string of length R that depend on the boundary conditions, namely the fact that the string is pinned down at its ends. It is universal because its existence just depends on continuous translation symmetry in spacetime. In fact, its strength depends just on the dimensionality of spacetime; d − 2 counts the number of spatial dimensions transverse to the string. Let’s do the calculation following Casimir’s original method for obtaining his famous effect for a black box immersed in the electromagnetic vacuum. In this case each transverse mode contributes 12 –hω to the ground-state energy, V (R) =
1 n , 2 n
= π n/R,
n = 0, 1, 2, 3, . . .
(6.54)
where we have counted massless normal modes for a field with fixed boundary conditions, ξ (−R/2, t) = ξ (+R/2, t) = 0. The sum over normal modes diverges – the fluctuations shift the ground-state energy. There must be a term that is extensive (proportional to R) and will renormalize the string tension
6.3 Relativistic thin strings
149
additively, and we know from experience with the Casimir effect in threedimensional cavities that there will be other terms with weaker dependences on R that produce attraction and/or repulsion between elements of the boundary. To find and separate these effects, we must regulate the sum with a convergence factor e−αn and let α → 0 at the end of the calculation. In order to think physically about this procedure and give α a physical interpretation, suppose that the string has a small intrinsic thickness of a, a natural ultraviolet cutoff. Then the largest physically sensible value for n is ∼ R/a, the largest energy is ≤ π/a, and the minimal value for α is ∼ a/R. With this regularization procedure, the change in the heavy-quark potential becomes, π −αn π d −αn V (α, R) = ne =− e (6.55) 2R n 2R dα n π (n − 1)α n−2 1 π d = − (6.56) Bn =− 2R dα n eα − 1 2R n n! where Bn are the Bernoulli numbers. Only two terms survive in this result. There is the n = 0 term which is extensive and contributes to the string tension through an additive renormalization. (Since the minimum physical value of α is ∼ a/R, the n = 0 term is proportional to R/a 2 , as expected of the divergent vacuum energy of a scalar field in a 1 + 1 dimensional box of extent R.) The second term comes from n = 2, and is insensitive to α, the regulator, V (R) = −
π 1 π/24 B2 = − 2R 2 R
(6.57)
We obtain a term of this value for each transverse dimension, so, if there are d − 2 of them, the final result is V (R) = −
(d − 2)π/24 R
(6.58)
It is very interesting that results from the most accurate lattice-simulation studies of the heavy-quark potential confirm the presence of the universal 1/R term of the simplest bosonic string model. Simulations in three and four Euclidean dimensions require the d − 2 counting factor as well. The 1/R effect is evident in the lattice data even for relatively “short” distances, of the order of half a fermi. It appears that the thickness of the confining flux tube is quite small, ∼ 13 fm, and the confining geometry is physically significant at separations ≥ 12 fm. These scales in the physics of confinement have not been assimilated into a first principles understanding of the dynamics of QCD at this time.
150
The Hamiltonian version
Figure 6.2. A kink and an anti-kink on a thin string. 6.4 Roughening and the restoration of spatial symmetries How does the delocalization of the string show up in lattice-gauge theory? For strong coupling, the cubic symmetry of the spatial lattice stabilizes the straight string and inhibits the transverse fluctuations that could delocalize it. There is a nonzero gap in the spectrum of the transverse fluctuations of the string. Consider an on-axis string consisting of N links. The string has an energy of [g 2 /(2a)]N in the standard Hamiltonian version of the theory. However, if we put a transverse wave into the string, we have an extra energy of 2 · [g 2 /(2a)] 34 , in the strong-coupling limit. The excited string has a “kink” and an “anti-kink”, as shown in Fig. 6.2. The strong-coupling energy of the single-kink state is m k = [g 2 /(2a)] 34 . The kink should be thought of as a particle – its wavefunction, energy, and momentum are localized and it can travel along the string. It spreads the transverse distribution of the electric flux of the string, but, for strong coupling, the transverse distribution is bounded because the wandering of the string is inhibited by the finite energy barrier, m k . However, as we consider weaker coupling g 2 and approach the continuum limit, m k will decrease. In fact, this mass gap will vanish at a nonzero value of the coupling, gR , before the continuum limit is reached. This critical coupling gR is called the “roughening” point, and has been studied very well in condensed-matter investigations into the physics of interfaces. In the context of Hamiltonian lattice-gauge theory, one can calculate the kink’s mass in strong-coupling perturbation theory and estimate the critical coupling gR at which m k vanishes. In such calculations, the magnetic terms in H smear out the kink state and reduce its energy. At the roughening point gR , there is no barrier to transverse wanderings of the on-axis flux tube. In fact, at this point, the transverse profile of the string widens like ∼ ln(R/a), as predicted by the continuum effective-Lagrangian model. All of this has been studied very well in lattice-gauge theory and in condensed-matter physics of interfaces. It is interesting to consider the roughening transition in the context of two-dimensional interfaces. We might be considering the three-dimensional Ising model at low T in the magnetized phase and choose boundary conditions to form an interface for which all the spins above the x–y plane are “up” and all those below the plane are “down.” We are interested in how the interface at z = 0 fluctuates as the temperature is increased. The fluctuations are frequently studied in the “solid-on-solid” approximation.
6.4 Roughening and restoration of symmetries
151
+
+ +
+
+
+ +
+
+
+ +
+ i∗
Figure 6.3. An interface configuration with an “overhang.” + + + +
+ + +
+ + +
+ +
+ i∗
Figure 6.4. An interface configuration of the “solid-on-solid” variety. One considers the flat, planar interface at low temperature T and neglects all bulk fluctuations that are not connected to the interface. One makes the simplifying assumption that one can ignore those rare fluctuations with overhangs, as shown in Fig. 6.3. More likely fluctuations are shown in Fig. 6.4, where we see that the height of each column of “down” spins above or below the x–y plane is a single-valued function. Call it n i ∗ , where i ∗ labels a dual site of the twodimensional x–y lattice. Since only broken bonds contribute to the action, the partition function of the interface is 4 Z= exp − |n i ∗ − n j ∗ | (6.59) T i ∗ j ∗ ni ∗ Considering sequential slices across the interface, this is clearly a natural lattice version of the thin string discussed above. It is interesting to ask how well this “solid-on-solid” model describes the real interface in the three-dimensional Ising model. This is an “experimental” question, so we turn to computer simulations. The simulations indicate that the roughening temperature TR is about half the bulk transition temperature Tc , the Curie point between bulk magnetization and bulk disorder. Since TR Tc , it is reasonable to ignore bulk fluctuations – the bulk Ising model is quite “cold” and ordered at TR and vacuum fluctuations are rare. In addition, for T ≈ TR ,
152
The Hamiltonian version
fluctuations of the surface are quite rare, with the most likely values of n i ∗ − n j ∗ , for nearest-neighbor dual sites, of 0, ±1. Therefore, the neglect of “overhangs” is a compelling approximation. This means that the solid-on-solid model is well approximated by the “discrete Gaussian” model which we found earlier in our discussion of the planar model, 4 Z= (6.60) exp − |n i ∗ − n j ∗ |2 T ∗ ∗ ni ∗ i j Since this model is dual to the planar model, we know all about interfaces with no additional work. What luck! In particular, the classic work of Kosterlitz and Thouless applies to teach us the following. 1. There are massless modes on the interface for T > TR ≈ 12 Tc . The interface is, therefore, delocalized for all T > TR , with transverse fluctuations in its position that diverge as ln L for an L × L interface. 2. The interface free energy, the analog of√the string tension in QCD, develops an essential singularity, exp(−B/ T − TR ) at the roughening temperature. We learn some general results from these exercises. The string sector of QCD has some very curious features. It does not have a mass gap simply because of the geometry of the string and the fact that spacetime is homogeneous. Although the underlying dynamics of QCD which generate linear confinement are rather special, we believe that there are several consequences, such as delocalization and the universal 1/R potential, that are simple, geometrical, and universal. The roughening transition has additional practical significance for latticegauge theory. For strong couplings above the roughening coupling gR , on-axis flux tubes are smooth, with unlikely fluctuations. At gR the string becomes rough and obtains some of the features of a thin string embedded in a homogeneous, continuum spacetime. Therefore, this sector of the theory and the dynamics of the string, in particular, decouple from the lattice with its cubic symmetry. The string begins to behave as if the underlying spacetime had full rotational and translational symmetry at the roughening point. It is interesting to see how this works out in Hamiltonian lattice-gauge theory. For the purposes of illustration consider (2 + 1)-dimensional SU(3) gauge ¯ be placed off axis as shown in Fig. 6.5. This general contheory. Let the Q and Q figuration will help us discuss the restoration of spatial symmetries. At strong coupling the flux must travel from the quark to the anti-quark by a route of minimal distance. The leading order, g 1, heavy-quark potential is then V (x, y) =
g2 3 (|x| + |y|) 2a 4
(6.61)
6.4 Roughening and restoration of symmetries _ Q
153
(x,y)
Q (0,0)
Figure 6.5. An off-axis flux configuration of minimal energy.
y
x
Figure 6.6. An equipotential surface of the heavy-quark potential at strong coupling. which has the equipotentials shown in Fig. 6.6. V (x, y) has the expected discrete rotational and translational symmetries inherited from the cubic lattice. The rotational symmetry of a continuum calculation has been distorted by the discrete lattice. It is interesting to see how this situation changes as fluctuations develop at weaker coupling and g approaches the roughening transition gR . Consider a
154
The Hamiltonian version H' = i
i
i+1
Figure 6.7. The strong-coupling perturbation effect on a “corner” in an off-axis flux configuration. Hamiltonian perturbation-theory description of this. To begin, we must write down the zeroth-order wavefunction in detail. We must find the path(s) of minimal length between the quark and the anti-quark. However, as soon as the heavy quarks are off axis, we see that there are many paths of equal, minimal length, (|x| + |y|)!/(|x|!|y|!) paths, in fact. Therefore, our supposedly simple perturbation-theory calculation becomes a tricky exercise in degenerate perturbation theory. To calculate the first-order correction to the heavy-quark potential V (x, y), we must diagonalize the perturbation in this subspace. The Hamiltonian reads H = Ho − xs H
(6.62)
where xs = 2/g 4 and Ho =
g2 2 El , 2a l
H =
(tr UUUU + h.c.)
(6.63)
plaquettes
What is the effect of H on paths of flux with corners, as shown in Fig. 6.5? Suppose that H is applied to a “corner” in the inital state, as occurs in Schr¨odinger perturbation theory. On the two links in Fig. 6.7 where the U matrices act, one U matrix describing the initial state and the other from the perturbation, a link in the plaquette operator, one must decompose the product into irreducible representations. Since 3 ⊗ 3¯ = 1 ⊕ 8, there is a singlet piece and the “flipped” flux path shown in Fig. 6.7 occurs with a weight of 13 . These are the important processes since they mix different members of the degenerate subspace. Recognize this as a problem in electronic conductivity – fermions hopping along a chain. To make this point quantitative and useful, make the definitions y link with flux → fermion at site i x link with flux → absence of a fermion at site i So, imagine a box of length L = |x| + |y| containing |y| fermions. It is clear from the perturbation-theory effects discussed above that the perturbation allows a “fermion” to hop one site to a nearest neighbor if that site were initially unoccupied. Therefore, the restriction of H to the degenerate subspace is equivalent
6.4 Roughening and restoration of symmetries
155
y
x
Figure 6.8. Equipotential surfaces of the heavy-quark potential at strong coupling. y
x
Figure 6.9. Equipotential surfaces of the heavy-quark potential at intermediate coupling.
156
The Hamiltonian version y
x
Figure 6.10. Equipotential surfaces of the heavy-quark potential at weak coupling. to the fermion hopping operator, 1 † a ai+1 + h.c., H = 3 i i
|y| =
†
ai ai
(6.64)
i
This one-dimensional hopping Hamiltonian is diagonalized by passing to momentum space, π ni 1 ai = , n = 1, 2, . . . , L sin an φn (i), φn (i) = √ L +1 2(L + 1) n (6.65) Then, H =
2 πn a † an cos 3 n L +1 n
(6.66)
For our application, the first |y| fermion levels are filled, so the heavy-quark potential is " # |y| 2 g2 3 πn (6.67) V (x, y) = cos (|x| + |y|) − xs 2a 4 3 n=1 L +1 =
g2 3 (|x| + |y|) 2a 4
6.4 Roughening and restoration of symmetries −
4 3g 4
sin
157
π(|x| + 1) π(|y| + 1) sin 2(|x| + |y| + 1) 2(|x| + |y| + 1) − 1 π sin 2(|x| + |y| + 1) (6.68)
It is interesting to plot the equipotentials of this V (x, y), especially since the formula is rather opaque(!). The equipotentials are shown in Figs. 6.8–6.10. The restoration of rotational symmetry is surprisingly compelling for such a short calculation. This calculation is sensible only for couplings stronger than gR because it assumes analyticity. gR turns out to be rather small in these gauge models [42]. Higher-order corrections can also be calculated and they do not destroy the good agreement of this low-order calculation. Computer simulations also show that restoration of rotational symmetry occurs. It is also amusing that V can be expanded for large L and power-law corrections to the linear potential can be found. These are nonuniversal because this calculation starts in the strongcoupling region where continuous spacetime symmetries are lost. Nonetheless, the fluctuating string of degenerate perturbation theory illustrates how powerlaw corrections to the linear potential occur because of the geometry of the confining string while local excitations in the bulk, such as glueballs, baryons, and mesons, all have considerable gaps in their spectra of states.
7 Phase transitions in lattice-gauge theory at high temperatures
7.1 Finite-temperature transitions at strong coupling When QCD is heated sufficiently it will undergo a phase transition from a state of colorless strongly interacting particles to a quark–gluon plasma. The properties of this phase transition and properties of the high-temperature plasma are the subjects of a great deal of modern research in QCD. We have seen that pure gauge-field lattice dynamics confines quarks at strong coupling. If we use the quenched approximation in which there are no internal quark loops, then external static quarks experience a linear confining potential and the only finite-energy states must be color singlets consisting of confined quarks and/or anti-quarks. It is not difficult to show that, if this theory is heated, it will experience a transition to a quark–gluon plasma [43]. The point is that, as the temperature is increased, thermal fluctuations will become large enough to melt the linear flux tube which had produced confinement. At high temperature the vacuum itself becomes a tangle of closed flux loops, bubbling and fluctuating. When a static quark is placed into this vacuum, the extra flux line emanating from it blends into a vacuum already rich in flux and the resulting state does not cost an energy that linearly diverges. In the hot gauge-field state, there is no barrier to generating more or less flux and complete screening of the external, static quark color can occur, no matter what representation of SU(3) it lies in. This physical picture has been confirmed in computer simulations of latticegauge theory of pure gauge fields and of full QCD. It has been confirmed in the scaling window of the continuum limit as well as for strong coupling, for which simple statistical-mechanics models indicate its correctness. To illustrate the argument with a minimum of analysis, consider the pure gauge theory based on the group U(1). This model is called “PQED,” periodic QED, and is especially easy to analyze. It confines at strong coupling, as we have shown above, because the periodicity of the U(1) gauge variable quantizes 158
7.1 Finite-temperature transitions
159
the model’s electric flux on each link. This was the crucial element in the demonstration of confinement at strong coupling for lattice-gauge theories, not the non-Abelian nature of the gauge group. The non-Abelian character of the gauge group is important in taking the continuum limit using asymptotic freedom and finding that the theory’s continuum limit is a unique confining one. The continuum limit of PQED is more problematical, but there are thought to be two distinct alternatives: (1) for weak lattice coupling the theory reduces to ordinary continuum QED in the continuum limit and has massless photons and no confining forces, and (2) for strong lattice coupling the periodicity of the U(1) variables is relevant and the theory contains massless photons and magnetic monopoles that lead to confinement. It may be that the monopoles survive the continuum limit and produce a strong-coupling theory that confines strongly interacting “electrons.” We are content here to consider the U(1) theory at strong coupling and show that it loses its confining nature at high temperature. In this case the gauge variables are just phases, U ( r , n) ˆ = exp (iφ( r , n)) ˆ
(7.1)
The generator of U (1) gauge rotations at sites, E( r , n), ˆ satisfies [φ( r , n), ˆ E( r , n)] ˆ =i
(7.2)
The periodicty of the gauge variables U implies that the spectrum of possible eigenvalues of the generator E( r , n) ˆ consists of the integers. At strong coupling the Hamiltonian is dominated by its electric-flux term, H=
g2 2 r , n) ˆ E ( 2a r,nˆ
(7.3)
and the theory confines fermions that carry charge. The partition function describing the thermodynamics at temperature T , β = 1/(kT ), is Z (β) =
exp(−β H )
(7.4)
and the only subtlety here is the sum over physical states . These states must satisfy Gauss’ law, as stated above, which means that, in the absence of matterfield sources (static charges) of flux, the sum of the generators on the links emanating from each site on the lattice must vanish, E( r , n)|0 ˆ =0 (7.5) nˆ
160
Phase transitions
where the sum goes over the six lattice directions from site r. In order to incorporate this constraint into the expression for the partition function, we exponentiate it, π 1 δ E( r , n) ˆ = exp iα E( r , n) ˆ (7.6) 2π −π nˆ nˆ Now the partition function reads, up to unimportant constants, π exp(−β H ) dα( r ) exp iα( r) E( r , n) ˆ Z (β) = E links
sites
−π
(7.7)
nˆ
We can write this in a more useful form as π βg 2 E( r , n) ˆ 2 dα( r) exp − Z (β) = 2a −π r links E + i[α( r ) − α( r + n)]E( ˆ r , n) ˆ
(7.8)
The sum over the integers E defines the periodic Gaussian function, an old friend. To evaluate the partition function we need one form of the Poisson summation formula, which reads ) π *1/2 1 2 2 (7.9) exp(cn + iαn) = pexp − α c 4c n where pexp denotes the periodic Gaussian, defined by pexp(−γ α 2 ) ≡ exp[−γ (α + 2π m)2 ]
(7.10)
m
It is a superposition of Gaussians with period 2π . It is a very convenient function because Gaussians are easy to integrate, while its periodicity in φ → φ + 2π is manifest. In addition, it has the general features of exp(−2γ ) exp(2γ cos φ) for large γ . Using this identity, the partition function becomes π a 2 Z (β) = (7.11) dα( r) pexp − (α( r ) − α( r + n)) ˆ 2βg 2 −π links where again we have dropped unimportant constants of integration. What are the features of this model? It is a system of nearest-neighbor Abelian spins coupled together through the periodic Gaussian and closely resembles the planar Heisenberg model of statistical-mechanics fame, π a Z (β) = dα( r) exp − 2 cos(α( r ) − α( r + n)) ˆ (7.12) βg −π links
7.1 Finite-temperature transitions
161
In fact, the periodic Gaussian expression is the so-called Villain approximation to the planar Heisenberg model. Note that the temperature of the original problem in quarks and gluons maps onto the inverse temperature of the Villain model. This is yet another duality relationship. In other words, as the temperature of the lattice-gauge theory increases, the temperature of the Villain model decreases. High-temperature features of the gauge model map onto lowtemperature features of the spin model and vice versa. The Villain model is known to have a low-temperature magnetized phase and a high-temperature disordered phase with a second-order phase transition between them. In particular, (1) for small a/(βg 2 ), the system has a short-range, exponential spin–spin correlation function, iα(0) iα( ∼ e−µr e r ) r →∞ (7.13) e and (2) for large a/(βg 2 ), the system is ordered and has a spin–spin correlation function that approaches a constant at large separations as 2 iα(0) βg iα( r) (7.14) ∼ exp e e r →∞ r The lattice theory will inherit a second-order phase transition from its dual spin system. To obtain a physical interpretation of the transition, let’s place static charges into the gauge system and calculate their force law. This means that we place charges ±g on sites r = 0 and r = R and recalculate the partition function. The sources are introduced into the formalism by changing Gauss’ law appropriately at the two sites, n)|0 E(0, ˆ = |0 (7.15) nˆ
and
n)|0 E( R, ˆ = −|0
(7.16)
nˆ
When we redo the partition-function analysis above, these changes at the two with the result that an extra sites alter the δ function at r = 0 and r = R, factor of exp(−iα( R)) exp(iα(0)) (7.17) is introduced everywhere. Our partition function is now proportional to a spin– spin correlation function, π a 2 = Z (β, R) eiα(0) e−iα( R) dα( r) pexp − (α( r ) − α( r + n)) ˆ 2 2βg −π links iα(0) −iα( R) (7.18) = Z (β) e e
162
Phase transitions
As reviewed above, we know the behavior of these correlation functions in the dual-spin model, so we can obtain the extra free energy of the lattice-gauge system due to the sources, = −[ln Z (β, R) − ln Z (β)]/β = F(β, R)
1 iα(0) −iα( R) ln e e β
(7.19)
Using Eqs. (7.13) and (7.14), we see that the extra energy due to the static charges in the gauge theory at low gauge temperatures is ∼ µR F(β, R) R→∞ β
(7.20)
whereas for high temperatures it is 2 ∼ g F(β, R) R→∞ R
(7.21)
So we learn that the gauge theory experiences a deconfining phase transition at some critical temperature at which the heavy-quark potential switches from linear confinement to ordinary Coulomb behavior. This is an interesting, nontrivial result. However, it is not the whole story for several reasons. First, the calculation was done for strong coupling, far from the lattice theory’s continuum limit. Results from additional analytic and numerical work indicate, however, that the important conclusion that there is deconfining transition in the model survives greater sophistication. Second, these Abelian models should be replaced by non-Abelian gauge fields. This complication can also be handled analytically at strong coupling and the deconfinement transition remains. Third, the gauge model does not contain matter fields that could screen the color of static sources and replace the linear confining potential at low temperatures by a short-range Yukawa potential and the Coulomb potential at high temperatures by a short-range Debye screened potential. Introducing simple matter fields into the strong-coupling considerations above confirms these expectations. In fact, dynamical matter fields act as external symmetry-breaking fields in the dual-spin model and smooth out the phase transition illustrated here and replace it with a crossover from Yukawa to Debye behavior. The sharp transition is eliminated. However, as we will discuss below, there is a phase transition in full QCD at nonzero temperature, but it is associated with chiralsymmetry breaking rather than confinement and the heavy-quark potential. 7.2 Simulations at nonzero temperature When four-dimensional lattice-gauge theory is formulated at nonzero T , one considers asymmetric lattices. Let the spatial box have N sites in the x, y, and
7.2 Simulations at nonzero temperature
163
z directions and denote the lattice spacing a. The physical volume of the box is then Vs = (a N )3 . The temporal direction is special and, as we know from conventional continuum field theory, has an extent proportional to the reciprocal of the temperature T . In fact, if we denote the number of lattice sites in the temporal direction Nt and the lattice spacing in this direction at , then the inverse temperature is T −1 = at Nt
(7.22)
Simulation studies are done using periodic boundary conditions imposed on the bosonic degrees of freedom and anti-periodic boundary conditions for the fermions. Of course, it is crucial to use such boundary conditions in the temporal direction in order to get the spin-statistics relations correct. In most simulations one chooses the spatial box to be large and varies the temporal extent to change T . There are many ways to do this and the best choice might depend on the goals of a particular simulation. For example, most simulations are done with the physical lengths of the lattice links the same in the spatial and temporal directions, a = at , but with Nt N . Then T is varied by varying the coupling β = 6/g 2 of the SU(3) lattice action. As β is taken larger (weaker coupling), the lattice system finds itself closer to the continuum limit where the correlation length of the model diverges in units of the lattice spacing a. This means that at Nt is effectively smaller when it is measured in physical units of the correlation length, so the temperature, the reciprocal of at Nt , has effectively been increased. Therefore, in many publications, authors plot observables against β and state that increasing β effectively heats the system. To make this connection quantitative we must return to the Callan–Symanzik function and calculate the dependence of the lattice coupling on the cutoff a. Recall that the idea of the Callan–Symanzik equation is the realization that, in a renormalizable field theory, the coupling defined at a certain length scale must depend on that length scale in order that physical quantities will, in fact, turn out to be cutoff-independent. For example, suppose that we calculate the proton’s mass M in a lattice version of QCD with a lattice coupling g. Then, by dimensional analysis, the mass would have the form M=
1 f (g 2 ) a
(7.23)
because a, the lattice spacing, is the only dimensional parameter in the theory. However, since M must be independent of a, this equation implies that g must depend on a in a precise fashion. The Callan–Symanzik equation makes this connection precise in the realm where perturbation theory applies.
164
Phase transitions
Through two loops in perturbation theory, the explicit lattice perturbationtheory calculations yield a
∂g = β0 g 3 + β1 g 5 ∂a
(7.24)
where the coefficients are
2Nf 11N 1 − β0 = 16π 2 3 3 2 34N 1 10N Nf (N 2 − 1)Nf β1 = − − (16π 2 )2 3 3 N
(7.25)
for SU(N ) gauge fields and Nf flavors of light quarks. The important point is, of course, that both β0 and β1 are positive if the number of quark species is small enough. This guarantees that the theory is weakly coupled at short distance – as a is taken smaller, then g must also be taken smaller, in accord with the Callan–Symanzik equation. Stated in another fashion, in order to keep the physics independent of the cutoff, for example, to keep the proton mass fixed at 938 MeV/c2 , the coupling must be tuned to the lattice spacing a according to the Callan–Symanzik differential equation. One can formally integrate the Callan–Symanzik equation, g dg (7.26) a = exp − β(g ) and use the perturbative expression over that region where g is small, 1 2 −1 (β0 g 2 )−β1 /(2β0 ) a = exp − 2 2β0 g
(7.27)
where the integration constant has the dimensions of a mass and sets the scale of lengths. Since the temperature T has the dimensions of mass, it must be proportional to , T = C
(7.28)
where the numerical constant C relates the two scales. Other physical quantities with the dimensions of mass should satisfy analogous relations, but each will have its own particular dimensionless constant C. This scaling result guarantees that ratios of quantities with the same dimensions are simple numbers, independently of the cutoff procedure. So we could imagine, as is done in practice, a lattice simulation that determines T and m ρ in terms of the cutoff a at a lattice coupling small enough that the Callan–Symanzik equation applies. Then the ratio of T to m ρ would be a property of the continuum
7.3 Pure gauge-field simulations
165
limit of the lattice theory and, knowing m ρ from experiment, we could predict the unknown temperature T . In practice, it is convenient to consider other more-elegant cutoff procedures and relate the lattice coupling g and its associated scale parameter to those of other schemes, such as Pauli–Villars and dimensional regularization. These intricate details can be found in the literature. In the course of these developments, one proves that the coefficients β0 and β1 are identical in each regularization scheme. We learn from this summary several lessons that are particularly important for lattice calculations and simulations. To make contact with continuum physics, lattice calculations using the simple actions discussed here must involve calculations at weak coupling, in the “scaling window” where the two-loop Callan– Symanzik equation is accurate. Only then can we expect the cutoff a to decouple from the physical quantities of interest. At finite temperature T , we anticipate the need for (1) weak coupling, to find the scaling region, (2) large temporal extents, Nt 1, so that discreteness effects are under control, and (3) much larger spatial extents than temporal extents, N Nt , so that finite-size effects associated with the size of the spatial box do not interfere with the desired temperature effects coming from Nt . It is a sobering fact that few, if any, simulations to date satisfy these criteria. There are many schemes for dealing with these potential problems. One can “improve” the lattice action so that the scaling window expands and continuum physics can be obtained from simulations at larger couplings on smaller lattices. The idea here, as we have seen in past chapters, is that there are O(a 2 ) deviations of the lattice action from its continuum counterpart even for smooth fields and weak (or vanishing) coupling. By using more sophisticated finite-differencing formulas these errors can be eliminated. Although the lattice action becomes more complicated and the differencing schemes will necessarily involve next-tonearest-neighbor couplings, the computational gains have been found to be very substantial. The lattice-spacing dependences of quantities such as the heavyquark potential are dramatically reduced. By including pertubative corrections in this improvement scheme errors of order O(g 2 a 2 ) can also be removed both from the pure-gauge piece and from the fermion-hopping piece of the action. Such improvements should make simulations aimed at finding the universality classes of various finite-T transitions practical and reliable. 7.3 Pure gauge-field simulations at nonzero temperature Simulations of pure glue are interesting, practical, and relevant. In the early days of lattice-gauge theory, theorists noted that there are eight adjoint colored gauge fields in QCD while there are only two light fundamental quarks. So “most” of the dynamics of QCD is due to gauge-field effects. In fact, asymptotic freedom is
166
Phase transitions
a property of non-Abelian gauge fields whereas Fermi fields tend to screen color and counter the unique effects of the interacting gauge fields. Therefore, it was felt sensible to generate equilibrated gauge-field configurations at various couplings and then study quark-matrix elements in this background. This approach constitutes the “quenched” approximation to the theory. Light-quark boundstate spectroscopy, chiral-symmetry breaking, and hadronic matrix elements are relatively easily studied in this context and much has been learned. Of course, we have also learned that this “approximation” has some serious limitations. These include the fact that it is misleading at nonzero-baryon-number chemical potential, as we will discuss in Chapter 10. It cannot be used to study QCD’s phase diagram in the T –µ plane. In addition, it suffers from other pathologies that derive from the fact that its configurations omit the axial anomaly and therefore display more symmetry (a massless η ) than they should. Unfortunately, an “approximation” that has more symmetry than the real theory can have qualitative errors and this occurs in the quenched approach in the limit of light quarks. With these caveats, it is interesting and useful to address the thermodynamics of pure gauge fields. At low T we expect that the theory confines quarks through the formation of electric-flux tubes that produce a linear confining potential, as described earlier. In addition, the pure gauge-field theory is expected to generate a mass gap dynamically and have a rich spectrum of color-singlet excitations. The idea is that the flux tube has an energy per unit length, the square root of the string tension σ , so the heavy-quark potential reads V (R) =
√
σR−
π/12 + ··· R
(7.29)
The string tension σ is generated dynamically and it can be related to the √ theory’s scale parameter , σ = C. The constant of proportionality C has been measured in lattice simulations. Since glueballs are created from the vacuum by applying operators consisting of closed loops of U operators, they can be thought of as rings of electric flux in the strong-coupling limit, g large, discussed earlier. Their masses and properties are, therefore, closely tied to the formation and scale of the flux-tube mechanism of confinement. Estimates of the glueball masses are made in lattice-simulation studies. They are relatively heavy states that are also expected to exist as fairly sharp resonances in QCD, in which they would be able to decay into lighter mesons. Their experimental discovery would be a triumph both for QCD in general and for lattice-gauge theory in particular. As the pure gauge theory is heated, we expect a phase transition to a gluonic plasma. In this phase there should be complete color screening. For example, if a quark in the fundamental representation of color SU(3) were placed into the hot vacuum, its color field should be short-ranged and screened exponentially. In particular, the string tension should vanish at the critical point between the
7.3 Pure gauge-field simulations
167
low- and high-temperature phases. We obtained these results at strong coupling above and they are expected to be true in the continuum limit of that analysis. Computer simulations in the scaling window support this view. These qualitative considerations suggest an order parameter for the transition: at low T the theory is expected to confine fundamental quarks through fluxtube formation, whereas at high T , for which asymptotic freedom applies and predicts that regions of fixed volume should have free-field properties decorated by small perturbative corrections that vanish logarithmically as the temperature grows large, quarks should propagate in a color plasma. This difference can be made precise by considering the excess free energy for a static quark charge. The worldline for such a quark would be purely timelike and it would couple to the gauge-field background through a U matrix, as in the hopping fermion actions studied earlier. So, an order parameter would be the Wilson line (called the Polyakov loop in this context when the trace has been taken) [44], Nt L( x ) = tr U0 ( x , x0 ) (7.30) x0 =1
As discussed above in the context of quark confinement, since this quantity represents a heavy, static quark impurity in the system, the expectation value of the Wilson line, suitably averaged over the spatial box, records the excess free energy due to the quark, : : : 1 : : : −Fq (T )/T e ∝ L = : L( x ) (7.31) : : N 3 x : This quantity is an order parameter for the transition to the plasma because it vanishes identically in the low-T phase in the thermodynamic limit, while it is nonzero in the plasma. At low T , the excess free energy that occurs when a static quark is placed in the vacuum diverges as the linear dimension of the spatial box, because the quark is the source of a flux tube. However, in the high-temperature phase the plasma of gluons can screen the color of any impurity, so the excess free energy is finite. In order to classify the phase transition, we need to determine the global symmetry group that is broken. The order parameter is locally gauge-invariant, so we seek a global aspect of color SU(3) rotations that will survive the trace in the definition of the order parameter. The triality of the color matrix is a natural candidate that is familiar from the old-fashioned quark model. In particular, consider the center of the color SU(3) group which consists of the three complex roots of unity, Z 3 . If we make a Z 3 transformation on all the timelike links on a given temporal hyperplane, U0 ( x , x0 ) → zU0 ( x , x0 ) (x0 specified),
z ∈ Z3
(7.32)
168
Phase transitions
Note that this transformation leaves the pure gauge-field action unchanged because the action contains only closed loops of U matrices that pierce the special hyperplane twice (once in the +t direction giving a z factor and once in the −t direction giving a z ∗ factor) or not at all (purely spatial loops). The transformation affects the Wilson line, L → zL
(7.33)
so, when L develops a ground-state expectation value, the Z 3 symmetry is broken. The significance of the center of the global gauge group has been demonstrated nicely in computer simulations that have plotted the Monte Carlo “time” evolution of the phase of the Wilson line and shown that it tunnels among the three different possible values for z. In fact, the first-order character of the deconfining transition has been confirmed in such studies. More insight into the order of the finite-temperature deconfinement transition for SU(3) gauge fields follows from effective-Lagrangian descriptions of the transition. Conventional wisdom suggests that this transition can be described by a local, bosonic spin model in three dimensions with an order parameter that experiences the same breakdown of symmetry. Let’s motivate this approach. The modern renormalization-group approach to phase transitions states that the universality class of a phase transition depends only on a few, crucial features of the system. The dimensionality, symmetries, and range of interactions of the order parameter are paramount. The relevant dimensionality here is three dimensions rather than four because we are studying the equilibrium statistical mechanics of a system with three spatial dimensions. There is no real temporal development. More formally, when the system is defined in Euclidean space, the “temporal” direction actually is just a recipe for incorporating the temperature into the thermodynamics. The temporal direction is fixed and “small,” aT = Nt−1 , relative to the size of the spatial box. Because of the periodic boundary conditions in the temporal direction, the possible frequencies that can be fit into the system are quantized. These are the Matsubara frequencies, ωn = 2nπ T (bosons),
ωn = (2n + 1)π T (fermions)
(7.34)
where n = 0, 1, 2, 3,. . . . For bosonic fields the lowest Matsubara frequency is zero, producing a field that does not vary in the temporal direction. The other Matsubara frequencies are nonzero and large at nonzero T . It is reasonable to suppose that the modes with nonzero Matsubara frequencies are not excited at a second-order phase transition. This is because the important modes are essentially massless and only their long-wavelength excitations are statistically significant at a second-order phase transition. In this situation, there is little probability of exciting nonzero Matsubara frequencies. The system is then effectively three-dimensional and one says that it is “dimensionally reduced.”
7.3 Pure gauge-field simulations
169
Back to the deconfinement transition. Given the plausibility of dimensional reduction for the high-temperature behavior of the pure gauge model, we are led to consider three-dimensional bosonic field theories with Z 3 global symmetries as their prototypes. The simplest of these is the three-state Potts model. It is just the obvious generalization of the Ising model (Z 2 ) to an order parameter with a threefold symmetry (Z 3 ). It is known that the three-state Potts model has a first-order transition in three dimensions. Explicit simulations have shown that the deconfinement transition in the SU(3) gauge field case is, in fact, of first order. As stated above, threefold metastability has been found in the Monte Carlo “time” evolution of the phase of the Wilson line, so there is considerable confidence that the Z 3 symmetry and the three-state Potts model are relevant. The bulk thermodynamics of the finite-T transition in the pure gauge model has been studied by simulation methods in considerable detail. Since the simulations involve only bosonic degrees of freedom, they have been done far better than their light-quark counterparts. It has been found that the pressure of the pure SU(3) gauge system stays small for all temperatures below Tc . This is certainly expected, because the only relevant degrees of freedom in the low-T phase are glueballs, which are, as indicated above, quite heavy and lead to an exponential suppression of pressure and energy density at low T . Above Tc simulations show that the pressure rises rapidly and reaches two thirds of its asymptotic ideal-gas value by a temperature of twice the critical value. However, the approach to the limiting free ideal-gas values for the pressure, entropy, and energy density is quite slow from this point onward. The old hope that high temperature perturbation theory could descibe the quark–gluon plasma accurately for temperatures of several times Tc seems too naive. Even though the energy density jumps discontinuously at the first-order plasma transition with a very substantial latent heat of about 1.5Tc4 , the system is not well approximated by the ideal-gas equation of state, p = /3, for temperatures expected to be relevant to heavy-ion collisions and controlled lattice-gauge simulations. One finds a delayed rise in the pressure compared with the energy density, so the velocity of sound in the plasma is predicted to be considerably below its ideal-gas value. According to the Casher argument relating chiral-symmetry breaking to bound-state formation and confinement, it is natural to wonder whether chiral symmetry is restored at the deconfinement transition at nonzero T . This is not necessarily true because the forces of attraction between the quark and the anti-quark may be strong enough even on the hot, plasma side of the deconfinement transition to lead to ¯qq condensation. However, for quarks in the fundamental representation of the gauge group, it does appear that the screened forces are not sufficient to produce a condensate and chiral symmetry is restored when deconfinement occurs. This has been confirmed in computer simulations of the quenched theory, i.e. we study light quarks propagating in the dynamical
170
Phase transitions
ensembles of pure gauge fields, as described above. In fact, the deconfinement and chiral-symmetry transitions are coincident and of first order. So the fact that the expectation of the Wilson line jumps from zero to a finite value at the deconfinement temperature Td , while the chiral condensate jumps from a sizable value to zero at the same Tc = Td , strongly suggests that the gluonic forces which cause confinement, flux-tube formation, and bound-state formation are also the cause of chiral-symmetry breaking. 7.4 Restoration of chiral symmetry and high temperature When light quarks are put into the dynamics so that realistic QCD with Nf flavors is being simulated, this scenario must change dramatically. In particular the Z 3 symmetry of the pure gauge-field model is not a symmetry at any temperature of the action with fermions. This is obvious because of the factor of U in the quark hopping term. The Z 3 symmetry is explicitly broken in the action, so the Wilson line should be nonzero under all conditions. This means that the excess free energy due to the static quark in the Wilson line is always finite. There is a simple interpretation for this formal result. Once there are light quarks in the dynamics, then they can screen any impurity such as the Wilson line. So the Wilson line is no longer the source of an “infinitely” long flux tube; instead, the light dynamical quarks terminate the tube and render its effects short-ranged. The static quark now gives rise to a Yukawa potential due to the screening provided by the light quarks. Since the long-range confining forces are eliminated, it is no longer obvious that there is any chiral-symmetry breaking in lattice QCD. In fact, simulations indicate that chiral-symmetry breaking survives the Yukawa screening at vanishing T . Casher’s physical picture continues to hold, apparently because the forces are strong enough to make a rich bound-state spectrum. It appears that, for dynamical quarks with small but nonzero masses, screening is weak enough to permit chiral-symmetry breaking to survive. Simulations suggest that the linear potential extends out to sufficient distances from its source, one light quark in a q¯ q state, so that binding can occur over that distance scale and make the linear potential relevant to the spectroscopy of the two-body states. The Casher argument applies as in the model with no screening. However, for distances greater than the size of a typical hadronic bound state, screening due to light quarks becomes overwhelming, eliminating the long-range potential and producing color-neutral bound states. This physical picture, complete with its particular length scales, finds additional support in finite-T simulations. Simulations of lattice QCD with two or three light quarks show that the Wilson line does jump dramatically at the same temperature Tc as that at which chiral symmetry is restored and the q¯ q condensate vanishes exactly. This observation can be made more precise by considering
7.4 Restoration of chiral symmetry
171
the susceptibilities for the Wilson line and the chiral condensate, ∂ ¯ ψψ (7.35) ∂m These quantities peak where the Wilson line and the chiral condensate, respectively, are changing rapidly. Plots of the susceptibilities pick out narrow regions that, within statistical accuracy, correspond to identical critical temperatures. In the limit of massless quarks, we expect that a good order parameter for the transition from hadronic matter to the quark–gluon plasma would be the chi¯ ral condensate ψψ. The continuum QCD Lagrangian has the algebraic, chiral symmetries of the massless Dirac equation, SU(Nf )L × SU(Nf )R , corresponding to independent γ5 rotations of the left- and right-handed components of Nf Dirac fermions. This symmetry should be broken dynamically by Casher’s argument down to the algebraic symmetry SU(Nf )L+R when the quarks develop a mass. This spontaneous symmetry breaking produces Nf2 − 1 Goldstone particles, which, in the case of Nf = 2, corresponds to three pions. In an effective three-dimensional Lagrangian description of this transition, one would want to include the relevant light bosonic degrees of freedom, which, in the case of chiral symmetry and Nf = 2, would be the three pions and the σ particle. In the chirally symmetric phase, these four states would be degenerate and would comprise a chiral vector. In the broken phase, the three pions would become massless Goldstone particles and the σ would be a massive excitation. In the generalization of this picture to arbitrary Nf , the order parameter is conveniently chosen to be an Nf × Nf matrix, & ≡ (&i j ), of quark–anti-quark bilinears that parametrize the symmetry breaking, &i j ∼ ψ¯ i (1 + γ5 )ψ j . Under an SU(Nf )L × SU(Nf )R transformation of the quarks, & → UL &UR . The lowenergy dynamics of the bosonic degrees of freedom are controlled by the threedimensional effective Lagrangian that has the same global symmetries as the fundamental QCD Lagrangian, χ L = N 3 (L 2 − L2 ),
χm =
1 1 π2 g1 (tr(&† &))2 L eff = − tr(∂µ &† ∂ µ &) − tr(&† &) + 2 2 3 π2 + (7.36) g2 (tr(&† &)2 ) + c(det & + det &† ) 3 where all the terms should be familiar and expected, except, perhaps, the last determinantal interaction. This term is needed in order to account for the fact that triangle graphs in QCD break the axial U(1)A anomaly, and with the help of instantons, lift the η mass to a large value. These subtle points were discussed in Chapter 4 above. A renormalization-group analysis of this Lagrangian predicts that it undergoes a second-order transition for Nf < 3, but a first-order transition otherwise [45]. The one caveat concerns the fate of the η meson at Nf = 2. If the U(1)A symmetry breaking, which is due to the triangle anomaly and instantons,
172
Phase transitions
becomes rather weak at Tc , then the η would become another low-lying bosonic excitation whose inclusion in the dynamics could induce a fluctuation-driven first-order transition. Simulations suggest that this accidental “restoration” of symmetry does not occur. In the case of two light flavors, this effective Lagrangian suggests that the O(4) spin model determines the universality class of the chirality restoring finite-T transition in two-flavor QCD. In this ideal case of massless quarks, measurement of the properties of the QCD phase transition would provide a nontrivial test of the universality hypothesis.1 Of course, the actual situation in QCD is rather complicated because the theory has two very light flavors, the up and down quarks, whose Lagrangian masses are just a few MeV, and a third, strange quark whose Lagrangian mass is of the order of 100 MeV. Simulations favor the argument that this quark spectrum leads to a chirality-crossover phenomenon rather than a real phase transition. However, since the massless, two-flavor chirality-transition point would lie “close” to the physical situation, its O(4) critical indices might be relevant to the character of the real system’s crossover. Lattice-gauge-theory simulations have not determined the critical indices of the two-flavor model. Improved gauge and fermion actions are being pursued with this goal in mind.
7.5 Hadronic screening lengths A simple and appealing way to study and understand the chiral-symmetryrestoring transition in QCD is to consider its spectrum and screening lengths for temperatures above and below Tc . Since chiral symmetry is restored at Tc , the pion triplet is not expected to be massless in the plasma. The character of the pion excitations above Tc is not known, but it is suspected that they are quasiparticles in the plasma phase, at least for temperatures near Tc . If this is the case then, since chiral symmetry is restored, there should be a “σ” quasi-particle in the plasma phase degenerate with the pion triplet. The fact that the system undergoes a second-order phase transition at Tc indicates that the σ and the pions were massless at Tc and their masses increase continuously as T increases from Tc . This can be seen in lattice simulations by measuring the chiral susceptibility, defined in the equation above, and finding a divergence at Tc . This perspective suggests that other aspects of restoration of chiral symmetry can be studied on the lattice by measuring susceptibilities for other quark–anti-quark bilinears. One defines hadronic correlation functions in a given
1 Although there is little doubt that it holds, this is by no means a proven fact in a theory like
QCD.
7.5 Hadronic screening lengths quantum-number channel, and their associated susceptibility, 1/T χH = dτ d3r G H (τ, r)
173
(7.37)
0
where the meson correlation function is G H (τ, r) = χ¯ (0)γ H χ (0)χ¯ (τ, r)γ H χ (τ, r)
(7.38)
where γ H is the appropriate collection of γ matrices from which to construct the composite operator with the desired quantum numbers. The susceptibilities are essentially the inverses of the squared masses, m 2H . High-quality calculations of these susceptibilities then expose the symmetries of both phases [46]. For example, the divergence of the π and σ susceptibilities at Tc shows that the σ becomes massless as the critical point is approached from below and chiral symmetry is restored in the quark–gluon-plasma phase. The two susceptibilities are identical in the hot phase, as expected when chiral symmetry is restored. This also suggests that there are π and σ quasi-particles in the hot phase in the vicinity of Tc . Another approach to understanding the symmetries of both phases is afforded by hadronic screening lengths. Consider two composite hadronic operators, A(τ, r) and B(τ, r), made out of the appropriate products of fields – a quarkand-anti-quark field with particular spin and flavor for mesons, and three quark fields to make a particular baryon. If we average these operators over the temporal axis τ and the transverse plane, 1/T L L T ¯ A(z) = lim dτ dx dy A(τ, x, y, z) (7.39) 4L 2 L→∞ 0 −L −L and calculate the correlation function, ¯ B(0) ¯ ¯ ¯ − A(0) B(0) S AB (z) = A(z)
(7.40)
which we then fit to a screening form, S AB (z) |z|→∞ ∼ c exp(−m AB (T )|z|)
(7.41)
we can extract the screening masses m AB . This program has been carried out both for the pure SU(3) gauge model and for QCD with several flavors of light dynamical quarks. One finds a spectrum of states characterizing chiral-symmetry breaking in the hadronic phase: the π is essentially massless while its chiral partner the σ is heavy, the ρ and its partner the a1 are massive with different masses, and the even-parity N+ and odd-parity N− baryons are even more massive and split. In the plasma phase, the π and the σ
174
Phase transitions
become massive and degenerate, as do the other chiral partners. Since the computer signals for these correlation functions are large and relatively free of noise, the simulation suggests that the plasma has a rich nonperturbative spectrum of states for T in the vicinity of Tc . Additional evidence for the nontrivial character of the plasma phase comes from studies of the Debye screening mass, m D . To leading order in perturbation theory, 3 Nf m D = 1 + g(T )T (7.42) 6 where g(T ) is the running coupling of QCD evaluated at the relevant mass scale, the temperature T . In the region of the transition, g(T ) is of the order of unity and the accuracy of expansions in g(T ) is questionable. In fact, the measured Debye screening mass is roughly three times the perturbative prediction for accessible values of T . We learn that even the short-distance properties of the plasma are highly nonperturbative for accessible temperatures, even though asympototic freedom assures us that, at truly high T , the bulk thermodynamics of the plasma should be given by free-field estimates decorated with small, calculable perturbative corrections. Sorting out the physics of the QCD plasma is an active field of research for lattice-gauge theorists, high-energy phenomenologists, and experimentalists. 7.6 Thermal dilepton rates and experimental signatures for the quark–gluon plasma The hadronic correlation lengths discussed in the previous section provide information about the response of QCD to time-independent, static perturbations. In other words, only zero-frequency components of the correlation functions are involved. It is also interesting, and even more relevant from the experimental point of view, to address the question of the response to time-dependent perturbations. The difficulty here, in comparison with static correlation-length measurements, is that one needs information about the correlation function at real time intervals, whereas lattice field theory is formulated in Euclidean time. In the frequency language, we need the correlation function, or the corresponding spectral function, for a continuum of real values of the frequency, while the Euclidean theory provides us only with discrete Matsubara modes. This difficulty can, in some cases, be overcome, and we shall discuss here the example of the spectral function relevant for dilepton rates. Dilepton rates have been discussed for many years as an informative probe into the dynamics of the quark–gluon plasma. In such a process, a quark and an anti-quark in the plasma annihilate into a virtual photon, which then materializes as a lepton–anti-lepton pair. The probability of such an event is clearly
7.6 Thermal dilepton rates
175
proportional to the thermal expectation value of the product of the electromagnetic current with itself. The expression for the cross section will be given below. The production rate will be very sensitive to the temperature of the plasma (the larger the temperature, the larger the rate), so the dileptons are preferentially produced in the early stages of a heavy-ion collision. Since they are only weakly coupled to nuclear matter through the fine-structure constant, they have long mean free paths, which allow them to escape from the system without significant final-state interactions. These properties make electromagnetic signals particularly good probes into the physics of the early stages of the formation of the quark–gluon plasma. It is particularly interesting to focus on hadronic-resonance contributions to the dilepton rate. If the Q 2 of the virtual photon (dilepton pair) matches the mass of a vector-meson state of QCD, we expect an enhancement of the dilepton rate. Since this production occurs inside a volume of quark–gluon plasma, the positions of the peaks and their widths depend sensitively on the interaction of the hadronic state with the plasma itself. By varying Q 2 we can concentrate on different length and time scales of the interactions and screening mechanisms within the quark–gluon plasma. The strength of the ρ signal, for example, tells us how the light-quark dynamics is affected by the medium, whereas the charmonium signal is sensitive to the screening of the interactions between heavier charmed quarks. Experimental and lattice-simulation data are converging on decisive results here. In fact, some lattice results can be interpreted as evidence for the experimental generation of the quark–gluon plasma at the RHIC. The dilepton rate, which is a function of frequency ω and momentum p, can be written in the form [47] dW 5α 2 1 = σV (ω, p; T ) 2 2 ω/T dω d p 27π ω (e − 1)
(7.43)
where the spectral function σV (ω, p; T ) is related to the current–current corre† on the periodic x ei p · x JVµ (τ, x)JV,µ (0, 0), lation function, G V (τ, p; T ) = d lattice as ∞ cosh[ω(τ − 1)/(2T )] (7.44) dω σV (ω, p; T ) G V (τ, p; T ) = sinh[ω/(2T )] 0 On an N 3 × Nτ lattice, one can measure the correlation function G V for each “time” slice, τk T = k/Nτ , k = 1, 2, . . . ,Nτ , and then attempt to invert the relation between the spectral function and the correlation function. There are two approaches to executing this inversion. First, one might use a model-dependent functional form that depends on several parameters and determine σV (ω, p; T ). This approach has had only limited success. Alternatively, one might use the maximum-entropy method (MEM) [48] and find the “most
176
Phase transitions
probable” spectral function. Since the lattice simulation gives the correlation function only at a discrete set of points τk , each with statistical errors, the problem of finding the continuous function σV of the continuous variable ω is illposed. The MEM can produce only a “most probable” answer with a measure of its reliability. The method, however, is well–known and quite successful in optics, where it is used for image reconstruction. The MEM is well grounded in traditional probability theory and uses the χ 2 likelihood function to constrain the maximization of the Shannon–Jaynes entropy function in order to produce the “most likely” spectral function. As one might guess, it requires many data points, each with small errors, to predict a spectral function reliably. At least several dozen values of τk are typically needed, so large lattices are essential, which restricts the application of the method to either quenched QCD studies or simple models at present. This is a pity because collision broadening, which is sensitive to dynamical quarks and Fermi statistics, is expected to play an important role in determining the shape of the spectral function in heavy-ion experiments. In addition, the MEM does not predict the spectral function with uniform confidence over its range of ω values. Typically, the low-ω range is poorly determined. This is also a pity because this is frequently the most significant part of the power spectrum in heavy-ion collisions. Simulations have shown that the ρ and φ peaks in the dilepton spectra are suppressed in the presence of the quark–gluon plasma, in qualitative accord with the experimental trends [47]. However, the dilepton signal is still somewhat larger than that found for free but hot quarks and gluons, at least for ω/T between 4 and 8. Simulations have been done above the transition, at 1.5Tc and 3Tc , so measureable attraction in these light-quark states at these high temperatures is somewhat surprising. Results such as these suggest that, although the bulk thermodynamic behavior of the quark–gluon plasma might be close to the Stefan–Boltzmann limit, detailed hadronic structures are not completely washed out. Additional simulation results at 0.9Tc and 1.2Tc have been obtained for the charmonium states. The s-wave states had a diminished (perhaps by a factor of two) signal above the transition where potential models predicted that there would be no signal at all because the gluon plasma would screen the heavyquark potential so thoroughly that the c– c¯ state would become unbound [49]. The p-wave signal, however, was reduced by a factor of about seven across the transition, suggesting that screening had removed these charmonium states from the spectrum. Recent data are shown in Figs. 7.1 and 7.2. 7.7 A tour of the three-flavor QCD phase diagram In Fig. 7.3, a very busy affair, we present a survey of the phase diagram of lattice QCD in temperature, light-quark masses, and the strange-quark mass. For
7.7 The three-flavor phase diagram
177
1
S0 state
0.25
0.9Tc 1.2Tc
0.2
σ/ω
2
0.15 0.1 0.05 0 2
4
6
8 10 ω (GeV)
12
14
16
18
Figure 7.1. S-wave charmonium signals above and below the phase transition in quenched QCD. 3
0.35
P1 state 0.9Tc 1.2Tc
0.3
σ/ω
2
0.25 0.2 0.15 0.1 0.05 0 2
4
6
8 10 ω (GeV)
12
14
16
18
Figure 7.2. P-wave charmonium signals above and below the phase transition in quenched QCD. experimental purposes and applications in general we are interested in very light “up” and “down” current quark masses and a considerably heavier “strange” quark. To handle a realistic spectrum of quark masses, with “up” and “down” quarks light enough to accommodate a realistic pion, and to make contact with continuum scaling laws will take more computer power and better, improved actions than have been used to date. These improvements are occurring gradually in the field. The presentation here represents the current state of the art and is probably incomplete. Perhaps someone will invent and apply qualitatively better approaches to this problem, which will yield a far more decisive version of our figure. We will leave to the references discussions of lattice sizes, bare-quark masses, the chosen fermion method, the action used, etc. There are important questions of lattice engineering at play here, but they are best not covered explicitly.
178
Phase transitions
Figure 7.3. Phases of finite-T QCD as the light-fermion spectrum changes. The physical point of QCD with a realistic spectrum of quarks is labeled with question marks. Anyway, let’s take our tour. Begin at the lower-left-hand corner of Fig. 7.3. Note that the common lightquark mass labels the horizontal axis and the strange-quark mass labels the vertical one. Recall that theory and simulation both indicate that, when the three quarks are massless, there will be a first-order chiral-symmetry-restoration transition between cold hadronic matter and the hot quark–gluon plasma. This result is sensitive to there being three light quarks, as discussed using effective Lagrangians above. In the case of two light quarks, the upper-left-hand side of the phase diagram, the transition is expected to be of second order and in the universality class of the three-dimensional O(4) spin system. The numerical evidence for this universality class is not yet compelling but should improve in the not-too-distant future. Along the left-hand vertical axis there should be a tricritical point separating the region of first-order transitions from the line of second-order transitions.
7.7 The three-flavor phase diagram
179
Now let’s enter the phase diagram along the diagonal from the lower-lefthand corner to the upper-right-hand corner. As the quark masses increase, the first-order chiral transition must weaken. Simulations indicate that the transition becomes a crossover phenomenon and the hadronic state is actually continuously connected to the plasma. The crossover, however, appears quite abrupt in lattice simulations. The boundary between the first-order region and the crossover region is expected to be described by the three-dimensional Ising model, as predicted by effective Lagrangians for analogous liquid–gas transitions. This boundary and its transitions do not have an immediate interpretation in terms of the degrees of freedom of QCD. Instead the transition is governed by the long-wavelength energy and order-parameter fluctuations in the system which are determined phenomenologically. There is some lattice evidence accumulating for the accuracy of this viewpoint and the Ising critical indices [50]. Let’s focus more on the following pressing question: where exactly does QCD, with its particular quark masses, fit into this diagram? As stated above, present-day lattice simulations put it into the crossover region, but rather close to the boundary with the first-order chiral region, as shown in greater detail in Fig. 7.4. This figure is labeled by the scale of the light iso-doublet of quark masses, M, and the temperature of the simulation T . The strange-quark mass is taken to be a higher multiple of M, typically (20–40)M in simulation studies. Numerical simulations show that the quantitative features of the change between hadronic and plasma observables are quite clear, and the crossover is quite abrupt. It appears that the high-temperature phase transition relevant to nature will be controlled by the physics of chiral-symmetry breaking rather than by the physics of confinement and deconfinement. If realistic QCD lies in the crossover region but close to the chiral critical point as shown in Fig. 7.4, then there may be some interesting and curious experimental signatures of the transition. The chiral critical point, labeled “C” in Fig. 7.4, resembles a gas–liquid point in a generic phase diagram and is expected to be described by the three-dimensional Ising model. Therefore, QCD should display its version of “critical opalescence” near that point. An SU(3) flavor-singlet, positive-parity meson should become massless at that point. Perhaps it will be the lightest state in QCD’s spectrum in the plasma phase near the critical temperature! Such an odd possibility, a single 0+ flavor-singlet meson lighter than the familiar pion multiplet, is a matter of speculation and interesting experimental scenarios. We have not heard the last word on this subject and the reader is referred to the literature for additional remarks on this possibility. More simulations with improved actions will be run on the lattice in order to determine where QCD with its particular quark masses actually lies on Fig. 7.4. Smaller light-quark masses and larger lattices, which are needed in order to control finite-size effects, are required.
180
Phase transitions confined
deconfined
D
M
QCD? C
hadronic matter
quark–gluon plasma T
Figure 7.4. A schematic phase diagram of (2 + 1)-flavor QCD. The masses of the “up” and “down” quarks are M, the label of the y axis and the strange quark is a much higher multiple of M, such as (20–40)M. The temperature T labels the horizontal axis. “C” labels the chiral critical point and “D” labels the deconfinement point. The dashed line, which passes close to point “C,” labels the “realistic” QCD situation, we believe. As we progress up the diagonal in Fig. 7.3 toward the upper-right-hand corner we must meet another boundary to another first-order regime governed by the pure SU(3) gauge-field dynamics. This is the region of heavy-quark masses. As the quark masses become comparable to the reciprocal of the ultraviolet cutoff a, the lattice spacing, they decouple from the low-energy dynamics of the gauge fields and the full theory becomes quenched. The finite-temperature transition is then one of confinement–deconfinement with the Wilson line acting as an order parameter. The global symmetry which is broken in the confined phase and restored as the system is heated and goes through a first-order transition into the unconfined phase is Z (3), the center of the gauge group, as we reviewed in more detail above. Theoretical considerations predicted a first-order phase transition, mapping the transition onto the three-state Potts model in three dimensions. Chiral symmetry is also broken at the first-order transition of the quenched model, with the chiral condensate of massless quarks jumping to zero at that temperature at which heavy quarks would be liberated. It is interesting to compare the critical temperatures for the three transitions that occur at the corners of the phase diagram Fig. 7.3. In the lower-left-hand
7.7 The three-flavor phase diagram
181
corner where three massless quarks experience a chiral-symmetry-restoring transition into the quark–gluon plasma, the critical temperature is the lowest, ∼155 MeV. In the upper-left-hand corner where two massless quarks experience a chiral-symmetry-restoring transition into the quark–gluon plasma, the critical temperature is somewhat higher, ∼175 MeV. In the upper-right-hand corner where the eight gluons of the SU(3) gauge group experience a deconfining transition from a state of closed, confining string into the gluon plasma, the critical temperature is the highest, ∼270 MeV. These systematics, namely that the addition of more and more light quarks leads to more and more screening and a lower critical temperature in each case, seem plausible.
8 Physics of QCD at high temperatures and chemical potentials
8.1 The thermodynamic background Let’s begin our discussion of QCD in extreme environments that are rich in baryons by reviewing some basic concepts of thermodynamics. We want to remind the reader of the partition function, its relation to Euclidean field theory, and the physics and formalism of the chemical potential in simple settings. These discussions will set the stage for more-challenging applications to QCD. The reader should appreciate that QCD is a remarkable, yet frustrating theory. On the one hand, we know the theory’s “first principles” thoroughly, in the sense that we have a well-defined prescription for calculating any quantummechanical amplitude governed by strong interactions. The principles are simple and beautiful. However, the phenomena which result from application of these principles are very complicated. For example, asymptotic freedom indicates that a natural language, both for physical ideas and for quantitative estimates, for the short-distance behavior of QCD is that of weakly interacting quarks and gluons. For larger distances and times, this description becomes inadequate in several ways. First, the running coupling grows large and perturbative estimates are no longer reliable. Second, the assumption that physical quantities should be analytic functions of the coupling becomes untenable. Quarks and gluons are not weakly fluctuating degrees of freedom at these length scales. Other dynamical variables, perhaps instantons, color-bearing monopoles, and flux tubes, provide a more relevant description of the physics. These facts indicate that reliable calculations in QCD are beyond our existing mathematical abilities. Most discussions and applications of the theory, starting from the seminal phenomena of color confinement and dynamical mass generation, require physical assumptions inferred from experiments and cannot be derived from mathematical, defensible self-contained arguments. In this book, however, we deliberately limit ourselves 182
8.1 The thermodynamic background
183
to results that can be obtained using purely theoretical means with a minimum of phenomenological input. As we turn to the properties of QCD in extreme environments, we shall be considering QCD in equilibrium under conditions characterized by finite densities of matter or energy. The theoretical language appropriate for these environments is that of thermodynamics. The primary mathematical object which encompasses all the thermodynamic information in the theory is the thermodynamic partition function. This function can be written as a path integral in QCD. The calculation of this integral from first principles is a very difficult mathematical task, and in the majority of cases it is impossible within our current abilities. There are limiting cases, for example QCD in the limit of large density, for which this calculation can be done, at least to leading order, in an expansion in a controllably small parameter – the effective strong coupling at the relevant energy scale. Another example is the limit of low density and small quark masses (the chiral limit), for which a controllable expansion in momenta and quark masses can be done – chiral perturbation theory. Lattice simulations and calculations on a computer allow us, in principle, to calculate the partition function (or its derivatives) for any values of parameters. The QCD partition function is an infinite-fold integral that can be rendered finite by limiting the number of quantum-field degrees of freedom through an ultraviolet cutoff (the lattice spacing) and by an infrared cutoff (the lattice volume). Though the results obtained in this way are numerical, in many cases they serve as a basis for our intuition into the behavior of QCD in parameter regimes that are inaccessible analytically. The most basic phenomena, confinement and chiral-symmetry breaking, are prime examples. Lattice QCD simulations often play a role similar to an experiment. Analytic calculations based on controllable expansions as well as lattice calculations based on discretization of the partition function are often referred to as first-principles calculations. 8.1.1 Thermodynamic ensembles In thermodynamics we do not attempt to describe the state of a system by specifying the values of all its quantum numbers. Rather, we allow the system freedom to occupy probabilistically any one of a set of microscopic states with given macroscopic properties. This set of states is refered to as a thermodynamic ensemble. Such a description is more appropriate for a system with a large number of degrees of freedom, in a situation in which we neither know, nor care to know, the precise behavior of every degree of freedom. Examples of quantities that characterize a system in a thermodynamic description are global, bulk quantities, such as the total volume, total energy, and total value of a conserved charge, such as the baryon number, for example. As long as the system is in a state with given values of such global quantities, we do not care which particular
184
High temperatures and chemical potentials
microscopic state it is in. In fact, in an experimental situation, it is even impossible to find out which microscopic state it is in, because of the vast number of degrees of freedom involved. In this situation we describe the state of the system by a probability distribution. At first glance there seems to be considerable ambiguity in the choice of the probability distribution. However, the power of thermodynamics is that this choice is largely insignificant, as long as we remain truly within our macroscopic, or thermodynamic, description. The most straightforward choice of an ensemble is to pick all the states with given values of the global quantum numbers (energy, charges, etc.) and assign them all the same probability (to be more careful, one should consider a small interval around the given values of global quantities). This is the so-called microcanonical ensemble. It corresponds literally to a system that is completely isolated from the environment, so that no conserved quantum numbers in it can change (fluctuate). Another ensemble, which is more convenient theoretically, is the so-called canonical ensemble, which was introduced by Gibbs, in which the energy is allowed to fluctuate in the following way: each state has a probability weight exponentially decreasing with its energy E: exp(−E/T ). The parameter T is the temperature. The higher T , the greater the likelihood that the system is in a higher-energy, excited state. Now, what would the probability distribution for the energy of the system be in this case? This probability is proportional to the number of states with a given energy, (E), times the factor exp(−E/T ): P(E) ∼ (E)e−E/T .
(8.1)
If the system has a large number of degrees of freedom, the number of states (E) should grow exponentially with the energy. This is indicated by simple combinatorics: the total energy is a sum of many small contributions, the number of which grows with the total energy. It should also grow with the total size of the system. As a result, we have two competing factors in Eq. (8.1), one that increases with E and one that decreases. Both factors are exponential, and the numbers in the exponents are large, being proportional to the total number of degrees of freedom. In this situation the probability distribution will be sharply peaked around a value of E at which the rates of change of these two factors balance each other: d ln (E) 1 = dE T
(8.2)
The width of the peak, E, grows only as the square root of the number of degrees of freedom. Thus the peak becomes relatively narrower and narrower, E/E → 0, as the volume of the system, V , grows. In the thermodynamic limit, V → ∞, the probability distribution is sharply peaked and is not significantly different from the microcanonical ensemble.
8.1 The thermodynamic background
185
Physically, the canonical ensemble describes a system in a situation in which it is a small part of a huge system (which is called the “thermodynamic reservoir”), which in turn is isolated. The total energy of the huge system is constant, but its energy can be distributed between our subsystem and the rest. The probability of our subsystem having energy E will then be a product of the number of states (E) and the number of states of the rest of the huge system res (E tot −E). Since the energy of our system is a tiny fraction of the total energy of the reservoir, E E tot , we can expand ln res (E tot − E) in powers of E and retain only the linear term. Thus we arrive at the distribution Eq. (8.1) with the temperature now dictated by the reservoir: 1 d ln res (E tot ) = T dE tot
(8.3)
The function of energy ln (E) ≡ S(E) is the entropy of the system. We see from Eq. (8.2) that this function determines the relation between the energy of the system and the temperature which is required so that the mean (peak) energy of the canonical ensemble is equal to this energy, 1 dS = T dE
(8.4)
Since the derivative d ln res (E tot )/dE tot = dSres (E tot )/dE tot = 1/Tres defines the temperature of the reservoir, Eq. (8.3) indicates that the temperatures of the reservoir and of our system are equal – a familiar consequence of thermal equilibrium. 8.1.2 The partition function and Lagrangian Typically, calculations are much easier using the canonical ensemble rather than the microcanonical ensemble. In particular, the partition function Z≡ e−E/T (8.5) all states
can be rewritten in terms of a path integral. The path-integral formulation makes it easier to set up perturbative expansions, on the one hand, and lattice discretizations and computer simulations, on the other. The relation to the path integral arises if we write the partition function in the following way: Z= α|e−H/T |α (8.6) α
where α labels states and, in principle, is a collection of values for all quantum numbers, while H is the Hamiltonian. In each term in the sum we recognize the
186
High temperatures and chemical potentials
amplitude of a quantum transition with the following peculiarities: (a) the time t during which this transition is allowed to take place is imaginary; 1/T = i t, and (b) the final and the initial states are the same. Applying standard textbook techniques to rewrite these amplitudes as path integrals, we arrive at 1/T Z= (8.7) Dα exp − dτ LE α(0)=α(1/T )
0
where the integral in the exponent is over the imaginary time τ and the Euclidean Lagrangian LE is obtained from the Lagrangian after substitution of time by −iτ , i.e. L(t → −iτ ) = −LE (τ ).
(8.8)
The minus sign in Eq. (8.8) makes the typical kinetic-energy term (dq/dt)2 transform into a positive term (dq/dτ )2 in the Euclidean Lagrangian, protecting the path integral from being afflicted by swarms of wiggly trajectories. The boundary conditions on the trajectories in Eq. (8.7) reflect the condition that the initial and final states are the same. A significant advantage of the path-integral representation (8.7) is that the variables of integration may be commuting numbers, so these would be ordinary integrals. Of course there are infinitely many integration variables, so a reduction to a finite number of degrees of freedom, through a lattice’s real-space cutoff, for example, is needed before the problem is ready for numerical methods. Another advantage is that it uses the Lagrangian rather than the Hamiltonian, which is often more convenient and powerful, as in the case of gauge theories. In addition, we have seen in earlier chapters that, even in the case of fermion systems in which anti-commuting variables are unavoidable, the path-integral approach is useful and practical for numerical methods. 8.1.3 Conserved charge and chemical potential So far we have focused on a thermodynamic description that uses the energy as the only parameter that describes the thermodynamic state of the system. There are, however, other global quantities that are of interest to us, which we can and must use to define ensembles. Imagine that we want to describe a volume filled with gas. If we specify only the total energy contained in this volume, the system can achieve such a state by having either many particles with less energy per particle, or fewer but more energetic particles. Experimentally we can distinguish such situations easily (cold versus hot, dense versus dilute), so we would like to have a global parameter that can control this “dimension” in the space of the thermodynamic states the system may occupy. In our example the total number of particles is such a quantity. In this way we can construct an ensemble in which the total number of particles is fixed. Similarly to our discussion of the
8.1 The thermodynamic background
187
distribution of energies, we will have the freedom to choose the distribution of the particle number, which again will not affect the thermodynamic properties of a macroscopic system. In nonrelativistic applications of thermodynamics, it is common to use the total number of particles to label an ensemble. However, in relativistic theories for environments in which particle–antiparticle creation is a common process, the number of particles need not be a quantity that we could fix by isolating the system – this number will still fluctuate due to pair creation and annihilation. For example, in a nonrelativistic plasma the number of electrons is practically constant. This is not true if the plasma is so hot that electron–positron pairs can be created. Even in the relativistic case, however, a conserved charge, such as the number of particles minus the number of antiparticles, can be defined. It does not fluctuate in a closed system and we can always use such a charge to define an ensemble of states with a given value of charge. Of course, in the nonrelativistic limit the charge simply counts the number of particles. Following our logic in Section 8.1.1 closely, we can either consider the total charge, N , as fixed, or allow states with all values of the charge into the ensemble. This case is called the grand canonical ensemble. Here we weight the states according to their charge N with an exponential factor, eµN /T . The quantity µ, which has the dimension of energy, is the chemical potential of the charge N . This nomenclature can be understood if we consider the chemical-potential factor together with the energy factor: E − µN −E/T µN /T e (8.9) ×e = exp − T One can think that the chemical potential shifts the energy of a given state by an amount proportional to the charge of the state. If the charge is electric, µ plays the same role as electrostatic potential −φ. Positive chemical potential µ favors states with larger positive total charge N . As in the case of the canonical versus microcanonical ensemble, for which the distribution of the energy was sharply peaked, the distribution of charge is also sharply peaked in a grand canonical ensemble. The combined energy and charge distribution is given by P(E, N ) ∼ (E, N )e−E/T eµN /T
(8.10)
The number (or more properly, the density) of states at a given energy and charge (E, N ) varies exponentially with changing E or N . CPT invariance tells us that S and can depend only on the absolute value of N . Since we expect that the mean (i.e. peak) value of the charge is zero in a system without a chemical potential, the number of states (E, N ) must have a peak at N = 0 and decrease at large N . The sharp peak in P(E, N ) appears when the variation of with E
188
High temperatures and chemical potentials
and N is balanced by the exponential factors in Eq. (8.10): 1 ∂S = T ∂E
and
µ ∂S =− T ∂N
(8.11)
where S = S(E, N ) = ln (E, N ). As we have just discussed, S decreases with growing |N |. Thus the chemical potential has the same sign as the average value of N . We can think of the chemical potential as a parameter that controls the total charge in the system through Eq. (8.11). The grand canonical ensemble corresponds to a situation in which our system can exchange charge, and energy, with the reservoir. The chemical potential and the temperature of our system are equal to those in the reservoir. 8.1.4 The grand canonical partition function and Lagrangian The grand canononical partition function reads Z= e−E/T eµN /T
(8.12)
all states
This can also be written as Z= α|e−H/T eµN /T |α = tr e−(H −µN )/T
(8.13)
α
where N in this equation is the charge operator. Note that conservation of charge, i.e. commutativity of N with H , saves us from concerns about operator ordering in the exponent. We can again use the correspondence between the amplitudes in Eq. (8.13) and Euclidean path integrals to derive the expression for Z in terms of the Lagrangian LE . How does µ enter? We can answer this question without performing an explicit calculation. As we have already noted, the chemical potential µ plays the same role as an electrostatic potential would have played were N just an electric charge. In addition, we know how the electric charge couples to the electrostatic potential in the Lagrangian. To keep the discussion generic, let us just say that the particles carrying the charge must be described by a complex field, which we shall call # (either fermion or boson fields). The conservation of the charge in the Lagrangian description corresponds to a symmetry (by Noether’s theorem). This (gauge) symmetry is the phase rotation of the complex field # carrying that charge. The coupling of the chemical (or electrostatic) potential to such a charge in the Lagrangian is obtained by extending the time derivative of the field #, ∂ ∂ #→ + iµ # (8.14) ∂t ∂t
8.1 The thermodynamic background
189
Another intuitively helpful (more mnemonic than rigorous) way to look at it is to note that the replacement # → #eiµt
(8.15)
produces the same effect as Eq. (8.14), since, due to the invariance of the Lagrangian with respect to global (spacetime-independent) phase rotations of #, the only effect of a time-dependent phase rotation is through the time derivatives. One can then consider the effect of µ as incrementing the frequency of the phase rotation of every state by an amount proportional to the number of # quanta in it. This is equivalent to decreasing the energy of the state, according to the Schr¨odinger picture in which |α(t) = e−iEt |α(0). This is exactly what we want to achieve by introducing µ.1 An inquisitive reader might wonder whether the chemical potential could be completely eliminated from the Lagrangian in this way. If so, where does the µ dependence in the partition function in the Lagrangian path-integral formulation come from? Yes, the dependence on µ can be completely eliminated from the Lagrangian. Insofar as processes in the vacuum are concerned, this has to be so, since the origin of the energy scale can be chosen arbitrarily. Because the charge is conserved, the system will never know that the energy changes if the charge is changed. However, in the grand canonical ensemble the relative probability weights of different charge states composing the ensemble depend on µ and the chemical potential is observable. Correspondingly, the dependence on µ, although it can be eliminated from the Euclidean Lagrangian, is transferred to the boundary conditions on the fields in Eq. (8.7). The substitution (8.15), which in Euclidean time is # → #eµτ
(8.16)
has to be accompanied by a change in the boundary conditions on the fields in the path integral (8.7), since #(1/T ) → #(1/T )eµ/T . The latter product should be equal to #(0) in the case of bosons, and −#(0) in the case of fermions, as we shall discuss in more detail below.
1 Another intuitive argument. Consider an infinitesimal variation δµ of µ. According to
Eqs. (8.14) and (8.15), this is equivalent to an infinitesimal phase change of the field #: = δµ t. The variation of the Lagrangian, to linear order in and up to total derivatives, should be equal to j µ ∂µ , where j µ is the Noether current, conservation of which is a consequence of the invariance with respect to the phase of # (we are repeating a step from a proof of Noether’s theorem). Thus δ L = j 0 δ µ. The infinitesimal variation of the Hamiltonian should be equal, up to a sign, to the variation of the Lagrangian and, as we must expect, it is proportional to the density of the charge j 0 .
190
High temperatures and chemical potentials 8.1.5 Derivatives of the partition function
The partition function Z (T, µ) serves as a generating function for all other thermodynamic quantities. Consider some familiar examples. The logarithmic derivatives of Eq. (8.12) with respect to T and µ give the average energy and charge: E − µN = −T 2
∂ ln Z ; ∂T
N = T
∂ ln Z ∂µ
(8.17)
It is convenient to introduce the following thermodynamic potential: '(T, µ) = −T ln Z For a macroscopic system we can write ' in the form S E/T µN /T ' = −T ln dE dN e e e = −T S + E − µN
(8.18)
(8.19)
since the integrand is sharply peaked. Therefore the entropy can be expressed in terms of ' (and, therefore, in terms of the generating functional Z ), ∂' (8.20) ∂T A simple way to obtain this relation is to differentiate Eq. (8.19) with respect to T and note that the implicit dependences coming through E and N cancel out because E and N are extremal, i.e. Eqs. (8.11) hold. At zero temperature, ' = E −µN is the total energy, corrected by the chemical potential, of the ground state at given µ. The ground state is the state with the minimal value of E − µN . In general, if ' depends on an extra parameter (e.g. an order parameter characterizing the structure, or the phase of the system), the value of this parameter in equilibrium will minimize ' (i.e. the partition function is maximized). The derivative of −' with respect to the volume produces the pressure S=−
∂' (8.21) ∂V For a macroscopic, translation-invariant system, ' is proportional to the volume V . Therefore, ∂'/∂ V = '/V , and Eq. (8.21) implies that p=−
' = − pV
(8.22)
Equation (8.19) also implies that the energy of the system is E = T S + µN − pV
(8.23)
which shows that the energy of a system can be increased by supplying heat (with N and V fixed), by adding particles (with S and V fixed), or by squeezing it (with S and N fixed).
8.1 The thermodynamic background
191
8.1.6 An example: fermions Consider a system of charged, massive noninteracting particles of spin 12 , obeying Fermi statistics. The spectrum of such a system is easy to understand by looking at each momentum state separately: each momentum state may be occupied orunoccupied. The difference in the energies between these alternatives is ω p = p2 + m 2 . In addition, there is also a twofold degeneracy due to spin: each momentum state p can be occupied by a spin-“up” or “down” fermion. Finally, since the particle is charged, there are antiparticle states with the same momentum. The partition function can be organized in a convenient form. First sum over the states of particles with spin up, and then take the fourth power to account for the same contribution from spin-down particle states and spin-up/down antiparticle states: 4 Z (T ) = 1 + e−ω p /T (8.24) p
The chemical potential simply shifts ω p by −µ for particles and by +µ for antiparticles: 2 2 1 + e−(ω p −µ)/T 1 + e−(ω p +µ)/T (8.25) Z (T, µ) = p
The free energy and its derivatives can now be evaluated. For example, 1 ∂' 1 (8.26) N =− =2 − ∂µ e(ω p −µ)/T + 1 e(ω p +µ)/T + 1 p has a simple meaning. The first term in the large parentheses is the average number of particles with momentum p (the occupation number of this mode), and the second is the average number of antiparticles. The factor of 2 accounts for spin. Note that the occupation numbers decrease exponentially with growing ω p , because of the penalty imposed by the Gibbs factor e−E/T . When the occupation numbers are small, quantum statistics has no effect, and we shall see below that the Bose particle occupation numbers are the same in this regime. When µ = 0 the average charge N is zero at any temperature. When the temperature is low compared with µ, the occupation number has the profile of a step function when it is plotted against ω p , as shown in Fig. 8.1. This means that modes with ω < µ are almost always occupied, whereas the modes with ω > µ are almost always empty. At zero temperature, there are no fluctuations, and the ground state of the system at finite µ is very simple: all momentum states within the Fermi surface, ω p < µ, are filled with particles and all momentum states outside the Fermi surface are empty. In our isotropic
192
High temperatures and chemical potentials np 1
0
pF
|p|
Figure 8.1. The occupation number n p of momentum modes (the expression in large parentheses in Eq. (8.26)) of a very cold Fermi gas, T µ. The width of the crossover region from 1 to 0 is of order T /vF (vF is the velocity of fermions on the Fermi surface). example, the Fermi surface is a sphere with radius (Fermi momentum) pF = 2 2 µ − m . There are no antiparticles in this ground state. In the infinite-volume limit the sum over the momentum modes becomes an integral over phase space: d3 p →V (8.27) (2π )3 p where the factor of V /(2π )3 is the density of momentum modes per unit cell in momentum space. At zero or very small T , the density of the charge is therefore proportional to the volume of the Fermi sphere: N 4π pF3 8π 2 (8.28) =2 = (µ − m 2 )3/2 V 3 3 Note that, for µ < m, the density of the charge n is zero. There is no Fermi sphere in this case. This is a reflection of a generic property of the partition function at zero T which states that the ground state of the system is the state having the minimal value of the potential ' = E −µN . This state exponentially dominates the sum defining the partition function in the T → 0 limit. At µ = 0 it is the state of lowest energy, the vacuum. At finite µ the ground state cannot change unless another state, with a finite value of charge N can compete for the lowest E − µN . The ground state will not change unless µ exceeds a value, µ0 , which is the minimum energy per unit charge among all charged states: E (8.29) µ0 = min N n≡
In our example, µ0 = m, and the state which competes with the vacuum and becomes the ground state is the one with a single fermion at rest: E/N = m/1. In fact, at µ = m there are two such states due to spin. At higher µ, the degeneracy of states grows rapidly due to the isotropy of momentum space in this
8.1 The thermodynamic background
193
example. Adding or removing a fermion on the Fermi sphere | p| = pF does not cost E − µN . This degeneracy is crucial to the phenomenon of superconductivity. Basic quantum mechanics tells us that even an arbitrarily small perturbation, which has matrix elements between two degenerate states, will have a significant (nonperturbative) effect due to mixing of these two states. In our case, an arbitrarily small attractive interaction will cause a nonperturbative rearrangement of the ground state. The ground state will become a coherent superposition of states with an arbitrary number of fermion pairs of opposite momenta | p| = pF , Cooper pairs, added to (or subtracted from) the filled Fermi sphere. This coherent state, the Cooper-pair condensate, is responsible for superconductivity according to the BCS theory. We shall discuss how this phenomenon is realized explicitly in the context of QCD in Section 9.1. The path-integral (Lagrangian) formulation We can also calculate the partition function using the path-integral formulation. For the free case it is somewhat more difficult than the combinatoric method, but it is indispensible once interactions become important. It allows us to develop perturbation theory and to set up lattice formulations and simulations. Our free massive fermion should be described by the Dirac Lagrangian, and, including the chemical potential using Eq. (8.14), we have µ ¯ L = −#(iγ ∂µ − γ 0 µ − m)#
(8.30)
¯ = # † γ 0 . Note that the term where # is a four-component Dirac spinor, and # ¯ 0 # is the density of the charge multiplied by the chemical potential, as it µ#γ should be. The next step is to obtain the Euclidean Lagrangian by substituting t → −iτ in Eq. (8.30): : ¯ γµE ∂µ − γ 0 µ − m # LE = −L:t→−iτ = # (8.31) where we have introduced the notation γµE for the “Euclidean” Dirac gammamatrices: γ0E = γ 0 ,
γkE = iγ k
(8.32)
All four matrices γµE are now Hermitian and their anti-commutation relations are also Euclidean-space covariant, {γµ , γν } = δµν . In the following we shall very often omit the index E from the Euclidean matrices, if it is clear from the context that we are working in Euclidean space. Now we can write the path integral: Z (T, µ) =
D# exp − 0
1/T
dτ
¯ γµE ∂µ − γ 0 µ − m # d x # 3
(8.33)
194
High temperatures and chemical potentials
The short-hand notation D# hides two important properties of this integral. First, the components of the spinor # are not complex numbers, but are Grassmann numbers (numbers that anti-commute with each other). We shall see that one can always reduce the integral over Grassmann variables to an ordinary Gaussian integral. This will rely on the fact that, even in the interacting case, the Lagrangian is bilinear in fermion fields, or can be reduced to a bilinear by introducing auxiliary bosonic fields. All we need to know about the integrals over Grassmann numbers is that the Gaussian integral is equal to the determinant of the matrix of the bilinear form appearing in the exponent. For our integral in Eq. (8.33) we obtain the determinant of the Dirac operator: Z (T, µ) = det γµE ∂µ − γ 0 µ − m (8.34) The second property that our notation in Eq. (8.33) hides is that the boundary conditions on the Grassmann field # are anti-periodic, #(1/T ) = −#(0)
(8.35)
which replaces the periodic boundary conditions for bosonic fields. We can think of this property as a rule that is required in order to obtain the correct partition function for fermions. It correctly reproduces the effects of Fermi statistics. The integral in Eq. (8.33) is a functional integral and the determinant in Eq. (8.34) is a functional determinant. These quantities are not properly defined until they are regularized and their ultraviolet infinities have been removed. For example, lattice discretization would make the Dirac operator a finite dimensional matrix, whose eigenvalues can be computed by a variety of methods, some numerical. In the free case we can calculate this determinant analytically. To start, we need to diagonalize the Dirac operator, which can be done in the free case using a Fourier transform. The eigenvalues of a linear combination of Dirac matri√ ces are also easy to find: aµ γµE = ± aµ aµ , i.e. two positive and two negative eigenvalues. Thus we find [( p0 − iµ)2 + p2 + m 2 ]2 (8.36) Z= p
p0
The values of p are continuous in the V → ∞ limit. However, at finite T , the values of p0 are discrete: p0 = ±2π T n 1/2 , where n 1/2 = 12 , 32 , . . ., due to the boundary conditions (8.35). These are called Matsubara frequencies. We can take the product over p0 . Let us write ( p0 − iµ + iω p )2 ( p0 − iµ − iω p )2 (8.37) Z= p
p0
8.1 The thermodynamic background where ω2p = p2 + m 2 . We can now use the formula2 ia 1+ = cosh(πa) n 1/2 ±n 1/2 to obtain Z=
2 2 e2ω p /T 1 + e−(ω p −µ)/T 1 + e−(ω p +µ)/T
195
(8.38)
(8.39)
p
up to a constant (infinite) factor. The first exponent on the right-hand side is the contribution of the zero-point energy of each mode, equal to −ω p /2 , the analog of the zero-point energy for bosons, but with the opposite sign. It is the contribution of the filled Dirac sea, which can be removed by the shift of the origin of the energy. As expected, the final result agrees with Eq. (8.25). A cold nonideal Fermi gas The chemical potential has some interesting effects on the ground state of the interacting Fermi gas. We briefly discuss two: restoration of chiral symmetry and BCS superconductivity. We shall try to keep our discussion here as generic as possible. Consider a cold Fermi gas with µ > m. Consider what would happen if the mass of the fermion were an adjustable parameter. How much would be gained in the value of the potential E − µN by changing the mass? d3 p µ4 m 'µ (m) = E − µN = 2 (8.40) (ω p − µ) = 2 f 3 π µ | p|< pF (2π ) where the dependence on m is in the dimensionless function √1−x 2 f (x) = t 2 ( t 2 − x 2 − 1) dt,
(8.41)
0
The integral can be computed easily, but for our discussion we need only some of its qualitative features, which can be seen in Fig. 8.2. Decreasing the mass is advantageous because itdecreases the potential 'µ . Qualitatively, the radius of the Fermi sphere pF = µ2 − m 2 is larger for smaller m and each extra fermion contributes negatively to E − µN . 2 A way to derive this formula is to take logarithmic derivatives with respect to a in Eq. (8.38)
and then consider the sum rule for the residues of the following function: 1 tan(π x) = 0 2 2 res x + a
196
High temperatures and chemical potentials f (x) 0
1
x = m/µ
Figure 8.2. The dependence of the potential 'µ = E − µN on the mass at fixed µ: 'µ = (µ4 /π 2 ) f (m/µ). In fact, to the extent that we understand the origin of fermion masses in our world, the mass is always a dynamical quantity. For example, in the electroweak theory, the mass of the electron is proportional to the expectation value of a dynamical field, the Higgs field. The masses of the baryons in QCD are also of dynamical origin. In other words, chiral-symmetry breaking, which is a necessary companion of fermion-mass generation, occurs via some dynamical mechanism. In the vacuum the value of the mass is a minumum of a certain effective potential. The curvature of this potential depends on the scale of the physics responsible for mass generation. As an example, we can consider a Yukawa theory with small coupling y and an expectation value of the scalar field (the analog of the Higgs) given by σ . The mass of the fermion is then given by m = yσ
(8.42)
The effective potential for the mass of the fermion, 'm (m), is then equal to the effective potential for the Higgs/sigma field 'σ (σ ). Thus the curvature of 'm (m) is proportional to the mass of the Higgs/sigma field: ∂ 2 'σ (σ ) m 2σ d2 'm (m) = = dm 2 y 2 ∂σ 2 y2
(8.43)
As we have seen above, the chemical potential µ > m puts “pressure” on the mass to decrease since that decreases 'µ . This “pressure” is countered by the dynamical mechanism of mass generation, since it costs energy to shift the mass away from the minimum of 'm . The balance is achieved at a smaller value of m, which decreases with growing µ. This new value is given by the minimum of 'm (m) + 'µ (m) as shown in Fig. 8.3. The size of this effect depends on the curvature of 'm . For example, the mass of the electron, given by the Higgs mechanism at a scale of order 100 GeV, will not be moved by any chemical potentials we could hope to achieve. For our practical purposes, we can think of this mass as a constant. However, the mass of a baryon is generated by a mechanism at a scale of the order of 1 GeV, and such chemical potentials are achievable, if not in a laboratory, then in the cores of neutron stars.
8.1 The thermodynamic background
197
Ω µ=0
m0
0
m
µ1 > m0
µ2 > µ1
Figure 8.3. The potential ' = 'm + 'µ for the dynamically generated fermion mass for various values of µ. The vacuum value of the mass is m 0 . At µ = µ2 the mass vanishes. If µ is sufficiently large, the mass will eventually reach zero, as shown in Fig. 8.3. Chiral symmetry will then be restored. This can happen due to either a second-order (smooth) or first-order (abrupt) transition. The order of the transition depends on the shape of the potential 'm . Typically, such a potential has the shape of a double well. If the (negative) curvature of 'm at the origin is not large, the potential 'm + 'µ may develop a second minimum at the origin, which may become lower than the minimum at m > 0, causing a first-order transition to occur. We can estimate the critical value of µ in the regime where the Yukawa coupling y is very small. In this regime the transition is of second order since the curvature of 'm at the origin is large, proportional to the curvature in the minimum (8.43). The transition occurs when the curvature of 'µ at m = 0 overcomes the negative curvature of the potential 'm at the origin. The latter is proportional to µ2 , while the former is of order m 2σ /y 2 . Thus µc ∼ m σ /y
(8.44)
Although this formula cannot be applied for moderate values of y, we can see, on a qualitative level, that, for a theory with y ∼ 1, we might expect a transition at a value of µ of the order of the mass-generation scale m σ . In QCD, this scale is of the order of 1 GeV. The transition leads to the restoration of chiral symmetry. It happens, however, in a region of parameters where no controllable calculational
198
High temperatures and chemical potentials
methods exist (even present numerical-simulation methods fail in the case of QCD at nonzero µ). Another interesting phenomenon in a cold nonideal Fermi gas is Cooper pairing and condensation leading to superconductivity. It occurs if there is an arbitrarily small attraction between two fermions on the Fermi surface. We shall discuss this phenomenon in considerable detail in the context of QCD in Section 9.1. 8.1.7 An example: bosons Consider now a system of free charged massive particles of spin 0, obeying Bose statistics. The spectrum is easy to understand, looking at each momentum mode separately. There could be any number n p of bosons in any given mode, contributing n p ω p to the energy. In other words, each mode behaves as a harmonic oscillator with frequency ω p (we throw away the ω p /2 contribution to the vacuum energy, which changes the partition function by a factor that is irrelevant here). The same states are available to the antiparticles. The partition function factors into a product of partition functions for harmonic oscillators, 2 1 Z (T ) = (8.45) 1 − e−ω p /T p A chemical potential shifts the frequency of each oscillator by −µ for particles and by µ for antiparticles, 1 1 (8.46) Z (T, µ) = −(ω p −µ)/T −(ω p +µ)/T 1−e 1−e p Now we can calculate various derivatives. In particular, 1 ∂' 1 N =− − = ∂µ e(ω p −µ)/T − 1 e(ω p +µ)/T − 1 p
(8.47)
The total charge is the difference between the number of particles and the number of antiparticles. The mean occupation numbers of high-frequency modes vanish exponentially when ω p T . As we expected, there is no difference between fermion and boson occupation numbers at large ω p /T . Of course, N = 0 when µ = 0. Unlike in the free-fermion case, there is a problem when |µ| > m. This can be seen already in the partition function. The sum of probability weights e−n(ω−µ)/T over n for modes with ω < µ fails to converge. Thus the value of µ for the free gas must be restricted to |µ| < m.3 3 In nonrelativistic applications the rest energy of a particle is subtracted from the chemical po-
tential, µnr = µ − m. For a free nonrelativistic ideal Bose gas µnr < 0.
8.1 The thermodynamic background np
199
np
1 µ<m
µ
=m
1 0
|p|
0
|p|
Figure 8.4. Occupation numbers of momentum modes in a cold ideal Bose gas without (µ < m) and with (µ = m) Bose condensation. The occupation of the mode p = 0 in the latter case is given by N − N T , as discussed in the text. It is interesting to compare the free Bose gas at zero T with the free-fermion gas. Similarly to the case of fermions, as a consequence of a generic argument, no change in the ground state can occur unless µ (or −µ) exceeds m. However, µ cannot be taken larger than m for the free Bose gas, as we have just noted. So how do we describe a state of a fixed number N of free Bose particles? The character of this state at T = 0 is not difficult to understand. Whereas fermions would fill all available momentum states from zero to the Fermi surface in order to minimize the energy at fixed N , all N bosons would simply occupy the single lowest state of p = 0. Consider now Eq. (8.47) at finite T . The value of N in Eq. (8.47) increases as µ increases toward its limiting value m. On transforming the sum over p into the momentum-space integral (8.27), we can calculate this sum in the limit V → ∞ and find that it reaches a finite T -dependent value as µ → m. For small temperatures N T = constant × V (T m)3/2 . Thus N T = 0 at T = 0. This means that the integral does not account for the particles in the ground state. What if we consider an ensemble with fixed N (a canonical ensemble)? As long as N < N T we can find the corresponding µ, which could reproduce this given value of N in the grand canonical ensemble. At N = N T , the value of µ is m. Further increase of N proceeds by putting those extra N − N T bosons into the p = 0 state. This process is called Bose condensation. It occurs at fixed T , if the density n = N /V exceeds the critical density n T = N T /V ; or at fixed density, if the temperature is low enough that n T < n. The occupation numbers in a cold Bose gas are illustrated in Fig. 8.4. When Bose condensation occurs, the occupation number of the p = 0 mode becomes macroscopic. We can consider this mode as an oscillator with frequency ω p=0 = m. Since the field φ is complex, the oscillator is twodimensional with O(2) symmetry. Its energy N m is so much bigger than the
200
High temperatures and chemical potentials
level spacing m that its motion is classical. It performs a rotation with frequency m. The amplitude of this oscillation, φ0 , or rather, the radius of the rotation, can easily be found by comparing the classical expression for the energy, V m 2 |φ0 |2 , to its quantum value, N m: 3 3 N n |φ0 | = = (8.48) Vm m The path integral The path integral for the partition function of the free Bose gas can be obtained by rather straightforwardly following the steps in Section 8.1.4. The Minkowskispace Lagrangian4 L = φ ∗ [−(∂0 + iµ)2 + ∂ 2 − m 2 ]φ (8.49) leads to the Euclidean Lagrangian, LE = φ ∗ [−(∂τ − µ)2 − ∂ 2 + m 2 ]φ The partition function is given by the path integral 1/T 3 dτ d x LE Z (T, µ) = Dφ exp −
(8.50)
(8.51)
0
The boundary conditions on the field configurations in the integral are periodic: φ(1/T ) = φ(0)
(8.52)
The field φ is a complex number at every point in space and the integrals are taken over its real and imaginary parts (two integrals per point in space). Since the path integral in Eq. (8.51) is Gaussian, it can be evaluated easily, Z (T, µ) = det−1 − (∂τ − µ)2 − ∂ 2 + m 2 (8.53) The power of the determinant is twice the usual inverse square root obtained from a Gaussian integration because φ is complex. The determinant can be calculated in the same way as the determinant for the fermions. The Matsubara frequencies are equal to p0 = 2π T n, where n is now an integer. A helpful trick is to realize that shifting µ by the value iπ T is equivalent to the needed 2π T × ( 12 ) shift of the Matsubara frequences; see Eq. (8.37). Thus we can just pick the final result for the fermions (8.39) and adjust µ, as well as the total power (by a factor of − 12 ), −1 −1 1 − e−(ω p +µ)/T Z= e−ω p /T 1 − e−(ω p −µ)/T (8.54) p
4 We added a total derivative to the more common form of the Lagrangian: |(∂ + iµ)φ|2 − 0 2 − m2. |∂φ|
8.1 The thermodynamic background
201
which agrees with Eq. (8.46), up to the contribution of the zero-point energies, which were not included in Eq. (8.46). Note that at µ = m, the constant Fourier mode of the field φ has zero curvature in the Lagrangian (8.50). The integral diverges if µ = m or greater. At µ = m we can think of the φ = constant mode as completely unconstrained by the Lagrangian (8.50). This reflects the phenomenon of Bose condensation discussed above. A cold nonideal Bose gas Consider the effect of a small interaction on the ground state (T = 0) of the Bose gas. If the interaction is attractive, the system of a macroscopic number of bosons will collapse, i.e. the density will increase, until some repulsive interaction intervenes. A more interesting case is a weakly repulsive Bose gas. Unlike for the ideal Bose gas, it is now possible to increase µ above m. The number N (or the density) of the particles in the p = 0 mode will now be controlled by the repulsion. It is easy to derive the leading term in the dependence of N on µ − m. Consider the interaction term −λ|φ|4 in the Lagrangian density, where small and positive λ provides weak (hard core) repulsion. Set µ = m. Without interaction, any occupation number N of the p = 0 mode will give the same E − µN . There is a degeneracy of the ground state at µ = m, which, as in the case of fermions, invalidates naive perturbation theory. With the repulsive interaction present, we can calculate the first correction to the energy of the ground state: E I = V λ|φ0 |4 , where φ0 is the amplitude of the p = 0 mode of the field. As we found above in √ Eq. (8.48), |φ0 | = N /(V m). Thus we find that the energy now depends on N in the following way: E = m N + EI = m N +
λ N2 V m2
(8.55)
Setting the chemical potential to a value µ > m can balance the energy cost at a given N if we choose µ to satisfy ∂ λ ∂ 2 mN + N − µN = 0 (8.56) (E − µN ) = ∂N ∂N V m2 Thus we obtain the relation between N and µ valid for small densities and shown in Fig. 8.5: m2 N = (µ − m), V 2λ
µ>m
(8.57)
The density grows linearly with the chemical potential. The limit λ = 0 is singular. The density blows up if µ exceeds m unless λ > 0.
202
High temperatures and chemical potentials n(µ)
m
µ
Figure 8.5. The density n = N /V of a cold Bose gas with weak repulsion. In contrast to the case of fermions, an interaction is required in order to prevent collapse of the Bose gas for µ > m. In the case of fermions this role is played by the exclusion principle, without any interaction, as seen in Eq. (8.28). 8.2 Hadron phenomenology and simple models of the transition to the quark–gluon plasma It is interesting to make a very simple model of the high-temperature transition to the quark–gluon plasma based on just the composite character of the colorless hadronic states of QCD and counting degrees of freedom [51]. This discussion is probably more useful and reliable at high temperature than at high chemical potential, but we shall apply it equally over the whole phase diagram in the T –µ plane. We will ignore subtleties expected in the physics at low T and high µ just for the sake of orientation. The fact that the µ axis of the phase diagram might be a wealth of physics in phase transition, superconductivity, and exotic states will be addressed below. For the moment just imagine that the world of the T –µ plane is relatively simple (and maybe relatively boring) and that there is a single transition line separating hadronic matter from the quark–gluon plasma, as shown in Fig. 8.6. We shall see that including just knowledge of the number of relevant degrees of freedom in each phase as well as some phenomenology about the formation of hadronic bound states in the low-T and low-µ phase predicts a first-order phase-transition line with very substantial discontinuities. An underlying assumption in this model is that interactions are weak in an environment of high temperature and/or high chemical potential. Asymptotic freedom suggests this point. For example, at high T , average momentum transfers will be of the order of T , so the relevant running coupling should be characterized by the energy scale T and should be logarithmically small. Similarly, at high chemical potential, the quark Fermi surfaces are high, so µ should characterize their momentum transfers and interactions should again be logarithmically weak. Under closer inspection, we will find limitations in both of these arguments. For example, in an environment of high µ, quarks can engage in
8.2 Hadron phenomenology
203
T Tc
quark–gluon plasma
hadronic matter
µc
µB
Figure 8.6. The phase diagram of the simple two-component model of QCD.
glancing collisions and these need not involve high momentum transfers. Since sharp Fermi surfaces are unstable against weak, residual interactions, we expect nonperturbative physics to occur even if the coupling is small. Perturbation theory and free-field behavior will give poor estimates and qualitatively wrong physical pictures in these cases. Some limitations on the applicability of perturbative estimates can be read off expressions for the perturbative quark and gluon masses. For example, the perturbative gluon mass is g 2 µ2 Nf g 2 T 2 2 m g = Nf + Nc + (8.58) 6π 2 2 9 where Nc is the relevant number of colors and Nf is the relevant number of flavors. Certainly perturbation theory is not reliable once m g ∼ T in an environment of vanishing µ. However, this occurs when αs ∼ 0.18, a tiny value. We learn that pertubative estimates will be of very limited use at high T . In contrast, at nonvanishing µ and T = 0, we find m g ∼ µ when αs ∼ 2.40. This suggests that perturbative estimates are much more valuable in dense quark matter. Of course, this argument overlooks the special kinematics of fermions in the vicinity of a spherical Fermi surface and it ignores nonstatic effects that are unaffected by the Debye screening included in Eq. (8.58). These failures will prove crucial in our discussions of QCD in dense environments. With these caveats, let’s jump into a simple two-component model. Armed with a broarder perspective, we hope that its failures will teach us as much as its successes.
204
High temperatures and chemical potentials
First consider the axis with µB = 0. At low temperatures, hadronic matter’s thermodynamics should be dominated by its lightest bosons, the pions. Let’s make the simplification that the pions are the most important degrees of freedom all the way to the transition line and that their interactions are not as significant as their kinetic energies. On the other side of the transition line, the relevant degrees of freedom are the quarks and gluons themselves in a high-temperature plasma state. We shall assume that the interactions are weak, following the suggestion of asymptotically free QCD, and can be ignored in the bulk thermodynamics of the system, at least for the purpose of ball-park estimates. With more sophistication, perhaps, we are assuming that the interactions, after suitable dressing of the quarks and gluons into quasi-particles, are just residual and weak. We will see from lattice simulations and other considerations that the quark–gluon plasma in the vicinity of the transition is not approximated well by free, essentially massless fields. The screening and damping mechanisms expected in a plasma are certainly at work in this one. Only at truly asymptotic temperatures, if at all, will we be able to approximate the system by free fundamental fields. With all these simplifications, the description of the transition will reduce to an exercise in counting. The energy density and pressure of the gas of massless pions are = 3·
π2 4 T , 30
P = 3·
π2 4 T 90
(8.59)
where the factor of three counts the three types of pions. The energy density and pressure of the gas of the quark–gluon plasma are estimated similarly, = 37 ·
π2 4 T + B, 30
P = 37 ·
π2 4 T −B 30
(8.60)
The number 37 counts the effective number of degrees of freedom including gluons and quarks. For gluons there are eight colors and two spin states, and for quarks there are three colors, two spins, two flavors, and particles and antiparticles. So we find 37 = 2 × 8 + ( 78 × 2 × 2 × 2 × 3). The quantity B, which adds to the energy density and subtracts from the pressure, accounts in an average fashion for interactions that are responsible for the change in the vacuum structure between the low- and high-temperature phases. The constant B was originally introduced in the context of the bag model of hadronic structure which, in one version, envisioned protons as consisting of three essentially massless, confined quarks in a bubble of chirally symmetric vacuum in an otherwise chirally asymmetric environment. For example, the energy of a bag that contains quarks reads E(R) =
4π 3 C R B+ 3 R
(8.61)
8.2 Hadron phenomenology P
205
quark–gluon plasma
pions
T
T4
4 c
−B
Figure 8.7. Pressure versus T in the simple two-component model of QCD. where the first term is the bulk energy that accounts for a bubble of radius R of symmetric vacuum and the second term records the energy of the confined, massless quarks in the bubble. The constant C depends on the number of quarks in the particular state of interest. If one considers the proton and minimizes the total energy with respect to R, one finds a proton radius of R0 ≈ 0.7 fm and a bag constant B ≈ 175 MeV fm−3 , or B 1/4 ≈ 192 MeV. In the equation above for the energy and pressure of the high-temperature vacuum, we now see that B adds to the energy, accounting for the presence of the chirally symmetric phase, and subtracts from the pressure. We can now compare the pressures in the two phases as a function of the temperature and decide which is the stable state. The relevant plot is shown in Fig. 8.7 and some simple algebra gives the critical temperature in terms of the bag constant, Tc =
45 17π 2
1/4 B 1/4 ≈ 0.72B 1/4
(8.62)
We learn that, for temperatures above Tc the quark–gluon plasma is favored thermodynamically over the pion gas because it has the larger pressure. The numbers work out rather well. For B 1/4 ≈ 200 MeV, we have Tc ≈ 150 MeV, which is near the numerical results coming from large-scale simulations of lattice QCD.
206
High temperatures and chemical potentials S quark–gluon plasma
Spl _ pions
Sh
_ 3 Tc
T
3
Figure 8.8. Entropy versus T in the simple two-component model. We see that the transition occurs in this simple model as a consequence of counting weakly interacting degrees of freedom – the quark–gluon plasma is favored at high T because there are so many more quarks and gluon species than there are pions. These are the lightest modes in each phase, and should be the most significant thermodynamically. This point is made especially well by considering the entropy, S = ∂ P/∂ T , as a function of T , as shown in Fig. 8.8. The bag constant B does not enter the entropy S, but just affects the estimate of Tc . The jump in the entropy at Tc is directly proportional to the change in the number of light degrees of freedom relevant to the transition at Tc . Next we want to generalize these considerations into the T –µB plane. This requires additional assumptions, which are almost certainly too naive. Anyway, let’s try to obtain a zeroth-order expectation for the phase boundary between the hadronic and quark–gluon-plasma phases throughout the whole T –µB plane. Begin by noticing that the total pressure at Tc is very small in the finite-T considerations above. To a good approximation the transition occurs when the kinetic pressures of the quarks and gluons match the bag constant B. If we take P = 0 as the condition for the transition, then our estimate for Tc changes from the value 0.72B 1/4 obtained above accounting for the pions in the hadronic phase to [90/(37π 2 )]1/4 B 1/4 ≈ 0.70B 1/4 . This “success” encourages us to try this simple criterion throughout the T –µB plane. The kinetic pressure of the quarks and gluons at nonzero T and µB reads 37 µ2 ) µ2 * (8.63) P(µB , T ) = π 2 T 4 + B T 2 + B2 90 9 9π Setting this quantity to B, P(µc , Tc ) = B, predicts a line of transitions with the reasonable shape of Fig. 8.6.
8.3 A tour of the T –µ phase diagram
207
This simple model certainly suggests that counting degrees of freedom plays an important role in the physics of the transition between hadronic matter and the quark–gluon plasma. However, it is not reliable for predicting the order of the transition . . . a first-order transition is almost impossible to avoid in this sort of scenario and we know that the order of the transition in real QCD depends sensitively on the quark content of the theory . . . and it certainly misses a great deal of physics on and near the µB axis. Nonetheless, it probably has some truth and utility away from the transition line, especially away from the µB axis, as long as one is asking just for estimates of some bulk thermodynamic quantities that are sensitive to counting degrees of freedom and not sensitive to small interaction effects. In the next section we will try to improve upon this foundation and later in the book we will become much more ambitious as we apply QCD and QCD-like models to the physics in the T –µ plane. 8.3 A tour of the T –µ phase diagram Before we begin our detailed study of QCD and related theories at large chemical potentials, let’s review the state of the art of the field’s expectations for the T –µ phase diagram. The phase diagram we have discussed so far, Fig. 8.6, is simply based on counting degrees of freedom and entropy, and ignores most of the interesting dynamics of QCD. Although counting and comparing degrees of freedom in adjacent phases is very illuminating, we know from other physical systems that these types of arguments miss phases and orders of transitions. It is a sobering thought that “simple” materials such as H2 O have a host of phases, which, in this case, dictate the very character of earth’s environment. The symmetry of the H2 O molecule and the tiny residual interactions between polar water molecules lead to the formation of many forms of ice, for example, which influence the earth’s weather patterns near its poles. It is a very challenging problem in condensedmatter physics to predict all the phases of ice, their symmetries, and the order of the transitions between them. The problem is simply too hard for our present mastery of computational quantum mechanics. From this perspective, the opinion that the phase diagram of QCD will be “solved” by field theorists any time soon might be too cavalier. A full understanding of the QCD phase diagram will be based on mastering small, residual interactions in QCD in exotic environments. The phase diagram, especially that at low temperatures, will be very rich, and it will be sorted out only through the interplay of theory and experiment. The “experiments” will be those of heavy-ion colliders and physics based on time scales of order 10−23 s, as well as observations of distant compact stars over periods of millions of years.
208
High temperatures and chemical potentials
The study of the phases of QCD at moderate µ and small T might lead to a new field: the condensed matter or even chemistry of QCD. With these “warnings” in mind, let’s jump right in! Our purpose in this section will be to assemble available knowledge, both theoretical and experimental, about QCD and apply it to the construction of the phase diagram in the T –µ plane. Knowledge of the underlying QCD degrees of freedom allowed us, in the previous section, to make entropy estimates and to draw a rough sketch of the phase diagram based on them. The natural next step is to use the information about the symmetries of QCD, in particular, chiral symmetry, to refine this picture. Many studies of the QCD phase diagram have concentrated on modeling the properties of the chiral phase transition (see, e.g., [52–54]). In this section, we shall discuss the properties of this transition in a more model-independent way. We shall see that chiral symmetry plays a crucial role. We shall also include in our analysis the effects from other phase transitions, such as the nuclear-matter liquid–gas transition, our knowledge of which comes mostly from experiment. The aim of our analysis is to transform this knowledge into the determination of a phase diagram for QCD in the T –µ plane. Such an analysis is especially important as an extension of Monte Carlo studies, given the technical problems that these encounter with finite baryon charge density. 8.3.1 Symmetry, order parameters, and phase transitions Perhaps the central theme of our analysis is the use of QCD symmetries to delineate the phases. This does not provide us with much quantitative information about the phase transitions. However, the information this provides is more robust than results obtained from any specific dynamical model of the transition. The transitions which can be studied in this way are transitions between phases, or thermodynamic states of the system, in which the symmetry is realized in one of two different ways: as an exact symmetry or as a spontaneously broken symmetry. Consider as an example the behavior of a ferromagnet. The microscopic interactions respect the global O(3) rotation symmetry. However, the state of the system at zero temperature is not O(3) symmetric – one direction, the direction of spontaneous magnetization, is special. At finite temperature, the thermodynamic state of the system still breaks the O(3) symmetry until the Curie temperature is reached, whereupon the magnetization vanishes, and the microscopic O(3) symmetry becomes an exact symmetry of the thermodynamic state. The transition between the O(3) symmetric and broken states must correspond to a singularity in the thermodynamic quantities. In order to show that this is the case, another concept becomes useful – the concept of the order parameter. For the purpose of this argument, the order parameter is a quantity that must have two properties: (i) it has to vary under the transformations of the symmetry, and
8.3 A tour of the T –µ phase diagram
209
(ii) it must be nonzero in a state with spontaneously broken symmetry. The mean magnetization is such a quantity in the ferromagnet. The symmetry of the thermodynamic state at high T means that all quantities that are not invariant under O(3) rotations must have zero mean (or expectation) values. The magnetization as a function of temperature, therefore, cannot be an analytic function across the Curie temperature Tc . Indeed, a function constant in a finite interval of T must be constant everywhere within the domain of analyticity. The axial-flavor symmetry of QCD in the limit of two massless quarks, SU(2)A , plays the role of such a symmetry. The order parameter is the expecta¯ tion value of the chiral condensate ψψ. This symmetry is restored at finite T , and we must therefore expect a thermodynamic singularity – a phase transition. What if the symmetry is not exact, as is the case for SU(2)A of QCD when the quarks are light but not massless? We can go back to the ferromagnet and ask this question. If there is no symmetry, there is no argument based on the order parameter. Any quantity, such as the magnetization, cannot be expected to vanish in any finite region of the temperature – there is no symmetry to protect it. Indeed, the magnetization at T > Tc is simply proportional to the applied magnetic field (for small field), which breaks the O(3) symmetry. The fate of the transition in this case depends on the order of the transition in the limit of the exact symmetry. In the case of the ferromagnet, the transition at the Curie temperature is of second order and it disappears for arbitrarily small magnetic field. The singularities in the thermodynamic quantities (e.g. in the magnetic susceptibility) round up and become sharp peaks, and the order parameter crosses over from the region where it is large to the region where it is small. Since we expect the transition in QCD with two massless quarks to be of second order, we should expect a similar crossover phenomenon to replace the true phase transition when the small finite quark masses are taken into account. On the other hand, if the transition was first order when the symmetry was exact, we can expect that the discontinuities which it created in the thermodynamic quantities are smooth functions of the symmetry-breaking parameters. In this case, the first-order transition should persist, at least for small explicit symmetry breaking. Finally, let us comment on the properties of phase transitions which determine their order. In some sense we shall be giving the definition of second- and firstorder phase transitions here. Although for some readers this may constitute a repetition of well-known facts, it may benefit others who might have different notions about the meaning of the order of a transition. We shall define a firstorder phase transition as a point in the space of thermodynamic parameters (such as T or magnetic field H for the ferromagnet or T and µ in QCD) where two distinct phases, or thermodynamic states, of the system can co-exist. In other words, the system has a choice of thermodynamic state. A typical example is a glass of water with ice. Unless the temperature is exactly 0 ◦ C, either all the
210
High temperatures and chemical potentials T Tc
H
Figure 8.9. The phase diagram of a ferromagnet. The width of the line between T = 0 and Tc represents qualitatively the strength of the discontinuity in the magnetization as a function of the magnetic field H across H = 0. water should freeze or all the ice should melt. At zero temperature, water has a choice, to be solid or to be liquid. Let us look at the T –H phase diagram of a ferromagnet in Fig. 8.9. Along the line H = 0 below Tc , the ferromagnet has a choice of, in fact, infinitely many states differing by the direction of the magnetization. At any nonzero H the state with M parallel to H is the only truly stable state. Therefore, the line between T = 0 and Tc at H = 0 is a first-ordertransition line. The transition between different phases occurs as a function of the magnetic field. Note also that each of the co-existing phases is completely free of any singularities at a first-order line. The transition occurs when the system jumps, or rearranges itself from one phase into another. Such a transition can even be prevented by going through the transition point slowly – as in supercooling the water. In contrast, a second-order phase transition is characterized by a singularity in thermodynamic functions. This happens practically always because the second-order transition is a special point on a line of first-order phase transitions – the end-point. It is a point where two or more different phases fuse into one. There is no phase co-existence here, because there is only one phase at a second-order transition. Second-order transitions are also often called critical points. Although the same term is often used for the first-order transitions, in this section we try to avoid this confusion and use this term only for second-order, singular, phase transitions. In the T –H plane of the ferromagnet in Fig. 8.9 one can clearly see that phase co-existence ends at Tc . The magnetization vanishes in all phases. There is only one phase at Tc – the symmetric phase. The system has no choice. 8.3.2 Definitions As we “warned” above, one can expect the phase diagram of QCD to have many subtleties. We can predict some of these by noticing that there are several small
8.3 A tour of the T –µ phase diagram
211
parameters, whose interplay may produce subtle, competing effects that will lead to new states of matter: the up- and down-quark masses are small on the natural mass scale of QCD. In addition, the strange-quark mass is comparable to the QCD scale, and it might produce additional exotic states in the phase diagram (see, e.g., Fig. 7.3). In realistic environments, one must also take electromagnetism into account. We shall not even try to attack the full problem. Instead, we shall strip as much from it as we possibly can while still keeping the most basic QCD features in. We shall find that even such a “stripped” version of QCD (F. Wilczek coined the name “QCDLite” for it) presents a theoretical challenge. Thus we consider pure SU(3) QCD (i) with electroweak interactions turned off and (ii) with two massless quarks. There is then an exact SU(2)L × SU(2)R × U(1)B global symmetry of the action, which is spontaneously broken down to SU(2)V × U(1)B at zero and sufficiently low temperatures by the formation of ¯ a condensate, ψψ. Many features of QCD indicate that this is a reasonable approximation, e.g. the lightness of pions and the success of current-algebra relations. (We will comment below on the inclusion of electromagnetic interactions and strange quarks.) This theory is described by a grand canonical partition function, which, written as a path integral, is formally −'(T,µ)/T Z ≡e = D A Dψ¯ Dψ exp(−SE ). (8.64) The Euclidean action, SE , is given by
1/T
SE =
dx0 0
Nf 1 µ ¯ / +mf + ψ f ∂/ + A d x tr(Fµν Fµν ) − γ0 ψ f 2g 2 Nc f =1
3
(8.65) where Nf = 2 is the number of flavors, Nc = 3 is the number of colors, and m f = m = 0 is the quark mass. The Euclidean matrices γµ are Hermitian. ¯ Note that, with our choices of sign, positive m and µ induce positive ψψ and ¯ ψγ0 ψ. The normalization of µ differs from the normalization customary in lattice calculations by a factor 1/Nc (i.e. the baryon charge of a quark). We should be using the symbol µB ; however, in this section we shall omit the subscript in order to avoid unnecessary clutter. On integrating over the fermion fields we can also write ) 1 * ) µ * / +mf + Z = D A exp − 2 tr (Fµν Fµν ) det D (8.66) γ0 2g Nc As indicated, this system is characterized by equilibrium values of T and µ. So the system can be viewed as being in thermodynamic equilibrium with a large
212
High temperatures and chemical potentials
reservoir of entropy and baryon charge that has these values of T and µ. The total energy and baryon charge of our system fluctuate. Of course, the relative magnitude of these fluctuations is negligible for an open system of macroscopic size. The relation between the chemical potential, µ, and the average number density (per unit volume) of baryons, n, is the same as that between the temperature, T , and the average entropy density (per unit volume), s: nV =
ψ¯ f γ0 ψ f = −
f
∂' ; ∂µ
sV = −
∂' ∂T
(8.67)
where ' is the thermodynamic potential defined in Eq. (8.64). It can also be seen that ' = − pV , where p is the pressure. In other words, pressure, temperature, and chemical potential are not independent variables for our system. Their variations are related by d p = s dT + n dµ (8.68) Both T and µ (as well as p) are intensive parameters. For a system in thermodynamic equilibrium, these quantities are the same for any of its smaller subsystems. In contrast, the extensive densities s and n can differ for two subsystems even when they are in equilibrium with each other. This happens in the phase-co-existence region, e.g. a glass containing water and ice. It is more convenient to describe the phase diagram in the space of intensive parameters, T and µ. In particular, the first-order phase transition which we shall encounter is characterized by one value of µ but two values of n – the densities of the two co-existing phases. Another reason for working with these coordinates is that first-principles lattice calculations are performed using T and µ as independent variables that can be controlled while the densities are measured. The results of relativistic heavy-ion-collision experiments are also often analyzed using this set of parameters [55]. 8.3.3 Zero temperature We begin by considering the phase diagram as µ is varied along the line T = 0. Strictly speaking, we are not dealing with thermodynamics here since the system is in its ground state. This fact leads to a simple property of the function n(µ) that we have discussed already. Let us write the partition function as the Gibbs sum over all quantum states, α, of the system: Z=
α
) E − µN * α α exp − T
(8.69)
where each state is characterized by its energy, E α , and its baryon charge, Nα . In the limit T → 0, the state with the lowest value of E α − µNα makes an
8.3 A tour of the T –µ phase diagram
213
exponentially dominant contribution to the partition function. When µ = 0, this is the state with N = 0 and E = 0, i.e. the vacuum or α = 0. Let us introduce µ0 ≡ min(E α /Nα ) α
(8.70)
As long as µ < µ0 , the state with the lowest value of E α − µNα remains the vacuum, α = 0. Therefore, we conclude that, at zero temperature, n(µ) = 0
for
µ < µ0
(8.71)
What is the value of µ0 ? As we have seen, for a theory with a free massive fermion (baryon) this value is equal to the mass of the fermion; µ0 = m. The function n(µ) is singular at the point µ = m, Eq. (8.28). The existence of some singularity at the point µ = µ0 , T = 0 is a robust and model-independent prediction. This follows from the fact that a singularity must separate two phases distinguished by an order parameter, e.g. n. The function n ≡ 0 cannot be continued to n = 0 without a singularity, as we pointed out in Section 8.3.1. What is µ0 for the case of QCD, and what is the form of the singularity? The answers to these questions are somewhat different in QCD and in the real world (QCD+) which includes other interactions, most notably electromagnetic interactions. Since QCD is our focus here and QCD+ is the ultimate goal of our understanding, we shall consider both cases. It is important to understand their differences if we are to extract physically useful predictions from lattice calculations, which are performed for QCD rather than QCD+. The energy per baryon, E/N , can also be written as m N − (N m N − E)/N , where m N = m p ≈ m n is the nucleon mass. Therefore, the state which minimizes E/N is that for which the binding energy per nucleon, = (N mN − E)/N , is a maximum. Empirically, we know that this state is a single iron nucleus at rest with N = A = 56 and ≈ 8 MeV. However, in QCD without electromagnetism the binding energy per nucleon increases with A. This is a consequence of the saturation of nuclear forces and can be seen from the Weizsacker formula. Without electromagnetism, only the bulk and surface energy terms are significant for large A: (A) ≡
Am N − m A ≈ a1 − a2 A−1/3 A
(8.72)
with a1 ≈ 16 MeV and a2 ≈ 18 MeV [56]. As A → ∞, saturates at the value a1 . This corresponds to the binding energy per nucleon in a macroscopically large sample of nuclear matter as defined by Fetter and Walecka in [56]. We conclude that, in QCD, the density jumps at µ = µ0 ≈ m N − 16 MeV to the value of the density of nuclear matter, n 0 ≈ 0.16 fm−3 . Therefore, in QCD there is a first-order phase transition, characterized by a discontinuity in the function n(µ) at µ = µ0 (see Fig. 8.10(a)).
214
High temperatures and chemical potentials
n2
n2
n1
n1
n0
n0 nFe
µ0
µ1 (a)
µ0
µ1
(b)
Figure 8.10. Schematic representations of the dependence of the baryon charge density on the chemical potential at T = 0 (a) in QCD (µ0 ≈ m N − 16 MeV) and (b) in QCD+ (µ0 ≈ m N − 8 MeV).
In QCD+, the Coulomb forces change the situation near µ0 . The contribution of the Coulomb repulsion to (A) is negative: −0.7 MeV× Z 2 /A2/3 , and it is responsible for the experimentally observed maximum in (A) at A ≈ 56. Isospinsinglet nuclear matter (A = ∞) is unstable at zero pressure due to Coulomb repulsion. Neutron matter with Z A is also unstable at zero pressure, and we are left to consider a gas of iron nuclei. In order to ensure electric neutrality, we must add electrons. Such a gas is clearly unstable at small densities and forms a solid – iron. Therefore, there is a discontinuity in the value of n(µ) at µ0 ≈ m N −8 MeV. This discontinuity is equal to the density of normal matter (i.e. iron) and is about 10−14 times smaller than in QCD. For very small µ−µ0 , n(µ) has structure, fine on the scale of QCD, which reflects the properties of normal matter under pressure. Then, for µ − µ0 = O(10–200 MeV), we traverse the domain of nuclear physics with the possibility of various phase transitions. In particular, the transition to neutron matter (Z A) is probably similar to the transition in QCD at µ = µ0 . (See Fig. 8.10(b).) In this domain, one may encounter such phenomena as crystallization of nuclear matter [57, 58], superconducting phases of neutron matter and quark matter [21–24], and, due to the strange quark in QCD+, kaon condensation [57, 59] and a transition to strange-quark matter [60, 61]. Moving along the µ axis to the right is equivalent to increasing the pressure: p = n dµ. Thus, this picture is roughly what one might encounter on moving toward the center of a neutron star from the iron crust at the surface. Our knowledge of n(µ) is scanty for densities of order one to ten times n 0 and µ − µ0 = O(10–200 MeV) both in QCD and in QCD+. We can only be sure that n(µ) is a monotonically increasing function, which follows from the requirement of thermodynamic stability.
8.3 A tour of the T –µ phase diagram
215
The behavior of n(µ) again becomes calculable in the region of very large µ QCD . In that case, the Pauli exclusion principle forces the quarks to occupy ever-higher momentum states, and, due to asymptotic freedom, the interaction of quarks near the Fermi surface is (logarithmically) weak. The baryon charge density is proportional to the volume of a Fermi sphere of radius µ/3, n(µ) ≈ Nf (µ/3)3 /(3π 2 ). At low temperatures, only quarks near the Fermi surface contribute to the Debye screening of the gauge fields. The square of the screening mass, m 2D , is proportional to the area of the Fermi surface: m 2D ∼ g 2 µ2 .This means that color interactions are screened on lengths O(1/(gµ)) = O( ln(µ/QCD )/µ). This motivates the conclusion that nonperturbative phenomena such as chiral-symmetry breaking should not occur at sufficiently large µ. Therefore, in QCD with two massless quarks one should expect at least one other phase transition, at a value of µ that we define as µ1 – a transition characterized by the restoration of chiral symmetry. The situation in QCD+, with a strange quark, is somewhat more subtle. As has been observed by Alford, Rajagopal, and Wilczek [62] at sufficiently large µ one must reach a phase in which chiral symmetry is broken by a completely different mechanism than that by which it is broken in the QCD vacuum, the mechanism which the authors named color-flavor locking (CFL). We shall discuss the CFL phase of QCD+ in Chapter 9. The mass of the strange quark again crucially affects the phase diagram. It is not known whether this transition to CFL occurs from the chirally symmetric phase µCL > µ1 , or before the chirally symmetric phase even sets in at zero T .5 Sending m s to infinity relieves us of this question. What is the value of µ1 in QCD, and is it finite? Very little reliable information about the phase transition at µ1 is available. However, results from several different approaches agree on the conclusion that the value of µ1 is finite and that µ1 − µ0 is of the order of the typical QCD scale QCD ≈ 200 MeV ≈ 1 fm−1 . For example, equating the quark pressure minus the MIT bag constant to the pressure of nuclear matter yields such an estimate (see, e.g., [64]). The simple model of Section 8.2, √ which is similar, but neglects the pressure on the hadronic side, gives µ1 = 3 π B 1/4 ≈ 1.1 GeV. Here, we should also point out another interesting distinction between QCD and QCD+: the effect of the strange quark in QCD+ is to decrease the value of µ1 compared with that of QCD. It has even been conjectured that this effect might be sufficient to drive µ1 below µ0 , which would make normal nuclear matter metastable [60, 61]. Another model that predicts the phase transition at finite µ1 is the Nambu–Jona Lasinio model, which focuses on the degrees of freedom associated with the spontaneous chiralsymmetry breaking and leads to a similar estimate for µ1 [53].
5 Perhaps there is no transition at all, as conjectured by Sch¨afer and Wilczek [63].
216
High temperatures and chemical potentials
What is the order of this phase transition? The MIT bag model predicts that it is a first-order transition since the density, n, of the baryon charge is discontinuous. Unfortunately, analysis of the Nambu–Jona Lasinio model shows that the order of the transition depends on the values of parameters, most notably, on the value of the cutoff. A larger cutoff leads to a second-order transition, a smaller cutoff to a first-order transition [53]. A random-matrix model at T = 0 predicts a first-order phase transition [65]. We shall discuss the random-matrixmodel predictions in more detail in Chapter 10. Here we shall use more general methods to analyze features of the phase diagram of QCD at finite density and temperature. An additional, qualitative argument for the first-order nature of the chiral phase transition at µ1 can be drawn from an analogy between QCD and a metamagnet, such as a crystal of ferrous chloride, FeCl2 . At temperatures below the N´eel temperature, TN , and at zero magnetic field, H , such a crystal is antiferromagnetically ordered, i.e. the staggered magnetization, φst , has a nonzero ex¯ pectation value, φst = 0. Analogously, ψψ = 0 in QCD below Tc . The magnetic field H is not an ordering field for the staggered magnetization because it couples to a different order parameter (i.e. normal magnetization, φ, with E = −H φ) and induces nonzero φ. Similarly, the chemical potential ¯ 0 ψ, and the term µψγ ¯ 0 ψ does not introduce explicit induces a nonzero ψγ breaking of chiral symmetry. At some critical value of H , ferrous chloride undergoes a first-order phase transition, and the staggered magnetization vanishes: φst = 0. One could naturally expect that in QCD a similar competition be¯ tween the low-temperature spontaneous ordering, ψψ = 0, and the ordering ¯ 0 ψ = 0 induced by µ would result in a first-order phase transition. This ψγ analogy can be continued into the T –µ plane or the T –H plane in the case of the antiferromagnet. The antiferromagnet has a well-known tricritical point in this plane. Its analog in QCD will be discussed in Section 8.3.5. Following the arguments of the two preceding paragraphs, we base our subsequent analysis of the phase diagram of QCD with two massless quarks on the following expectations: (i) µ1 ∼ µ0 + O(200 MeV) and (ii) the transition is of first order. 8.3.4 Finite T and µ We shall use two order parameters to analyze the phase diagram of QCD at ¯ nonzero T and µ: the chiral condensate ψψ (per flavor) given by ¯ ψψV =−
1 ∂' Nf ∂m
(8.73)
and the density of the baryon charge n given by Eq. (8.67). We have already used n to show that there is a singularity at µ = µ0 and T = 0. It was important for
8.3 A tour of the T –µ phase diagram
217
that argument that n is exactly zero for all µ < µ0 . At nonzero T , however, n is not strictly 0 for any µ > 0. For example, for very small µ and T one finds a very dilute gas of light mesons, nucleons, and antinucleons with calculable density µ )2m N T *3/2 −m N /T n(T, µ) ≈ e (8.74) T π Since it is nonzero, at finite T the density n cannot be used to predict phase transitions as it can at zero T . Nevertheless, we can use a continuity argument to deduce that the first-order phase transition at T = 0, µ = µ0 has to remain a first-order phase transition for sufficiently small T . Therefore, there must be a line emerging from the point T = 0, µ = µ0 . One can think of this transition as boiling the nuclear fluid. The slope of this line can be related to the discontinuities in the entropy density, s (or the latent heat per volume T s), and in the baryon density, n, across the phase transition line through the generalized Clapeyron–Clausius relation: dT n =− dµ s
(8.75)
This relation follows from the condition that the pressure, temperature, and chemical potential should be the same in the two phases on a phase-coexistence curve and Eq. (8.68). In analogy with ordinary liquid–gas transitions, the gaseous phase has a lower particle density (whence n < 0) and lower entropy density6 (whence s < 0). Therefore, the slope dT /dµ must be negative. We further expect that the slope is infinite at T = 0 since s(T = 0) = 0, and hence s(T = 0) = 0. As there is no symmetry-breaking order parameter that distinguishes the two phases, there is no reason why these two phases cannot be connected analytically. As in a typical liquid–gas transition, it is natural to expect that the first-order phase-transition line terminates at a critical point with the critical exponents of the three-dimensional Ising model.7 The temperature of this critical point can be estimated from the binding energy per nucleon in cold nuclear matter, T0 = O(10 MeV). (See Fig. 8.11.) Signatures of this point are seen in heavy-ion collisions at moderate energies (i.e. ≈ 1 GeV per nucleon), and the critical properties of this point have been studied through measurements of the yields of nuclear fragments [64, 66]. In particular, the reported critical exponents are in agreement with those of the three-dimensional Ising model [66]. Additional phase transitions that might occur at T = 0 would give rise to additional phase-transition lines. One could expect two generic situations. If 6 The entropy per particle is greater in the gaseous phase, but the entropy per volume s is smaller
because the particle density is much smaller. 7 The Ising nature of the universality class follows from the fact that the transition can be modeled
as an Ising lattice gas.
218
High temperatures and chemical potentials
there is a breaking of a global symmetry (e.g. translational symmetry in the case of crystallization of nuclear matter), the phase-transition line must separate such a phase from the symmetric phase at higher temperature without any gaps in the line. Otherwise, the transition can terminate at a critical point. At very high T QCD , we have a plasma of quarks and gluons with a logarithmically small effective coupling constant, g(T ), and we can again calculate the density of the baryon charge n: −1 d3 p | p| − µ/3 n(T, µ) ≈ 4 +1 exp − {µ → −µ} (8.76) (2π )3 T We expect that the chiral condensate is zero at very high T since the effective coupling is weak, according to asymptotic freedom. Therefore, a phase transition must separate the quark–gluon-plasma phase from the low-temperature phase. This transition has been studied extensively at µ = 0 using a variety of methods. In particular, as we discussed in Chapter 7, lattice calculations have established the value of Tc as approximately 170 MeV [67]. Arguments based on universality suggest that this transition is of second order with the critical exponents of the three-dimensional SU(2)L × SU(2)R ∼ O(4) universality class [68]. Results from lattice calculations are consistent with this scenario [69].8 Here, we assume that this description is true and try to understand what happens to this transition when µ is not zero. For massless quarks, the low-temperature hadronic phase and the quark– ¯ gluon-plasma phase can be distinguished by the expectation value of ψψ, since this is identically zero in the quark–gluon-plasma phase and nonzero in the hadronic phase with spontaneously broken chiral symmetry. Therefore, when quark masses are strictly zero, a phase transition must separate these two phases, i.e. these phases cannot be connected analytically in the T –µ plane at m = 0. Therefore, a line of phase transitions must begin from the point T = Tc , µ = 0 and continue into the T –µ plane (see the left-hand side of Fig. 8.11). As discussed above, restoration of chiral symmetry at T = 0 is most likely to proceed via a first-order phase-transition. Therefore, the transition must remain of first order as we continue along a line into the T –µ plane. The slope of this line can again be related to the discontinuity in the baryon charge and the entropy density, Eq. (8.75). Since we expect that both density and entropy will be larger in the quark–gluon-plasma phase, the slope of this line, dT /dµ, should be negative. This first-order-transition line cannot terminate because the order parameter, ¯ ψψ, is identically zero on one side of the transition. The minimal possibility is 8 A sufficiently light third quark would drive the transition first order [68]. However, lattice cal-
culations also indicate that the strange quark is not sufficiently light for this to occur [70].
8.3 A tour of the T –µ phase diagram
219
T3 , µ3
crossover
0
T0 µ0
µ1
8
T0
TE, µE
0
µ0
µ1
8
Tc
8
8
m=0
Figure 8.11. Schematic plots of the phase diagram of QCD with two massless (left) and light (right) quarks. The SU(2)A symmetry of QCD is restored at the second-order phase-transition line begining at T = Tc and µ = 0 (left). This transition is smoothened into a crossover when m = 0 (right). The first-order phase transition (thick line) remains. that it merges with the second-order phase-transition line coming from T = Tc , µ = 0 (see the left-hand side of Fig. 8.11). The point where the two lines join is a tricritical point. Such a point exists in many physical systems (e.g. for the FeCl2 antiferromagnet), and universal behavior in the vicinity of this point has been studied extensively. In the next section, we review those properties of a tricritical point which follow from universality. 8.3.5 Universal properties of the tricritical point By analogy with an ordinary (bi)critical point, where two distinct co-existing phases become identical, one can define the tricritical point as a point where three co-existing phases become identical simultaneously. A tricritical point marks an end-point of three-phase co-existence. In order to see this in QCD, it is necessary to consider another dimension in the space of parameters – the quark mass m – see Fig. 8.12. This parameter breaks chiral symmetry explicitly. In such a three-dimensional space of parameters, one can see that there are two surfaces (symmetric with respect to m → −m reflection) of first-order phase transitions emanating from the first-order line at m = 0. On these surfaces or wings with m = 0, two phases co-exist: a low-density phase and a high-density phase. There is no symmetry distinguishing these two phases since chiral symmetry is explicitly broken when m = 0. Therefore, each surface can have an edge that is a line of critical points. These lines, or wing lines, emanate from the tricritical point. The first-order phase-transition line can now be recognized as a line where three phases co-exist: the high-T and -density phase and two ¯ low-density and -T phases with opposite signs of m and, hence, also of ψψ. This line is called, therefore, a triple line.
220
High temperatures and chemical potentials T
critical line, m = 0 m tricritical point, m = 0 line of end-points, m = 0 µ
triple line
surface of first-order transitions
Figure 8.12. Phase transitions in the three-dimensional T –µ–m space in the vicinity of the tricritical point. The plane m = 0 is a symmetry plane. Chiral symmetry is exact only in this plane, and it is only here that the low- and high-temperature phases must be separated by a transition. One can also view this plane as a first-order ¯ phase-transition surface, since ψψ has a discontinuity across it. Then, the second-order phase-transition line and the triple line provide a boundary for this surface. Critical behavior near the tricritical point can now be inferred from universality. The upper critical dimension for this point is 3. Since critical fluctuations are effectively three-dimensional for the second-order phase transition at finite T , we conclude that behavior near this point is described by mean-field exponents with only logarithmic corrections. The effective Landau–Ginsburg theory ¯ for the long-wavelength modes, φ ∼ ψψ, near this point requires a φ 6 potential, which has the form (in the symmetry plane m = 0) 'eff = '0 (T, µ) + a(T, µ)φ 2 + b(T, µ)φ 4 + c(T, µ)φ 6
(8.77)
with c > 0. The φ 6 term is necessary in order to create three minima corresponding to the three co-existing phases. This explains why the critical dimensionality is 3, since, for this dimension, the operator φ 6 becomes a marginal operator. When b > 0, the transition occurs when a = 0 and is a second-order transition similar to that seen in a φ 4 theory. This corresponds to the second-order line. When b < 0 the transition occurs at some positive value of a and is of first order. This is the triple line. When both a and b vanish, we have a tricritical point.
8.3 A tour of the T –µ phase diagram
221
In particular, the following exponents in the symmetry plane m = 0 are readily found using mean-field φ 6 theory (as noted above, results from renormalization-group studies [71] show that the actual singularities include additional, logarithmic corrections). The discontinuity in the order parameter ¯ φ = ψψ along the triple line as a function of the distance from the critical point µ3 , T3 (measured as either T3 − T or µ − µ3 ) behaves like ¯ ψψ ∼ (µ − µ3 )1/2
(8.78)
The discontinuity in the density, n = d'eff /dµ, across the triple line behaves like n ∼ µ − µ3 . (8.79) The critical behavior along the second-order line is everywhere the same as at the point µ = 0, T = Tc (which is an infrared-attractive fixed point). Therefore, ¯ ψψ vanishes on the second-order line with three-dimensional O(4) exponents. ¯ At the tricritical point, however, the exponent with which ψψ vanishes is given by Landau–Ginzburg theory as ¯ ψψ ∼ (T3 − T )1/4
(8.80)
When m = 0, the potential 'eff (φ) can also contain terms φ and φ 3 that break φ → −φ symmetry explicitly. (The term φ 5 can be absorbed by a shift of φ.) The potential 'eff (φ) still has three minima, and a first-order phase transition can occur when two adjacent minima are equally deep. These transitions form a surface of first-order phase transitions – the wings. The two minima (and an intermediate maximum) can also fuse into a single minimum. This happens on the wing lines at the edge of the surface of first-order phase transitions. The critical behavior along the wing lines is given by the three-dimensional Ising exponents, as is usual at the end-points of first-order liquid–gas-type phase transitions not associated with restoration of a symmetry. In particular, the discontinuity in ¯ ψψ and n vanishes with exponent β ≈ 0.31. These discontinuities are related to the slope of the wing surface at constant T through a relation similar to Eq. (8.75): ¯ dµ ψψN f =− (8.81) dm n There are many other universal properties in the vicinity of a tricritical point that can be derived from the above φ 6 Landau–Ginzburg effective potential. One can, for example, show that the m = 0 second-order line, the wing lines, and the triple lines approach the triple point with the same tangential direction: The second-order line approaches from one side while the wing lines and the triple line approach from the opposite side. For a more detailed description of the properties of tricritical points, see [71].
222
High temperatures and chemical potentials 8.3.6 Summary and remarks
We reviewed an analysis (qualitative and, in some cases, quantitative) of the salient features of the phase diagram of QCD with two light or massless quark flavors at finite temperature and baryon chemical potential. The most important features of this phase diagram are summarized in Fig. 8.11. The phase diagram can certainly have a much richer structure. The phase transitions shown in Fig. 8.11 are distinguished by the fact that a good order parameter can be associated with each of them. Here the term “good order parameter” implies the existence of some quantity whose expectation value is identically zero in some finite region of parameter space or in one phase and is some function of parameters in the other phase. Two such phases must be separated by a nonanalytic boundary, i.e. a phase-transition. What is crucial here is the identical vanishing of an order parameter or its strict independence of the parameters of the theory. Usually, this is ensured by the existence of some symmetry with respect to which this order parameter transforms nontrivially. ¯ For the chiral phase transition, a good order parameter is the value of ψψ, which spontaneously breaks the global SU(2)L × SU(2)R chiral symmetry to ¯ ¯ SU(2)V . Hence, the phase with ψψ = 0 and the phase with ψψ = 0 cannot be connected without crossing a phase-transition line in the T –µ plane. The transition from n ≡ 0 to n = 0 along the T = 0 line provides another example of a phase transition associated with a good order parameter. The phases n ≡ 0 and n = 0 cannot be analytically connected; i.e. one must pass through a nonanalytic boundary (phase transition) on passing from one to the other. Since this is a first-order phase transition, continuity requires that there is also a firstorder-transition line for some T = 0. This line can, however, terminate since n is no longer a good order parameter when T = 0. Of course, the existence of a good order parameter is a sufficient but not a necessary condition for a phase transition. Other phase-transition lines associated with more subtle phenomena are also possible. In fact, one should expect many subtle surprises in this phase diagram, especially once the finite strange-quark mass is introduced. The most interesting feature of the phase diagram of Fig. 8.11 is the presence of a tricritical point. Because the critical dimensionality for such a point is equal to 3, critical behavior near this point is given by mean field theory plus logarithmic corrections. The tricritical point lies in the region expected to be probed by heavy-ioncollision experiments. Discovery of experimental signatures of this point is an exciting prospect for these experiments. Since quark masses are not precisely zero, we should consider a slice of a three-dimensional phase diagram, Fig. 8.12, with m = 0, shown on the right-hand side in Fig. 8.11. A qualitative difference between the phase diagram for m = 0 and that for m = 0 is the absence of
8.4 The quark–gluon plasma
223
the second-order phase-transition line associated with the restoration of chiral symmetry. This symmetry is explicitly broken for m = 0. However, continuity from m = 0 ensures that the first-order finite-density transition is still present at m = 0. This transition line is terminated by an ordinary critical point. Criticality at this end-point is not associated with restoration of chiral symmetry, and excitations with the quantum numbers of pions do not become massless there. Criticality at this point is associated with the fact that a correlation length in the channel with the quantum numbers of the sigma meson becomes infinite. (Hence, it is plausible to infer that this point has the critical behavior of the three-dimensional Ising model.) The experimental signatures of the critical point should therefore be associated with this phenomenon.
8.4 The quark–gluon plasma and the energy scales of QCD Experimental, phenomenological, and theoretical developments point to the possibility that the physics of QCD is characterized by three distinct length scales. At sufficiently short distances, less than 15 fm, calculations and theory confirm asymptotic freedom, which is essential to exposing the underlying gluon and quark degrees of freedom of the theory. At more moderate distances, from 15 fm to 12 fm, for which the running coupling is in the intermediate range, the description of the theory changes qualitatively. On these scales one can argue that semiclassical instanton configurations are breaking the anomalous U(1)A symmetry and, through properties of their ensemble, are breaking chiral symmetry and are producing the constituent-quark masses. Finally, on greater length scales, phenomenology and lattice methods confirm confinement, the fact that hadronic states are color singlets, and that the confining dynamics comes through thin, discernable but breakable, flux tubes. One of the major purposes of studying QCD in extreme environments is to understand these phenomena in greater detail. In other words, one of the major purposes of studying QCD in extreme environments is to understand it in ordinary environments. Scattering experiments in which the quark–gluon plasma is created afford us the chance to look into the workings of QCD with higher spatial and temporal resolutions than can be obtained in ordinary collisions. As emphasized by E. Shuryak [72], the mass scales of the quark–gluon plasma are different and, importantly, smaller than those of the more familiar hadronic phase. The hadronic phase breaks chiral symmetry; the quark–gluon plasma does not. The hadronic phase confines quarks; the quark–gluon plasma does not. The binding mechanism in the hadronic phase is nonperturbative whereas the screening mechanism in the quark–gluon plasma is perturbative. The perturbative screening masses in the quark–gluon plasma are M gluon ≈ 0.4 GeV and
224
High temperatures and chemical potentials
Quarks and gluons
E/T 4
Counts degrees of freedom
Mesons
170 MeV
Figure 8.13. The energy density /T 4 versus T in a typical lattice simulation of QCD. M quark ≈ 0.3 GeV for temperature above but near Tc , at which the plasma first appears. The finer level spacings in the plasma act as a fine-resolution grid to the dynamics in the hadronic phase. A collision that starts in the hadronic phase and ends in the plasma phase is especially sensitive to the levels and dynamics of the hadronic phase because of the relative wealth of open channels in the plasma phase. This point shows up in several well-known phenomena. 1. The energy density, . Lattice simulations [73] have shown that /T 4 grows rapidly as T passes through Tc = 170 MeV. On the low-temperature side of Tc , the energy density is accounted for by the presence of light mesons and on the hightemperature side it is accounted for by quarks and gluons. The number of degrees of freedom grows rapidly as T passes through Tc , and, although interactions are essential for T ≥ Tc , the Stefan–Boltzmann estimate for the high-temperature energy density does not overshoot lattice-simulation data by more than 10%–20%. 2. The chiral condensate. ¯ The chiral condensate ψψ falls rapidly as T passes through Tc (Fig. 8.14). The hadronic phase is characterized by a large constituentquark mass, massive baryons and light mesons, whereas the plasma phase has light quarks. 3. The heavy-quark potential. When T < Tc the heavy quark potential V (r ) rises linearly at large distances before pair production breaks the flux tube which apparently forms between the heavy source and sink of color. However, when T > Tc the
8.4 The quark–gluon plasma
225
Chiral condensate
Heavy nucleon Light pion
Light quarks T
170 MeV
¯ Figure 8.14. ψψ versus T in a typical lattice simulation of QCD.
V(r) linear confinement
COLD
screening HOT
r
Figure 8.15. The heavy-quark potential above and below the T transition in QCD. linear component of the potential is replaced by a screening form that saturates as r grows (Fig. 8.15). 4. The velocity of Sound. Lattice simulations have not yielded the velocity of sound vs in the two phases, but interesting results are expected. Since vs2 = ∂ P/∂, one expects this quantity to have a dramatic dip in the vicinity of Tc where a change in temperature converts hadronic matter to screened quarks and gluons rather than changing the pressure P. The velocity of sound is an excellent signal for the transition theoretically and should influence the evolution of hadronic matter to the quark–gluon plasma in heavy-ion collisions. Lattice calculations of vs are demanding because the pressure
226
High temperatures and chemical potentials 2 vs
1/3
relativistic gas
nonrelativistic gas
170 MeV
T
Figure 8.16. The velocity of sound versus T expected in QCD. estimates require a numerical integration of lattice data over a range of couplings. At small T vs2 should rise from zero as in a nonrelativistic gas and at large T vs2 should approach 13 , which is characteristic of a relativistic gas, as shown in Fig. 8.16.
8.5 The extreme environment at a relativistic heavy-ion collider Relativistic heavy-ion colliders, such as the RHIC facility at Brookhaven National Laboratory, are providing insight into the behavior of QCD at high energy density and small but nonzero baryon chemical potentials. Let’s consider some of the theoretical ideas and methods of analysis that go into the highenergy collisions of large nuclei. This is a vast subject so our discussion will be hardly more than a few snapshots. However, since the goal of heavy-ion collisions is the production and study of the quark–gluon plasma, we should consider the challenges and potential rewards that this experimental subject faces. RHIC collisions can be analyzed in four stages, which are thought to be characterized by distinct time scales. First, viewed from the center-of-momentum frame (Fig. 8.17), there are rightmoving (positive pz ) and left-moving (negative pz ) nuclei. Each of them can be described by infinite-momentum-frame wavefunctions that give the probability amplitudes of each nucleus to consisting of ensembles of quarks and gluons. The parton model, suitably improved with asymptotic freedom to describe its short distance features, should provide a framework for these initial states. Second, there is the collision between the constituents in each nucleus. Experiments will shed light on the typical momentum transfers and multiplicities of these underlying collisions. Third, there is the development of a thermalized plasma of quarks and gluons.
8.5 Relativistic heavy-ion colliders
NUCLEUS
227
NUCLEUS
Figure 8.17. Two nuclei described in a parton picture about to collide in the center-of-momentum frame. Fourth, there is the development of the final states of hadronic debris from the hot soup of colliding and produced constituents. Consider each stage of the collision in turn. Begin with the lightcone wavefunctions of the projectiles. That for the rightmover is n d2 K i dxi |# = ψn (x1 , K 1 ; . . . ; xn , K n ) xi n i ) * ) * × δ2 xi |x1 , K 1 ; . . . ; xn , K n (8.82) K i δ 1 − Recall from infinite-momentum-frame [74] or lightcone perturbation theory [74] that the wavefunction for a strongly interacting particle is written in a basis of the theory’s fundamental constituents, quarks and gluons. In Eq. (8.82) we have a sum over components of the wavefunction of a nucleus, each component consisting of n partons. By the rules of Schr¨odinger perturbation theory, written using lightcone variables or, equivalently, written in an infinite-momentum frame, the probability amplitude for this component is given by the n-body wavefunction ψn whose arguments are the transverse momenta and longitudinal fractions of each parton. The transverse momenta of the constituents add up to that of the nucleus, zero in this case, and the longitudinal fractions add up to unity. Each longitudinal fraction xi is a positive variable, 0 < xi < 1, according to the rules of infinite-momentum-frame perturbation theory. This condition, that each xi is positive, eliminates vacuum structure from the description of the wavefunction and makes this formulation of the collision particularly physical and direct. In the context of deep inelastic scattering, for example, one views a nucleon in the infinite-momentum frame and probes its structure with a pointlike photon, thereby measuring the longitudinal momentum distributions of the partons in the nucleus, as described by the parton model of Feynman [75] .
228
High temperatures and chemical potentials
dN/dr
Leftmovers
Central plateau Rightmovers
ln s
r
Figure 8.18. The rapidity distribution of partons in a nucleus–nucleus collision. The bulk features of the wavefunctions of the projectiles are conveniently displayed on a rapidity plot, r = 12 ln[( p0 + pz )/( p0 − pz )]. The rapidity is particularly handy because it simply translates under a boost along the z axis. One can plot the density of partons of both projectiles on a rapidity plot. Deep inelastic experiments at a fixed Q 2 , the four momentum squared of a photon probing the structure of the nuclei, indicate that the plot is essentially flat and boostinvariant. The extent of the rapidity plot is ln s, where s is the usual scattering variable, four times the square of the center-of-momentum energy, describing the collision. Experiments indicate that rapidity distributions for partons have some simple, statistical features (Fig. 8.18) [75]. 1. Short-range correlations in rapidity. The parton distributions lose memory, with a correlation length of 1–2 units, in rapidity. Screening in rapidity is an essential feature. Therefore, features like the quantum numbers of each projectile are limited to the edges of the rapidity plot. Unfortunately, this means that, in very-high-energy heavy-ion collisions, most of the partons are not influenced by the baryon number of the projectiles. The collisions can be characterized by a high energy density and temperature but not high chemical potential µB . 2. The “central region” is universal and “vacuum-like.” The length of the central region grows like ln s.
8.5 Relativistic heavy-ion colliders
229
dN = dx/x
Figure 8.19. A multiperipheral graph in a parton gφ 3 field theory. The graph fills rapidity space uniformly, dN ∼ dx/x ∼ d ln x ∼ dr . Although first-principles calculations of lightcone wavefunctions are beyond us, models having these features have been known for decades. Multiperipheral graphs of QCD (Fig. 8.19) can populate the rapidity axis uniformly and have suggested that gluons are the most likely parton species in the central region [76]. Asymptotic freedom affects these considerations profoundly. Recall that, in the naive parton model, partons interact only softly with one another, never exchanging high transverse momenta [75]. However, if we consider the structure of the wavefunctions at high spatial resolution where the running coupling is small and perturbative asymptotic freedom applies, then exchanges of high transverse momentum and short-distance additional structure are inevitable [77, 78]. Call the gluon-distribution function in momentum space x G(x, Q 2 ). This is the distribution function that would be resolved by a probe with resolving power 2 δX⊥ ∼ 1/Q 2 . Models and theoretical analysis of scattering data suggest that x G(x, Q 2 ) approaches a constant for small x and fixed Q 2 (Fig. 8.20). However, as Q 2 is increased one discovers that gluons can split perturbatively into three or four gluons of lesser longitudinal fractions xi . The simplest perturbative Feynman diagrams of QCD apply here and give the probability amplitude of a parton of longitudinal fraction unity and transverse momentum zero consisting of several other partons whose longitudinal fractions sum to unity and whose transverse-momentum distributions are power-law-behaved and given by the features of the low-order pertubation-theory graph relevant for this process. So, for sufficiently small x, x G(x, Q 2 ) must be an increasing function of Q 2 . This is a general feature of the asymptotically free parton model [77] and perturbative evolution equations make this simple physical idea quantitative [78]. For quarkdistribution functions that are accessible to deep inelastic electron and neutrino scattering, these ideas have been tested well phenomenologically.
230
High temperatures and chemical potentials
2 xG(x,Q )
2 Increasing Q
x
0.10
0.01
Figure 8.20. The longitudinal momentum distribution for increasing Q 2 .
2 Low Q
High Q
2
Figure 8.21. Variation in the transverse-plane distribution of partons for increasing Q 2 . If we view the parton density on the plane transverse to the collision axis pz , 2 then at low Q 2 we see just a few gluons of size δ X ⊥ ∼ 1/Q 2 , but as Q 2 is taken larger we resolve many more gluons of smaller size (Fig. 8.21). This progressive development has led to the idea of “gluon saturation” in the lightcone wavefunction of a nucleus [79]. To understand this idea, consider the wavefunction and imagine that it is coming at you and you have a snapshot of the transverse plane [79]. Call the
8.5 Relativistic heavy-ion colliders
231
transverse size of the nucleus RA . Each parton of transverse momentum Q in the wavefunction occupies an area π/Q 2 , as suggested by the uncertainty principle. The parton’s cross section is proportional to the square of the running coupling and its geometrical size, σ ∼ αs (Q 2 )π/Q 2 . However, partons will overlap in the transverse plane when their number NA becomes comparable to π RA2 /σ , which is Q 2 RA2 /αs (Q 2 ). One supposes that, when NA is this large, the overlap stops further growth in the density and “saturation” has been achieved [79]. The transverse momentum of the partons at saturation is Q 2s ∼ αs (Q 2 )NA /RA2 ∼ A1/3 . It is hoped that the A dependence in this and other similar equations will be testable in heavy-ion collisions. The number of partons at the saturation level is Ns ∼ Q 2s RA2 /αs (Q 2s ) ∼ A and this is the number of partons of size scale 1/Q s that could materialize in the quark–gluon plasma. In addition, Q 2s sets the scale for the transverse momenta of partons in the central region of the rapidity plot, K⊥ 1 dN 1 = 2 F σ dr d2 K ⊥ Q 2s αs Q s
(8.83)
These formulas can be made more precise by considering longitudinal fractions [79]. The discussion here is somewhat simplified, but a useful first guide. When numbers for the various nuclei and the energies available at experimental facilities are substituted here, one finds that Q 2s ∼ 1–2 GeV. This means that applying perturbative QCD is suspect and competitive nonperturbative effects should be expected. This is potentially the realm of lattice-gauge theory. The relationship of Euclidean lattice calculations and the Minkowski scattering formulation appropriate to heavy-ion collisions must be addressed. Stages of a heavy-ion collision The types of physics of interest in the four different stages of a heavy ion collision are thought to be quite distinct. 1. Wavefunctions approach one another. As reviewed above, the phase-space parton density is quite large, ∼ 1/αs (Q 2s ). Classical and semi-classical estimates should apply. In particular, the color per volume is large, so the lack of commutivity, [Q a , Q b ] = i f abc Q c , appears to be a relatively small effect, and the gluon field can be treated classically for many purposes [79]. At sufficently large resolution Q 2 , the partons of that stage fill the transverse plane. The nucleus resembles a “black” absorbing disk. As Q 2 increases further, additional partons are generated at the edges of the disc and cause it to expand. If multiperipheral dynamics is a good guide here,
232
High temperatures and chemical potentials then the expansion occurs at a logarithmic rate. This becomes the basis for high-energy nulceus–nucleus cross sections to grow at a rate proportional to ln2 s and saturate the Froissart bound [80].
2. Interactions. The gluon-saturation picture suggests that the characteristic time of interactions is quite small, tint ∼ 1/Q s ∼ 0.1–0.3 fm/c and the energy density is very, very large, int ∼ Q 4s /αs (Q 2s ) ∼ 20 GeV fm−3 . Such a large energy density would place the system deep in the quark–gluon-plasma phase. This is good news! Estimates of int in the past were much more modest, of the order of several GeV fm−3 , at most, and estimates of tint were much larger. In this scenario, it is more plausible that thermalization and the creation of a plasma is likely in heavy-ion collisions. 3. Thermalization. After the collision the quark–gluon matter expands and thermalizes. The experimental data and models suggest that the time scale for this to occur is ttherm ∼ 0.5–1.0 fm/c. RHIC data, especially those showing the spatial distribution of particles produced in non-head-on collisions, strongly suggest that the interactions at early times are strong and the quark–gluon plasma appears very early in the evolution of the final-state [81]. The basic parton–parton interactions must be strong and involve high multiplicities. Perturbative processes alone are not sufficient to explain the data. 4. Hydrodynamic expansion. The quark–gluon plasma maintains thermalization and expands until decoupling sets in at tdec ∼ 10 fm/c. In the expansion and production of the final-state hadrons, the fastest particles are produced last, as in the “inside–outside” cascade of electron–positron-annihilation processes [82]. The expansion is essentially one-dimensional, along the collision axis, until the latest stages of the development of the final state. A particle’s rapidity turns out to be strongly correlated with the spacetime rapidity, 12 ln[(x0 + x3 )/(x0 − x3 )], of its point of creation. Signatures of the quark–gluon plasma RHIC experiments are complicated and challenging to interpret. Several signatures for the production of the quark–gluon plasma are thought to be the following. 1. Jet quenching. The heavy-ion collisions show little or no sign of jets for p⊥ < 4 GeV. This is interesting in the light of the fact that, if one scales single-particle
8.5 Relativistic heavy-ion colliders
233
Figure 8.22. Gluons annihilating into quarks within the plasma phase. hadron spectra from proton–proton data to A–A data, one overshoots the jet data very significantly [81]. It has been argued that this is evidence for the existence of an extended space-time region of the quark–gluon plasma. The idea is that two colliding gluons in the plasma annihilate into an energetic quark–anti-quark pair with the quark and anti-quark having sizable transverse momenta, but their interactions with the plasma medium sap the energy out of the energetic quark and its partner, eliminating (“quenching”) those jet-like characteristics (Fig. 8.22). Of course, very-high-energy jets are unaffected by the medium because of the short-range nature of the most prevalent interactions on the rapidity axis, and jets with energies much larger than 4 GeV are seen. 2. Suggestions of deconfinement and restoration of chiral symmetry. Lepton pairs have long been cited as effective probes into the internal dynamics of the quark–gluon plasma. In particular, if the ρ and φ existed in the plasma, then the leptonic decays ρ → ee and φ → ee would easily be seen, as in proton–A collisions. These peaks are missing, however [81]. It is tempting to interpret this experimental result as evidence that the light hadrons have “melted” in the hot plasma. 3. Screening and suppression of J/ψ. One of the first theoretical suggestions for a signal of the existence of the quark–gluon plasma concerned the heavy quark states J/ψ. It was argued, and backed up with potential model calculations, that quasi-free quark and gluon thermal screening would reduce the attractive forces between the heavy quarks sufficiently to eliminate the binding energies and the J/ψ
234
High temperatures and chemical potentials states themselves [49]. In fact, the suppression of these states, which is again implied by the absence of the associated lepton pairs, is much more dramatic than that expected in hadron-absorption models.
RHIC experiments are in their infancy but hold great promise for elucidating the dynamics of QCD on a wide range of length and time scales. Heavy-ion collisions and the critical end-point What can heavy-ion-collision experiments say about the phase diagram of QCD? On looking at the phase diagram in Fig. 8.11 one observes that, since the transition between hadron and quark matter is not a proper phase transition, but rather a crossover, the strength of the signatures of the quark–gluon plasma should depend on the strength of this transition. Of course, in the ideal limit of T → ∞ the quark–gluon plasma is qualitatively different from the hadron gas, but at the intermediate temperatures probed by heavy-ion colliders the distinction depends on the sharpness of the crossover. The too-well-known fact that hadronic models, if pushed to the extreme, can almost reproduce or at least mimick the signatures of the quark–gluon plasma should not come as a surprise – on the basis of our present-day knowledge of QCD, we must expect that the hadron gas smoothly transforms into the quark–gluon plasma as the temperature rises. There is no good order parameter distinguishing the two states and no phase transition. Is there a property of the phase diagram that has distinct qualitative, not simply quantitative, features whose signatures cannot be mimicked? The most prominent feature of the phase diagram in Fig. 8.11 is the first-order transition line terminating in a critical point at TE , µE . The end-point is especially attractive since it is the point where thermodynamic properties of QCD are singular. What is the location of this singularity, the coordinates TE , µE ? This question has an answer, but we do not know it. If we did, it would appear as one of the fundamental quantitative properties of the QCD phase diagram in this book. Lattice efforts to determine TE , µE will be discussed later. The strategy for discovering the QCD critical end-point [83, 84] is based on the fact that fluctuations of the sigma field acquire an anomalously large correlation length.9 The divergence of this correlation length is the reason for the thermodynamic singularity. Such fluctuations induce characteristic correlations between particles that can interact by exchanging sigma quanta. Examples are pions, which are the most convenient particles to observe due √ to their abundance. By changing the parameters of the collision, notably s, one can vary 9 This correlation length is limited by the finite size and duration of the process to some 3 fm or
so [84, 85], which is still very large compared with the correlation length in this channel in the vacuum, and with typical correlation scales at these temperatures.
8.5 Relativistic heavy-ion colliders
235
the location in the√ phase diagram probed by the heavy-ion-collision experiment. As a function of s, the signatures of the critical point must appear and disappear as the critical point is approached and then passed during such a scan. Measurements of particle ratios allow one to determine the values of the temperature and the chemical potential achieved during the final stages of the collision (before the system breaks up). By matching the two measurements one can determine the location of the critical point on the phase diagram of QCD.
9 Large chemical potentials and color superconductivity
9.1 Color superconductivity and color–flavor locking Lattice-gauge theory has produced unique insights and predictions for QCD at high temperature, but it has had little to say about QCD in environments rich in baryons. There are several reasons for this impasse. One of them is the fact that the fermion determinant is complex in this environment, so the standard algorithms of the subject fail. This point will be demonstrated later. Other more conventional methods, such as weak-coupling approximations, apparently apply in this environment and suggest that the theory experiences color superconductivity at asymptotically large baryon chemical potentials. There is the hope that the transition to this exotic state of matter actually occurs at a chemical potential of just a few hundred MeV. The subject of color superconductivity produces interesting conjectures for the QCD phase diagram in the T –µB plane where T is small (below a few tens of MeV) and µB is very large. In this chapter we will discuss the major points, both strengths and weaknesses, of this approach and refer the reader to the growing literature for more recent, developing events. Once lattice-gauge theory develops an algorithm that applies to this environment, these ideas will be put to stringent tests. However, for much of the rest of this chapter, we must ignore lattice-gauge theory and focus on weak-coupling methods and the BCS theory of superconductivity and its application and extensions to QCD. We shall see that these weak-coupling nonperturbative methods suggest a fascinating array of phenomena – superconductivity, exotic pairings and condensates, novel spectroscopy, etc. It is interesting to recall that, in the earliest days of this subject, physicists believed that QCD at large µB would be easy to understand. The chemical potential µB carries the dimensions of energy, so one argued that the relevant coupling would be g(µB ), which would vanish logarithmically as µB → ∞. Therefore, 236
9.1 Color superconductivity
237
the nonperturbative phenomena, such as the formation of bound states, chiralsymmetry breaking, and confinement itself, would disappear in an environment rich in baryons. At high chemical potential, the argument went, QCD would become a weakly coupled plasma and the phase diagram of the theory would be described by the two-component model, Fig. 8.6. Physicists born and raised in the era of the BCS explanation of superconductivity knew better [21–24]. If one imagined that the ground state at asymptotic µB was a free-quark Fermi sphere, then one knows that even an infinitesimal attractive residual interaction between quarks touches off an instability with respect to a lower-energy ground state consisting of Cooper pairs. Actually BCS superconductivity occurs in a very sophisticated fashion in metals at low temperatures. There the strong Coulomb repulsion between electrons is screened by the nuclei, leaving the weak, but attractive, exchange of low-energy, lowmomentum sound waves (phonons) to act between the electrons on the opposite sides of the Fermi surface and lead to the condensation of large, floppy Cooper pairs. These doubly charged pairs condense in the new ground state and further shield it from electromagnetic fields. In the language of field theory, the photon develops a mass in the true ground state through the Meissner effect and there are energy gaps for all the excitations in the superconducting state. The energy gap, , the energy required to break up a Cooper pair, is the source of the system’s superconductivity. Can some of these ideas be applied to QCD? It has been argued that QCD should experience color superconductivity through a BCS mechanism at asymptotic µB and it is hoped that this mechanism persists down to reasonable values of the chemical potential, producing color superconductivity inside “typical” neutron stars. One imagines the free-quark model of three massless quarks and three colors at asympototically large µB . Sharp Fermi surfaces would appear. A free colored gluon would not propagate easily through such a system. It would experience Debye screening, Landau damping, etc. and single-gluon exchange between quarks on the top of the Fermi surfaces would destabilize them and lead to pairing. Since single-gluon exchange is attractive in the 3¯ channel, one would expect, perhaps, that 3¯ diquark condensation would be energetically favorable with respect to a sharp Fermi surface of uncorrelated quarks, and color superconductivity would occur. This physical picture has been analyzed more critically and there are surprises and subtleties, although color superconductivity is expected with a sustantial mass gap of several tens of MeV. First, one might question the applicability of asymptotic freedom to the dynamics on the surface of the Fermi sphere. The point is that high-energy quarks can scatter through glancing collisions, with small momentum transfers that are not controlled by g(µB ) 1. One needs to develop, at least, a self-consistent approach to the dynamics here, comparable in
238
Large chemical potentials
quality to discussions of the field-theoretical approach to BCS superconductivity. There one looks for self-consistent superconducting solutions to the coupled Schwinger–Dyson equations for the fermion propagator, the gluon propagator, and the fermion–gluon vertex. We will discuss this program for color superconductivity further below, after we have introduced more of the most elementary features expected in the solution. If single-vector-gluon exchange is the dominant destabilizing interaction at large chemical potential, then the wavefunction of the ground-state diquark condensate has some simple and striking group-theoretical features, called color– flavor locking [62]. Right-helicity fermions couple with right-helicity fermions through single-gluon exchange and left-helicity fermions couple with lefthelicity fermions in such a way that the ground state is a scalar, aα aα bβ bβ ψiL ( p)ψ jL (− p) = − ψiR ( p)ψ jR (− p) = ( p) ab αβρ i jρ (9.1) where α and β are color indices, i and j are flavor indices, and a and b are spinor indices. It is crucial here that there are three flavors and three colors. In fact, the β β condensate wavefunction carries the indices αβρ i jρ = δaα δb − δbα δa and the color and flavor indices are “locked” together. We see that color and flavor must be rotated together to preserve the condensate. This curious “locking” feature of the ground state is similar to other broken-symmetry states in condensedmatter and high-energy physics. In 3 He physics, one has the famous B phase in which spin and orbital rotations are locked together by the condensate. Closer to home, the chiral-symmetry-breaking condensate of QCD at low T and µB , the condensate of left-handed quarks and right-handed anti-quarks, locks the SU(3)L and SU(3)R flavor rotations. It is particularly curious that the condensates in Eq. (9.1) lock the flavor chiral groups, SU(3)L and SU(3)R , together through the intermediary symmetry SU(3)color . Apparently, color–flavor locking provides a new mechanism for the dynamical breaking of chiral symmetry. In this picture, chiral symmetry is not restored at large µB . Why should this pattern of condensates be favored over any other? We shall see that, in a small-coupling (large µ) regime, the pattern (9.1) is a solution of a self-consistent equation. In another language, this means that it is at least a local minimum of the ground-state energy functional. Whether it is a global minimum has not (yet) been rigorously proven, but a simple physical argument can be given: this pattern involves quarks of all nine color–flavor combinations, providing gaps for all of them. This makes the energy of such a ground state lower than that of other possibilities, which leave some quarks ungapped. Before considering some of the dynamics here in greater detail, it is interesting to discuss two features of this new ground state. First, note that the con¯ so it drives a dynamical Higgs densate carries color; the wavefunction is a 3, mechanism. A purist would complain that the notation here is breaking the exact,
9.2 The gap at asymptotically large µ
239
local color symmetry of QCD. In fact, the notation should be understood in the same way as that in which the Higgs mechanism of the standard electroweak model is described: at weak coupling we choose a gauge to do calculations and find the important correlations. This notation is extremely convenient and useful as long as we are considering just weak couplings. Second, baryon number is broken in the superconducting state as well. Of course, ordinary superconductivity breaks electric charge in the same fashion. This does not mean that the state is violating conservation of charge in a fundamental sense, but rather that the condensate makes the charge quantum number indefinite and, by virtue of Gauss’ law, electric charge flows easily into and out of macroscopic volumes. Third, chiral symmetry is broken in the color–flavor-locked phase, but in a very different fashion than in ordinary hadronic matter. In ordinary hadronic matter, right-handed quark fields and left-handed anti-quark fields are correlated by strong attractive forces, a condensate forms, and a mass gap opens in the fermion propagator. When this state is subjected to an environment having an increasing chemical potential, there is less of a tendency to maintain that mass gap. The reason is that, if there is a mass gap, then, as the fermion states are occupied at nonzero, increasing chemical potential, the new fermions must be placed in states of relatively high energy. Eventually the energy cost is too great and the gap vanishes so that the additional fermions can reside at lower energy. In other words, the attractive forces that favor the formation of a gap are overwhelmed by the energetics of high-energy, additional fermions in states above the gap. So, at some critical µB , one would expect restoration of chiral symmetry and the gap to vanish and a sharp Fermi surface to appear in the system. However, this is not the end of the story, because that sharp Fermi surface is unstable with respect to the weakest attraction between the fermions at the top of the Fermi surface, and this new attraction causes its own chiral-symmetry breaking through color–flavor locking. Clearly, even though both phases break chiral symmetry, the mechanisms are totally different (see, however, Section 9.3.4 for another look at this question). Now on to the development of color superconductivity. 9.2 Calculating the gap at asymptotically large µ We shall compute the color-superconducting gap at asymptotically large µ. The Green functions describing quark pairing, and the equations which we need to obtain and solve, contain, besides an energy–momentum dependence, a complicated spin, color, and flavor matrix structure. Instead of writing down the equations for such Green functions straightforwardly and only then trying to simplify them in the limit µ → ∞, we shall choose another strategy. We shall first simplify the Lagrangian and then derive the simplified form of the equation we need
240
Large chemical potentials
from it. In this way we can separate the problem into much more manageable and physically intuitive small steps. 9.2.1 The effective action for quarks near the Fermi surface The key idea is to identify the fields and their corresponding excitations which will dominate the effects of interest when µ is large. These are, typically, the lowest-energy excitations. In the almost free gas of quarks at large µ, they are the excitations corresponding to adding a quark (or a hole) in close proximity to the Fermi surface, where the kinetic energy of the quark | p| is almost cancelled out by the chemical potential µ. In the following we shall identify the fields which are responsible for these excitations and write down the effective action for them. First consider the full Dirac Lagrangian of quarks in QCD in Euclidean space: ¯ µ + ig Aµ )γµ − µγ0 ]# #[(∂
(9.2)
The quark fields # carry Dirac, color, and flavor indices, which we have not written out in Eq. (9.2), and the gluon field is a matrix in color indices. The kinetic term and free spectrum The free-quark kinetic term (i.e. Aµ = 0) for a mode of a given four momentum p in Euclidean space is i (# † )iαa [i p0 + α · p − µ]αβ #βa
(9.3)
¯ 0 and αi = iγ0 γi . First, we see that the Dirac matrix where, as usual, # † = #γ [i p0 + α · p − µ]αβ does nothing to color a and flavor i indices. Color and flavor just “flow through,” and the free action is a sum of actions for quarks of each of Nc × Nf color and flavor combinations. We shall suppress color and flavor indices in the following. Since we are dealing with massless quarks, it is convenient to use the chiral representation of the 4 × 4 Dirac matrices: 0 σi (9.4) αi = 0 −σi where σi are the usual 2 × 2 Pauli matrices. The upper and lower blocks in Eq. (9.4) correspond to spinors of right and left chiralities, respectively. As with color and flavor, the chirality also flows through – the kinetic terms do not mix right and left spinors. We are now left with only 2 × Nc × Nf actions that involve only two-component spinors. Pick the upper two components of the Dirac spinor (right chirality), so the action becomes (# † )α [i p0 + σ · p − µ]αβ #β where the indices α and β now run over two values only.
(9.5)
9.2 The gap at asymptotically large µ
holes
Fermi surface
E
241
ψ quarks µ
0 ψ
|p|
−µ χ
Figure 9.1. The spectrum of free massless quarks at finite µ. The branches with negative energy E correspond to antiparticles, or holes, with energy |E| (dashed lines). The branches corresponding to the fields ψ (E = | p| − µ) and χ (E = −| p|−µ) are marked correspondingly. The dotted line denotes the Fermi surface | p| = µ. For each vector p we can diagonalize the matrix α · p. Denote the eigenvector which satisfies α · p# = +| p|# by ψ. The other eigenvector, corresponding to α · p# = −| p|#, will be denoted χ . Both ψ and χ are only single-component fields. The action for the field ψ is ψ † (i p0 + | p| − µ)ψ
(9.6)
This action describes an excitation with energy E = −i p0 given by E ψ = | p| − µ (see Fig. 9.1). These are excitations corresponding to a free quark, when E ψ > 0, and a free hole, when E ψ < 0. The quark excitations can occur only outside of the Fermi surface | p| = µ, due to the Pauli principle, while the holes can exist only inside. Similarly, the dispersion relation for excitations of χ is E χ = −| p| − µ (see Fig. 9.1). What are these excitations? Their energy is always negative, therefore they describe only antiparticles, with energies | p|+µ. These energies are at least µ, because of the penalty that the chemical potential imposes on an extra antiquark in the system. Thus, if we are interested in the processes which involve energies E much smaller than µ, the effect of such virtual excitations of χ will be suppressed typically by a power of E/µ due to a large energy denominator.1 1 We shall also encounter interesting cases in which the effects of the antiparticles appear at
leading order, because the contribution of particles vanishes or because the contribution of the antiparticles is additionally enhanced by a power of µ.
242
Large chemical potentials
The interaction term and the action Now consider the interaction term in Eq. (9.2): i (# † )iαa [i(A0 )ab + Aab · α ]αβ #βb
(9.7)
In this case only the flavor index “flows through.” The gluon field Aµ is a color Nc × Nc matrix, with color indices a and b. The Dirac matrix is still blockdiagonal in Dirac indices, αβ, i.e. chirality also flows through – see Eq. (9.4). Thus we can split the interaction term into identical 2LR × Nf terms each involving a 2ψχ × Nc spinor. Let us keep this in mind, but not write the color indices explicitly: · σ ]αβ #β ( p) (# † )α (k)[iA0 (q) + A(q)
(9.8)
The indices α and β run over two values only. We also restored the momentum arguments of the fields. For simplicity choose the coordinate system such that p = (0, 0, | p|). Then the fields ψ and χ (eigenvectors of σ · p) correspond to upper and lower components of the spinor #. For a moment neglect the momentum of the gluon, i.e. set q = 0. Then, the momentum of the fermion is not changed, p = k, so # † A · σ # = ψ † A3 ψ − χ † A3 χ + {χ † [A1 + iA2 ]ψ + h.c.}
(9.9)
We see that all terms, except the first one, involve the field χ , which produces contributions typically suppressed by a large energy denominator, as discussed earlier. In the gap equation, diagrams involving the field χ contribute only to the next order in the expansion in powers of E/µ, where E is the typical relevant energy. Therefore, we can neglect these terms for the calculation of the gap to leading order. Finally, what is the effect of nonzero q? Equation (9.9) is valid when the orientation of the eigenvector ψ(k) is the same as the orientation of the eigenvector ψ( p) in the 2-spinor space. For k = p these two eigenvectors are not aligned and additional terms are generated compared with Eq. (9.9). We can estimate the relative magnitude of these terms by calculating the overlap between ψ( p) and ψ(k), which is easiest to do using projectors: ψ( p) =
1 + v p · σ #, 2
where
v p ≡
p | p|
(9.10)
The vector v p is the velocity of the quark of momentum p. We can write for the overlap 1 + vk · σ 1 + v p · σ # 2 2 vk − v p † # · σ ψ( p) = ψ( p)ψ( p) + 2
ψ(k)† ψ( p) = # †
(9.11)
9.2 The gap at asymptotically large µ
243
If q is much smaller than µ, and we shall see that these are the gluon momenta that we need in our leading-order calculation, we can expand vk − v p = O(| q |/| p|). Since | p| ∼ µ for the quark excitations we are focusing on, we conclude that corrections to Eq. (9.9) due to q = 0 are suppressed by a power of q/µ. Thus, they are negligible in our leading-order calculation. where v = v p . On putting together the Now, we can also write A3 = v · A, kinetic term (9.6) and the interaction term for the fields ψ from Eq. (9.9) as well as the coupling to A0 , we finally obtain the effective Euclidean-space action for the quark excitations near the Fermi surface [86]: ψ † ( p)[(i p0 + p )δ pk + g(iA0 (q) + v · A(q))]ψ(k) (9.12) S= p
k
In Eq. (9.12) we used momentum modes of the fields, instead of the usual coordinate-space fields, to simplify the fermion kinetic term and to make Feynman-diagram rules easier to derive. We use the following short-hand notations: d4 p p ≡ | p| − µ, ≡ , δ pk ≡ (2π )4 δ 4 ( p − k) (9.13) (2π )4 p Note that the interaction with the gluon field has the very simple form of the One coupling of a charge density ψ † ψ to A0 and current density ψ † vψ to A. should bear in mind that the action (9.12) is applicable to modes of the field ψ describing quark/hole excitations near the Fermi surface, and that the momentum q carried by the gluon should be much smaller than µ in order to ensure that both p and k are in the vicinity of the Fermi surface. We shall see below that such soft gluons give the leading contribution to the gap due to a logarithmic singularity when |q| → 0. 9.2.2 The strategy of the calculation and the meaning of the gap How do we determine whether the phenomenon of BCS-like fermion pairing/condensation is occurring and how do we quantify the strength of this phenomenon? The general strategy is based on the following observation. If pairing occurs the ground state of the system is modified compared with the “naive” perturbative one. In the Lagrangian or path-integral formulation of the field theory, we can detect it by evaluating certain Green functions, which are vacuum expectation values of fields. We look at functions that would be zero in the perturbative vacuum. In our case we shall look at the following two-point function: ψ( p)ψ(− p). This fermion–fermion propagator is “anomalous” in the sense that it must be zero in the naive vacuum, by virtue of conservation of fermion number, but if the vacuum contains a condensate of fermion pairs, such
244
Large chemical potentials
a fermion-number-violating process becomes possible and the Green function becomes nonzero. In a more common case of spontaneous symmetry breaking, the vacuum expectation value of a scalar field is also such an anomalous (onepoint, in this case) Green function. However, how can the Green function be nonzero if we are able to calculate it only in perturbation theory? The answer is that we must go beyond perturbation theory to evaluate this function. In principle, such a calculation requires resummation of an infinite number of Feynman diagrams. The standard method of doing this was introduced in the context of relativistic Lagrangian field theory by Nambu and Jona Lasinio. We assume some trial form of our anomalous Green function and use it in all perturbation-theory calculations. How do we determine whether our particular assumption is correct? We do that by calculating the very same Green function itself. By equating the calculated value of the function to the trial function we obtain a self-consistency equation for the Green function – often called the gap equation, terminology borrowed from BCS. This idea is akin to the mean-field approximation in spin systems. Obviously, the gap equation always has a trivial solution, corresponding to the naive perturbative value of the Green function. This corresponds, in Hamiltonian language, to the fact that the trivial vacuum, although it is not the most energetically favorable, is still a local extremum of the energy. The nontrivial solution, if and when it exists, signifies that another such extremum exists. Whether this is a more favorable vacuum can be verified by calculating the energy (given by vacuum, or bubble, graphs). This procedure can, in principle, be carried out to arbitrary order in perturbation theory, but it is seldom practical beyond the leading order. The equation is nonlinear in the Green function already at leading order, and its solution is a function of the coupling constant g, which is often nonanalytic at g = 0. This reflects the fact that no finite number of Feynman graphs can reproduce this effect. It also means that corrections to the leading-order value of the Green function are not necessarily organized as an expansion in powers of g. However, it is possible to show that they become smaller as g → 0, thus justifying the leading-order result. From the point of view of the perturbative expansion of the path integral, we can also think of this procedure in the following equivalent way (which is the original view of Nambu and Jona Lasinio). The perturbative expansion is set up by separating out a bilinear, “free” part L0 from the bare Lagrangian L: L = L0 + LI . The remaining, “interacting” part of the Lagrangian, LI , is then used to generate vertices, while L0 provides propagators. Since the anomalous function is a two-point function, using it in perturbative calculations amounts to adding a certain bilinear term into the Lagrangian L0 . To prevent changing the path integral, one must incorporate the same bilinear into LI with the opposite sign – a counterterm. This counterterm generates an additional two-point vertex.
9.2 The gap at asymptotically large µ
245
The result of the calculation will not depend on how we separate the Lagrangian into the interacting and free parts. Let us now choose a trial separation, giving a nonzero value for the anomalous Green function. This can be achieved by bringing a term p ψ( p)ψ(− p) into L0 . We can now calculate corrections to ψ( p)ψ(− p) in our modified perturbation theory. If we can find a value for p that makes the corrections vanish, order by order in perturbation theory, we will have found the exact value of p . Now we are almost ready to write the gap equation, but before doing this let’s clarify the meaning of the parameter p . This will not be necessary to derive or solve the equation, but it will help us understand what we are trying to calculate. At this point, we should pay some attention to the color–flavor j structure of the condensate ψai ψb . As we discussed in the introduction to this section, the pairing occurs between the quarks in the anti-symmetric color (as j well as flavor) channel. Thus the fields ψai and ψb must always have different j colors and different flavors. Denote them ψ1 = ψai and ψ2 = ψb . The bilinear Lagrangian, consisting of the bilinear parts of Eq. (9.12) for fields ψ1 and ψ2 with the term p ψ1 ( p)ψ2 (− p) + h.c. added, can be written as
ψ1 ( p) ψ2∗ (− p)
†
i p0 + p p
− p −i p0 + p
ψ1 ( p) ψ2∗ (− p)
(9.14)
Thus, the anomalous propagator is given in terms of the function p as ψ1 ( p)ψ2 (− p) =
p02
p + 2p + 2p
(9.15)
The normal propagator is also modified: †
†
ψ1 ( p)ψ1 ( p) = ψ2 ( p)ψ2 ( p) =
−i p0 + p + 2p + 2p
p02
(9.16)
The poles of this propagator determine dispersion relations for excitations. The energy of propagating excitations, E = −i p0 , satisfies −E 2 + 2p + 2p = 0. For simplicity, let us neglect the dependence of p on p. This is usually (and we shall see it explicitly in our case) a good approximation in the range of values of p0 or p of order p . Denote the value of p for momenta p near the Fermi surface by 0 . Then we have two branches: E = ± 2p + 20 = ± (| p| − µ)2 + 20 (9.17) There is a gap between these two branches of the spectrum, equal to 20 (see Fig. 9.2). It takes a finite energy 0 to excite either a quark or a hole in the system.
246
Large chemical potentials ψ2∗
E
0
E
ψ1
0
| p| − µ
uψ1 + vψ2∗ ∆0 ∆0
|p| − µ
Figure 9.2. The spectrum of the quark and hole excitations near the Fermi surface. The left-hand graph is a zoom-in on the region around | p| = µ, E = 0 in Fig. 9.1. The spectrum of the field ψ2∗ is inverted due to charge conjugation. On the right-hand side, the spectrum is given by Eq. (9.17). The excitations are now quanta of a field, which is a linear superposition of ψ1 and ψ2∗ , as indicated. The excitations of the fields ψ1 and ψ2∗ mix with each other because of the anomalous propagator (9.15). The fields, corresponding to the energy eigenstates in Eq. (9.17), can be found by diagonalizing the matrix (9.14). For the positive-energy branch it is given by ψ+ = u p ψ1 + v p ψ2∗ ,
with
vp = tan φ p up
and tan(2φ p ) =
p (9.18)
Near the Fermi surface, p = 0, the mixing is maximal, u p = v p , as it should be because the energy levels cross there (see Fig. 9.2).
9.2.3 The gap equation The fermion propagator: two flavors First consider the case of two flavors: Nf = 2. The calculation in the case of three flavors is similar, but requires a little more care to set up because of its more complicated color–flavor mixing. As we discussed, the condensate should be anti-symmetric in flavor and color. Such a pairing can be achieved if the anomalous term in the Lagrangian has the following flavor–color form: p i j ab3 ψai ( p)ψbi (− p)
(9.19)
We have arbitrarily chosen the color direction of the condensate to be 3. Let us use letters u and d to denote fields of the two flavors: u a = ψa1 and da = ψa2 , where a is the color index. Then the term (9.19) induces pairing similar to that described by Eq. (9.14) for pairs of fields u 1 d2 and u 2 d1 . The propagators in each pair are the same as in Eqs. (9.15) and (9.16), with ψ1 ψ2 substituted for
9.2 The gap at asymptotically large µ
247
q k a i
ab3 ij
b
a
j
i
p A Taa
−p a b 3 ij
−k TbbA
b j
Figure 9.3. The diagrammatic representation of the gap equation. The “flow” of momentum as well as color and flavor indices is indicated. u 1 d2 or u 2 d1 . Therefore, the anomalous Green function can be written as i p j i j ab3 (9.20) ψa (− p)ψb ( p) = 2 p0 + 2p + 2p The case of three flavors is a little more complicated because the anomalous term has the form j
p i jk abk ψai ( p)ψb (− p)
(9.21)
and mixes each flavor with two other flavors, e.g. there is a term u 1 d2 as well as u 1 s3 , where sa = ψa3 . Below we shall discuss a trick that helps us determine the propagator in this case. First consider the simpler case of two flavors. The equation Using the Lagrangian (9.12), introducing the bilinear term with a trial function p as in Eq. (9.19), and applying the self-consistency condition to leading order in the coupling g, we obtain the following gap equation: p 2 2 Dµν ( p − k)vµ ( p)vν (− p) (9.22) k = − g 2 2 2 3 p p0 + p + p where the factors vµ ( p) ≡ (i, v p ) and vν (− p) = (i, − v p ), come from the fermion-gluon vertex ψ † vµ Aµ ψ in Eq. (9.12) and Dµν is the gluon propagator. Equation (9.22) is graphically represented in Fig. 9.3. The fermion propagator has the form given in Eq. (9.20) and we have removed the identical factors i j ab3 from the right- and left-hand sides of Eq. (9.22). The fact that the flavor and color tensor factors cancel out in Eq. (9.22) is not entirely trivial, and signifies that our choice of the flavor and color structure in Eq. (9.19) was correct – had we chosen a different structure, we might not have been able to satisfy the gap equation. The flow of the flavor indices (see Fig. 9.3) is trivial, because the quark–gluon vertices are “transparent” to flavor. However, color indices require the following algebraic calculation, the result of which is the factor − 23 in Eq. (9.22).
248
Large chemical potentials A Taa
a
b
a
k
p
−k
−p TbbA
b
Figure 9.4. The one-gluon exchange between quarks of opposite momenta. Flow of color and momentum is indicated. On glueing the quark lines on the right with an anomalous quark propagator one obtains the diagram in Fig. 9.3 The color indices in the quark–gluon vertex in Fig. 9.3 come from the interaction term in Eq. (9.12), ψ † vµ Aµ ψ, if we recall that Aµ is a matrix in color space, which can be written in the basis of eight color generators T A as Aµ = AµA T A , where the index A = 1, . . ., 8 runs over eight gluon colors. The traceless 3 × 3 matrices T A are normalized as 1 tr(T A T B ) = δ AB (9.23) 2 The color factors in the diagram on the right-hand side of the gap equation in Fig. 9.3 then give 1 1 Nc + 1 2 A A δab δa b − Taa Tbb a b 3 = δaa δbb a b 3 = − ab3 = − ab3 2 Nc 2Nc 3 (9.24) In Eq. (9.24) we have used the identity2 1 1 TaaA TbbA = δab δa b − δaa δbb 2 2Nc
(9.25)
The factor − 23 in Eq. (9.24) carries information about the strength and the sign of the interaction between the two quarks due to the exchange of a gluon. This can be seen by drawing the quark lines in Fig. 9.3 in the way shown in Fig. 9.4. If we assume that p is positive, the right-hand side of the gap equation (9.22) is manifestly positive – the gluon propagator is positive in Euclidean space and symmetric with respect to µ ↔ ν, i.e. −Dµν vµ ( p)vν (− p) = D00 + Di j vi v j > 0
(9.26)
2 This identity is easy to derive using the fact that any N × N matrix can be represented in the c c √ basis of matrices T A , completed with the unit matrix (normalized by a factor 1/ 2Nc ). This means that, for an arbitrary matrix X , we can write X √ = 2 tr(X T A ) T A , where the summation over A = 0, 1, . . ., 8 also includes the matrix T 0 = 1/ 2Nc .
9.2 The gap at asymptotically large µ
249
where v ≡ v p ≡ p/| p|, as before. Like the cancelling out of the color tensors, the consistency of the overall sign in Eq. (9.22) is also an indication that our choice of the trial anomalous term (9.19) was correct. Indeed, suppose for a moment that we had chosen the trial gap term to be symmetric in color (this would ¯ Then, on repeating the correspond to a color representation 6 as opposed to 3). algebra in Eq. (9.24) with a symmetric tensor instead of ab3 , we would have found that, although the tensor structure is preserved, the factor is now positive, (Nc − 1)/(2Nc ) = + 13 . This is because the members of a pair of quarks in a symmetric color representation repel each other. Also note that the signs of the electric and the magnetic terms in Eq. (9.26) being the same reflects the fact that the quarks which pair, according to Eq. (9.19), are moving in opposite directions, and thus, as in electrodynamics, if they attract electrically, they also attract magnetically. Pointlike interaction For a moment imagine that the gluon does not propagate, i.e. it provides a pointlike interaction, both in space and in time. In other words, consider a propagator that is independent of the 4-momentum p − k: Dµν = δµν /2 , where is some constant mass parameter. Then Eq. (9.22) takes the form p 4 g2 k = (9.27) 2 2 3 p p0 + 2p + 2p Since the right-hand side does not depend on k now, we conclude that k ≡ is constant. The integral over p can be done. We see that, when µ, the integrand is appreciably nonzero only for p0 and p of order , and that the integral diverges logarithmically when → 0. This justfies our use of the effective action (9.12) as long as we limit ourselves to logarithmic precision. Within this precision, we can write the integral in the following way, by using the fact that it is dominated by a thin shell around the Fermi surface: d p0 d p d' p d4 p µ2 ≡ = (9.28) 4 2 (2π ) 2π 2π 4π p We find, to logarithmic precision, )µ* µ2 4 g 2 µ2 4 g 2 = d p = ln 0 2π 2 3 2 2π 2 3 2 2 p 2 + 2
(9.29)
0
where we cut off the intergal in the ultraviolet at a scale of order µ, since at this scale our approximate action (9.12) breaks down. The nontrivial solution to the gap equation (9.29) is thus 3π 2 2 1 = µ exp − (9.30) 2µ2 g 2
250
Large chemical potentials
with exponential precision. This result – an exponentially small value of the gap exp(−constant/g 2 ) is familiar from BCS theory, in which the role of an approximately pointlike interaction is played by phonon exchange. We should note two features of the solution (9.30). First, the gap is nonanalytic in the coupling g, so it is not expandable in powers of g. Second, the gap is exponentially small at small coupling µ (we assume that µ < ). Both features will remain qualitatively true when we include the effect of the non-locality of gluon exchange. Long-range magnetic-gluon exchange In QCD, we have to take into account the momentum q = p − k dependence of the propagator Dµν , i.e. the long range and the temporal retardation of the interaction. This can be done to logarithmic precision in Eq. (9.22). If we take ≈ | p| ≈ µ, the tree-level propagator Dµν (q) ∼ 1/q 2 and specialize to |k| we can write 1 (9.31) Dµν ( p − q) ∼ ( p0 − k0 )2 + 2µ2 (1 − cos θ ) The integral over this angle in Eq. (9.22) where θ is the angle between p and k. is logarithmically divergent at θ = 0 when p0 = k0 and is cut off at θ ∼ q0 /µ. This means that small-angle scattering of the quarks dominates, and it will saturate the result to logarithmic precision. The right-hand side of the gap equation now has an additional logarithmic divergence, a collinear one, apart from the usual BCS logarithmic divergence in Eq. (9.29). This will turn out to change the coupling-constant dependence of the gap. To calculate this dependence correctly, one needs to go beyond the free-gluon propagator. The reason is that the cutoff θ ∼ q0 /µ may become irrelevant if q0 is too small, and we must take into account the effect of the gluon polarization tensor "µν in the medium of quarks. Although the polarization tensor is of order g 2 , its effect on small-angle scattering can become dominant if q0 is sufficiently small, typically when q02 < "µν . For example, Debye screening will provide the cutoff for the scattering angle θmin ∼ | q |min /µ ∼ g, which becomes relevant when q0 < gµ. We shall need to consider the gluon propagator only in the regime q0 | q |, for, as we shall see, the logarithmically dominant contribution to the gap equation is accumulated in that regime. In this case, the propagator, with the effect of the polarization included and to the precision we require, can be written in Coulomb gauge as3 1 D00 (q) = (9.32) 2 | q | + g 2 µ2 The actual coefficient of the Debye mass term g 2 µ2 is not unity as in Eq. (9.32) 3 We shall show how to derive this result and discuss the issue of gauge invariance below.
9.2 The gap at asymptotically large µ
251
but its precise value does not affect the result in the logarithmic approximation. To the same precision, Di j (q) =
δi j − qˆi qˆ j | q |2 + g 2 µ 2
q0 | q|
(9.33)
where qˆi ≡ q/| q |. The last term in the denominator in Eq. (9.33), whose actual numerical coefficient is also unimportant, describes Landau damping of the spacelike (| q | q0 ) gluon modes. Note that it causes screening of the slowly varying magnetic fields and vanishes for static fields (q0 = 0) – this is dynamical screening. It cuts off fields at the wave vector | q | < (g 2 µ2 q0 )1/3 . For q0 gµ this screening scale is much lower than gµ. Thus the collinear logarithmic divergence of the magnetic interaction is cut off much closer to the singularity than is the electric, Coulomb, interaction. In other words, the leading contribution to the collinear logarithmic singularity is from the magnetic gluons, as long as q0 gµ. Retaining only the contribution of the magnetic gluons, one arrives at p 1 2g 2 (9.34) k = q0 2 2 2 3 p p0 + p + p | q |2 + g 2 µ2 | q| The electric gluons contribute significantly only when q0 > gµ. We shall return to the contribution of electric gluons later and see that their effect is indeed subleading in the limit g → 0. The integral in Eq. (9.34) is over the energy p0 , magnitude of momentum | p|, or p = | p| − µ, and direction of momentum ' p and can be approximated as in Eq. (9.28). Taking the integral over p first, using the method of residues: d' p p 1 2g 2 µ2 (9.35) d p0 k = q 2 3 2π 4π 2 p 2 + 2 | 2 + g 2 µ2 0 q | p 0 | q| The gap on the right-hand side, which was initially a function both of p0 and of | p| ( p = | p| − µ) is now a function of p0 alone, since, as a result of integration over p we have put p on the mass shell: p02 + 2p + 2p = 0, i.e. | p| is a function of p0 in Eq. (9.35). The integral over the angle ' p can also be taken, in logarithmic approximation, giving p 1 2g 2 µ2 µ k = (9.36) d p0 ln 2 3 2π 2 (g 2 µ2 q0 )1/3 2 p02 + 2p 2µ The argument of the collinear logarithm in Eq. (9.36) is equal to 1/θmin , where θmin ∼ | q |min /µ ∼ (g 2 µ2 q0 )1/3 /µ is the scattering angle at which magneticgluon exchange is cut off by Landau damping. The integral over p0 adds one
252
Large chemical potentials
more logarithmic divergence, which has the same origin as in the usual BCS case. One can divide the interval spanned by | p0 | into four regions, by order of magnitude: (0, 0 ), (0 , |k0 |), (|k0 |, gµ), and (gµ, µ). In the first region, the integral does not contribute to the BCS logarithm (the singularity is regularized by in the denominator). In the second and third regions, we can replace |q0 | = | p0 − k0 | by the largest of the momenta: |k0 | in the second region and | p0 | in the third, i.e. to logarithmic precision we have |q0 | = max(| p0 |, |k0 |). To the √ same precision we can also neglect compared with p0 in p02 +2p =| p0 |. The last region (gµ, µ) gives a parametrically small contribution, as we shall see shortly, and we can ignore it in the leading logarithmic approximation. Within the same precision we can also neglect the power of g under the logarithm in Eq. (9.36): p p 1 µ g 2 1 k0 µ µ k = + (9.37) d p0 ln d p0 ln 2 6π 3 0 p0 k0 3 k0 p0 p0 It is convenient to introduce logarithmic variables: µ µ µ x = ln , y = ln , x0 = ln , k0 p0 0 where 0 ≡ p0 =0
(9.38)
The variables x and y measure the “order of magnitude” of k0 and p0 on the scale between 0 and µ. The value 0 is the value of the actual gap near the Fermi sphere (see Fig. 9.2). As we shall see at the end of this calculation, the gap is exponentially smaller than µ when g → 0, i.e. the total number of e-foldings between 0 and µ is proportional to 1/g in this limit (see Eq. (9.45)). This fact should make it clear why the contribution of the scale interval (gµ, µ) is small. It spans only ln(1/g) e-foldings, which is parametrically a negligible number compared with 1/g (see also Fig. 9.9). Then g 2 1 x0 1 x (x) = dy (y)x + dy (y)y (9.39) 6π 2 3 x 3 0 where we introduced (x) ≡ k with x from Eq. (9.38). This integral equation can be transformed into a differential equation with boundary conditions and solved. First note that Eq. (9.39) implies that (0) = 0
(9.40)
– i.e. the pairing strength, or gap, vanishes far from the Fermi surface, at the scale of order µ. Taking the derivative of Eq. (9.39) gives x0 g2 dy (y) (9.41) (x) = 18π 2 x
9.2 The gap at asymptotically large µ
253
This equation implies another boundary condition: (x0 ) = 0
(9.42)
– the gap does not vary with energy near the Fermi surface. Now, on differentiating once again, we find (x) = − which has a two-parameter solution,
g2 (x) 18π 2
g (x) = A sin √ x + φ 3 2π
(9.43) (9.44)
The boundary conditions (9.40) and (9.42) should be used here. Equation (9.40) means that φ = 0. The condition (9.42) means that the gap (9.44) reaches its maximum at x = x0 , i.e. A = (x0 ). It may seem that we cannot fix the amplitude A because the differential equation we are solving, Eq. (9.43), as well as the boundary conditions (9.40) and (9.42) are homogeneous. However, Eq. (9.42) means that the argument of the √ sine, gx/(3 2π ), is equal to π/2 at x = x0 , i.e. 3π 2 x0 = √ 2g
(9.45)
However, the value of x0 is the number of e-foldings between the scales of the gap 0 and µ, Eq. (9.38), i.e. [87–89] 3π 2 −x0 (9.46) = µ exp − √ 0 = µe 2g Since 0 = (x0 ) = A, the energy dependence of the gap reads finally g µ (9.47) p = 0 sin √ ln | p0 | 3 2π which is sketched in Fig. 9.5. We see that the appearance of the collinear logarithm in addition to the usual BCS one changed the power of 1/g in the exponent of the gap from 1/g 2 in Eq. (9.30) to 1/g in Eq. (9.46). Another notable consequence of the nonlocality of the interaction is the nonnegligible energy dependence of the gap in Eq. (9.47). Electric gluons and higher-order corrections What is the effect of the electric gluons described by the propagator (9.32)? As we discussed, the collinear divergence in the Coulomb propagator (9.32) is regulated by Debye screening at the scale | q | ∼ gµ, i.e. electric gluons contribute
254
Large chemical potentials
∆0
0
ln g1
p0 µ gµ (log scale) Far from Fermi surface
x0 =
c g
x = ln pµ0
∆0 Near Fermi surface
Figure 9.5. The energy dependence of the gap in the leading logarithmic approximation, Eq. (9.47). Note also that the energy dependence in the tiny interval (gµ, µ) should be corrected for the effects of the electric gluons, if the terms of order ln(1/g) are to be retained together with terms of order 1/g as in Eq. (9.48). to the collinear logarithm only when q0 is in the interval (gµ, µ). This means that at least p0 or k0 must lie in this interval. In terms of the logarithmic variable x, Eq. (9.38), the size of the corresponding interval is ln(1/g), compared with the total scale interval x0 = 1/g. One can therefore expect the effect of the electric gluons to (a) modify the function (x) significantly in the small interval (0, ln(1/g)) and (b) shift the value of x0 in Eq. (9.45) by a correction of order ln(1/g). We leave the detailed calculation as an exercise to the reader. We note only that the corrections of order ln(1/g) from the magnetic gluons, which we neglected, must also be included at this level of precision. The result is [87] 3π 2 1 x0 = √ − 5 ln (9.48) g 2g where we neglected terms parametrically smaller than ln(1/g). Terms ln(1/g) and higher produce pre-exponential multiplicative corrections to the value of the gap at the Fermi surface: 3π 2 −x0 −5 (9.49) 0 = µe = µg exp − √ 2g We also note that, since electric gluons provide extra attraction, they increase the gap, as is implied by the sign of the correction in Eqs. (9.48) and (9.49). Of course, Eq. (9.49) is good only up to an unknown constant factor. The estimates of higher-order corrections to the gap equation indicate that neither the coefficient of 1/g nor that of the logarithm in Eq. (9.48) is modified
9.2 The gap at asymptotically large µ
255
by such corrections [90]. Higher-order corrections contribute only additional sub-leading terms, which are finite as g → 0, to Eq. (9.48). Consider, for example, corrections that can be absorbed into the renormalization of the coupling g. These corrections amount to a replacement g → g + O(g 3 ln(µ/)). On performing such a shift of the coupling in Eq. (9.48) and accounting for the fact that the large renormalization logarithm ln(µ/) is of order 1/g, we see that x0 is modified by at most O(g 0 ). The precise value of the next-order, constant, term in Eq. (9.48) is unknown at the time of writing. This is especially unfortunate, because a constant term in Eq. (9.48) translates into a constant factor in Eq. (9.49) that determines the absolute size of the gap. The case of three flavors As we have already mentioned, the case of three flavors is a little more involved because the anomalous, or pairing, term (9.21) in the Lagrangian pairs each quark flavor with two others. In order to obtain the anomalous quark propagator one can use the following trick. Instead of ψai , use a different basis for the Nf × Nc = 9 quark fields, ψ A , such that ψai
8 √ = 2 (T A )ia ψ A ,
ψA =
i.e.
√ A 2 (T )ai ψai
(9.50)
ia
A=0
The term (9.21) is diagonal in the new basis: j
i jk abk p ψai ( p)ψb (− p) = 2[tr T A tr T B − tr(T A T B )] p ψ A ( p)ψ B (− p) = (3δ A0 δ B0 − δ AB ) p ψ A ( p)ψ B (− p) (9.51) ≡ − Ap ψ A ( p)ψ A (− p) where in the last term we introduced & for A = 1, . . ., 8 A ≡ −2 for A = 0
(9.52)
Since the normal term in the Lagrangian remains diagonal in the new basis, i
A
A
ψ † a ψai = 2 tr(T A T B ψ † ψ B ) = ψ † ψ A
(9.53)
we can now easily find the propagator (both the normal and the anomalous one): A
ψ † ( p)ψ B ( p) = δ AB ψ A ( p)ψ B (− p) = δ AB
p02
−i p0 + p + 2p + ( Ap )2
A p02 + 2p + ( Ap )2
(9.54)
256
Large chemical potentials
We see that all nine quarks ψ A acquire gaps. These gaps are not all equal, however. Eight of the quarks acquire the same gap but one has the gap 2. This fact creates a complication when we try to close the gap equation, as we shall see soon. Rewrite the anomalous propagator in Eq. (9.54) in the original basis of ψai : 8 j ψai ψb = 2 TiaA T jbB
A p02 + 2p + ( Ap )2 A=0 = − i jk abc 2 p0 + 2p + 2p 2 1 2 − δia δ jb − 2 3 p02 + 2p + 42p p0 + 2p + 2p
(9.55)
The last term, proportional to δia δ jb , corrects for the fact that, in the first term, proportional to i jk abc , we have put all denominators equal. We see that, although the anomalous term in the Lagrangian (9.21) is proportional to i jk abc , the anomalous propagator (9.55) is not. After we have substituted the propagator into the gap equation in Fig. 9.3, the right-hand side of this equation will not be proportional to i jk abc . The first term in Eq. (9.55), sandwiched by the generators T A from the gluon vertices, will give the term i jk abc , as in the two-flavor case (simply because it is anti-symmetric in color indices, see Eq. (9.24)). However, the second term, δia δ jb , will produce a mixture of terms symmetric and antisymmetric with respect to a ↔ b. This means that already at the one-loop level the anomalous term is a mixture of an anti-symmetric term (9.21) and a symmetric term, j δai δb + (a ↔ b). In other words, the quarks pair in a mixture of color 3¯ and color 6 states. To account for this we should set up two equations for two gaps. However, to the leading logarithmic precision with which we are working, this is not needed. Indeed, the last term in the propagator (9.55), which is responsible for the colorsymmetric admixture in the gap equation, does not produce the BCS logarithmic divergence ln(µ/). Thus the symmetric (color 6) pair condensate is suppressed by a factor 1/ ln(µ/) ∼ g, and its effect can be neglected to the order with which we are working at large µ. In this case the gap equation closes and we obtain the same equation for as in the two-flavor case. Debye screening and Landau damping Here we shall derive the expression for the gluon propagator Dµν which we needed for the calculation of the gap. We can use the same effective action (9.12)
9.2 The gap at asymptotically large µ
257
ψ
χ (a)
(b)
Figure 9.6. Quark loop diagrams contributing to the gluon polarization tensor "µν . Thin lines denote propagation of the field ψ, while the thick line is the propagator of the field χ – see Fig. 9.1. When the momentum running in the loop is near the Fermi surface, and the energy is small, the χ field is far off shell and its propagator can be effectively shrunk to a point, producing a local contribution to "µν . Only diagram (a) contributes to "00 , see Eq. (9.56), while "i j receives contributions both from (a) and from (b), see Eqs. (9.59) and (9.63).
to calculate the one-loop polarization operator: 1 1 "00 (q) = N g 2 i p + i( p + q 0 p 0 0 ) + p+q p θ ( p ) − θ ( p+q ) d' d' v · q N g 2 µ2 N g 2 µ2 = = d p 2π 2 4π iq0 + p+q − p 2π 2 4π iq0 + v · q 2 2 q0 q0 Ng µ 2 = m (9.56) f f = D 2π 2 | q| | q| where the number N = 2Nf Nc counts the two chiralities, Nf flavors, and Nc colors of quarks running in the loop in Fig. 9.6(a), and the Debye mass is given by m 2D =
g 2 µ2 N g 2 µ2 = N N f c 2π 2 π2
q | is given by The dimensionless function f of the ratio q0 /| x +i ix = 1 + π x + O(x 2 ) f (x) = 1 + ln 2 x −i
(9.57)
(9.58)
The difference θ( p ) − θ ( p+q ) appears in Eq. (9.56) because the integral is zero when both poles are on the same side of the real axis. It has a simple meaning as the difference n p+q − n p of two occupation numbers, step-functions at zero temperature. We also used the fact that | q | | p| ∼ µ to neglect | q |/µ in p+q − p = v · q. In other words we are considering the response of a very dense medium, in which the typical Fermi momenta of particles are large (hard) with respect to a very slow (soft) variation of the external field A0 . Such a regime, and the corresponding approximation, when the momenta of particles inside the
258
Large chemical potentials
loop are much harder than the external momenta, is known as hard dense loops (HDL). Note that, in this regime, | q | µ, only the fermions in a thin shell around the Fermi surface contribute to "00 – this shell is cut out by the θ functions in Eq. (9.56). This has a simple reason: screening requires rearrangement of charge carriers, which is impossible due to Pauli blocking anywhere except in the vicinity of the Fermi surface. For the same reason the value of "00 is proportional to µ2 , i.e. to the area of the Fermi surface. We neglected the modification to the fermion propagator due to the presence of the gap (9.16). This does not change the result within the precision we are after. The effect of is to “smear” the θ functions in Eq. (9.56) into distributions n p with a typical width of the crossover near | p| = µ from 0 to 1 of order . In the regime in which the photon propagator is employed in our calculation of the gap, | q | , the width of this crossover is negligible. For the spacelike components of the polarization operator, the contribution of the near-surface modes described by the effective action (9.12) is, similarly, d' v · q g 2 µ2 ˜ "i j = N vi v j 2π 2 4π iq0 + v · q 1 2 3 2 1 1 1 1 2 − f − x f qˆi qˆ j = m D − + f + x f δi j + 6 2 2 2 2 2 q0 x ≡ (9.59) | q| ˜ i j since The correct value of the polarization operator, "µν , cannot be equal to " the transversality of the polarization operator qµ "µν = 0 requires that "i j = −qˆi qˆ j a 2 "00 + G(δi j − qˆi qˆ j )
(9.60)
where the function G is arbitrary, not constrained by the requirement of transversality. In other words, only the spatially transverse part of "i j is independent; the longitudinal part is related to "00 . Using Eqs. (9.59) and (9.56), we find that ˜ does not satisfy Eq. (9.60), but, if "=" ˜ ij + "i j = "
m 2D δi j 3
then Eq. (9.60) is satisfied. The function G is given by 1 1 1 G = m 2D − + f + x 2 f 2 2 2
(9.61)
(9.62)
Why are we missing the term (m 2D /3)δi j in Eq. (9.59)? As we shall see, this term comes from the contribution of the modes in the bulk of the Fermi sphere. These
9.2 The gap at asymptotically large µ
259
modes are already integrated out when we use the effective action (9.12) for near-surface modes. The effect of these modes is local, i.e. momentumindependent, and can by accounted for by a counterterm in the effective action: (m 2D /6)Ai Ai . This counterterm comes from the diagram in Fig. 9.6(b), where one of the two fermion lines describes the propagator of the field χ . The vertices of this diagram arise from the last terms in the curly brackets in Eq. (9.9). As we have seen, the excitations created by the field χ carry large negative energy, −| p| − µ, i.e. describe antiparticles that can be created virtually for a very short time if the available energy is small compared with µ. Thus, on the scale of the effective theory the thick line in Fig. 9.6(b) can be effectively shrunk to a point, creating a local counterterm. The diagram can be evaluated easily using the method of residues to do the integral over p0 first: 1 1 2N g 2 (δi j − vi v j ) | − µ) i p0 + (| p| + µ) p i p0 + (| p d3 p 1 m 2D − v v ) = (9.63) = 2N g 2 (δ δi j ij i j 3 | 3 | p|<µ (2π ) 2| p The overall factor of two accounts for the possibility of either line in the diagram being thick. The last integral is restricted to momenta inside the Fermi sphere – only for those momenta will the two propagator poles be on opposite sides of the real axis. The denominator 2| p| is the difference between the energies of the particle and antiparticle states at the same momentum p. The factor δi j − vi v j accounts for the fact that, according to Eq. (9.9), only two components of the gluon field (A1 and A2 – orthogonal to v = (0, 0, 1)) can flip a positive-energy quark into a negative-energy quark. Finally, to obtain the gluon propagator Dµν , we need to invert the matrix: 1 q 2 δµν − qµ qν + "µν + q¯µ q¯ν ξ
(9.64)
where the last term is the gauge-fixing term and q¯µ = (0, q). We choose the Coulomb gauge.4 On inverting Eq. (9.64) and setting ξ → 0, we obtain D00 =
q2
1 ; + "00
Di j = (δi j − qˆi qˆ j )
q2
1 +G
(9.65)
q |, the function In the regime relevant for our calculation of the gap, q0 | "00 tends to a constant, the square of the Debye mass (9.56). The electrostatic 4 To the order in which we are working the gap is gauge-invariant. This is because, in our calcu-
lation, both quarks p and k in Fig. 9.3 are on the mass shell. A gauge-dependent term qµ qν in the propagator will produce a factor iq0 + q · v in the vertices, which vanishes, since it is (up to O(| q |/µ)) equal to (ik0 + k ) − (i p0 + p ), which is zero when p and k are on the mass shell. Note that only the gauge invariance of the value of the gap on the mass shell is ensured.
260
Large chemical potentials
gluon fields are screened on the length scale of 1/m D , according to Eq. (9.65). The function G is given by, expanding Eq. (9.62), 2 q0 π 2 q0 +O (9.66) G(q) = m D 2 | q| | q |2 Note that, if we rotate back to Minkowski space, q0 → iq0 , this term becomes imaginary, as it should since it is the Landau damping term. Since Landau damping provides screening only for the dynamical magnetic fields, we might worry that the effect of screening due to the Meissner effect is significant, since it screens even static fields. We neglected this effect when we replaced the full propagator, including the anomalous one, by the free-fermion propagator in the calculation of "i j . The contribution of the Meissner effect, however, is sub-leading, as are many other contributions that we neglected in our leading-order calculation. The reason is that the integral over q0 in the gap equation is (logarithmically) dominated by the region where q0 . Thus, dynamical screening operates on the scale of | q | ∼ (q 0 g 2 µ2 )1/3 (g 2 µ2 )1/3 , which is much larger than the scale g on which Meissner screening becomes significant. Thus the momentum region where the Meissner effect is significant does not contribute to the collinear logarithm and can affect the result only in the next order in 1/ ln(µ/) ∼ g. 9.3 Lowest excitations of the CFL phase Our discussion in the previous section was concerned with the ground state of QCD at large baryochemical potential µ. We saw that, in this regime, the state in which the quarks simply fill all available levels inside the Fermi sphere is unstable. An arbitrarily small coupling to gluons creates a more energetically favorable state, which we can characterize by the presence of an anomalous Green function ψψ. We also saw that the spectrum of quarks acquires a gap of order . In this section we shall address the following question: what are the lowest-lying excitations of such a ground state? Motivation One reason why we need to understand the properties of the exitations is because some potentially observable consequences of the color superconductivity depend on the spectrum of excitations. For example, the rate of cooling of a neutron star depends on the specific heat, which, in turn, is a measure of the number of excitable degrees of freedom at a given temperature. At small temperatures, the lowest-lying excitations dominate the specific heat. There is also another, not completely unrelated, conceptual reason why we want to understand the excitations. The reader must already have noted that most of our discussion of the ground state has been phrased in terms of quantities
9.3 Lowest excitations of the CFL phase
261
that are not gauge-invariant, and thus unobservable. This is not an uncommon situation for a gauge theory. In perturbation theory, the language based on quarks and gluons (or leptons and vector bosons, in electroweak theory), is appropriate in the situations in which the effective gauge coupling is small, as it is in our case. The reason for this is that, as long as the coupling is small, the actual physically observable (i.e. gauge-invariant) states have the same properties as the elementary excitations of the quark and gluon fields, e.g. insofar as energy– momentum relations and conserved charges are concerned. However, to make a conceptually sensible physical prediction, we have to phrase our results in terms of the gauge-invariant quantities.5 The situation we are facing is very similar to the Higgs phenomenon in the electroweak theory. In this case, we cannot, strictly speaking, make a sensible prediction that involves the value of the expectation value vH of the Higgs field, because it is not a gauge-invariant, and thus not an observable, quantity. An example of a quantity that can be observed/measured is the mass of electroweak bosons m W . This mass is proportional to the Higgs expectation value to leading order in perturbation theory (which is the main purpose of the Higgs mechanism). At higher orders this relation becomes gauge-dependent. Thus, we can still use the Higgs expectation value as a parameter in the theory to leading order, but not to all orders. To avoid this limitation we find other quantities, which are observable, and can also be expressed through the Higgs expectation value, in a gauge-dependent way, of course. The key is that, if we write the relation between such two observable quantities, eliminating the Higgs expectation value, the gauge dependence will also disappear from such a relation, to all orders in perturbation theory. In electroweak theory, an example of such an observable is the Fermi coupling constant G F (or the lifetime of the muon). The relationship between G F and vH is not gauge-invariant to all orders, but the relationship between m W and G F is. Symmetries and Goldstone excitations Let us begin with the considerations of symmetry. We need to begin by comparing the symmetries of the microscopic, i.e. QCD, Lagrangian with the symmetries of the ground state. If we find that a global symmetry of a Lagrangian is broken in the ground state, the Goldstone theorem will tell us that there are massless particles, associated with the broken-symmetry generators. If the symmetry is only approximate, the masses of the corresponding “almost” Goldstone 5 The lattice theory also makes this point, in its own way. According to Elitzur, an attempt to
measure a non-gauge-invariant quantity on the lattice must give zero identically. For example, we cannot measure the anomalous Green function ψψ. Of course, this is not a limitation of the lattice theory; rather it is a demonstration that the physical requirement of gauge invariance of observed phenomena is built into the foundation of the theory.
262
Large chemical potentials
particles will be small. These Goldstone particles will then be low-energy excitations. As we discussed, the Lagrangian symmetry SU(3)L × SU(3)R × SU(3)color breaks to SU(3)L+R+color . The color SU(3)color does not count as a global symmetry and we thus have 16(= 8 + 8) generators of symmetry in the Lagrangian, but only eight in the ground state. Thus, we can expect eight (= 16 − 8) Goldstone particles. We can also guess that these eight particles fill a multiplet under the remaining SU(3) symmetry group. In addition, the global U(1)B symmetry, which is responsible for conservation of baryon charge, is also broken in the ground state, which gives us one Goldstone particle. The U(1)A symmetry requires special consideration. The Cooper-pair condensate breaks this symmetry also. However, due to the anomaly, this symmetry of the Lagrangian is not a symmetry of the quantum theory at all. In other words, the vacuum degeneracy corresponding to the rotation of the angle parametrizing U(1)A – which is also known as the θ angle – is lifted. However, it turns out that, at large µB , the effect of the anomaly on the symmetry of the ground state becomes weak, and the degeneracy of the ground state with respect to U(1)A is restored. In other words, the dependence on the angle θ disappears. This happens because the instanton configurations responsible for the θ dependence are suppressed [91]. These configurations of the gauge field carry long-range color electric fields, which are screened by the Fermi sea of quarks. We can therefore consider U(1)A as an approximate symmetry (and parametrically so, in the sense that, at µB → ∞, this symmetry becomes exact). This means that we should expect in total ten Goldstone bosons, one octet and two singlets.6 9.3.1 The effective Lagrangian for Goldstone modes Our next step is to determine the effective Lagrangian which describes the Goldstone excitations which we have predicted. Our guiding principle is again based on symmetry: any symmetry of the microscopic Lagrangian has to be correctly reproduced in the effective Lagrangian. If we limit ourselves to the terms of lowest order in momenta, the above symmetry considerations allow a small number of terms. These considerations leave only a finite number of undetermined constants in the effective Lagrangian. 6 It is important to recognize that the U(1) boson is exactly massless whereas the U(1) boson B A
has a small but nonzero mass. This mass is actually calculable [92]. This difference between the bosons becomes even more significant if one considers global properties, such as vortices. A vortex of U(1)B phase, similarly to an Abrikosov vortex in liquid helium, carries only the energy associated with the length of the vortex. However, in the U(1)A case, there is additional energy associated with a two-dimensional surface attached to the vortex. Properties of such nontopological domain walls are also calculable at high µ [92].
9.3 Lowest excitations of the CFL phase
263
We recall that the ground state of the CFL phase is characterized by the following diquark condensates: bj ∗ bj ∗ X ia ∼ i jk abc ψL ψLck and Y ia ∼ i jk abc ψR ψRck (9.67) where the complex conjugation was added for convenience so that X and Y transform under SU(3)c × SU(3)L × SU(3)R as (3,3,1) and (3,1,3), respectively: X → UL XUcT
and
Y → UR Y UcT
(9.68)
Note that left quarks pair with left quarks and right with right, and the orientations of these condensates X and Y are free for us to choose. The low-energy excitations in the CFL phase are given by the slow rotations of the phases of X and Y . Therefore, we can factor out the norm of the condensates and consider unitary matrices X and Y . Together they give us 9 + 9 = 18 degrees of freedom. Eight of them are eaten by the gluons through the Higgs mechanism, and the surviving ten become the low-energy excitations we have anticipated. For simplicity, we shall at first ignore the overall U(1) phases of X and Y and assume that det X = det Y = 1. The Lagrangian should be symmetric under the SU(3)c × SU(3)L × SU(3)R rotations (9.68). This condition fixes the Lagrangian to the leading (second) order in derivatives [93]: Leff =
f π2 tr[(X † ∂0 X )2 + (Y † ∂0 Y )2 ] + spatial gradients 2
(9.69)
The spatial gradients enter in a similar way but with a different constant instead of f π . The cross term tr[(X † ∂0 X )(Y † ∂0 Y )] is allowed by the symmetries, but, as we shall see in Section 9.3.2, it is suppressed in weak coupling (i.e. at large µ). Roughly speaking, to leading order in g 2 , left and right quarks decouple from each other. So far we have ignored the fact that SU(3)c is a local symmetry. To take this into account, we must replace the derivatives in Eq. (9.69) by covariant derivatives, f π2 tr[(X ∂0 X † − g A0 )2 + (Y ∂0 Y † − g A0 )2 ] + · · · 2 f2 = π tr[(X ∂0 X † − Y ∂0 Y † )2 + (X ∂0 X † + Y ∂0 Y † − 2g A0 )2 ] + · · · 4 (9.70)
Leff =
The second term is responsible for the Higgs effect: the vector-like fluctuations, i.e. fluctuations with equal changes in X and Y , δ X = δY , become longitudinal components of the gluon Aµ . The gluons acquire a mass of order O(g f π ) (electric and magnetic masses are, in general, different). We shall
264
Large chemical potentials
see that f π ∼ µ, and thus the gluon mass is much larger than the momentum scales p that we are considering ( p < 2 gµ), so gluons decouple from the low-energy theory. The axial-like fluctuations of the phases, δ X = −δY , can be written as fluctuations of the phases of a new unitary matrix, = XY †
(9.71)
and the effective Lagrangian takes the form (in Euclidean space) Leff =
f π2 tr ∂0 4
∂0
†
+ vπ2 ∂i
∂i
†
(9.72)
which is the usual Lagrangian of the nonlinear sigma model except that the speed of the mesons, vπ , can be different from the speed of light. The matrix is a singlet in color, transforms under SU(3)L × SU(3)R as †
→ UL UR
(9.73)
and describes the meson octet. What happens if we take into account the U(1) phases of X and Y ? Then it is possible to add into the chiral Lagrangian a term proportional to [tr(X † ∂0 X )]2 + [tr(Y † ∂0 Y )]2
(9.74)
It is possible to add a cross term, [tr(X † ∂0 X )][tr(Y † ∂0 Y )] but it is suppressed in weak coupling for the same reason that the cross term was omitted from Eq. (9.69). We shall make the consequences of the term (9.74) clear by splitting X and Y into SU(3) and U(1) parts, X = X˜ e2iθ+2iφ
and
Y = Y˜ e−2iθ+2iφ
(9.75)
where X˜ and Y˜ are SU(3) matrices, and the angle φ is the variable conjugate to the baryon charge, normalized to 1 for a single quark. The normalization of the U(1)A phase θ is fixed analogously. Consequently, the field defined in Eq. (9.71) now has the form = ˜ e4iθ ,
˜ = X˜ Y˜ †
(9.76)
In terms of ˜ , φ, and θ , the lowest-order chiral Lagrangian consistent with the symmetries has the form Leff =
f π2 tr ∂0 ˜ ∂0 ˜ † + vπ2 ∂i ˜ ∂i ˜ † + 12 f η2 (∂0 θ )2 + vη2 (∂i θ )2 4 (9.77) + 12 f H2 (∂0 φ)2 + vH2 (∂i φ)2
The difference between the decay constants of η and H mesons from f π arises
9.3 Lowest excitations of the CFL phase
265
from the additional allowed term (9.74), whereas the difference between f η and f H arises from the cross term [tr(X † ∂0 X )][tr(Y † ∂0 Y )], and, therefore, is small at weak coupling. We shall use f η = f H in the rest of the book. The elementary meson fields π A , η , and H are defined as A A λ π η H ˜ = exp i , θ=√ , φ=√ (9.78) fπ 24 f η 24 f H where λ A (A = 1, . . ., 8) are Gell-Mann matrices normalized so that tr(λ A λ B ) = 2δ AB . 9.3.2 Decay constants of the Goldstone bosons Let us now show that the decay constants of the Goldstone bosons, f π , f η , and f H , can all be computed in the high-density regime and are all proportional to the chemical potential µ. We shall demonstrate the method for f H , whose calculation is the simplest. Let us imagine that the symmetry U(1)B generated by conserved baryon charge is not a global symmetry, but instead a local one. So we introduce into the theory a gauge field Aµ coupled to the baryon current. Owing to the Higgs mechanism, the gauge bosons acquire a finite mass, which can be computed both in the microscopic theory, in which it is expressed in terms of the chemical potential, and in the effective theory, in which it is proportional to f H . Matching the two results determines f H in terms of µ. In order to compute f H we have to deal only with the part of the effective chiral Lagrangian (9.77) that contains the phase φ. Since φ is the phase of the U(1)B rotation, which is now a local symmetry, one should replace the derivatives by covariant ones, L = 12 f H2 (∂0 φ + e A0 )2 + vH2 (∂i φ + e Ai )2 (9.79) where e is a small auxiliary coupling constant. From Eq. (9.79) we find that the gauge bosons acquire finite mass terms, which are different for A0 and Ai , m 2A0 = 24e2 f H2
(9.80)
m 2Ai = 24vH2 e2 f H2
(9.81)
On the other hand, the masses in Eqs. (9.80) and (9.81) can be computed from the microscopic theory, in which m 2A0 has the meaning of the Debye mass of the electric Aµ field, while m 2Ai is the Meissner mass of the magnetic components of Aµ . To compute these masses, it is most convenient to use the low-energy effective theory containing only fermion modes near the Fermi surface that we derived in Section 9.2.1.
266
Large chemical potentials
(a)
(b)
Figure 9.7. The leading-order diagrams contributing to decay constants. To leading order, the only contribution to m 2A0 is from the sum of two one-loop diagrams shown in Fig. 9.7, which, for zero external momentum, is equal to 9 (i p0 + p )2 d p0 d p µ2 2A 2 2 − 2 (9.82) + 2π (2π ) A=1 ( p0 + 2p + 2A )2 ( p02 + 2p + 2A )2 where we use the formula for the phase space near the Fermi surface (9.28). The overall factor of two in Eq. (9.82) comes from summation over left- and righthanded quarks in the internal loop. We find that the two diagrams in Fig. 9.7 give equal contributions to the squared Debye mass, which is given by µ2 (9.83) 2π 2 This value is the same as the Debye mass that Aµ would have in the absence of the superconductivity – see Eq. (9.57). The origin of this coincidence can be made clear by the following argument. What we have computed is simply the 00 component of the polarization operator of Aµ , "00 . Since A0 is coupled to the baryon charge n = ψ † ψ, "00 has the following interpretation in linear-response theory: the baryon charge density generated by an external uniform field A0 is given by m 2A0 = 18e2
n = "00 (0)A0
(9.84)
However, the uniform A0 field is simply a shift of the chemical potential, or the Fermi energy. Therefore, "00 is equal to e2 ∂n/∂µ, i.e. the density of states near the Fermi surface, which is exactly the right-hand side of Eq. (9.83). It is easy to see that ∂n/∂µ is the same in the normal phase and superconducting phase, provided that the gap in the latter is small. On comparing Eq. (9.83) with the result from the effective theory, Eq. (9.80), one finds the decay constant f H , 3 µ2 (9.85) 4 2π 2 An important remark is in order here. Equation (9.85) tells us that f H depends only on the chemical potential µ, not on the gap . This might seem to contradict the fact that, at = 0, the U(1)B symmetry would be restored and no f H2 =
9.3 Lowest excitations of the CFL phase L
267
R
Figure 9.8. A higher-order diagram contributing to the decay constants. One loop contains a left-handed quark; the other has a right-handed quark. Goldstone boson is expected in the theory. The explanation is that, as decreases, the domain of applicability of the effective Lagrangian p < 2 shrinks and disappears at = 0. Therefore, the persistence of f H does not contradict the restoration of U(1)B symmetry at = 0. To find the Meissner mass, we have to evaluate the same two diagrams of Fig. 9.7 and add the “counterterm” mass given by the diagram in Fig. 9.6(b). In the case of the Meissner mass the vertices are attached to Ai and A j . Instead of being equal, the two diagrams in Fig. 9.7 now cancel each other out, since the vertex factor of the first diagram is vi v j , whereas that in the second diagram is vi (−v j ). Therefore, the Meissner mass in the superconducting phase is equal to the “counterterm” Meissner mass (9.63), which (squared) is equal to m 2A0 /3. On comparing this with Eqs. (9.81) and (9.80), one finds that the velocity of the H boson is given by vH =
m Ai 1 =√ m A0 3
(9.86)
i.e. it is equal to the speed of sound in relativistic fluids. This result is not completely surprising, since the U(1)B phase φ of the condensate is the variable conjugate to the baryon density ψ † ψ, whose fluctuations give rise to sound waves. Therefore, H quanta can be considered to be phonons. However, they are not hydrodynamic phonons since they exist outside the hydrodynamic regime (as in our case at zero temperature where the mean free path diverges). One can repeat the same calculation for the η meson and see that, to leading order, all diagrams remain the same. Thus we find that f η = f H , and the η meson also propagates with the speed of sound. There is, however, no symmetry that requires the decay constants of H and η to be equal; in fact, they are not equal in higher orders of perturbation theory. For example, the diagram drawn in Fig. 9.8 gives contributions of opposite signs to f H and f η . The computation of f π and vπ is completely analogous to that of f H . We could introduce a fictitious field coupled to the flavor currents, but we can also make use of the existing coupling to the gluons to compute f π . The only additional complication is that the interaction vertex now has a nontrivial structure. In our
268
Large chemical potentials
color–flavor basis (9.50), the interaction vertex has the form 2
8 8
tr(T A T B T C ) ψ † A A B ψ C
(9.87)
A,C=0 B=1
A straightforward calculation shows that the first diagram in Fig. 9.7 gives ( 34 )g 2 µ2 /(2π 2 ) and the second diagram contributes −[(3 + 4 ln 2)/18]g 2 µ2 / (2π 2 ) to the Debye mass. Taking into account the factor of two arising from left- and right-handed quarks, the result reads m 2A0 =
21 − 8 ln 2 g 2 µ2 18 2π 2
(9.88)
The mass in Eq. (9.88) is not equal to the Debye mass in the normal phase, in contrast to the previous case of the fictitious U(1) boson coupled to the baryon current. Notice that, if the gluons are coupled to the left-handed quarks alone, the squared Debye mass would be halved, which is exactly what one obtains by throwing away Y from the Lagrangian (9.70). However, if we add the X Y cross term to the Lagrangian, this will no longer be true. Therefore, this cross term, though it is not forbidden by the symmetry, has a small coefficient in weak coupling. This can be seen also from the fact that the coupling of the left- and right-handed flavor currents is zero to leading order. In higher orders of perturbation theory it receives contributions from diagrams like those in Fig. 9.8, where one vertex corresponds to the left-handed flavor current and the other to the right-handed one. In the effective theory, the gluon mass is given by m 2A0 = g 2 f π2 . Therefore, one can determine the decay constants of the mesons in the pseudo-scalar octet [94], f π2 =
21 − 8 ln 2 µ2 18 2π 2
(9.89)
The ratio f π2 / f η2 = f π2 / f H2 = 2(21 − 8 ln 2)/27 ≈ 1.14 is quite close to unity. This fact is reminiscent of the OZI rule [95]. The Meissner mass is equal to 21 − 8 ln 2 g 2 µ2 g 2 µ2 1 1 3 + 4 ln 2 2 − − = m Ai = 2 (9.90) 2π 2 2 4 54 54 2π 2 where the first term in the parentheses in the intermediate expression comes from the bare Meissner mass, while the last two terms come from the two diagrams in Fig. 9.7. Again, in contrast to the case of U(1)B , the contributions of the two diagrams do not cancel each other out. It is somewhat surprising that the squared Meissner mass is a third of the squared Debye mass, as it is for the U(1)B case.
9.3 Lowest excitations of the CFL phase
269
Since the ratio of these two masses is the velocity of the Goldstone bosons, we find that √ the velocities of all Goldstone modes in our theory equal the speed of sound 1/ 3 to leading order of perturbation theory.7 9.3.3 Meson masses We now turn on finite small quark masses and compute the resulting masses of the Goldstone bosons. By introducing bare-quark masses into the microscopic theory, we break the SU(3)L × SU(3)R symmetry. Less symmetry means more freedom: more terms are allowed in Leff . However, constraints on the form of these terms can still be imposed if we note that the bare-quark mass term †
L = #L M#R + h.c.
(9.91)
with M being the 3 × 3 mass matrix, could be made invariant under SU(3)L × SU(3)R if the matrix M is not passive under this symmetry but also transforms together with #L,R in the following way: #L → UL #L ,
# R → U R #R
and
†
M → UL MUR
(9.92)
Any term in the effective Lagrangian, written as a function of and M, must respect the extended symmetry (9.73) and (9.92). At large µ the U(1)A symmetry is effectively restored, which imposes an additional constraint on the possible form of mass terms in the chiral Lagrangian. Under the U(1)A transformation, the microscopic degrees of freedom transform as #L → eiζ #L ,
#R → e−iζ #R
(9.93)
where ζ is an arbitrary pure phase. Equation (9.93) implies the following transformation law of X , Y , and : X → e−2iζ X,
Y → e2iζ Y
and
→ e−4iζ
(9.94)
The quark-mass term (9.91) is invariant under the U(1)A symmetry if one requires that M transforms as M → e2iζ M
(9.95)
under U(1)A . Thus, any mass term in the effective Lagrangian must be invariant both under the SU(3)L × SU(3)R symmetry and under the U(1)A symmetry extended by the transformation of M. Since the bare-quark masses are assumed to be small, we want to construct the mass terms of lowest possible order in M. To first order, the only candidate is the √ 3, have been found by Bogolyubov and Anderson for phase waves in BCS superconductors and by Leggett for spin waves in superfluid 3 He [96].
7 Similar results, v = v / F
270
Large chemical potentials
mass term of the chiral Lagrangian at zero chemical potential, tr(M † ) + h.c. This term is invariant under SU(3)L × SU(3)R ; however, it is not invariant under the U(1)A symmetry. Therefore, we must consider terms of higher order in M. At order M 2 we find that the following term is allowed by the symmetries:8 Leff = −c det M tr(M −1 ) + h.c.
(9.96)
There are two other possible terms allowed by the symmetries:9 {tr[(M
† 2
) ] + [tr(M
†
)]2 } det
+ h.c.
and
tr(M
†
M† )
(9.97)
However, only the coefficient of the term (9.96) is nonzero to leading order in the expansion in powers of g, or 1/ ln(µ/), as we shall see below. Meson spectroscopy in CFL: inverse mass ordering The term (9.96) gives rise to an interesting pattern of meson masses in the CFL phase. To write the (mass)2 matrix for the Goldstone bosons we expand the field using 3 × 3 Gell-Mann matrices λa : a a iπ a λa π a π b λa λb iπ λ =1+ − + ··· (9.98) = exp fπ fπ 2 f π2 √ where a = 1, . . ., 9, tr(λa λb ) = 2δ ab , λ9 = 2/3 and π 9 = η f π / f η . Since the kinetic term is conventionally normalized we can read off the (mass)2 matrix from the mass term (9.96) (except for the trivial rescaling of the η field): C c det M tr(M −1 λa λb ) π a π b = det M tr(M −1 λa λb ) π a π b 2 fπ 2
(9.99)
where C=
20 108 2c = f π2 21 − 8 ln 2 µ2
(9.100)
which comes from Eq. (9.89), which we have derived, and Eq. (9.114), which we shall derive below. 8 One way of looking at this term is to realize that, under SU(3) , both M and L
transform as fundamental 3-plets. We require two powers of M and one of in order to satisfy the U(1)A neutrality. We can construct an SU(3)L singlet out of a product of M, M, and if we antisymmetrize with respect to the first index of each of these matrices (the index on which SU(3)L acts). By antisymmetrizing also with respect to the second index we obtain an SU(3)L × SU(3)R singlet, a b c abc Maa Mbb cc , which coincides with Eq. (9.96) up to a factor of two. 9 Taking a minus sign in the curly brackets in the first term would give the term in Eq. (9.96).
9.3 Lowest excitations of the CFL phase
271
With the quark-mass matrix M = diag(m u , m d , m s ), the 9 × 9 meson-mass matrix (9.99) decomposes into a diagonal 6 × 6 matrix and a nondiagonal 3 × 3 ¯ 0: matrix. The former gives rise to the mass formulas for π± , K± , K0 , and K m 2π± = C(m u + m d )m s m 2K± = C(m u + m s )m d m 2K0 ,K¯ 0 = C(m d + m s )m u
(9.101)
The remaining 3 × 3 matrix corresponding to the π0 , η, and η mesons is more complicated. Unlike in the QCD vacuum, in the CFL phase the π0 , η, and η excitations mix strongly. The mixing pattern is easy to understand if we neglect the small difference between f π and f η . Then the mass matrix (9.99) ¯ s¯s, which is related to π0 , η, η by has the simplest form in the basis u¯ u, dd, √ √ ¯ ¯ − 2¯ss)/ 6, the quark-model relations π0 √= (¯uu − dd)/ 2, η = (¯uu + dd ¯ + s¯s)/ 3. In this basis λu¯ u = diag(1, 0, 0), λdd and η = (¯uu + dd = ¯ diag(0, 1, 0), λs¯s = diag(0, 0, 1), and the matrix in this basis becomes diagonal. The mixing among π0 , η, and η is thus ideal, and the masses of the mixed states are m 2u¯ u ∼ = 2Cm d m s ,
∼ m 2dd ¯ = 2Cm u m s ,
m 2s¯s ∼ = 2Cm u m d (9.102)
¯ mesons are of the same order of magnitude as those of The masses of u¯ u and dd √ ± π and K mesons. The u¯ u meson√is the heaviest and is approximately 2 times ¯ is 2 times heavier than neutral kaons. The s¯s, in heavier than K± , while the dd contrast, is particularly light. In reality, f π = f η , so the mixing is not completely ideal. The reader will notice that the mass ordering in the CFL phase is completely reversed compared with what one sees in the QCD vacuum. While this is quite surprising, it can easily be explained. The key point is that the mesons, which are the fluctuations of in Eq. (9.71), should be thought of as bound states of a triplet anti-diquark X and an anti-triplet diquark Y † , i.e. as q¯ q¯ qq states, rather than q¯ q states. Consider, for example, the lightest meson in the CFL phase, which in the normal quark model would be labeled as the s¯s state. Now the s quark should be replaced by the u¯ d¯ anti-diquark, and the s¯ anti-quark should be ¯ replaced by the ud diquark, so this meson is represented as u¯ dud. Since such a meson does not contain the strange quark, it is not surprising that its mass does not depend on m s , as in Eq. (9.102). For all other mesons, one can write ¯ s, d → u¯ s¯, and the quark structure as well, by making the replacements u → d¯ ¯ s → u¯ d. Since one replaces the heaviest quark by the lightest anti-diquark and vice versa, the inverse mass ordering can be expected. It is important to note, however, that the mesons in the CFL phase have the same quantum numbers (up to mixing) as do the mesons in the QCD vacuum.
272
Large chemical potentials X
X
M + M
M Y†
+
X↔Y
+
h.c.
M Y†
Figure 9.9. Lowest-order diagrams contributing to the M and dependence of the vacuum energy in the CFL phase. Arrows indicate the direction of flow of the fermion (baryon) charge. This direction is reversed only at the X and Y vertices. Calculation of the coefficient of the mass term In order to determine the coefficient of the mass term we can use the same strategy as we did in determining the decay constants. We need to choose a quantity that can be calculated in two independent ways, using the effective Lagrangian and the microscopic theory (QCD), and match the results. In the present case it is convenient to calculate the shift of the vacuum energy due to the quark mass. The mass-dependent term in the (Euclidean) effective Lagrangian (9.96) is equal to the shift of the energy of the ground state as a function of the quark-mass matrix M and of the given orientation of the slow field . In order to calculate such a shift in the microscopic theory, we need to identify quark–gluon-vacuum bubble diagrams that depend on M and . The diagrams we are interested in must carry two M vertices as well as one X vertex and one Y † vertex, since = X Y † . The diagrams of this type, which are of lowest order in g, are shown in Fig. 9.9. The gluon line is needed in this diagram for the following reason. Let us go clockwise around the quark loop in Fig. 9.9 starting from the Y † vertex. This vertex sits on the propagator of the low-energy field ψR in our notation of Section 9.2.1. The insertion of M has two effects on the quark field. It flips the chirality of the field, R → L, according to Eq. (9.91). However, there is another effect. The left-chirality field which we obtain is not ψL , but χL – the field whose excitations describe antiparticles.10 On the other hand, the insertion of X , which 10 This can be seen using the representation of the α Dirac matrices (9.4). The fields ψL and ψR satisfy the eigenvalue equation α · pψ = | p|ψ, which gives them the necessary dispersion
relation E = | p| − µ. The fields χ satisfy α · pχ = −| p|χ. If we chose the z axis along the vector p, the components of the Dirac 4-spinor are # ≡ (#L ; #R ) = (ψL , χL ; χR , ψR ). Thus † † we have a mass term #L #R = ψL χR + · · ·.
9.3 Lowest excitations of the CFL phase
273
Figure 9.10. Diagrams that combine into the composite-vertex, simplifying diagrams in Fig. 9.9.
Figure 9.11. The same diagrams as in Fig. 9.9, but using the composite vertex from Fig. 9.10. we need to make, has to be on a ψL line, since X describes the color–flavor orientation of the anomalous term ψL ψL . The insertion of the gluon vertex is necessary in order to provide the flip χL → ψL . The diagrams in Fig. 9.9 can be grouped into a simpler diagram, which has a composite vertex – a sequence of an M and a gluon insertion, as illustrated in Fig. 9.10. With the composite vertex defined in Fig. 9.10, the sum of the diagrams in Fig. 9.9 can be replaced by the diagrams in Fig. 9.11. We could also have arrived at this diagram by integrating out the χ fields in the path integral and using the resulting effective theory for the ψ fields. It turns out that the diagram in Fig. 9.11 is easy to evaluate despite its being a two-loop diagram. Let us first determine what expression corresponds to the composite vertex in Fig. 9.10. Both of the diagrams which contribute to this vertex contain an antiparticle (i.e. χ) propagator, represented by a thick line. If the external fermion lines are on shell, E ≡ i p0 = | p| − µ (and we shall use this vertex only for on-shell external quark lines), this internal antiquark propagator is off shell by E − (−| p| − µ) = 2| p|, which we can approximate, within our precision, i.e. up to terms of order E/µ, as 2µ. By using four-component spinors, using Eq. (9.7) for the quark–gluon vertex, and writing the quark-mass term (9.91) as # † γ0 M#, we obtain for the sum of the two contributioins to the
274
Large chemical potentials
composite vertex LψL ψR A =
g † 0 M + Mγ0 (iA0 + α · A)]# # [(iA0 + α · A)γ 2µ
(9.103)
Since the matrices γ and α are anti-diagonal and diagonal in the L and R 2 × 2 blocks, 0 1 σ 0 γ0 = and α= (9.104) 1 0 0 −σ the effective vertex given by LψL ψR A flips the chirality of the quark. Since γ0 α + α γ0 = 0, the contribution of A cancels out – the magnetic gluon cannot be emitted from this vertex because the two amplitudes in Fig. 9.10 interfere destructively. Although # is a four-component Dirac spinor, only two independent components enter Eq. (9.103) – those describing ψL and ψR fields. Because the incoming, #, and outgoing, # † , fields in general carry different momenta, their ψ components occupy different entries in the four-component spinor. We can take care of this by using projectors: Pp± =
1 ± α · v p 2
(9.105)
Thus, we can finally write for the composite ψL ψR A vertex, using momentum components of the fields, ig ig † + # Pk γ0 Pp+ M A0 # = # † γ0 Pk− Pp+ M A0 # µ µ
(9.106)
Another simplifying feature of this vertex is that it vanishes if the gluon does not since P − P + = 0. This can be understood as a carry any momentum, i.e. p = k, p p consequence of conservation of angular momentum: the emitted electric gluon carries away no angular momentum, while the spin of the quark has to flip if its momentum is unchanged, since its chirality flips. Now we can write the expression corresponding to the diagram in Fig. 9.11: p k 1 1 ig 2 E vac = − 2 2 2 2 2 2 2 2 µ − k| p k p0 + p + p k0 + k + k | p 8 1 × tr(Pk− Pp+ ) tr tr(M T A X M T A Y † ) + h.c. 2 A=1
(9.107)
We have collected all the color- and flavor-carrying matrices under the double (color and flavor) trace at the end of the expression. The first factor of 12 is due to the symmetry of the diagram in Fig. 9.11. The factor of 12 in front of
9.3 Lowest excitations of the CFL phase
275
tr(Pk− Pp+ ) accounts for the fact that the diagram shown in Fig. 9.11 includes only the ψR → ψL part of the vertex (9.103) – the other part, ψL → ψR , is in the complex-conjugate diagram denoted by h.c. Evaluating the trace over Dirac indices gives tr (Pk− Pp+ ) = 1 − v p · vk = 1 − cos θ
(9.108)
and we see that the dependence on the angle θ between p and k cancels out between this trace and the Coulomb propagator: 1 2 | p − k|
=
2µ2 (1
1 − v p · vk )
(9.109)
As a result, the integrals over p and k factorize: 2 8 p 1 1 g 2 tr tr(M T A X M T A Y † ) + h.c. E vac = 2 2 2 + 2 2 µ 4µ p + p 0 p p A=1 (9.110) The integral can be evaluated within our (logarithmic, or O(g)) precision: ∞ p d p0 ( p0 ) µ2 ≈ 2 2 2 2 2π 0 p p0 + p + p p02 + 2 µ d p0 µ2 g µ ≈ 0 sin √ ln 2 2π 0 p0 p0 3 2π √ 2 µ 3 2π = 0 (9.111) 2 2π g where we took into account the energy dependence of from Eq. (9.47) and also Eq. (9.46) for 0 (in particular, the fact that, for p0 = 0 , the argument of the sine in Eq. (9.111) is equal to π/2). Note that this integral receives its value from the large interval of energies between and µ, in close similarity to our calculation of the gap. The logarithmic enhancement of the integral is clear from the factor 1/g in the result which, according to Eq. (9.46), is equal to ln(µ/ p0 ) up to a numerical factor. Finally, let us evaluate the color and flavor traces in Eq. (9.110): 8
tr tr(M T A X M T A Y † )
A=1
= = =
8
M ii TaaA X ck i jk abc M j j TbbA (Y † )kc i
j k
A=1 − 23 M ii X ck i jk a b c M j j (Y † )kc i j k a b c − 43 M ii M j j (X Y † )kk i jk i j k = − 83 det
a b c
M tr(M −1 )
(9.112)
276
Large chemical potentials
where we began by applying Eq. (9.24) to the underlined factors. Note that we have obtained the term which matches the effective Lagrangian (9.96). The other two terms in Eq. (9.97) can appear only in a higher order in g. On putting together Eqs. (9.110), (9.111), and (9.112) we obtain E vac = −
320 det M tr(M −1 ) + h.c. 2π 2
(9.113)
By comparing this with the dependence of the vacuum energy on M and in the effective theory given by Eq. (9.96), we determine the value of the coefficient c [94, 97]: c=
320 2π 2
(9.114)
This completes our calculation in this section. We can now insert the constant c into the effective Lagrangian Eq. (9.96), and obtain the expressions for the meson masses (9.101) and (9.102), using the constant C = 2c/ f π2 as in Eq. (9.100). To add to our discussion of the spectrum in Section 9.3.3, since C ∼ 2 /µ2 , the masses become smaller and smaller as µ increases. The values of the masses depend on the value of the gap , which, unfortunately, we know only up to a factor. 9.3.4 Continuity of quark and nuclear matter? It is amusing to notice that, insofar as the quantum numbers of the particles are concerned, the spectrum of the lowest-lying pseudo-scalar excitations is the same as one would expect in superfluid nuclear, or rather hypernuclear, matter. The breaking of chiral symmetry produces the usual pseudo-scalar octet, while the breaking of the singlet U(1)B , which is superfluidity, is responsible for the scalar singlet. In hypernuclear matter, the candidate for such a state is the H dibaryon – a bound state of , which, similarly to the H boson in the CFL phase, has the quark content udsuds. For the Goldstone-type excitations the similarity is a consequence of the similar patterns of symmetry breaking in the ground state, SU(3)L × SU(3)R × U(1)B → SU(3)V , and the Goldstone theorem. It is also interesting that the fermion excitations are also similar. Since the baryon charge is not a good quantum number in the CFL phase, the baryons should be characterized by their flavor quantum numbers under the unbroken SU(3)V symmetry (neglecting quark masses). The fermion excitations in the CFL phase are, on the perturbative level, quarks. There are nine of them, counting color and flavor combinations. It is easy to see that they fall into an octet and
9.3 Lowest excitations of the CFL phase
277
a singlet. For the same reason eight of the gaps are equal whereas one is larger in Eq. (9.52). The lowest-lying baryons also fill out an octet (a heavier singlet might exist, but, if it does, is certainly unstable due to decay into an octet baryon and a pseudo-scalar). Furthermore, the vector excitations in the CFL phase – the gluons, which acquire mass by a Meissner–Higgs-type mechanism, also occupy an octet that is similar to the octet of the ρ, K∗ , etc. in the hadronic phase. All of these observations suggest that the transition to hypernuclear matter is the only transition necessary as the chemical potential increases at T = 0. In other words, in QCD with three light quarks at T = 0, there might only be the transition to hypernuclear matter, with the breaking of the U(1)B symmetry and the existence of superfluidity. The symmetry of the ground state is the same at asymptotically high µ and no additional phase transitions are necessary in order to connect to this asymptotic regime, as pointed out in [63]. Since the operator of the electric charge is one of the SU(3)V generators (the charges of u, d, and s sum to zero), the charge content of the octets is the same in the CFL phase and in hypernuclear matter. However, how can the quarks appear as integer-charge excitations? This is possible because, in addition to flavor, quarks carry color. In the CFL phase neither color nor flavor is a good quantum number of an excitation, but their difference is, since simultaneous equal color and flavor rotations preserve the ground-state condensate (9.1). There are eight such transformations, and one of them is the electric charge of the CFL phase. The electric charges in the octet are thus given of the eigenvalues by differences of the actual electric charge Q flavor = diag 23 , − 13 , − 13 and the color Q color = diag 23 , − 13 , − 13 , which are always integral. The modified electric-charge generator Q flavor − Q color is not broken, and, therefore, if a gauge field is coupled to this charge it will remain massless, as the photon should be. However, since the corresponding transformation, involving both color and flavor rotations, is different from the transformation in the vacuum, the corresponding modified photon field is a mixture of the usual photon and the field coupled to the diag 23 , − 13 , − 13 color generator. The admixture of the latter is of the order of the ratio of electromagnetic and strong couplings e/g (this is the analog of the Weinberg angle in electroweak theory). This also means that the modified photon couples slightly more weakly (by a relative amount of order (e/g)2 ) in the CFL phase. It is also interesting that, since all charged excitations which can be excited by a passing photon are gapped, a chunk of CFL matter would be transparent, like a diamond.11 11 In a realistic environment, e.g. in the interior of a cold compact star, one would need to be
sure that no electrons are present in the CFL matter. This is the case, due to the condition of electric neutrality [98], provided that condensation of charged kaons does not occur (see [99] and Section 9.4).
278
Large chemical potentials 9.4 Comments and some further developments
In this chapter we concentrated on finding the properties of the QCD ground state and the lowest excitations using first-principles perturbative calculations. The success of such an approach is due to the asymptotic freedom of QCD which makes the expansion in powers of g(µ) systematically meaningful at large µ. We have seen that such calculations are nontrivial even at lowest order and the dependence on the coupling constant g is far from analytic. This is due to the nonperturbative nature of the phenomenon of BCS pairing. How far in µ does one have to go to achieve reasonable precision in such calculations? A straightforward analysis [89,100] suggests that this value of µ may need to be as large as 108 MeV. This means that, although they are systematically correct, the results obtained using first-principles perturbative calculations need not be practically reliable at values of µ of the order of a few hundred MeV – typical values expected to pertain to the interiors of neutron stars. The same region of moderate µ is also of great theoretical interest – the phase transitions are expected to occur there. Although perturbative calculations cannot address these issues, these first-principles results establish the existence of effects that one might expect to persist, in qualitatively similar ways, at moderate µ. Phenomenologically motivated models of strong interactions, such as Nambu–Jona Lasinio-type theories with interactions modeled by one-gluon or instanton effects [23, 24], allow one to study these phenomena in the region of interesting µ. In particular, phase transitions can be studied as a function of µ as well as of T [101]. Another parameter, which we kept small in our calculations, is the value, or values, of the quark masses. Our calculation in Section 9.3.3 is based on an expansion in m q . This expansion may be a good guide for realistic up- and downquark masses, but we must be concerned about higher-order corrections in the expansion for the strange-quark mass which is, physically, not small. The most important such corrections to the meson masses have been analysed in [97, 99]. It turns out that these additional mass terms lower the masses of the kaons (K0 and K+ – kaons with an antistrange quark) further in the CFL phase, thus making 1/3 condensation of these mesons possible once m s ∼ m u,d 2/3 . One can turn this condition around to determine the value of µ at which such condensation occurs in the phase diagram of QCD [102]. In addition, if the value of m s is large enough, the pairing of strange with up and down quarks is not energetically favorable, because of the mismatch in the Fermi surfaces. Again, analysis based on models such as the Nambu–Jona Lasinio-type models in [103] is necessary in order to address this regime. The corresponding unlocking transition from the CFL phase to the two-flavor pairing phase occurs at a value m s ∼ 1/2 µ1/2 . For the physical value of m s , such a condition determines the boundary of the CFL phase in the QCD phase diagram.
9.4 Comments and further developments
279
The mismatch of the Fermi surfaces of the paired quarks could also trigger the following interesting phenomenon, as pointed out in [104]. It has been known, at least on a theoretical level since the work of Larkin, Ovchinnikov, Fulde, and Ferrell [105], that, in the BCS theory of pairing, a mismatch of the electron up- and down-spin Fermi surfaces does not lead immediately to an unpairing. Rather, there is a critical value of the mismatch δµ at which there occurs a transition to a pairing with nonzero total momentum of a pair – essentially, because each of the pairing fermions sits close to its own Fermi sphere. The resulting variable-phase condensate breaks translational invariance. It has also been known that it is more energetically favorable to condense several such waves breaking all (continuous) translational symmetries. The resulting state has all the properties of a crystal. In the context of QCD such a phase has been termed a crystalline color superconductor. The study of such a crystal in QCD is still going on, but preliminary analysis indicates that the face-centered cubic crystal is energetically preferred [106]. Lattice-field theory as a means of addressing nonperturbative problems from first principles would be indispensible for obtaining reliable information about all such phenomena at intermediate µ and realistic m s . Unfortunately, as we have already pointed out, this regime is still off bounds to a direct Monte Carlo simulation, for the reasons we discuss in detail in Section 10.1. The application of these results to the physics of compact stars is a developing subject. Such an appliction would require, besides reaching into the regime of relevant intermediate µ and realistic m s , also the inclusion of electromagnetism, which plays an essential role in the energetics of the QCD phases (see, e.g., [107]). A growing number of potential manifestations of color-superconducting phenomena is being investigated. They include the mass–radius relation, spindown effects of gravitational r-mode instabilities, glitches, and cooling rates. The interested reader should find the recent reviews on the subject a valuable source of information. Further references include [108, 109].
10 Effective Lagrangians and models of QCD at nonzero chemical potential
10.1 QCD at finite µ and the sign problem As discussed briefly in earlier chapters, the QCD partition function at nonzero baryon chemical potential cannot be evaluated by standard Monte Carlo simulation methods. This is so because the integrand of the Euclidean path integral is not positive for all configurations of the fields if the color gauge group is SU(3). This problem is known as the sign problem. Later in this book we shall look at the state of the art of developing new approaches and algorithms that might provide some limited simulation information on QCD at nonzero µB and T . The sign problem is one of the most pressing bottlenecks in computational physics. In addition to limiting our knowledge of QCD, it occurs in condensedmatter physics and has undermined simulation studies of correlated electron systems and high-Tc superconductors. The grand canonical partition function is formally, when it is written as a path integral, −'(T,µ)/T Z ≡e = D A Dψ¯ Dψ exp(−SE ) (10.1) The Euclidean action, SE , is given by 1/T Nf * ) 1 ¯ f (∂/ + i A / + m f − µγ0 )ψ f dx0 d3 x tr(F F ) − ψ SE = µν µν 2g 2 0 f =1 (10.2) where Nf = 2 is the number of flavors, Nc = 3 is the number of colors, and m f = m is the quark mass. The Euclidean matrices γµ are Hermitian. On integrating over the fermion fields we can also write 1 / + m f − µγ0 ) (10.3) Z = D A exp − 2 tr(Fµν Fµν ) det( D 2g 280
10.1 Finite µ and the sign problem
281
The first factor, the exponent, is manifestly positive. The problem arises because of the second factor, the determinant of the Dirac operator. This determinant becomes complex when µ = 0. Let us first check that this determinant is positive when µ = 0. A note on terminology: we shall use the term “matrix” and “operator” interchangeably when referring to the Dirac operator. In a discretized version of the theory, e.g. on a lattice, or in a random-matrix approximation, it is a finite, albeit large, dimensional matrix. In a continuum theory it is an operator acting on Dirac 4-spinor functions. / is anti-Hermitian; therefore, if Consider the case m = 0 first. The matrix D m = 0 and µ = 0, the Dirac determinant is at least real. In fact, it is positive because of the vectorlike nature of the Dirac fermion. It is easy to see this explicitly in the Weyl (chiral) representation of the gamma-matrices. In this representation all four gamma-matrices are block diagonal in 2 × 2 blocks: 0 1 0 iσ γ0 = , γ= (10.4) 1 0 −iσ 0 All four gamma-matrices are anti-Hermitian (we are working in Euclidean space) and the upper block is the complex conjugate of the lower block. There/ has the form fore the matrix D 0 iX /= , iX = D0 + iσ · D. (10.5) D iX † 0 – the upper block is the anti-Hermitian conjugate of the lower block. The deter/ = det(X X † ) > 0. minant of such a matrix is manifestly positive: det D The mass enters on the diagonal in Eq. (10.5) and does not spoil the argument / + m) = det(X X † + m 2 ). above: det( D Now consider µ = 0. Then m iX − µ / + m − µγ0 = (10.6) D m iX † − µ and / + m − µγ0 ) = det[(X + iµ)(X † + iµ) + m 2 ] det( D
(10.7)
is not positive definite. It would be if we formally chose µ to be imaginary [110]. However, for real µ, the physical situation, the determinant is in general complex. This means that we cannot straightforwardly apply Monte Carlo methods to Eq. (10.3). 1 1 The problem could be called the “complex-phase” problem, but, at least in principle, one could
reduce such a problem to a sign problem by combining the gauge-field configurations into pairs such that the phase of the determinant cancels out explicitly. This leaves an indefinite sign for the integrand.
282
Effective Lagrangians and Models
One natural step would be to investigate the partition function (10.3) in the quenched approximation. That is, use the Monte Carlo weight given by the pure gluon action and set the determinant to unity. One could then argue that the limit Nf → 0 is smooth and we could obtain results that are in qualitative agreement with real QCD, i.e. with Nf = 2 or 3. Though such an approach is very fruitful for finite-temperature QCD, it fails when µ = 0. The essence of the problem is that there are many theories that have the same Nf → 0 limit. We need to know which one of these theories the quenched case approximates. The answer to this question is crucial, since the different theories have qualitatively different quark contents and physical properties. It turns out that the quenched case approximates a “wrong” theory. It also turns out that this “wrong” theory also has physical significance (a theory of finite isospin density) and we shall discuss it at the end of this chapter. A random-matrix model of the QCD partition function becomes very useful when we want to understand the failure of the quenched approximation. We shall introduce this model and see what it actually predicts for real QCD with fermions. We shall see that these predictions agree with the physical expectations outlined in Chapter 8. Owing to the simplicity of the model, we shall be able to carry out the explicit analytic calculations needed to investigate the nature of the quenched limit. 10.2 The random-matrix model of QCD Random-matrix theory provides an effective description of those degrees of freedom in QCD which are responsible for the spontaneous breaking of chiral symmetry. In this respect, it is similar to Landau–Ginzburg effective theory. Random-matrix theory is based on the observation that spontaneous chiralsymmetry breaking is related to the density of small eigenvalues of the Dirac operator (λ QCD ). This relationship is expressed quantitatively by the Banks– ¯ Casher formula (5.117), ψψ = πρev (0). Here, ρev (0) is the density of small (but nonzero) eigenvalues (per unit λ and per unit four-volume, V4 ) of the Euclidean Dirac operator in the thermodynamic limit V4 → ∞. The dynamics of these eigenvalues can be described using a random matrix (of infinite size) in place of the Dirac operator. This approximation can be shown to give exact results in a certain limit – the mesoscopic limit [111, 112]. The random-matrix approximation consists of replacing the matrix X in the expression (10.5) for the Dirac matrix by a completely random matrix. It is surprizing that many properties of the small eigenvalues are correctly reproduced in this case. This means that small eigenvalues are very insensitive to the precise form of correlations in the matrix X introduced by its dependence on fluctuating gauge fields Aµ . We shall not attempt to prove this, but rather take a pragmatic
10.2 The random-matrix model
283
approach and see what happens if we do make such an approximation. The virtue of the random-matrix model is that it is solvable analytically. 10.2.1 A random-matrix-model description of the QCD phase diagram We write
Z RM =
N D X exp − 2 tr(X X † ) det Nf (D + m) σ
(10.8)
/ − µγ0 : where D is the 2N × 2N matrix approximating the Dirac operator D 0 iX − iC (10.9) D= 0 iX † − iC The random matrix X has dimensions N × N . The total dimension of D is 2N . This is the number of small eigenvalues, which is proportional to V4 . In QCD we expect N to be approximately equal to the typical number of instantons (or anti-instantons) in V4 ; therefore, N /V4 ≈ n inst ≈ 0.5 fm−4 [113]. The matrix C is deterministic and describes the effects of temperature and chemical potential [114]. In the simplest (and original) T = 0, µ = 0 model [115], the choice C = π T describes the effect of the smallest Matsubara frequency. As noted in [116], it is possible to simulate the effects of the eigenvalue correlations induced by the pairing of instantons and anti-instantons into molecules by choosing a more general form for the diagonal matrix C with elements Ck , which are (increasing) functions of T . In the T = 0, µ = 0 model of [65], C = iµ describes the effect of the chemical potential. Here we consider the more general case T = 0, µ = 0. Although we do not know the detailed dependences of the elements of C on T and µ, we understand that T primarily affects the real (i.e. Hermitian) part of C and µ affects the imaginary (i.e. anti-Hermitian) part. We shall adopt the following approximate form for this dependence: Ck = aπ T + ibµ for one half of the eigenvalues and Ck = − aπ T + ibµ for the other half with a and b dimensionless parameters.2 This form accounts for the fact that there are two smallest Matsubara frequencies,3 +π T and −π T . Such a linear Ansatz for C is certainly very naive, but it probably would be pointless to refine it while accepting the other crude approximations we have already made. This form reflects our understanding of the properties of C sufficiently well. The chiral condensate is calculated as 1 ∂ ln Z RM ¯ ψψ = (10.10) Nf V4 ∂m 2 These parameters are intended to reflect the degree of overlap and correlation between instan-
tons and anti-instantons. One can therefore predict that a and b are smaller than 1. We shall estimate the values of a and b below. 3 Alternatively, this form preserves the relation γ D = 0 at µ = 0. 0
284
Effective Lagrangians and Models
¯ 0 ≈ 2 fm−3 at T = µ = m = 0. The only Current algebra fixes the value of ψψ dimensional parameter remaining in the partition function, Z RM , is the variance of the random matrix, σ . Thus, N n inst ¯ 0 = constant × ψψ ≈ constant × (10.11) V4 σ σ The dimensionless constant will be found below and is equal to 2. This fixes the value of σ ≈ 0.5 fm−1 ≈ 100 MeV. It is convenient to use σ as a unit of mass in the model and also absorb the coefficients πa and b into T and µ. In other words, we measure m in units of σ , T in units of σ/(πa), and µ in units of σ/b. The N → ∞ (i.e. thermodynamic) limit of the partition function, Eq. (10.8), can be found in the following way. Let’s first sketch the strategy of the calculation and then carry out the details below. First, write the determinant in Eq. (10.8) as an integral over Grassmann N vectors (quarks). Second, perform the now Gaussian integration over X . As a result, one is left with an integral over Grassmann N vectors, which appear in the exponent in a quadrilinear form. Third, to make this integration Gaussian, introduce integrations over auxiliary Nf × Nf matrices φ. Finally, on performing the Gaussian integral over Grassmann N vectors, one arrives at Eq. (10.16). The limit N → ∞ in Eq. (10.16) is given by the saddle point of the integrand. Now let’s perform the above steps explicitly. We must introduce Grassmann variables that carry two indices. The index i = 1, . . ., N – the same as the index numbering the rows and columns of X . This index corresponds to color as well as space indices of the original Dirac matrix (from the instanton point of view, it counts instanton zero modes). To account for the power of the determinant, there should be Nf such Grassmann variables, and we label them with the index a = 1, . . ., Nf . These are simply quark-flavor indices. We need two such doubleindexed variables, since the matrix D is 2N × 2N . We denote them #L and #R . The subscripts L and R refer to chirality. Concluding this step, we write (σ = 1 in our units now) Z RM = D X exp[−N tr(X X † )] †a †a × D# exp #L (iX − iC)#La + #R (iX † − iC)#Ra †a †a (10.12) × exp #L m#Ra + #R m#La where we suppressed “color/space” indices i of # and X to focus on flavor indices a. The integration over matrix elements of X can be done now using this formula, which is valid for arbitrary matrices A and B: D X exp[−tr(X X † )] exp[tr(AX ) + tr(B X † )] = exp[tr(AB)] (10.13)
10.2 The random-matrix model
285
which is easy to derive by writing X = Re X + i Im X and integrating over real and imaginary parts of the matrix elements X using standard Gaussian integration rules. Thus in Eq. (10.12) the terms linear in X collapse into a four-fermion term: 1 †a b †b a Z RM = # # #R #L D# exp N L R †a †a × exp − i#L C#La − i#R C#Ra †a †a (10.14) × exp #L m#Ra + #R m#La Note that, in order to write the four-fermion term in Eq. (10.14), we had to permutate some # variables, which affects the signs of the terms above. Now we shall break the four-fermion term into bilinears, but using the Nf × Nf †a †a matrices such as #L #Rb , instead of the N × N matrices such as #La #L . This requires a new auxiliary complex matrix φ ab : † † † Dφ exp[−N tr(φφ )] D # exp[tr(#L φ#R + #R φ † #L )] Z RM = †
†
× exp(−i#L C#L − i#R C#R ) †
†
× exp(#L m#R + #R m#L )
(10.15)
In Eq. (10.15) we suppressed flavor indices a and b. Finally, on integrating over # we obtain, taking into account the fact that C has two kinds of diagonal elements, ±T − iµ, Z RM = Dφ exp[−N tr(φφ † )] φ + m µ + iT φ + m µ − iT N /2 N /2 det × det µ + iT φ † + m µ − iT φ † + m = Dφ exp[−N '(φ)] (10.16) where
1 ln{[(φ + m)(φ † + m) − (µ + iT )2 ] 2 † 2 × [(φ + m)(φ + m) − (µ − iT ) ]}
'(φ) = tr φφ † −
(10.17)
The integration in Eq. (10.16) is performed over 2 × Nf × Nf variables, which are the real and imaginary parts of the elements of the complex matrix φ. In the limit N → ∞ this integral is determined by a saddle point of the integrand or,
286
Effective Lagrangians and Models
alternatively, the minimum of '(φ): lim
N →∞
1 ln Z RM = − min '(φ) φ N
(10.18)
The function '(φ) is an effective potential for the degrees of freedom describing the dynamics of the chiral phase transition. One can see that the value of φ at the minimum, i.e. the equilibrium value φ, gives us the value of the chiral condensate, Eq (10.10): 1 N ¯ ψψ = 2 Re trφ (10.19) Nf V4 σ (cf. Eq. (10.11)). For real m, it is reasonable to expect that the minimum occurs when φ is a real matrix proportional to the unit matrix. With this assumption, we need to find only one real parameter, φ, which minimizes the potential: 1 2 2 2 2 2 ' = Nf φ − ln{[(φ + m) −(µ + iT ) ][(φ + m) −(µ − iT ) ]} (10.20) 2 This is not a simple φ 6 potential, but it has remarkably similar properties. In particular, the condition ∂'/∂φ = 0 gives a fifth-order polynomial equation in φ. The symmetry plane m = 0 Let us first consider the symmetry plane m = 0. Then the equation ∂'/∂φ = 0 always has one trivial root, φ = 0. The remaining four roots are the solutions of a quartic equation, which has the form of a quadratic equation in φ 2 : φ 4 − 2 µ2 − T 2 + 12 φ 2 + (µ2 + T 2 )2 + µ2 − T 2 = 0 (10.21) Above the second-order line, i.e. in the high temperature phase, Eq. (10.21) does not have real roots (since φ 2 < 0 for each pair of roots). This corresponds to the fact that the potential ' has only one minimum at φ = 0 (i.e. the trivial root). On the second-order line, a pair of roots of Eq. (10.21) becomes zero, i.e. the potential is ' ∼ φ 4 near the origin. This means that, on the second-order line, (µ2 + T 2 )2 + µ2 − T 2 = 0
(10.22)
The second-order line ends when the remaining pair of roots also becomes zero, i.e. the potential becomes ' ∼ φ 6 near the origin. This happens when µ2 − T 2 +
1 2
=0
(10.23)
on the second-order line. The condition Eq. (10.23), together with Eq. (10.22), determines the location of the tricritical point in the T –µ plane: 1 √ 1 √ T3 = 2 + 1 ≈ 0.776 and µ3 = 2 − 1 ≈ 0.322 (10.24) 2 2
10.2 The random-matrix model
287
The equation for the triple line is obtained from the requirement that the depth of the minima in ' at φ given by the pair of solutions of Eq. (10.21) (furthest from the origin) should coincide with the depth at the origin φ = 0. The equation for the triple line is therefore 1 1 1 1 + 1 − 16µ2 T 2 2 2 2 2 µ − T + + 1 − 16µ T − ln 2 2 2 2 + ln(µ2 + T 2 ) = 0
(10.25)
In particular, when T = 0, we obtain the elementary equation µ2 +1 + ln µ2 = 0 whose solution is µ ≈ 0.528 [65]. This is the value of µ1 . On setting µ = 0 in the equation for the second-order line, Eq. (10.22), we find Tc = 1 [115]. We recall that the units of T and µ depend on the unknown dimensionless parameters a and b. However, these unknown factors cancel out from the ratios T3 µ3 ≈ 0.78; ≈ 0.61 (10.26) Tc µ1 Taking Tc = 160 MeV and µ1 Nc = 1200 MeV, we find that T3 ≈ 120 MeV and µ3 Nc ≈ 700 MeV.4 Note that the second-order line, Eq. (10.22), marks the location of the points on the phase diagram where the symmetric minimum φ = 0 disappears, i.e. turns into a maximum. On continuing this line below point T3 , we obtain the location of spinodal points (see Fig. 10.1). In the region between this line of spinodal points and the first-order phase-transition line, the chirally symmetric phase φ = 0 can exist as a metastable state. Such a state can be reached by supercooling, and it is unstable against the nucleation of bubbles of the broken phase φ = 0. A similar line with equation 4T µ = 1, the superheating line, and the supercooling line, Eq. (10.22), bound the region around the first-order phase-transition line where the potential '(φ) has three minima. All these lines meet at the tricritical point as shown in Fig. 10.1. Away from the m = 0 plane Away from the symmetry plane m = 0, the expressions for the wing surfaces and the wing lines become rather lengthy. The principle, however, remains simple. The minima of the potential, ', satisfy the equation ∂'/∂φ = 0 and are given by (three out of five) roots of a fifth-order polynomial. On the wing surface, the depth '(φ) in a pair of adjacent minima is the same. On the wing line, the two adjacent minima fuse into one. In terms of the roots of the polynomial, three roots coincide (two minima and one maximum). In other words, the potential is ' ∼ (φ − φ)4 on the wing line and near the minimum. 4 We remind the reader that we denote by µ the chemical potential per quark, therefore µ = B µNc . With these values for Tc and µ1 , the parameters a and b have the values a ≈ 0.2 and
b ≈ 0.13.
288
Effective Lagrangians and Models
second order tricritical point spinodal
spinodal
first order (triple line)
Figure 10.1. The phase diagram in the symmetry plane (m = 0) near the tricritical point with the corresponding shapes of the effective potential ' indicated schematically. The dashed (spinodal) lines bound the region where both phases (chirally symmetric and broken) can exist, one as a stable phase and one as a metastable phase. Within this region the transition from one phase to another occurs through nucleation (e.g. one phase boils), whereas outside, provided that the system has been superheated/supercooled carefully, it would occur via a process known as spinodal decomposition.
1
T
0.8 0.6
m
0.4
0.3
0.2 0
0.2 0.1 0.1
0.2
µ
0.3
0.4
0.5
0.6
0.7
Figure 10.2. The phase diagram of QCD with two light flavors of mass m as calculated from the random-matrix model. The almost-parallel curves on the wing surface are cross sections of this surface with m = constant planes. The units of m are σ ≈ 100 MeV, those of T are Tc ≈ 160 MeV, and those of µ are µ1 Nc /0.53 ≈ 2300 MeV, with the choices of Tc and µ1 from the text. The resulting phase diagram is plotted in Fig. 10.2. One can see, as expected from mean field theory near the tricritical point, that the wing lines together with the triple line approach the tricritical point with the same slope as the second-order line but from the other side. The critical exponents near the tricritical point given by the random-matrix model can also be seen to coincide, as expected, with the mean-field exponents of Eqn. (8.78), (8.79), and (8.80). The corresponding dependences
10.2 The random-matrix model
¯
ψψ
289
1.2 1 0.8 0.6 0.4 0.2
0
0 0 0.2 0.4 0.6 0.8
T
0.2 1 1.2 1.4
0.8
0.4 0.6 µ
¯ ¯ 0 ≈ 2 fm−3 ) as a function of T Figure 10.3. The chiral condensate ψψ (in units of ψψ and µ in the random-matrix model. The units of T and µ are as in Fig. 10.2.
¯ of ψψ on T and µ in the random matrix model are shown in Fig. 10.3.
10.2.2 Chiral-symmetry breaking, Lee–Yang zeros, and an electrostatic analogy As observed by Lee and Yang [117], nonanalytic behavior of a thermodynamic quantity, including, in the present case, the discontinuity in the value of an order parameter in the thermodynamic limit, is caused by the coalescence of zeros of the partition function to form a boundary crossing the relevant parameter axis. In the case of QCD, the signature of spontaneous chiral-symmetry breaking is ¯ the discontinuity in ψψ as m is varied along the real axis and crosses m = 0. This discontinuity is equal to 2πρ(0), where ρ(0) is the density of the zeros on the imaginary-m axis near m = 0 in the thermodynamic limit. Indeed, in finite volume, when the number of degrees of freedom is finite, the partition function (10.3) is an analytic function of its parameters. The parameter we are interested in is the quark mass m, and it is easy to see that the partition function is a polynomial in m. It has a set of discrete zeros. The derivative of the logarithm of Z with respect to m is the order parameter, the chiral condensate: ¯ ψψ =
∂Z ∂m
(10.27)
¯ (To simplify equations that follow we chose a different normalization for ψψ compared with Eq. (10.19).) If ρ(m) is the distribution of the zeros in the
290
Effective Lagrangians and Models
complex plane of m, then ln Z (m) = dx dy ρ(x, y) ln(m − z), and ¯ ψψ(m) =
dx dy
where z = x + iy
ρ(x, y) m−z
(10.28)
(10.29)
It is helpful to look at this equation using the following simple electrostatic analogy. Consider a two-dimensional charge distribution ρ. Then Eq. (10.29) ¯ says that real and imaginary parts of ψψ are components of the 2-vector of the electric field strength created by ρ: ¯ ¯ E = (Re ψψ, −Im ψψ)
(10.30)
This analogy makes it easy to invert the relation (10.29): ρ=
1 1 ∂ ¯ ∇E = ψψ 2π π ∂z ∗
(10.31)
where ∂/∂z ∗ ≡ (∂/∂ x + i ∂/∂ y)/2. From Eq. (10.31) we see that ρ vanishes ¯ if the function ψψ is holomorphic (depends only on x + iy). The function ¯ ρ(z) is a measure of the nonanalyticity of ψψ(z) at a given point z. If the ¯ function ψψ(z) has a cut, the zeros form a one-dimensional distribution in the limit N → ∞. The discontinuity across the cut along the imaginary axis at m = 0 is the signature of spontaneous symmetry breaking, and the size of this discontinuity is related to the linear density of zeros at the origin, ρ(0): ¯ ¯ ψψ(+0) − ψψ(−0) = 2πρ(0)
(10.32)
Note how this relation resembles the Banks–Casher relation (5.117). The difference is that here ρ is the density of partition-function zeros rather than the density of the eigenvalues as in Eq. (5.117). In fact, when the chemical potential is nonzero, the Dirac operator is not even Hermitian, and its determinant is no longer real. As a result, when µ = 0, the density of Dirac eigenvalues can be defined straightforwardly only in quenched QCD, i.e. when the contribution of the (complex) fermion determinant to the measure is approximated by unity. Because the random-matrix model is solvable analytically, we can determine ¯ the locations of the cuts of the function ψψ, and therefore learn how the zeros of the partition function in the complex-m plane evolve with changes in temperature and chemical potential. A few typical cases are illustrated in Fig. 10.4. The results in this figure are actual locations of the zeros of the partition function (10.8) at finite N . It illustrates that the zeros do indeed line up in the limit N → ∞. At zero T and µ, the zeros form a line, which (in the
10.2 The random-matrix model
(a)
(b)
291
(c)
Figure 10.4. Zeros of the partition function of a finite-sized N -random-matrix model (10.8) in the complex-m plane calculated numerically for various values of T and µ. The calculation has been done for Nf = 1, but the N → ∞ limit is independent of Nf . The density of points is ¯ proportional to the strength of the cut (discontinuity in ψψ) in the N → ∞ limit. ¯ N → ∞ limit) means a cut on the Riemann sheet of the function ψψ along the imaginary axis. Raising the temperature pushes the zeros away from the origin along the imaginary axis until the density at the origin vanishes (continuously), the cut breaks in two, as in Fig. 10.4(a), and chiral symmetry is restored (cf. Eq. (10.32)). The chemical potential pushes the zeros away from the origin in the direction of the real axis until the cut splits in two, as illustrated in Fig. 10.4(b). Note that the density ρ(0) is finite just before the split. Therefore, the transition is of first order. Near the tricritical point, the split in the direction of the real axis (due to the chemical potential) occurs at the same time as the density ρ(0) vanishes (due to the effects of the temperature). This is illustrated in Fig. 10.4(c). 10.2.3 The quenched limit of QCD and the random-matrix model Why does the quenched approximation fail at µ = 0? In the quenched approx¯ imation we calculate the order parameter ψψ as the average over configurations with a weight from which the determinant is thrown out: ¯ quench (m) = tr(m − D)−1 ψψ
(10.33)
292
Effective Lagrangians and Models
where . . . denotes the integral over the gauge fields with the weight given by the pure gluon action only, without the determinant. The numerical Monte Carlo calculation, which is now possible for any µ since the determinant is absent, would just mean generating configurations with the positive weight and calculating tr(m − D)−1 averaged over such an ensemble of configurations. We then would like to assume that this simple calculation will give us the same result as the limit Nf → 0 of the complete, unquenched calculation. In fact, it does not. Finite Nf and Nf → 0 Let us first see where the limit Nf → 0 of the unquenched calculation takes us. In the random-matrix model this limit is simple. In fact, there is no dependence on Nf at all, and therefore the partition-function zeros always line up along the imaginary axis and the density ρ(0) at the origin vanishes only when µ exceeds a finite value of µ as we observed above. In other words, the singularities of the ¯ function ψψ(m) are cuts. To see this explicitly, let us consider the simpler case of T = 0. We can write the partition function in the following way: Z = det Nf (m − D)
(10.34)
using the same notations as in Eq. (10.33). In the random-matrix approximation the · denotes integration over X and it can be performed using the same sequence of transformation and auxiliary variables as in Eq. (10.16). The value of ¯ ψψ(m) in the N → ∞ limit is determined by Eq. (10.19), which, in our cur¯ rent normalization becomes simply ψψ = φ, with φ given by the saddle point of the effective potential: '(φ) = tr{φφ † − ln[(φ + m)(φ † + m) − µ2 ]}
(10.35)
(We remind the reader that the real and imaginary parts of φ should be considered as independent complex variables.) Assuming that the saddle point is achieved when φ is a diagonal matrix, we find the following equation for φ: φ + m = φ[(φ + m)2 − µ2 ]
(10.36)
The solution of this cubic equation is an analytic function of m, which has three Riemann sheets, connected by cuts. The correct Riemann sheet is selected by the condition that φ → 1/z when z → ∞, which follows from dx dy ρ = 1 and Gauss’ theorem in the electrostatic analogy of Section 10.2.2. The cuts evolve with µ as shown schematically in Fig. 10.5. The quenched result ¯ quench (z) in the quenched Now, what are the singularities of the function ψψ ¯ quench (z) can be related to approximation? From Eq. (10.33) the value of ψψ
10.2 The random-matrix model µ=0
µ2 > 1/8
µ > µc
(b)
(c)
293
z
(a)
¯ Figure 10.5. The evolution of cuts of the function ψψ(z) with increasing µ. µc = 0.53. 2
2
1
1
0
0 0 0.1 0.2 0.3 0.4 0.5
0 0.2 0.4 0.6 0.8 1
Figure 10.6. Eigenvalues of 20 random 100 × 100 matrices on the complex-z plane at two values of µ2 , 0.06 (left) and 0.40 (right). The line is the boundary of the ρev = 0 region calculated analytically. the averaged distribution of the eigenvalues ρev of the matrix D: ρev (x, y) ¯ , z = x + iy ψψquench (m) = dx dy m−z
(10.37)
This distribution is easy to obtain numerically and, in the random-matrix model, it has the form shown in Fig. 10.6. One sees that, as soon as µ is different from zero, the eigenvalues spread into a two-dimensional distribution. This is not a surprise, since the matrix D loses its anti-Hermiticity properties. However, on comparing Fig. 10.6 with Fig. 10.5 we conclude that singularities of the func¯ quench (m) are qualitatively different from line-like (cut) singularities tion ψψ ¯ of ψψ(m) at any finite Nf . In particular, the linear density of eigenvalues vanishes immediately when µ differs from zero. In other words, chiral simmetry is restored at µ = 0 in the quenched approximation, instead of at finite µ = µc . The limit Nf → 0 is not the same as the quenched approximation at finite µ. What does this mean? It means that the quenched approximation does not approximate, even qualitatively, the behavior of the full theory. This can be further understood if we find a theory that has the quenched approximation as its Nf → 0 limit.
294
Effective Lagrangians and Models
Conjugate quarks The theory which has the quenched approximation as its Nf → 0 limit is given by the following partition function: Z = det Nf /2 (m − D)(m ∗ − D † )
(10.38)
On comparing this partition function with the one given by Eq. (10.34), we note that the determinant in Eq. (10.38) is manifestly positive. This is achieved by introducing a set of conjugate quarks – fermion fields whose Dirac matrix is the Hermitian conjugate to that of normal quarks. For m real, this means that the sign of the chemical potential is reversed, and for µ = 0 conjugate quarks are identical to normal quarks, and are not necessary in order to obtain the correct quenched limit. Instead of providing a formal proof, which can be found in the mathematical literature on non-Hermitian random matrices [118], we shall demonstrate that the partition-function zeros in Eq. (10.38) have the density ρ(z) which coincides with ρev (z) that we find in the quenched calculation using Eq. (10.33) and which we see in Fig. 10.6. The advantage of the random-matrix model (over full QCD calculation) is that we can find ρ(z) analytically. The integration over matrix elements of X (denoted by the brackets) in Eq. (10.38) is similar to the previous calculation we have done in Eqs. (10.34) and (10.16). However, instead of one auxiliary Nf × Nf matrix field φ we have to introduce four Nf /2 × Nf /2 matrices, which we denote a, b, c, and d: z + a µ 0 id µ ic 0 z + a† Z = Da Db Dc Dd det N † ∗ † 0 z +b µ id 0 µ z∗ + b ic† × exp[−N (|a|2 + |b|2 + |c|2 + |d|2 )]
(10.39)
On differentiating with respect to m, we find that the value of the condensate ¯ ψψ can be found by taking the value of a at the saddle point of the integrand in Eq. (10.39). The set of solutions of the saddle-point equation is richer in this case. There is a solution with c = d = 0. In this case the conjugate quarks decouple and ¯ we simply obtain the same holomorphic function ψψ = a = b as before. However, there is another solution in which the condensates c and d are not zero! Then the function φ = a is not holomorphic and therefore ρ = 0. This saddle point dominates the integral at small z for 0 < µ < 1. ¯ The condensates c and d are bilinears of the type ψχ, mixing original ψ and conjugate χ quarks. These condensates do not break the original chiral symmetry but break a spurious (replica-type) symmetry involving both original and conjugate quarks. We shall encounter similar condensates carrying baryon number when we discuss the SU(2) model of QCD with quarks in Section 10.3. In
10.3 Two-color QCD and effective Lagrangians
295
the quenched theory, as in SU(2) QCD, the original chiral symmetry is always restored at µ > 0. The spurious symmetry is spontaneously broken for µ < 1 and is restored for µ > 1. The boundary of the ρev = 0 region is given by y 2 = (µ2 − x 2 )−2 [4µ4 (1 − µ2 ) − (1 + 4µ2 − 8µ4 )x 2 − 4µ2 x 4 ]
(10.40)
It is plotted in Fig. 10.6 for comparison with numerical data. The baryonic condensates c and d inside of the “blob” are given by |c|2 = |d|2 =
µ2 x2 y2 2 − µ − − µ2 − x 2 4(µ2 − x 2 )2 4
(10.41)
On the boundary, Eq. (10.40), they vanish and the two solutions (holomorphic and nonholomorphic) match. In the outer region c = d = 0 and a is the solution of the cubic equation (10.36). Inside the “blob” the value of a is given by ¯ ψψ =a=
iy x 1 −x− 2 2 2µ −x 2
(10.42)
and the density of the partition-function zeros (and of the eigenvalues in the quenched case) (10.31) is 2 x + µ2 1 − 1 (10.43) ρ = ρev = 4π (µ2 − x 2 )2 To appreciate the nontriviality of this result one should notice that expression (10.33) appears to depend only on m, not on m ∗ ! The limit Nf → 0 must be taken with great care. 10.3 Two-color QCD and effective Lagrangians Effective Lagrangians are used in high-energy physics to extract aspects of the low-energy solution of the theory by emphasizing symmetries and their realizations, through the Goldstone mechanism and current algebra. The approach is akin to spin-wave analyses of condensed-matter systems. Although its reach is limited to the lowest momenta and energies in the system, it can produce nontrivial results that are strongly constrained by symmetries and their realizations. Since many such results are nonperturbative, they are difficult to derive from other approaches that begin with the microscopic Lagrangian and its interacting quarks and gluons. For energies and momenta less than or of the same order as the pion mass, the effective-Lagrangian method, and chiral perturbation theory in particular, may be invaluable. Two-color QCD has some peculiar properties, of which some present advantages and others present disadvantages. Since its representations are pseudo-real,
296
Effective Lagrangians and Models
as we will discuss in detail below, its quark–anti-quark meson spectrum is identical to its two-quark “baryon” spectrum. Therefore, its lowest-lying “baryon” has the same mass as its pion and vanishes in the chiral limit. When it is placed into an environment with a chemical potential, it will experience a transition to a diquark condensate at a critical quark chemical potential of µc = m π /2 because at this point the sum of the lowest energy levels of the two quarks, −2µ, cancels out the energy of the lightest “baryon” and triggers condensation. This critical chemical potential vanishes when the bare mass of the theory’s quarks is taken to zero. This is qualitatively different from QCD, in which quark–anti-quark mesons and three-quark baryons are distinct, both through dynamics and through quantum numbers. In particular, a small chemical potential should hardly affect the theory’s meson spectroscopy since mesons carry no baryon number. The nucleons are the lightest, colorless, physical fermion states with the lowest physical baryon number and, when µ exceeds the gap in this channel, 3µc = MN , a transition should occur. At very high chemical potential one expects the BCS mechanism to prevail and 3¯ Cooper pairs to condense off a sharp Fermi surface and form a color superconductor. The mass of the nucleon is large in the chiral limit of the model where m π → 0, so the separation of the two energy scales, chiral symmetry on the one hand and the formation of a color superconductor on the other, should be clear. We shall see that, in the SU(2) version of QCD, one predicts a superfluid phase of diquark condensation at µc = m π /2. In the diquark phase, the baryon number is spontaneously broken, so one expects a single Goldstone boson to appear. Since the diquarks are meson-like and colorless, the phase resembles a superfluid rather than the color superconductor expected in SU(3) QCD. Note that the Goldstone bosons of baryon-number breaking are exactly massless even for a theory with nonvanishing quark masses. In sharp contrast, the color-diquark phase of the SU(3) model is similar to the Higgs phase of the electroweak model because in both cases the condensate breaks a global part of the gauge symmetry. The diquarks of the SU(3) theory transform nontrivially under the gauge group, like the Higgs particle of electroweak physics and the Cooper pairs of electronic superconductivity, and, when they condense, one expects the formation of a vacuum with color screening and no exact Goldstone excitations. The energetics of the superfluid phase of the SU(2) model and the superconducting phase of the SU(3) model appear to be very different. It is particularly interesting that all of the physics in the SU(2) model is within reach of effective Lagrangians and chiral perturbation theory, in particular. The phase transition itself, from hadronic matter to a superfluid phase at µc = m π /2, is also predicted by these methods. This is a unique situation in which an inherently perturbative method of analysis like chiral perturbation theory can address a phase transition in full detail and with good control.
10.3 Two-color QCD and effective Lagrangians
297
We will discuss the SU(2) transition thoroughly and not speculate further on its differences from the physics of the SU(3) model. Perhaps we will learn things relevant to color superconductivity, perhaps not. Superfluid phases have been argued for in the SU(3) model, in addition to its superconductivity, and pion condensation and kaon condensation have been modeled. There is a good chance that some of the SU(2) analysis will apply to some aspects of the physics of QCD. 10.3.1 QCD inequalities and the nature of the ground state We begin with the fermion Lagrangian,
L=
Nf (ψ¯ f γν Dν ψ f + µψ¯ f γ0 ψ f + m q ψ¯ f ψ f )
(10.44)
f =1
where we shall assume that the quark mass is small and consider the chiral limit m q → 0 in much of this discussion. First, we are interested in establishing general constraints on the pattern of symmetry breaking here. What quark bilinear can develop a vacuum expectation value? What are the quantum numbers of the lightest meson? We know from ancient discussions in Euclidean QCD [119] that the pion propagator bounds all others from above. From this one can prove that the pion ¯ is a pseudo-scalar and therefore the condensate must be a scalar ψψ. This ¯ establishes that the condensate is not a pseudo-scalar, like ψγ5 ψ, because this would give rise to a 0+ Goldstone boson. The argument proceeds as follows. Consider the Dirac operator for µ = 0, D = γ (∂ + iA) + m q . which satisfies the identity γ5 Dγ5 = D †
(10.45)
Now consider the composite propagator for a meson. Generate the meson with ¯ the operator M = ψψ, where is a generic flavor or spinor matrix. The expectation value of the composite propagator is M(x)M(0)ψ,ψ,A = tr[S(x, 0)S † (0, x)] A ¯
(10.46)
where we have indicated which integrals in the path integral have been done, and S is the fermion propagator, S = D −1 . Taking = γ5 , we can use Eq. (10.45) to write Eq. (10.46) as tr[S(x, 0)S(0, x)] A . This quantity is an upper bound on the general composite propagator, as we can see using the Schwartz inequality, tr[S(x, 0)S(0, x)] = tr[S(x, 0)γ5 S † (x, 0)γ5 ] ≤ tr[S(x, 0)S † (x, 0)] (10.47)
298
Effective Lagrangians and Models
On taking expectation values, we obtain the desired inequality for the correlation function and, therefore, for the meson masses. Implications and consequences of these correlation-function inequalities have been discussed in the literature to shed light on chiral-symmetry breaking and spectroscopy [119]. What parts, if any, of these observations generalize to nonzero chemical potential? In the case of SU(3) color Eq. (10.45) does not hold true when the chemical potential is nonzero and D = γ (∂ + iA) + µγ0 + m q . However, in the case of SU(2) color there is a generalization of Eq. (10.45) that proves useful. Use the fact that SU(2) is a pseudo-real group. In particular, the second Pauli matrix satisfies T2 Ta T2 = −Ta∗ . By combining this with charge conjugation, C = iγ0 γ2 (C 2 = 1 and Cγν C = −γν∗ ), we can generalize Eq. (10.45) to γ5 C T2 Dγ5 C T2 = D ∗
(10.48)
It is particularly illuminating to consider the propagator of the SU(2) diquark, Mψψ = ψ T C T2 γ5 ψ, which is a 0+ , I = 0 (anti-symmetric in flavor) operator, for which Mψψ (x)Mψψ (0)ψ,ψ T ,A = tr[S(x, 0)C T2 γ5 S T (x, 0)C T2 γ5 ] A = tr[S(x, 0)S † (x, 0)] A
(10.49)
This relation becomes the basis of showing that the correlator of Mψψ = ψ T C T2 γ5 ψ bounds the correlator of any other meson like ψ T C T2 γ5 ψ. Therefore, it is the 0+ , not the 0− , state which is the lightest. In addition, if there is condensation, it has to be that of the diquark operator ψ T C T2 γ5 ψ which leaves parity invariant.
10.3.2 Symmetries and Goldstone bosons in the diquark phase Let’s consider the SU(2) color version of QCD with two flavors, massless “up” and “down” quarks. We want to set up a convenient formalism for it so that we can easily read off that the theory at vanishing chemical potential, µ = 0, in the chiral limit m q = 0, has an SU(4) global symmetry, which, when chiral symmetry breaks dynamically, is reduced to Sp(4), producing five Goldstone bosons. The theory has two “extra” Goldstone bosons compared with SU(3) color and we want to understand their origin in the pseudo-real nature of the SU(2) color group. When the chemical potential is turned on, we will also see that the theory’s global symmetry is reduced to SU(2) × SU(2) × U(1), which breaks upon diquark condensation to SU(2) × SU(2), producing one Goldstone boson corresponding to baryon-number breaking. In addition, there are four pseudoGoldstone bosons, each with a mass of precisely 2µ, to leading order [120]. We begin by writing the fermion piece of the SU(2) color Lagrangian in terms
10.3 Two-color QCD and effective Lagrangians
299
of left- and right-handed quark fields, ψ=
qL qR
(10.50)
and define the extensions of the Pauli matrices, σν = (−i, σk ),
σ¯ ν = (−i, −σk )
(10.51)
Now the Lagrangian can be written †
†
¯ / ψ = qL iσν Dν qL + qR iσ¯ ν Dν qR L = ψiD
(10.52)
where Dν = ∂ν + Ta Aaν is the covariant derivative for SU(2) color. It will prove handy to recall that Ta∗ = −T2 Ta T2 = TaT for the Pauli-matrix generators of SU(2) color and verify that the covariant derivative satisfies DνT = −T2 Dν T2
(10.53)
Now we can define conjugate quarks, q˜ = σ2 T2 qR∗ ,
q˜ † = qRT T2 σ2
(10.54)
The Lagrangian can now be written L = q † iσν Dν q + q˜ † iσν Dν q˜ = # † iσν Dν #
(10.55)
which has the global SU(4) flavor symmetry advertised above. In this expression # is the Weyl spinor, 1 q q2 q (10.56) #= = q˜ q˜ 1 q˜ 2 Note that q and q˜ have opposite baryon numbers but identical chiral U(1)A charges. It is useful to write various quark bilinears in this language to determine their transformation properties. For example, the chiral condensate is 1 0 1 ¯ ¯ # + h.c. (10.57) ψψ = #σ2 T2 −1 0 2 which we identify with the sigma meson. It is a member of an SU(4) 6-plet. The remaining five members of the 6-plet are three pions, a scalar diquark, and a scalar anti-diquark.
300
Effective Lagrangians and Models
The baryon charge reads ¯ 0ψ = ψγ
† qL qL
† + qR qR
†
T †T
= q q + q˜ q˜
†
†
= q q − q˜ q˜ = #
†
1 0
0 # −1 (10.58)
which indicates that adding a chemical potetial to the Lagrangian breaks its algebraic symmetry down to SU(2)L × SU(2)R × U(1)B . When this symmetry breaks through diquark condensation, there will be one Goldstone boson corresponding to the breaking of U(1)B . We will see that the other four “pseudo-Goldstone” bosons form a (2, 2) multiplet of SU(2)L × SU(2)R and acquire a common mass of 2µ. 10.3.3 Effective-Lagrangian construction Now we can construct the effective Lagrangian to describe the low-energy features of this model. We will concentrate on the low-lying mesons and diquarks and restrict their interactions, guided by the symmetry considerations we have laid out above. To do this we will use # because it transforms simply, # → U #, under SU(4). We begin with the quark–quark bilinears that compose the low-energy states, ∼ ## T σ2 T2
(10.59)
which is anti-symmetric both in spin and in color indices and transforms under SU(4) according to → U U † , as a flavor SU(4) 6-plet. The Lagrangian that can describe the low-energy aspects of the theory and is invariant under flavor SU(4) is a nonlinear sigma model, L 1 = f π2 tr(∂ν
†
∂ν )
(10.60)
We want to generalize this discussion to nonzero chemical-potential. The chemical-potential term in the microscopic theory reads 1 0 † ¯ 0 ψ = µ# # (10.61) µψγ 0 −1 which breaks the SU(4) flavor symmetry because it distinguishes between quarks and conjugate quarks. However, it is useful to re-interpret this symmetry. Write this term as µ# † iσν Bν # where Bν is the SU(4) matrix Bν = δ0ν
1 0
0 −1
(10.62)
= δ0ν B
(10.63)
10.3 Two-color QCD and effective Lagrangians
301
and give Bν the SU(4) transformation property Bν → U Bν U †
(10.64)
With this rule, the microscopic Lagrangian # † iσν Dν # + µ# † iσν Bν #
(10.65)
becomes invariant. Now we can turn our attention to the construction of the effective Lagrangian which has the same transformation properties as Eq. (10.65). It must also be invariant under these extended transformations and its dependence on µ must be unique and not involve any additional couplings. Since Bν transforms nontrivially under SU(4), a term linear in Bν cannot appear in the effective Lagrangian. The lowest-order nontrivial term would be proportional to µ2 tr( BνT † Bν ). Such a term will produce pseudo-Goldstone boson masses that are linear in µ, but what should the overall coefficient of this term be? The answer must be unique, because there is no freedom in how to introduce the chemical potential into the microscopic, quark-level Lagrangian. We shall see in our discussions of the lattice version of QCD at nonzero µ that the chemical potential is introduced like a timelike imaginary Abelian gauge field. This inspires us to extend the symmetry properties of Bν to a local, gauge symmetry in the microscopic theory: we require that the theory be invariant under the transformations Bν → U Bν U † +
1 U ∂ν U † µ
(10.66)
The effective Lagrangian must be invariant under this local symmetry, so the derivatives in the effective L of Eq. (10.60) must become (10.67) Dν = ∂ν + µ Bν + BνT † † † T † Dν (10.68) = ∂ν −µ Bν + Bν The effective Lagrangian becomes L eff = f π2 tr(Dν
†
Dν )
(10.69)
Finally, we should incorporate the effect of a bare quark mass into the effective Lagrangian. We need this term in order to pick out the ground state and to discuss the phase transition to a diquark condensate at nonzero µ. The bare mass in the microscopic theory is 1 1 0 −1 T ¯ m ψψ = m# σ2 τ2 # + h.c. = − # T σ2 τ2 M# + h.c. (10.70) 1 0 2 2
302
Effective Lagrangians and Models
where the mass matrix M is given by ˆ M = m M,
Mˆ =
0 1 −1 0
(10.71)
We see that the bare-mass term breaks the SU(4) symmetry. The full SU(4) invariance can be restored if M is given the transformation property # → U #,
M → U ∗ MU † .
(10.72)
This extended symmetry must also be manifest in the effective theory, M → U ∗ MU †
→ U U T,
(10.73)
The lowest-order term induced by the quark mass must therefore have the form L qmass = −G Re[tr(M )] = −mG Re[tr( Mˆ )].
(10.74)
One can view Re tr( Mˆ ) as a generalized cosine of the angle between unitary matrices and Mˆ † . It is maximal when is aligned with Mˆ † . Therefore the direction of minimizing Eq. (10.74) is given by 0 1 † ˆ (10.75) c = M = −1 0 The mass term comes with a phenomenological coefficient, which we denote by G. It is given by the derivative of the vacuum energy with respect to m and is, therefore, proportional to the chiral condensate in the chiral limit m → 0 at µ = 0, ¯ 0 /4 G = ψψ
(10.76)
The resulting Lagrangian with the mass term is L eff = f π2 tr(Dν Dν
†
) − mG Re tr( Mˆ )
(10.77)
Expanded to second order in the pion fields, it yields a spectrum with five degenerate pseudo-Goldstone bosons with masses given by the Gell-Mann–Oakes– Renner relation (10.78) m 2π = mG/ 2 f π2 We can use this relation to trade G for another parameter, m π , and write (10.79) L eff = f π2 tr(Dν Dν † ) − 2m 2π Re tr ( Mˆ ) T
which can be expanded, using the property L eff = f π2 tr(∂ν − 2 f π2 µ2 tr
†
∂ν ) − 4µf π2 tr(∂ν † Bν + BνT † Bν +
†
= − , to read
Bν ) BνT − 2 f π2 m 2π Re tr( Mˆ ) (10.80)
10.3 Two-color QCD and effective Lagrangians
303
10.3.4 Vacuum alignment, diquark condensation, the phase diagram, and scaling laws Our next task is the determination of the ground state and its symmetries as a function of µ [121]. Take the static pieces in L eff and write them in the form x2 f π2 m 2π T † ˆ − tr Bν Bν + Bν Bν − 2 Re tr( M ) (10.81) L static ( ) = 2 2 where we introduced the dimensionless variable x = 2µ/m π . We need to find those fields which minimize the static part of the effective Lagrangian. We have already observed that, when the chemical potential vanishes, the condensate direction is c = Mˆ † . However, when the chemical potential becomes very large, x → ∞, the best becomes iI 0 0 −1 , I = (10.82) d = 0 iI 1 0 This choice of the condensate direction is not unique. It is clear from the effective Lagrangian that we can rotate the condensate by the generator B and the effective Lagrangian remains unchanged. This degeneracy corresponds to the existence of a massless Goldstone boson. Now we need to determine the condensate for intermediate values of the chemical potential. The simplicity of this model produces a simple result: the condensate at intermediate values of x, call it α , is just a rotation from c to d, α
=
c
cos α +
d
sin α
(10.83)
where the “rotation” angle α varies form 0 at x = 0 to π/2 at x = ∞. The proof of this fact consists of some algebra, which will be done at the end of this chapter. Let’s concentrate on the physics first. On substituting α into the static piece of the effective Lagrangian, we find 2 2 2 x L static ( α ) = 2 f π m π [cos(2α) − 1] − 2 cos α (10.84) 2 which determines α as a function of x, α = 0, 1 cos α = 2 , x
when x < 1 when x > 1
(10.85) (10.86)
This is a major result of this model. It establishes that there is a second-order phase transition in the model at x = 1, which means that µc = m π /2. For
304
Effective Lagrangians and Models
µ < m π /2, the vacuum does not respond to the chemical potential at all. At µc = m π /2 a transition occurs and the direction of the condensate starts rotating from c to d . Such a state breaks baryon-number symmetry spontaneously and this breaking is accompanied by a Goldstone boson, a 0+ singlet. To understand the character of the phase transition, we should calculate the ground-state expectation values of some local observables, such as the chiral ¯ condensate ψψ, the diquark condensate ψψ, and the induced baryon number n B . According to lowest-order perturbation theory, ¯ ψψ =−
∂ E vac , ∂m
nB = −
∂ E vac ∂µ
(10.87)
To calculate the diquark expectation value, we need to put a diquark source into the Lagrangian. In the microscopic Lagrangian, such a source reads j T j T iI 0 # + h.c. −i ψ Cγ5 T2 I ψ + h.c. = − # σ2 T2 0 iI 2 2 j = − # T σ2 T2 Jˆ# + h.c. (10.88) 2 where we have defined Jˆ ≡
iI 0
0 iI
(10.89)
and I is the anti-symmetric matrix introduced earlier. This source term should ¯ source term, be added to our m ψψ j 1 ¯ − i (ψ T Cγ5 T2 I ψ + h.c.) = − # T σ2 T2 Mφ # m ψψ 2 2 where Mφ ≡ m Mˆ + j Jˆ = =
m 2 + j 2 ( Mˆ cos φ + Jˆ sin φ) m 2 + j 2 Mˆ φ
(10.90)
(10.91)
where tan φ = j/m. Finally, the effective Lagrangian with both sources is, re¯ peating the steps above when we were just considering the source m ψψ, L eff ( ) = f π2 [tr(Dν Dν
†
) − 2m 2π Re tr( Mˆ φ )]
(10.92)
Now we can complete Eq. (10.87) with the addition of the diquark condensate, ψψ = −
∂ E vac ∂j
(10.93)
10.3 Two-color QCD and effective Lagrangians
305
¯ Table 10.1 The values of the chiral condensate, ψψ, the diquark condensate, ψψ, and the baryon density n B in the two phases of the theory
Phase
¯ ψψ
µ < m π /2
¯ 0 ψψ ) *2 ¯ 0 mπ ψψ 2µ
µ > m π /2
ψψ 3
0
¯ 0 1− ψψ
nB )
mπ 2µ
1
*4
+
0
32µf π2 1 −
)
mπ 2µ
*4 ,
1
condensates
0.8
3 0.6
0.4
2
0.2
0
0
0.5
1 1.5 2 chemical potential/pion mass/2
2.5
3
Figure 10.7. The effective-Lagrangian prediction for the diquark condensation transition, showing magnitudes of the diquark condensate ψψ (1) and the chi¯ ¯ 0 = 4G as a function of 2µ/m π for ral condensate ψψ (2) in units of ψψ zero diquark source. The density of the baryon charge (3) is shown in units of 32 f π2 m π . Our last task is to write the effective Lagrangian at the minimum, E vac = L eff ( α ) = f π2 − 2µ2 tr α BνT α† Bν − 2m 2π Re tr( = −16 f π2 µ2 sin2 α − 4G(m cos α + j sin α)
†
φ
α)
(10.94)
On carrying out the differentiations, we find ¯ ψψ = 4G cos α,
ψψ = 4G sin α,
n B = 32 f π2 µ sin2 α (10.95)
The results for these vacuum expectation values are listed in Table 10.1 and are plotted in Fig. 10.7. Note a few interesting, elementary features. First, the observables are constants in the µ < m π /2 phase. This was assured by the fact that the vacuum does not change its direction for µ < m π /2. Therefore,
306
Effective Lagrangians and Models
the chiral condensate retains its µ = 0 value exactly until µc = m π /2. In addition, both the diquark condensate and the induced baryon number vanish exactly in the low-µ phase. However, at µc = m π /2 all of the observables develop nonanalytic dependences on x. In particular, the diquark condensate rises fromzero with a square-root √ singularity characteristic of a mean-field transition, −2 4 4 µ µ − (m π /2) ∼ 2 µ − m π /2, for µ greater than but very near m π /2. In addition, the induced baryon number rises linearly with µ in the critical region. Finally, in the high-µ phase the chiral condensate falls as x −2 . We have already observed that, in the mean-field approximation, the ground-state vacuum rotates from c to d as x grows beyond unity and we see that property reflected in the ¯ 2 + ψψ2 remains constant throughout the condensates. In fact, the sum ψψ transition. Now we return to some unfinished business. We need to verify that Eq. (10.83) describes the vacuum for all x. To begin, write out , which is an anti-symmetric unitary matrix, in terms of 2 × 2 blocks, A −C = (10.96) CT B where the anti-symmetric matrices A and B satisfy the unitarity constraints A A† + CC † = 1,
B B † + (C † C)T = 1,
AC ∗ = C B † (10.97)
Using these relations the static piece of the effective Lagrangian can be written just in terms of C, 1 1 1 2 2 2 † 2 L eff ( ) = 2 f π m π x tr C − 2 C − 2 −2 x + 2 (10.98) x x x 2 To minimize L eff ( ), we should take C = 1/x , and to satisfy the constraints Eq. (10.97), we can take A = B = iI 1 − 1/x 2 . Then, if we use the notation cos α = 1/x 2 , α will have the form Eq. (10.83). Note that, when x < 1, we have a special situation. The solution C = 1/x 2 is then inconsistent with the constraints Eq. (10.97) and we are forced to the solution C = 1, A = B = 0. So, for all x < 1, for any chemical potential below the critical value µc = m π /2, α reduces to c , the minimum with no chemical potential. There are other interesting features of this model that are worth mentioning. In particular, the spectroscopy of the low-lying pseudo-Goldstone states is constrained by chiral symmetry and the simplicity of the coupling of the model to the chemical potential. For example, in the absence of bare-quark masses, the curvature of the potential which determines these masses is just L curv = 2µ2 tr Bν2 − 2µ2 tr BνT † Bν (10.99)
10.4 Nonzero isospin chemical potential
307
and to find the mass matrix of the pseudo-Goldstone bosons we should expand in small, smooth fluctuations around the vacuum d , =U
dU
T
(10.100)
It is sensible to write U as an exponent of the various generators of SU(4) and consider first those generators which do not change d , call them Ta , and those that do, call them X b . The details are given in the literature [120] and show that there are ten Ta ’s and five X b ’s. One of the broken generators, called X 5 in the literature, is the baryon charge B and the fluctuations it generates in the ground state leave the ground-state energy unchanged. Therefore, its Goldstone particle is massless. This is an exact result for all quark bare masses and chemical potentials, because the baryon number is always a perfect symmetry of the local Lagrangian density. The other four X generators can be identified and their common mass is found to be exactly 2µ, unadorned by strong interaction. This final result is a consequence of the local symmetry which dictated how the chemical potential can enter the effective Lagrangian. 10.4 QCD at nonzero isospin chemical potential We have seen in Section 10.1 that the integrand of the partition function in QCD lacks positivity, which is the major obstacle to current attempts to simulate QCD in this regime using Monte Carlo methods. We have also seen in Section 10.2.3 that the quenched theory is a limit of a theory that has a positive measure/integrand in the partition function. This is due to the presence of conjugate quarks. Although unphysical, as they appear at first, they turn out to be present in two-color QCD, due to the pseudo-reality of the SU(2) group. In this section we discuss a regime of actual three-color QCD that contains conjugate quarks explicitly, and therefore can be simulated on the lattice. It also turns out that this theory can be analysed by the effective-Lagrangian methods of the previous section. This regime is QCD at finite isospin density [122]. Why should this regime be interesting? One of the questions we would like to address by studying QCD at finite baryon-number density is how the transition between hadronic and underlying quark–gluon degrees of freedom occurs as a function of a conserved number density, unlike the temperature transition. In QCD, isospin is such a conserved number, like the baryon charge, and it is natural to inquire what happens in QCD as a function of the isospin density n I , or the isospin chemical potential µI . Let us comment on the relevance of this regime to the real world. Nature provides us with nonzero-µI systems in the form of isospin-asymmetric matter (e.g. inside neutron stars); however, the latter contains both isospin density and baryon-number density. In contrast, the idealized system considered in this paper does not carry baryon number: the chemical potentials of the two light quarks,
308
Effective Lagrangians and Models
u and d, are equal in magnitude, |µI |/2, and opposite in sign. Such a system, strictly speaking, is unstable with respect to weak decays that do not conserve isospin, and, as we shall see, is also not electrically neutral and thus does not exist in the thermodynamic limit. However, since we are interested in the dynamics of the strong interaction alone, one can imagine that all relatively unimportant electromagnetic and weak effects are turned off. Once this is done, we have a nontrivial regime that, as we shall see, is accessible to present-day lattice Monte Carlo methods, while being analytically tractable in various interesting limits. As a result, the system we consider has the potential to improve substantially our understanding of cold dense QCD. This regime carries many attractive traits of two-color QCD, but is realized in a physically relevant theory – QCD with three colors. Our analysis here will closely parallel that of the previous section. 10.4.1 Positivity and QCD inequalities Since the fermion determinant of our theory is real and positive in Euclidean space, some rigorous results on its low-energy behavior can be obtained from QCD inequalities. At finite isospin density, µI = 0, D = γ (∂ + iA) + 12 µI γ0 τ3 + m, and Eq. (10.45) no longer holds, since the operation on the right-hand side of Eq. (10.45) changes the relative sign of µI . However, provided that m u = m d , interchanging up and down quarks compensates for this change of sign (the u and d quarks play the role of mutually conjugate quarks), i.e. τ1 γ5 Dγ5 τ1 = D†
(10.101)
Instead of isospin τ1 in Eq. (10.101) one can also use τ2 (but not τ3 ). Equation (10.101) replaces the now invalid Eq. (10.45) and ensures that det D ≥ 0. On repeating the derivation of the QCD inequalities using Eq. (10.101), we find that ¯ 5 τ1,2 ψ, i.e. a linear the lightest meson, or the condensate, must be in channels ψiγ − + ¯ combination of π ∼ uγ ¯ 5 d and π ∼ dγ5 u states. Indeed, as shown below, in both analytically tractable regimes of small and large µI the lightest mode is a ¯ 5 u. massless Goldstone mode that is a linear combination of uγ ¯ 5 d and dγ 10.4.2 Small isospin densities: the pion condensate When µI is small, chiral perturbation theory can be used to treat the problem. To have a rough sense of how small µI should be, we require that no particles other than pions are excited due to the chemical potential. This gives µI , m ρ as the upper limit of applicability of chiral perturbation theory. For zero quark mass and zero µI , the pions are the massless Goldstone bosons of spontaneously broken SU(2)L × SU(2)R chiral symmetry. In reality, quarks have small masses, which break this symmetry explicitly. Assuming that we
10.4 Nonzero isospin chemical potential
309
have equal quark masses, the symmetry of the Lagrangian is SU(2)L+R . The low-energy dynamics is governed by the familiar chiral Lagrangian, which is written in terms of the matrix pion field ∈ SU(2): L=
1 2 f 4 π
tr(∂µ
∂µ
†
− 2m 2π Re )
This Lagrangian contains only two phenomenological parameters: the pion decay constant, f π , and the pion mass in the vacuum, m π . We will see that interesting physics occurs at µI > m π , and, since m π m ρ , there is a nontrivial range of µI for which the chiral Lagrangian is reliable and useful. The isospin chemical potential further breaks SU(2)L+R down to U(1)L+R . Its effect can be incorporated into the effective Lagrangian to leading order in µI , without introducing additional phenomenological parameters. Indeed, µI enters the QCD Lagrangian in the same way as the the zeroth component of a gauge potential. Thus the finite-µI chiral Lagrangian is obtained by promoting the global SU(2)L × SU(2)R symmetry to a local gauge symmetry: gauge invariance completely fixes the way µI enters the chiral Lagrangian: f π2 m2 f 2 tr(∇ν ∇ν † ) − π π Re tr (10.102) 4 2 The covariant derivative is defined as µI ∇i = ∂i (10.103) ∇0 = ∂0 − (τ3 − τ3 ), 2 which follows from the transformation property of under rotations by the isospin generator I3 = τ3 /2. Using Eq. (10.102) it is straightforward to determine the vacuum alignment of as a function of µI and the spectrum of excitations around the vacuum. We will be interested in negative µI , which favors neutrons over protons, as in neutron stars. The results are very similar to two-color QCD at finite baryon density, Section 10.3. From Eq. (10.102), one finds the potential energy for , Leff =
f π2 µ2I f 2m2 tr(τ3 τ3 † − 1) − π π Re tr (10.104) 8 2 The first term in Eq. (10.104) favors directions of which anti-commute with τ3 , i.e. τ1 and τ2 , while the second term prefers the vacuum direction = 1. It turns out that the minima of Eq. (10.104) at all µI are captured by the following Ansatz: Veff ( ) =
= cos α + i(τ1 cos φ + τ2 sin φ) sin α
(10.105)
On substituting Eq. (10.105) into Eq. (10.104), one sees that the potential energy depends only on α, not on φ: Veff (α) =
f π2 µ2I [cos(2α) − 1] − f π2 m 2π cos α 4
(10.106)
310
Effective Lagrangians and Models
On minimizing Veff (α) with respect to α, one sees that the behavior of the system is different in two distinct regimes. (1) For |µI | < m π , the system is in the same ground state as it is at µI = 0: α = 0, or = 1. This result is easy to understand. The lowest-lying pion state costs a positive energy m π − |µI | to excite, thus at zero temperature no pion is excited. The ground state of the Hamiltonian at such µI coincides with the normal vacuum of QCD. The isospin density is zero in this case. (2) When |µI | exceeds m π the minimum of Eq. (10.106) occurs at cos α = m 2π /µ2I
(10.107)
In this regime the energy needed to excite a π− quantum, m π −|µI |, is negative, thus it is energetically favorable to excite a large number of these quanta. Since pions are bosons, the result is a Bose condensate of π− . If the pions did not interact, the density of the condensate would be infinite. However, the repulsion between pions stabilizes the system at a finite value of the isospin density. This value can be found by differentiating the ground-state energy with respect to µI : m4 ∂Leff nI = − (10.108) = f π2 µI sin2 α = f π2 µI 1 − 4π ∂µI µI For |µI | just above the condensation threshold, |µI | − m π m π , Eq. (10.108) reproduces the equation of state of the dilute nonrelativistic pion gas (cf. Eq. (8.57)), n I = 4 f π2 (µI − m π ) At larger µI , |µI | m π , the isospin density is linear in µI , n I = f π2 µI ,
|µI | m π
From Eq. (10.108) one can find the pressure and the energy density as functions of µI . The fact that the minimum of the potential (10.104) is degenerate with respect to the angle φ corresponds to the spontaneous breaking of the U(1)L+R symmetry generated by I3 in the Lagrangian (10.102). This is not unexpected since the ground state is, in essence, a pion superfluid, with one massless Goldstone mode. Since we start from a theory with three pions, there are, in addition to the massless mode, two massive modes in the superfluid phase. One can be identified with the π0 . The other is a linear combination of π+ and π− , which we denote as π˜ + , since it coincides with π+ at the condensation threshold. The
10.4 Nonzero isospin chemical potential
311
∞
m _____ mπ π+
π0
1
π−
0
1
| µI | _____ mπ
∞
Figure 10.8. A schematic plot of masses (rest energies) of lowest-lying excitations in QCD at finite (negative) µI , in the regime of applicability of chiral perturbation theory: m π , µI m ρ . mass (defined as the rest energy) of these modes can be obtained by expanding the Lagrangian (10.102) around the minimum. The result reads m π0 = |µI |, m π˜ + = |µI | 1 + 3(m π /µI )4 (10.109) At the condensation threshold, m π0 = m π and m π˜ + = 2m π , while for |µI | m π both masses approach |µI | (see Fig. 10.8). ¯ The values of the chiral condensate, uu ¯ + dd, and the pion condensate, uγ ¯ 5 d, follow from (10.107): ¯ = 2ψψ ¯ vac cos α uu ¯ + dd
and
¯ vac sin α uγ ¯ 5 d + h.c. = 2ψψ (10.110)
i.e. the chiral condensate “rotates” into the pion condensate as a function of |µI |. It is also possible to find baryon masses, i.e. the energy cost of introducing a single baryon into the system. The most interesting baryons are those with the lowest energy and highest isospin, i.e. the neutron n and − isobar. There are two effects of µI on the baryon masses. The first comes from the isospin of the baryons, which effectively reduces the neutron mass by 12 |µI |, and the − mass by 32 |µI |. If this were the only effect, the effective − mass would vanish at |µI | = 23 m . For larger µI , baryon or anti-baryon Fermi surfaces would form,
312
Effective Lagrangians and Models
which would lead to a nonzero baryon susceptibility χB ≡ ∂n B /∂µB . However, long before that another effect turns on: the π− ’s in the condensate tend to repel the baryons, lifting up their masses. These effects can be treated in the framework of baryon chiral perturbation theory [123]. For example, the (Euclidean) Lagrangian describing nucleons and their interactions with the pions at finite µI can be written as LN = N¯ γµ ∇µ N + m N ( N¯ L NR + h.c.)
(10.111)
) µI * ∇0 N = ∂0 − τ3 N , ∇i N = ∂i N 2 On diagonalizing this bilinear Lagrangian in the pion background given by = ¯ from Eq. (10.105), one finds the nucleon masses. The results for the neutron and the − isobar read where
mn = mN −
|µI | cos α, 2
m − = m −
3|µI | cos α 2
(10.112)
in the approximation of nonrelativistic baryons. Equation (10.112) can be interpreted as follows: as a result of the rotation (10.105) of the chiral condensate, the nucleon-mass eigenstate becomes a superposition of vacuum n and p states. The expectation value of the isospin in this state is proportional to cos α appearing in Eq. (10.112). With cos α given in Eq. (10.107), we see that the two effects mentioned cancel each other out when m π |µI | m ρ . Thus the baryon mass never drops to zero, and χB = 0 at zero temperature in the region of applicability of the chiral Lagrangian. As one forces more pions into the condensate, the pions are packed closer together and their interaction becomes stronger. When µI ∼ m ρ , chiral perturbation theory breaks down. To find the equation of state in this regime, full QCD has to be employed. As we have seen, this can be done using present-day lattice techniques since the fermion sign problem is not present at finite µI , similarly to two-color QCD. 10.4.3 Asymptotically high isospin densities: the quark–antiquark condensate In the opposite limit of very large isospin densities, or |µI | m ρ , the description in terms of quark degrees of freedom applies since the latter are weakly interacting due to asymptotic freedom. In our case of large negative µI , or n I , the ground state contains an equal number of d quarks and u¯ anti-quarks per unit volume. If one neglects the interaction, the quarks fill two Fermi spheres with equal radii |µI |/2. Turning on the interaction between the fermions leads to the instability with respect to the formation and condensation of Cooper pairs, which is similar to the BCS instability in metals and the diquark pairing at high
10.4 Nonzero isospin chemical potential
313
baryon density in Chapter 9. To leading order of perturbation theory, quarks interact via one-gluon exchange. It is easy to see that the attraction is strongest in the color-singlet channel, thus the Cooper pair consists of an u¯ and a d. The ground state, hence, is a fermionic superfluid. The perturbative one-gluon exchange, however, does not discriminate between the scalar, ud, ¯ channel, and the pseudo-scalar, uγ ¯ 5 d, channel: the attraction is the same in both cases. However, one expects that the instanton-induced interaction, however small, will favor the uγ ¯ 5 d channel over the ud ¯ one. The condensate is therefore a pseudo-scalar and breaks parity, uγ ¯ 5 d = 0
(10.113)
This is consistent with our earlier observation that QCD inequalities constrain the I = 1 condensate to be a pseudo-scalar at any µI . Note that the order parameter in Eq. (10.113) has the same quantum numbers as the pion condensate at lower densities. We shall discuss this coincidence later. As a consequence of Cooper pairing, the fermion spectrum acquires a gap at the Fermi surface, where = b|µI |g −5 e−c/g ,
c = 3π 2 /2,
(10.114) −c/g
where g should be evaluated at the scale |µI |. The peculiar e behavior comes from the long-range magnetic interaction, as in the superconducting gap at large √ µB in Section 9.2.3. The constant c is smaller by a factor of 2 compared with the latter case due to the stronger one-gluon attraction in the singlet q q¯ channel compared with the 3¯ diquark channel. Consequently, the gap (10.114) is exponentially larger than the diquark gap at comparable baryon chemical potentials. As in the BCS theory, the critical temperature at which the superfluid state is destroyed is of order . 10.4.4 Quark–hadron continuity and confinement Since the order parameter (10.113) has the same quantum numbers and breaks the same symmetry as the pion condensate in the low-density regime, it is plausible that there is no phase transition along the µI axis. In this case, as one increases the density, the Bose condensate of weakly interacting pions smoothly transforms into the superfluid state of u¯ d Cooper pairs. 5 Of course, this conjecture needs to be (and can be, due to the positivity) verified by lattice calculations. At first sight, this conjecture seems to contradict common wisdom, which states that there is a “deconfinement” phase transition from the hadron phase to 5 The situation is very similar to that of strongly coupled superconductors with a “pseudogap”
[124], and possibly to that of high-temperature superconductors [125]. This also parallels the continuity between nuclear and quark matter in three-flavor QCD as conjectured by Sch¨afer and Wilczek (see Section 9.3.4).
314
Effective Lagrangians and Models
the quark–gluon-plasma phase. It is logically possible that there exists a firstorder phase transition at an intermediate value of µI . However, there are several nontrivial arguments that make the hypothesis of continuity highly plausible. The first argument arises from considering baryons. One notices that all fermions have a gap at large |µI |, which means that all excitations carrying baryon number are massive. In particular, at zero temperature, the baryonnumber susceptibility χB vanishes. This is also true at small µI . It is thus natural to expect that all excitations with nonzero baryon number are massive at any value of µI , and χB remains zero at T = 0 for all µI . This also suggests one way to verify continuity on the lattice. Another argument comes from considering the limit of a large number of colors Nc . Recall that, in finite-temperature QCD, there is a mismatch, at large Nc , between the number of gluon degrees of freedom, which is O(Nc2 ), and that of hadrons, which is O(Nc0 ). This fact is a strong hint of a first-order confinement– deconfinement phase transition, at which the effective number of degrees of freedom jumps from O(Nc0 ) to O(Nc2 ). It is easy to see, however, that the Nc behavior of thermodynamic quantities is the same in the “hadronic” phase (low µI ) and the “quark” phase (large µI ). Indeed, at very large µI the isospin density n I is proportional to the number of quarks, which is O(Nc ): Nc µ3I nI = (10.115) 3 8π 2 In the small-µI region the isospin density is given by Eq. (10.108). In the largeNc limit, the pion decay constant scales as f π2 = O(Nc ), and thus the isospin density in the pion gas is also proportional to Nc .6 What happens is that the repulsion between pions becomes weaker as Nc increases, so more pions can be stacked at a given chemical potential. As a result, the Nc dependence of thermodynamic quantities is the same in the quark and the hadronic regimes, although for seemingly very different reasons. Now let us return to the question of confinement. Naively, one would think that, at asymptotically large µI , the u¯ and d quarks are packed at a very high density, and the system should become deconfined. At finite temperature, there is no rigorous way to distinguish between the confined and deconfined phases in QCD with quarks in the fundamental representation. However, at zero temperature (and finite µI ), a sharp distinction between the two phases can be made. In the confined phase, all particle excitations carry integer baryon number; the deconfined phase can be defined as the phase in which there exist finite-energy excitations carrying fractional baryon charge. The pion superfluid at small µI 6 With physical values of N , f and m , the values of n given by Eqs. (10.108) and (10.115), c π π I
naively continued into the regime of intermediate µI , cross at µI ≈ 800 MeV. This agrees with the value of µI ∼ m ρ at which one would expect the crossover between the quark and hadron regimes to occur. This is a quantitative indication that a phase transition is not necessary.
10.4 Nonzero isospin chemical potential
315
clearly is in the confined phase. The question to ask is the following: is quark matter at large µI confined or deconfined? It might seem that, at very large µI , there exist excitations with fractional baryon number. These are the fermionic quasi-particles near the Fermi surface, which are related to the original quarks and anti-quarks by a Bogoliubov–Valatin transformation. The opening of a BCS gap makes the energy of these excitations larger than , but still finite. To see that the logic above has a flaw and there are no such excitations, one needs to consider the dynamics of very soft gluons. The crucial observation is that, at large µI , gluons softer than are not screened, either by the Meissner effect or by the Debye effect. The Meissner effect is absent because the condensate does not break gauge symmetry (in contrast to the color-superconducting condensate in Chapter 9). Debye screening is also absent, because on scales lower than there are no charge excitations in the medium: the Cooper pairs are neutral, while the fermions are too heavy to be excited. Thus, the gluon sector below the scale is described by pure SU(3) gluodynamics, which is a confining theory. This means that there are no quark excitations above the ground state: all particles and holes must be confined in colorless objects, namely mesons and baryons, just like in QCD in an ordinary vacuum. If there is no transition along the µI axis, we expect confinement at all values of µI . At large µI , since the running strong coupling αs at the scale of is small, the confinement scale QCD is much less than . In more detail, let us imagine following the running of the strong coupling from the ultraviolet to the infrared. First, αs increases until the scale gµI is reached, whereupon it “freezes” due to Debye screening and Landau damping. The freezing continues until we reach the scale , after which the coupling runs again as in pure gluodynamics. Since the coupling is still small at the scale , it can become large only at some scale QCD much lower than . Thus, at large |µI | there are three different scales separated by large exponential factors, µI QCD . That the scale of confinement is much smaller than the gap at large µI has an important consequence for finite temperature. One can actually predict a temperature-driven deconfinement phase transition at a temperature Tc of the order of QCD . Indeed, at such low temperatures, quarks are unimportant, so the transition must be of first order as in pure gluodynamics. In particular, one expects the temperature dependence of the baryon-number susceptibility to change from e−3/T to e−/T around Tc due to deconfinement. The smallness of the confinement scale QCD compared with the BCS gap allows one to conclude that the binding energy of quarks and anti-quarks is small and the hadronic spectrum follows the pattern of the constituent-quark model, with playing the role of the constituent-quark mass. This means that mesons weigh 2 and baryons weigh 3, approximately. A good analog of the large-µI regime is vacuum QCD with only heavy quarks. As in the latter case, the string
316
Effective Lagrangians and Models
tension and string breaking are determined by parametrically different energy scales (QCD and , respectively). Hence the area law should work up to some distance much larger than −1 QCD , even when fundamental quarks are present. For the same reason one also expects the high-spin excited states of hadrons to be narrow at large µI . 10.5 Pion propagation near and below Tc Some nonperturbative features of QCD, such as the equation of state, can be determined from numerical Monte Carlo techniques. However, many important issues related to real-time behavior and the response of high-temperature strongly interacting matter cannot be systematically studied by such methods. This is because lattice techniques rely on the formulation of quantum-field theory in imaginary time. As a result, changes in the hadron spectrum of QCD caused by the environment created in a heavy-ion collision cannot be obtained in a reliable fashion by computer simulations. This is a pity since one needs the spectrum of QCD in these extreme environments in order to calculate signals for the experimental existence of the quark–gluon plasma, such as the dilepton spectrum (see Section 7.6). In general, an analytic continuation τ → −it is necessary in order to obtain real-time correlation functions from Euclidean correlators. Such a continuation from a numerically known Euclidean time function may be reliable for small τ , or large frequencies, ω T , but becomes problematic if the behavior for long real time, or low frequencies, is required. In simple terms, this is a consequence of the fact that, on the imaginary-frequency axis, the smallest nonzero Matsubara frequency is 2π T . There is no direct information about the correlation function for smaller frequencies (except exactly at ω = 0). New numerical methods to simulate the real-time behavior of QCD are sorely needed. Fortunately, many quantities characterizing the real-time behavior of finitetemperature systems can be related, by exact identities, to static (thermodynamic) functions. The most familiar case is the relation among the velocity of sound u, the pressure p, and the energy density : u = (∂ p/∂)1/2 . A less trivial example is that of spin waves in antiferromagnets: it has long been known [126] that, at long enough wavelengths and at any temperature below the phase transition, there exist low-frequency spin waves that have a linear dispersion curve, whose slope is given exactly in terms of static quantities. In this section, following [127], we point out that, in thermal QCD, the dispersion relation of soft pions can be determined entirely using static quantities. Such quantities can, in principle, be measured on the lattice. Using this observation, we shall see that the pion pole mass, which characterizes the propagation of the collective pion modes, must decrease as one approaches the critical temperature, despite the well-known fact that the pion screening mass increases in the same limit.
10.5 Pion propagation near and below Tc
317
10.5.1 Pion dispersion from static quantities (summary) From the point of view of symmetry properties, QCD at temperatures T below or just above the temperature of the chiral phase transition Tc is similar to a Heisenberg antiferromagnet [128, 129]. With two light quarks (u and d), QCD possesses an approximate chiral SU(2)V × SU(2)A , O(4) symmetry, which is broken spontaneously to SU(2)V , O(3) by the chiral condensate. This is similar to the O(3) → O(2) symmetry breaking in antiferromagnets. Moreover, the ¯ order parameter of QCD, the chiral condensate ψψ, is distinct from the conserved charges (the vector- and axial-isospin charges), which makes the realtime behavior of QCD similar to that of antiferromagnets (but not of ferromagnets, in which the order parameter – magnetization – is a conserved quantity.) By analogy with spin waves in antiferromagnets [126], one can show that, at any T below Tc , the real part of the dispersion relation of soft pions is given by ω2 = u 2 ( p2 + m 2 )
(10.116)
provided that the quark masses are small enough. We shall use the following terminology: u is the pion velocity (although it is the velocity only when m = 0), m is the pion screening mass, and the energy of a pion at p = 0, m p = um, is the pion pole mass. At zero temperature, u = 1, and the pole mass coincides with the screening mass. At nonzero temperature, Lorentz invariance is lost, and u generally differs from 1 [130, 131]. Such pion modes with their modified dispersion relation are termed “quasipions” in [130]. The parameters u and m can be determined by measuring only static (zero-frequency) Euclidean correlators. In particular, m can be extracted from the long-distance behavior of the correlation function of the operator ¯ 5 τ a ψ, π a ≡ iψγ 1 π a (x)π b (0) δ ab = (10.117) dτ dV e−iq · x ¯ 2 f 2 q2 + m 2 ψψ where x = (τ, x), ψ is the quark field, a, b = 1, 2, 3, τ a are isospin Pauli matrices, tr(τ a τ b ) = 2δ ab , and · · · denotes thermal averaging. The integration over the Euclidean time variable τ is taken in the interval (0, 1/T ). In Eq. (10.117) ¯ ψψ is the chiral condensate at zero quark masses. Equation (10.117) also provides the definition of the temperature-dependent pion decay constant f . Less trivial is the relation of u to static correlators. The pion velocity u is equal to the ratio of the above-defined pion decay constant f and the axialisospin susceptibility χI 5 : u 2 = f 2 /χI 5
(10.118)
This is a close analog of the equation c2 = ρs /χm [126] for the velocity of spin waves in antiferromagnets. The axial-isospin susceptibility χI 5 can be defined as
318
Effective Lagrangians and Models
the second derivative of the pressure with respect to the axial-isospin chemical potential (see Eq. (10.120) below), or, equivalently, via the static correlator of the axial isospin charge densities, τa ¯ 0 γ 5 ψ (10.119) δ ab χI 5 = dτ dV Aa0 (x)Ab0 (0) , Aa0 ≡ ψγ 2 The right-hand side of Eq. (10.119) is free of short-distance divergences in the limit of zero quark masses, when Aa0 are densities of conserved charges. The derivation of Eqs. (10.116)–(10.119) at nonzero temperature requires an analysis of the hydrodynamic theory as developed in [132]. Here we shall use an intuitively simpler (but less rigorous) derivation based on the effective Lagrangian approach. This approach does not allow a correct treatment of dissipative effects, but will be sufficient for our purpose. 10.5.2 Derivation Our strategy is to first write down the most general form of the effective Lagrangian of pions, and then relate its free parameters to the correlation functions of QCD by matching the partition function Z = eP V /T and its derivatives in the effective and microscopic theories. The quark part of the QCD Lagrangian at finite axial-isospin chemical potential µI 5 is given by ¯ µ Dµ ψ − (ψ¯ L MψR + h.c.) + µ A30 Lquark = iψγ I5
(10.120)
where M = diag(m u , m d ) is the quark-mass matrix. The chemical potential µI 5 is coupled to the axial-isospin charge A30 defined in Eq. (10.119). For simplicity, we set m u = m d = m q . We shall assume that, in the infrared, the pion thermal width is negligible compared with its energy. This has been seen in explicit calculations at low T [133] and is related to the fact that pions are Goldstone bosons. The dynamics of the pions is described, in this case, by an effective Lagrangian Leff , which we assume to be local, allowing its expansion in powers of momenta. This is equivalent to the assumption that the correlation functions have only pole singularities, as in hydrodynamics. To lowest order, the Lagrangian is fixed by symmetries up to three coefficients, f t , f s , and f m , f t2 f2 f2 tr(∇0 ∇0 † ) − s tr(∂i ∂i † ) + m Re tr(M ) (10.121) 4 4 2 where is an SU(2) matrix whose phases describe the pions. Owing to the lack of Lorentz invariance, f t2 and f s2 are independent parameters. The chemical potential µI 5 enters the lowest-order effective Lagrangian (10.121) through the covariant derivative ∇0 , which is completely determined by symmetries. This can be seen by promoting the SU(2)A symmetry in Leff =
10.5 Pion propagation near and below Tc
319
Eq. (10.120) to a local symmetry and treating µI 5 as the time component of the SU(2)A vector potential, as we discussed earlier in this chapter. The covariant derivative ∇0 necessarily has the form i − µI 5 (τ3 + τ3 ) (10.122) 2 The structure of the Lagrangian (10.121) is analogous to that of the effective Lagrangian at finite (vector) isospin chemical potential µI in Section 10.4. A significant difference between the two cases is that the QCD vacuum breaks the SU(2)A (axial-isospin) symmetry spontaneously. It is important to note, however, that SU(2)A is a symmetry of the Lagrangian (at m q = 0), as good as the SU(2)V symmetry. The conservation of the axial-isospin current Aaµ in the chiral limit puts the consideration of finite µI 5 on solid theoretical ground. The pion dispersion relation following from Eq. (10.121) is given by Eq. (10.116) with ∇0
u2 =
≡ ∂0
f s2 f t2
m2 =
and
m q f m2 f s2
(10.123)
By matching the second derivative of the pressure P with respect to µI 5 in QCD and in the effective theory, we find the relation between f t and χI 5 : χI 5 =
∂ 2P = f t2 ∂µ2I5
(10.124)
Together with the first of Eqs. (10.123) and f = f s (see below), this implies Eq. (10.118). The first derivative with respect to m q gives ¯ −ψψ =
∂P = f m2 ∂m q
(10.125)
By combining this with the second of Eqs. (10.123), we derive the generalization of the famous Gell-Mann–Oakes–Renner (GOR) relation to finite temperature: ¯ f s2 m 2 = −m q ψψ
(10.126)
Finally, we need to show that f = f s . We achieve this by treating M as an extera a nal field, which we parametrize as M(x) = m q eiα (x)τ . By matching derivatives of ln Z, we find π a (x)π b (0) = ¯ 5 τ a ψ, π a ≡ iψγ
δ2 ln Z
m 2q δα a (x) δα b (0)
= f m4 φ a (x)φ b (0)
φ a (x) ≡ Re tr[iτ a (x)/2]
(10.127)
Note that π a is defined in the microscopic theory (QCD), while φ a is a field of the effective theory.
320
Effective Lagrangians and Models
The correlation function of φ a (x) can be calculated by expanding the effective Lagrangian in Eq. (10.121) to second order in φ a . We expect the result to match the correlator of π a only for small momenta, which means that we have to limit ourselves to zero Matsubara frequency and small spatial momenta, e.g. smaller ¯ than the screening mass m σ of the order parameter σ = ψψ. For the static a correlator of π , by integrating Eq. (10.127) over τ and using Eq. (10.125); we obtain π a (x)π b (0) dτ = dτ φ a (x)φ b (0) 2 ¯ ψψ 1 d3 q eiq ·x δ ab 1 e−m|x | ab δ = 2 = (10.128) f s (2π )3 q2 + m 2 f s2 4π | x| We see that, by measuring the large-distance (| x | m −1 σ ) static correlation a 5 a ¯ τ ψ, we can extract two parameters of the function of the operator π = iψγ effective Lagrangian (10.121): the screening mass m and f s , which coincides with the decay constant f defined by Eq. (10.117). The third parameter, f t2 , coincides with the susceptibility χI 5 , which can also be expressed in terms of the static correlation function in Eq. (10.119). From Eq. (10.118) we completely determine the dispersion relation of soft pions.
10.5.3 Critical behavior For the above results to be valid, pions must be the lightest modes. In particular, this requires m m σ . If m q is very small, this condition is satisfied everywhere below Tc , except for a region very close to Tc . As T → Tc from below and m σ → 0, one can ask the following question: what is the critical behavior of the parameters u and m when T remains sufficiently far from Tc that the hierarchy m m σ T is maintained? Since u and m can be related to static correlation functions, one should expect their critical behavior to be governed by the same static critical exponents as those known from the theory of critical phenomena. We begin by considering the critical scaling of the decay constant f = f s . It is defined via the behavior of a static correlator (10.128) at distances larger than m −1 σ . In the range of momenta m | q | m σ we have (see Eq. (10.117))
dτ dV e−iq · x π a (x)π b (0) = δ ab
¯ 2 1 ψψ f 2 q2
(10.129)
On the other hand, at distances short compared with the correlation length, i.e. q | T , the correlator of the order parameter for momenta such that m σ |
10.5 Pion propagation near and below Tc ¯ has the following scaling behavior: ψψ ¯ ¯ dτ dV e−iq · x ψψ(x) ψψ(0) ∼
1 | q |2−η
321
(10.130)
¯ and π a = We also know that, in this regime, the correlators of σ = ψψ 5 a ¯ τ ψ are degenerate, since they are related by the SU(2)A symmetry, which iψγ is restored at Tc . Thus the correlator (10.130) must match with the correlator (10.129) at the scale | q | ∼ m σ . This requires ¯ 2 f 2 = Am −η σ ψψ
(10.131)
The coefficient A cannot be found from scaling arguments, but is finite and regular at Tc . The exponent η is in the universality class of the O(4) sigma model in d = 3 dimensions (to which two-flavor QCD at Tc belongs, as discussed in Section 7.4) and is known: η ≈ 0.03. At T → Tc the scaling laws for the inverse correlation length m σ and the ¯ order parameter ψψ are also known from universality, mσ ∼ t ν ¯ ψψ ∼ tβ
(10.132) (10.133)
where t = (Tc − T )/Tc . Thus we find f 2 ∼ t 2β−νη = t (d−2)ν
(10.134)
where in the last equation the relation 2β = ν(d − 2 + η)
(10.135)
is used (recall that all scaling exponents can be expressed in terms of two independent ones, e.g. η and ν, assuming hyperscaling). From this point on, we set d = 3, so f ∼ t ν/2 . This scaling law is the same as Josephson scaling for the superfluid density in helium [134]. Contrary to naive expectations, f scales ¯ differently from the order parameter ψψ ∼ t β . The difference, however, is numerically small, due to the smallness of η. In the O(4) universality class in d = 3, ν ≈ 0.73 and β ≈ 0.38 [135]. Next, we point out that χI 5 is finite at T = Tc , where it is degenerate with the vector-isospin susceptibility. The singular behavior of χI 5 is dominated by the mixing of Aa0 with operators linear or quadratic in σ or π a in the dimensionally reduced theory describing infrared modes | q | T . Such mixing, however, is forbidden by the O(4) chiral symmetry, as well as by charge conjugation. This is consistent with the lattice result that the vector-isospin susceptibility is finite at Tc [136]. Since f t2 = χI 5 , finiteness of χI 5 invalidates the common assumption that f t → 0 at Tc . Note that, above Tc , there are no propagating soft pion
322
Effective Lagrangians and Models
modes, so the parameters of the effective Lagrangian (such as f t ), and hence Eq. (10.124), lose their meaning, even as χI 5 remains well-defined and finite. This is not surprising if one recalls that, as T → Tc from below, the domain of validity of the Lagrangian (10.121) (| q | m σ ) shrinks away and disappears at Tc . Now we are ready to find the scaling of u. Using Eq. (10.118), the scaling of f in Eq. (10.134), and the fact that χI 5 is finite at Tc , we find u2 ∼ f 2 ∼ t ν
(10.136)
This means that the velocity of pions vanishes at Tc . The scaling of the screening mass m can be found from the GOR relation (10.126) (recall that m q is assumed to be small): m2 = −
¯ m q ψψ ∼ m q t β−ν f2
(10.137)
In the O(4) universality class β < ν, which implies that the static screening pion mass grows (at fixed m q = 0) as T → Tc . This fact agrees with lattice simulations of QCD. The pole mass of the pion, m p , scales differently: m 2p ≡ u 2 m 2 = −
¯ m q ψψ ∼ mqt β χI 5
(10.138)
This means that the pole mass of the pion drops as T → Tc . For the formulas (10.136)–(10.138) to be valid it is necessary that t 1. However, for any m q = 0, these formulas break down when t is so small that the condition m m σ is violated. Using Eqs. (10.132) and (10.137), we see 1/(βδ) 1/(βδ) or smaller. In the regime t m q the that this happens when t ∼ m q “distance” from the critical point (T = Tc , m q = 0) is controlled by m q , but not by t. The scaling with m q of all quantities can be obtained starting from ¯ ψψ ∼ m 1/δ q
at
t =0
(10.139) 1/(βδ)
On comparing this with Eq. (10.133), we see that t and m q have the same scaling dimension. Using the scaling hypothesis we can easily obtain the scaling 1/(βδ) . For example, with m q by replacing t by m q , m 2 ∼ m 1−(ν−β)/(βδ) q
m 2p ∼ m 1+1/δ q
(10.140)
(m p now has the meaning of the typical frequency of the pion mode with zero momentum. This mode may be overdamped in this regime.) Both masses vanish as m q → 0 at T = Tc ; however, for the screening mass m 2 m q , whereas for the pole mass m 2p m q . In particular, near the phase transition m p m.
10.5 Pion propagation near and below Tc
323
The decrease of the pion pole mass may have interesting consequences for heavy-ion collisions. It is the pole mass of a hadron, rather than its static screening mass, that affects the observed spectrum. Within statistical models for hadron production, the drop in the pion pole mass would lead to an overpopulation of pions at low momenta, provided that the chemical freezeout temperature Tch , at which the hadron abundances are fixed, is close to Tc . For a crude estimate of this effect we use QCD ∼ 200 MeV as the typical QCD scale in Eq. (10.140), and Tch ∼ 170 MeV as an estimate for the freezeout temperature [137], which is indeed very close to Tc . The shift of the pole mass near Tc is approximately m ≡ m p − m π ≈ m π [(m q /QCD )1/(2δ) − 1] ≈ −0.3m π , where δ ≈ 5, and the pion multiplicity at small momenta is enhanced by roughly exp(−m/Tch ) ≈ 1.3. This is a noticeable effect, although it is smaller than the known contribution to pion overpopulation due to the feed-down from the decays of resonances [137]. This enhancement is comparable to the effect of the pion chemical potential µπ ∼ 50 MeV induced by pion kinetics after the chemical freezeout [138]. Another potential consequence of the fact that the pion velocity decreases ˇ at Tc is the possibility of Cerenkov radiation of pions by a hard probe moving through the hot medium created in a heavy-ion collision.
11 Lattice-gauge theory at nonzero chemical potential
11.1 Propagators and formulating the chemical potential on a Euclidean lattice Now that we know how lattice-gauge theory is formulated and know a few tricks of the trade, we need to formulate it in the presence of a chemical potential. To gain an appreciation of the idea and to invent an elegant formalism, let’s review some simple problems involving µ [139]. Consider a nonrelativistic field that describes particles that can propagate only forward in time. The finite-T partition function reads β ∗ ∗ 2 ∗ 3 Z = d[φ] d[φ ] exp − [φ (− ∂ − µ)φ + φ ∂τ φ] dτ d x (11.1) 0
Here φ could represent either a boson field or a fermion field. Periodic boundary conditions in the temporal direction apply in the Bose case and anti-periodic in the Fermi case, in which φ would be a Grassmann variable. Letting a − sign apply to the Bose case and a + sign to the Fermi case, we can evaluate Z , Z = [det(∂ 2 − µ + ∂τ )]∓1
(11.2)
We can evaluate the Bose or the Fermi determinant using the thermal Green function, G β , which is the familiar Green function defined on all of space but a temporal strip ranging from 0 to β. Differentiate the partition function Z with respect to the chemical potential, dZ = ± d3 x dτ G β (x, x, 0) (11.3) dµ where G β (x, y, τ ) = (−∂ 2 − µ + ∂τ )−1 x,y,τ 324
(11.4)
11.1 Propagators and the chemical potential
325
It is particularly illuminating to express the thermal Green function, G β , in terms of the ordinary Green function, G, which lives in unbounded space and time R 3 × R. G satisfies the “heat” equation ∂G ∂τ G =0 if 4 G(x, y, 0) = δ (x − y)
(∂ 2 + µ)G(x, y, τ ) =
τ <0 (11.5)
We can use the method of images to implement the boundary conditions and find the series of images for the thermal Green function, ± d3 x dτ G β (x, x, 0) = ±tr(−∂ 2 − µ + ∂τ )−1 = ±β d3 x [G(x, x, 0) ± G(x, x, β) + G(x, x, 2β) ± G(x, x, 3β) · · ·]
(11.6)
This equation has the usual interpretation associated with the method of images: the “heat” arriving at τ = 0 consists of that put in at τ = 0 (G(x, x, 0)), plus the heat that has traveled once around the cylinder of circumference β (G(x, x, β)), etc. The ± signs in this equation record the fact that a fermion picks up a minus sign each time it crosses anti-periodic boundary conditions. We can make the µ dependence in this equation explicit using the following observations. First, since the combination −µ + ∂τ enters the Green function, we have G(x, y, τ )µ = eµτ G(x, y, τ )µ=0 In addition,
(11.7)
G(x, y, τ1 + τ2 ) =
d3 z G(x, z, τ1 )G(z, y, τ2 )
This allows us to simplify the equation above, 2 −1 ±tr(−∂ − µ + ∂τ ) = ±β d3 x [G(x, x, 0) ± eµβ G(x, x, β)] 2µβ +e d3 x G(x, x , β)G(x , x, β) . . .
(11.8)
(11.9)
where these Green functions now refer to µ = 0 and are solutions to the ordinary heat equation. The partition function can now be obtained by integrating over µ. For Bose fields, ln Z = ±βµG(x, x, 0) − tr ln(1 − eβµ G(β))
(11.10)
326
Lattice-gauge theory
For fermions, ln Z = ±βµG(x, x, 0) + tr ln(1 + eβµ G(β))
(11.11)
The first term in these equations arises because of a normal ordering ambiguity in the density and is usually set to zero. If we evaluate the traces in momentum space, we obtain more familiar answers. For Bose fields, d3 k 2 ln Z = − ln(1 − eβµ e−βk ) (11.12) (2π )3 and for Fermi fields,
ln Z = +
d3 k 2 ln(1 + eβµ e−βk ) 3 (2π )
(11.13)
The point we want to emphasize here is the appearance of the chemical potential in the exponential factors, recording each excursion that the field makes around the cylinder [0, β] in the temporal dimension. When we sum over all worldlines, the factor eβµ “encourages” the particle to move forward in “time” 2 and, if µ exceeds the rest mass so that eβµ e−βk > 1, or eβµ e−β E > 1 in more generality, the worldline can wrap around the cylinder an unlimited number of times. For Bose fields, this leads to a catastrophic divergence, but for Fermi fields, it simply fills the Fermi sea. This formalism illustrates how one can fill a Fermi sea with just one fermion. In other words, in a quenched lattice-gauge simulation one can fill a Fermi sea and generate an appreciable baryon number. Unfortunately, the quenched approximation, which is such a useful starting point for lattice-gauge investigations in other fields, suffers from a serious disease at nonzero chemical potential, as we have discussed in Section 10.2.3. 11.2 Naive fermions at finite density We want to discuss lattice-gauge theory of QCD at nonzero chemical potential. We will begin with naive fermions because they are easy to deal with and because the lessons learned here will carry over to staggered fermions. It is convenient to normalize the fermion fields so the fermion piece of the lattice QCD action reads ¯ Sf = ψ(x)ψ(x) x
¯ + K ψ(x)
4
γν [U (x, nˆ ν )ψ(x + nˆ ν ) − U † (x − nˆ ν , nˆ ν )ψ(x − nˆ ν )]
ν=1
(11.14)
11.2 Naive fermions at finite density
327
where the Euclidean γ matrices satisfy γν† = γν
[γκ , γν ]+ = 2δκν ,
(11.15)
In this expression it was convenient to define K in terms of the bare lattice mass m 0 , m0 =
1 2K
(11.16)
and to normalize lattice fields, ψ, in terms of their continuum relatives as 3 1 ψcontinuum ψ= (11.17) 2K The fermion propagator for this action can be written as a sum over worldlines between points x and y, ab || G αβ (x, y) = K γν U (11.18) paths
αβ
ab
This form makes it clear how to introduce the chemical potential into the lattice theory so that it has all the desired properties, allowing us to count fermions appropriately. We just replace the gauge-field factors U on timelike links in the + direction with eµU and replace the gauge-field factors U † on timelike links in the − direction with e−µU † [139, 140]. Since the circumference of the cylinder in the temporal direction is β, this prescription encourages forward propagation with a factor eβµ and inhibits backward propagation with a factor e−βµ . Notice that only those fermions which actually wind around the cylinder pick up the factors e±βµ . Other fermions that “double back” do not contribute any chemical-potential dependence. Now we can write the grand canonical partition function for the lattice theory, ¯ d[ψ] d[U ] exp(−Sf − Sgauge ) = tr(eβµN e−β H ) (11.19) Z = d[ψ] We can now differentiate the action with respect to the chemical potential to obtain the number operator N , ¯ J0 (x) = K [ψ(x)γ ˆ 0 )eµ ψ(x + nˆ 0 ) Q(τ ) = 0 U (x, n τ =constant
τ =constant
¯ + nˆ 0 )γ0 e−µU † (x, nˆ 0 )ψ(x)] − ψ(x
(11.20)
We see that J0 counts paths passing through the link x + nˆ 0 by assigning a +K for upward paths and −K for downward paths. So J0 (x) is literally the
328
Lattice-gauge theory
expectation value of the number of paths through the link x + nˆ 0 . The spatial components of the vector ¯ ¯ + nˆ a )γa U † (x, nˆ a )ψ(x)] Ja (x) = K [ψ(x)γ ˆ 0 )ψ(x + nˆ a ) − ψ(x a U (x, n (11.21) together with J0 form a conserved vector current that counts the flux of worldlines. We can obtain many useful, informative observables from various derivatives of the partition function. For example, J0 =
1 ∂ ln Z βV ∂µ
(11.22)
1 ∂ ln Z βV ∂m
(11.23)
and the chiral condensate is ¯ ψψ =
With these general relations, the Maxwell relation follows, ¯ ∂(ψψ) ∂ J0 = ∂m µ ∂µ m
(11.24)
It is also informative to observe that ' 1 ln Z = = Pf βV V
(11.25)
where Pf is the fermionic contribution to the pressure and ' is the grand canonical potential. By collecting everything, we can relate the chiral condensate to the mass derivative of the pressure, ¯ ψψ =−
∂ Pf ∂m
(11.26)
So, in principle, we could obtain the pressure by integrating the area under the chiral-condensate curve and obtain the equation of state. Care is needed here because one must subtract the chiral condensate at vanishing chemical potential in order to remove the vacuum contribution. It is interesting to illustrate these results for the free-fermion field. At zero T , ¯ one can compute ψψ and J0 and consider their µ dependences. In particular, +π 4 d p m ¯ (11.27) ψψ =
4 4 2 −π (2π ) m 2 + i=1 sin pi
11.2 Naive fermions at finite density and
J0 =
+π
−π
d4 p i sin p0 cos p0
4 4 (2π ) m 2 + i=1 sin pi 2
329
(11.28)
The µ dependence enters these momentum expansions through the substitution p0 → p0 + iµ. It is convenient to change integration variables to z = ei p0 , so after the substitution we have dz 1 ¯ f (z) (11.29) ψψ = 2π i |z|=e−µ z and the charge becomes 1 ¯ 0 ψ = ψγ 2π i where the function f (z) is +π f (z) = −π
m2 +
dz 1 2 1 2 z − 2 f (z) z 4 z
|z|=e−µ
d3 p 3 1 2 (2π )3 i=1 sin pi − 4 (z − 1/z) 1
3
(11.30)
(11.31)
Using the fact that the integrand in the expression for f (z) behaves as 4z 2 as z → 0, and behaves as 4/z 2 as z → ∞, we find the asymptotic behavior of the chiral condensate and the baryon charge density as µ → ±∞, ¯ ψψ →0 and ¯ 0 ψ → ψγ
+π
−π
(11.32)
d3 k = ±1 (2π )3
(11.33)
It is interesting to understand these simple results in more physical terms that will help us later to interpret lattice-gauge-theory simulation data. The reason ¯ why ψψ vanishes in these limits is that ¯ ψψ =−
∂('/V ) ∂m
(11.34)
where '/V is essentially the energy of the filled states, 1 1 ' =− ln Z = V Vβ V
(E(k) − µ)
(11.35)
filled states
Only the states near E = 0 can be affected by variations in m, so, if µ m, as many states rise in energy as fall when m increases, so the chiral condensate
330
Lattice-gauge theory
vanishes. Similarly, when µ is large and negative no states move when m is ¯ varied and ψψ = 0 again. This completes our introductory remarks about incorporating the chemical potential into lattice-gauge theory. Later we will consider simulations of the model and aspects of the fermion determinant at nonzero µ, and find some challenging subtleties. 11.3 The three-dimensional four-Fermi model at nonzero T and µ Before discussing QCD it will prove worthwhile to consider a simpler model in some detail. We choose the four-Fermi model in three dimensions for several reasons. First, it is renormalizable in a 1/N expansion and displays chiralsymmetry breaking. The theory is, of course, nonrenormalizable in a couplingconstant expansion, but that is a problem with perturbation theory, not with the physics, as emphasized by K. G. Wilson in his early work on the expansion [141]. The model’s continuum Lagrangian reads N g 2 ( j) ( j) 2 ( j) ( j) ¯ ¯ L= ψ ψ ψ /∂ ψ − (11.36) 2N j=1 and ψ ( j) is a four-component spinor even though we are working in three spacetime dimensions. In three dimensions one could also consider a model with two-component spinors, but we shall see that the four-component version studied here is more relevant to our ultimate interests in four dimensions. N is the number of fermion species which is the basis for the 1/N expansion and its soluble N → ∞ limit. The theory has a discrete Z 2 chiral symmetry, ψ → γ5 ψ, ¯ 5 , which is broken dynamically for sufficiently strong coupling ψ¯ → −ψγ 2 2 g > gc . The critical coupling gc2 serves as a fixed point of the renormalization group and its existence underlies the good features of the 1/N expansion about this critical coupling. The theory’s critical indices are not those of mean-field theory. To O(1/N ) they are given by [142] 8 , 3N π 2 8 γ = 1+ , Nπ2 ν = 1+
8 , Nπ2 16 η =1− 3N π 2 δ =2+
βmag = 1
(11.37) (11.38)
These indices satisfy the hyperscaling relations expected of a critical system with one dynamical mass scale. In addition, the theory has an interesting finiteT and finite-µ transition that can be studied analytically in the 1/N expansion [143] as well as through lattice simulations. Both approaches will be discussed here. In order to analyze this model by conventional methods, we need to introduce an auxiliary Bose field, σ , so that the Lagrangian can be written as a quadratic
11.3 The three-dimensional four-Fermi model
331
form in the Fermi field, L=
N j=1
N ψ¯ ( j) /∂ ψ ( j) + σ ψ¯ ( j) ψ ( j) − 2 σ 2 2g
(11.39)
Clearly, σ can serve as an order parameter for chiral-symmetry breaking. σ acts as the dynamical mass of the fermion and will prove particularly important in lattice simulations as well as in analytic analyses of the model. Begin with the large-N solution of the model. We wish to calculate the order parameter = σ as a function of the coupling g, chemical potential µ, and temperature T ≡ 1/β. Since the large-N limit suppresses fluctuations around the saddle-point solution of the path integral, this is equivalent to a mean-field treatment. However, the Fermi nature of the underlying degrees of freedom will produce the critical indices listed above, not the mean-field exponents so familiar from commuting classical spin models of statistical mechanics. At large N , the leading contribution to comes from the simple fermionloop tadpole. The gap equation determines self-consistently. It states that the tadpole graph generates the dynamical mass which determines the propagator in the loop graph that produced it, g2 tr Sf (µ, T, ) V where the fermion propagator in Euclidean space reads Sf−1 (k; µ, T, ) = iγ0 (k0 − iµ) + ikν γν + ¯ = −g 2 ψψ =
(11.40)
(11.41)
ν=1,2
where the chemical potential has already been introduced and the temperature appears implicitly through the fact that the allowed values of k0 are given by the Matsubara frequencies for fermions, enforcing anti-periodic boundary conditions in the temporal direction, k0 = (2n − 1)π T
(11.42)
where n is an integer. On collecting these features into the gap equation, we find ∞ 1 1 d2 p = 4T (11.43) 2 [(2n − 1)π T − iµ]2 + p 2 + 2 g2 (2π ) n=−∞ The sum over n can be done by first rewriting it using the Poisson summation formula, ∞ d2 p 1 = 4T g2 (2π )2 m=−∞ ∞ e2π imφ × dφ [(2φ − 1)π T − iµ + iE][(2φ − 1)π T − iµ − iE] −∞ (11.44)
332
Lattice-gauge theory
where E = p 2 + 2 . The integral of φ can be done easily if µ < because the poles in the integrand lie on opposite sides of the integration contour for all values of p, 1 = g2
∞
1 dE 1 1 − β(E−µ) − π e + 1 eβ(E+µ) + 1
(11.45)
If µ > , then the position of the poles, above or below the contour, depends on p and the integral must be redone, giving 1 = g2
1 dE 1 − β(E−µ) + π e +1 µ ∞ 1 dE − β(µ+E) π e +1
∞
µ
1 dE β(µ−E) π e +1 (11.46)
Finally, we can eliminate the coupling constant from this equation in favor of 0 , the order parameter at vanishing T and µ. Replace the right-hand side of this equation with the equation before it after setting T = µ = 0. This produces an implicit equation for in terms of the physical mass scale 0 , with no reference to any ultraviolet-dependent quantity. For µ < we have 0
and for µ >
−
= T ln 1 + e−β(
−µ)
+ ln 1 + e−β(
+µ)
(11.47)
we have 0
− µ = T ln 1 + e−β(µ− ) + ln 1 + e−β(µ+ )
(11.48)
Actually, these two solutions are identical for (µ, T ), and demonstrate that curves of (µ, T ) at constant T are symmetric under reflection across the line = µ. The last two equations produce the complete solution for (µ, T ) in terms of approaches zero smoothly from above, except precisely at µ = 0, 0 . Since we have a line of second-order phase transitions throughout the T –µ plane. To obtain the critical line we set = 0 in the last equation and find 1−
µ
=2
T
0
ln(1 + e−βµ )
(11.49)
0
At vanishing chemical potential, we find a chiral-symmetry-restoring transition at the critical temperature, Tc =
0
2 ln 2
≈ 0.72
0
(11.50)
11.3 The three-dimensional four-Fermi model
333
For vanishing µ the gap equation in the broken phase becomes 0
−
= 2T ln(1 + e−β )
Finally, at vanishing temperature, we find that up to a critical value µc =
=
(11.51) 0
independently of µ
0
(11.52)
where the transition becomes of first order at this isolated point. For small excursions into the µ–T plane, the gap equation predicts : sinh(βµ) ∂ :: , −eβ(µ− ) = lim − (11.53) ∂µ :T →0 T →0 sinh(β ) So the slope of the surface (µ, T ) diverges in an essentially singular fashion as µ → µc and T → 0. This means that the large-N limit predicts a first-order transition for T = 0, which becomes of second order as soon as T > 0. The fact that the critical chemical potential at vanishing temperature is exactly the fermion dynamical mass indicates that interactions other than those that create the dynamical mass itself are relatively negligible at large N . Using similar calculational methods we can predict other features of this model. In particular, the fermion number density n in the broken phase, µ < , reads ∞ −βk T 1 + e−β( −µ) 2T 2 sinh(βkµ) ke n = ln − (−1) (11.54) −β( +µ) π 1+e π k=1 k2 whereas for µ >
,
µ2 − n = 2π ×
2
∞ 2T 2 T 1 + e−β(µ− ) − (−1)k + ln π 1 + e−β(µ+ ) π k=1
1 − e−βkµ cosh(βk ) k2
(11.55)
This expression reduces to the two-dimensional free relativistic Fermi-gas result in the symmetric phase, n =
∞ µ2 2T 2 1 − e−βkµ − (−1)k 2π π k=1 k2
(11.56)
This free Fermi behavior is another manifestation of the simplicity of the large-N limit and is not expected at finite N or in richer models.
334
Lattice-gauge theory
We see that the fermion density is strongly suppressed for low temperatures in the symmetric phase, µ < , jumps discontinuously as soon as µ exceeds , and then continues to rise quadratically in the phase with chirality restored. These results have been verified in lattice simulations. In fact, the theory’s critical behavior and the indices listed above have all been reproduced in computer simulations. With that in mind, let’s consider the lattice formulation of this model following our discussion of lattice QCD at nonzero T and µ given above. To place this four-Fermi model on the lattice we begin with the Bose form of the Lagrangian and use the staggered-lattice-fermion method, N /2 1 S = χ¯ i (x) M˜ x,y χi (y) + χ¯ i (x)χi (x) σ (x) ˜ 8 x x,y i=1 x,y ˜ +
N 2 σ (x) ˜ 4g 2 x˜
(11.57)
where χi and χ¯ i are Grassmann-valued staggered-fermion fields defined on the lattice sites x, the auxiliary scalar field σ is defined on the dual lattice sites x, ˜ and x, ˜ y denotes the set of eight dual lattice sites x˜ surrounding the site y. In this case the dual lattice is just a copy of the original lattice but translated by half a link in each lattice direction so that each dual site lies at the center of each cube of the original lattice. The auxiliary fields are placed on the dual lattice because the symmetry of the construction guarantees that the lattice action better approximates the continuum action for nonzero lattice spacing a [144]. The fermion lattice Dirac operator in the presence of a chemical potential reads 1 1 M˜ x,y = (eµ δ y,x+0ˆ − e−µ δ y,x−0ˆ ) + ην (x)(δ y,x+ˆν − δ y,x−ˆν ) 2 2 ν=1,2
(11.58)
where ην (x) are the staggered-fermion phases (−1)x0 + ··· +xν−1 . The lattice spacing a has been set to unity everywhere. One can simulate this lattice action by using the hybrid Monte Carlo method introduced earlier. The Grassmann fields representing the fermions are replaced by real Bose “pseudo-fermion” fields φ(x), governed by the action S=
N /2 1 N 2 σ (x) ˜ φi (x)(M T M)−1 x yi j φ j (y) + 4g 2 x˜ x,y i, j=1 2
(11.59)
where 1 Mx yi j = M˜ x y δi j + δx y δi j σ (x) ˜ 8 x,x ˜
(11.60)
11.4 Four-flavor SU(2) lattice-gauge theory
335
The sum over N extends only up to N /2 because of fermion doubling: for each pseudo-fermion field on the lattice, two will emerge in the continuum limit. The fact that the pseudo-fermions are real (not complex) fields puts the ratio here at 2 to 1 rather than 4 to 1, as one might have expected. Note that the hopping matrix M is strictly real. When the field φ is integrated over in the path integral one obtains the measure det(M T M) ≡ det M if the determinant of M is positive semi-definite. This condition is true as long as N /2 is an even integer, even if the chemical potential µ = 0. This simplicity relies on the fact that there are no weights on the links. For example in SU(3) lattice-gauge theory there are SU(3) matrices on each link, the fermion determinant will not be real, there will be no positive-semi-definite measure in the lattice action, and therefore no probabilistic interpretation of the Euclidean path integral. This will leave us without a reliable simulation method in lattice QCD at nonzero chemical potential, a point we will ponder in more detail in later sections. In the case of the three-dimensional four-Fermi model, we can calculate observables from simulations of the pseudo-fermion action without difficulty. The expectation value of the auxiliary field σ is immediately accessible, as are its correlation functions. The chiral condensate can be measured through ¯ −ψψ =
1 1 tr Sf = tr M −1 V V
(11.61)
the expectation of the energy density is 1 µ −1 1 ∂ ln Z 1 −µ −1 e Mx,x+0ˆ − e Mx,x−0ˆ = − = tr(∂0 γ0 Sf ) = Vs ∂β V 2V x (11.62) and the expectation of the number density is 1 µ −1 1 1 ∂ ln Z −1 e Mx,x+0ˆ + e−µ Mx,x− = tr(γ0 Sf ) = n = − 0ˆ Vs β ∂µ V 2V x (11.63) where Vs is the spatial volume, β is the inverse temperature, and V = Vs β, the overall volume of spacetime. 11.4 Four-flavor SU(2) lattice-gauge theory at nonzero µ and T SU(2) lattice-gauge theory is of considerable interest at nonzero chemical potential because it has a transition to a superfluid state complete with a diquark condensate predicted by effective-Lagrangian methods [145]. The equivalence
336
Lattice-gauge theory
of quarks and anti-quarks in the two-color model makes mesons and baryons equivalent physically and leads to a critical chemical potential of µc = m π /2. Diquarks are color singlets in this model and, when they condense at µc , a superfluid phase, rather than a superconducting phase, is formed. Many of the predictions of the effective-Lagrangian analysis have been confirmed by lattice simulations and are worth understanding [120]. We will display and discuss some simulation results here, although much more accurate and precise simulations are constantly updating these findings. We will present the lattice simulations of this model as a prototype of such studies in QCD-like models in extreme environments. However, first we will review some facts about the model’s fermion determinant and compare it with SU(3) lattice-gauge theory with a baryon-number chemical potential. Consider the Dirac operator in QCD at vanishing chemical potential, D = γ (∂ + A) + m q . In Euclidean space where the γ matrices are Hermitian and the matrix A is anti-Hermitian, the Dirac operator satisfies the identity γ5 Dγ5 = D †
(11.64)
which guarantees that its determinant is positive. This fact allows us to interpret the Euclidean path integral of QCD as a probabilistic weight and apply algorithms such as the hybrid molecular-dynamics algorithm and the hybrid Monte Carlo algorithm to its simulation. When the chemical potential µ is turned on, the fermion determinant remains positive for SU(2) color, but not, alas, for the physical case of SU(3) color. In the special case of the SU(2) color group, we can use the pseudo-reality of its representations to find a useful generalization of Eq. (11.64). In particular, recall the charge-conjugation matrix, C = iγ0 γ2 (C 2 = 1, Cγµ C = −γµ∗ ) and the second Pauli-matrix generator, T2 (T2 Ta T2 = −Ta∗ ), of SU(2) color. Then defining the product K = γ5 C T2 , and considering the SU(2) Dirac operator at nonzero chemical potential, D(µ) = γ (∂ + A) + µγ0 + m q , we can easily show that K D(µ)K = D ∗ (µ)
(11.65)
which allows us to show that the fermion determinant is again positive for any µ. It is easy to check that the fermion determinant for SU(3) color is not real when µB = 0. An example of an explicit demonstration of this will be given for heavy quarks below. The pseudo-real feature of SU(2) color was essential in establishing the reality of its determinant when µ = 0. We shall see that there is considerable interesting physics at nonzero chemical potential in the SU(2) color model. Some of it may be relevant to the real world. However, there are gross qualitative shortcomings in the model that must be appreciated. First, SU(2) color does not have baryons. Because of its pseudoreality, its two-quark “baryons” are degenerate with its quark–anti-quark mesons
11.4 Four-flavor SU(2) lattice-gauge theory
337
and there are no spin- 12 quark-model states in the theory. Second, the critical chemical potential per quark must be half the mass of the lightest “baryon,” which must be degenerate with the lightest meson, the pion. So, µc = m π /2 and the threshold for restoration of chiral symmetry and diquark condensation vanishes in the chiral limit where the mass of the quark goes to zero and the pion becomes massless. This peculiarity is the reason why chiral perturbation theory has precise predictions for the phase transition here. The chiral-perturbative approach and effective Lagrangians were discussed above in Chapter 10. Of course, in SU(3) QCD, the lightest baryon, the nucleon multiplet, remains massive in the chiral limit, so chiral perturbation theory does not describe its transition at µB = MN /3. In addition, the diquark-condensate phases in the two models are qualitatively different. In the SU(2) theory, the diquarks are colorless, scalar mesons and their condensation is expected to lead to a superfluid state, not so different from a Bose–Einstein condensate. By contrast, the diquarks in the SU(3) QCD theory are color 3¯ states and their condensation breaks the global color symmetry dynamically, not unlike the Higgs mechanism. A condensate of such diquarks should have color-superconducting properties that should be qualitatively different from the properties expected of SU(2) both in their spectroscopies and in their bulk thermodynamics. Color superconductivity was discussed above in Chapter 9. At this time its properties have been “established” by symmetry considerations and weak-coupling estimates. It has not been possible to study this phase in terms of lattice-gauge theory because the theory has no reliable algorithms for phases in which the fermion determinant is complex. It is hoped that this shortcoming will be conquered in the near future. Now we return to the SU(2) model at nonzero temperature and chemical potential and we consider the system’s phase diagram. For fixed quark mass, which insures that the pion has a nonzero mass and chiral symmetry is explicitly broken, simulations give a line of transitions separating a phase with no diquark condensation from one with a diquark condensate. Along this line there is a tricritical point where the transition switches from being of second order (and well described by mean-field theory) at relatively low µ to a first-order transition at an intermediate value of µ. At large µ the line of first-order transitions separates the diquark-condensate phase at low T from a quark–gluon phase at high T . The transition is clear both in diquark observables and in the Wilson line. A schematic phase diagram is shown in Fig. 11.1. (Note that we have not included the line of first-order transitions starting from the µ = 0, finite-temperature transition, which exists for m small enough, and the line of crossovers that should exist for larger m. We are just concentrating on the transition to the diquark condensate instead.) The existence of a tricritical point has a natural explanation in the context of chiral Lagrangians. In fact, since µ plays the role of a second “temperature” in this theory, in that it is a parameter that controls diquark
338
Lattice-gauge theory
Temperature
〈qq〉 = 0
X 〈qq〉 ≠ 0
Chemical Potential
Figure 11.1. A schematic phase diagram in the T –µ plane for the two-color model. The thin (thick) line consists of second (first)-order transitions. X labels the tricritical point. condensation, but does not explicitly break the (quark-number) symmetry, tricritical behaviour is expected. We begin with a discussion of the simulation method and the numerical results. The lattice action of the staggered-fermion version of this theory is 1 T T Sf = χ[D ¯ / (µ) + m]χ + λ[χ τ2 χ + χ¯ τ2 χ¯ ] (11.66) 2 sites where the chemical potential µ is introduced by multiplying links in the +t direction by eµ and those in the −t direction by e−µ [139]. The diquark source term (Majorana-mass term) is added in order to allow observation of the spontaneous breakdown of quark number on a finite lattice. The parameter λ and the usual mass term m control the amount of explicit symmetry breaking in the lattice action. Integrating out the fermion fields in Eq. (11.66) gives λτ2 A pfaffian = det(A† A + λ2 ) (11.67) −AT λτ2 where A≡D / (µ) + m
(11.68)
11.4 Four-flavor SU(2) lattice-gauge theory
339
0.4 0.35
Diquark Condensate
0.3 0.25 0.2 0.15 0.1 0.05 0 0.24
0.26
0.28
0.3 0.32 0.34 Chemical Potential
0.36
0.38
0.4
Figure 11.2. The diquark condensate versus µ, on a 164 lattice with β = 1.85, m = 0.05, and λ = 0.00 (spline fit).
Note that the Pfaffian is strictly positive, so the hybrid molecular-dynamics [38] method can be used to simulate this theory using “noisy” fermions to take the square root, giving Nf = 4. For λ = 0, m = 0, we expect no spontaneous symmetry breaking for small µ. For µ large enough (µ > m π /2 ) we expect spontaneous breakdown of quark number and one Goldstone boson – a scalar diquark. Now consider the simulation results for the Nf = 4 theory measuring the chiral and diquark condensates (χ T τ2 χ), the fermion number density, the Wilson/Polyakov line, etc. First consider measurements on a 164 , “zero-temperature” lattice. The SU(2) model was simulated at β = 1.85, within the theory’s scaling window. The quark mass was m = 0.05 and simulations were done at λ = 0.0025, 0.005, and 0.01 so that the results could be extrapolated to vanishing diquark source, λ = 0. For second-order phase transitions, only in the limit of vanishing λ do we expect real diquark phase transitions. In Fig. 11.2 the diquark condensate, extrapolated to λ = 0, is plotted against the chemical potential µ. There is good evidence for a quark-number-violating second-order phase transition in Fig. 11.2. The dashed line is a power-law fit, confidence level 24%, which picks out the critical chemical potential of µc = 0.2749(2) and a critical index of βmag = 0.54(5), which is compatible with the mean-field result
340
Lattice-gauge theory 0.1
Fermion Number Density
0.08
0.06
0.04
0.02
0 0.24
0.26
0.28
0.3 0.32 0.34 Chemical Potential
0.36
0.38
0.4
Figure 11.3. The fermion number density versus µ, on a 164 lattice, with β = 1.85, m = 0.05, and λ = 0.0025.
βmag = 12 predicted by loop-improved chiral perturbation theory [121], [146]. Note that the quark mass is fixed at m = 0.05 throughout this simulation, so chiral symmetry is explicitly broken and this transition is due to quark-number breaking alone. The fermion number density, shown in Fig. 11.3, also shows the diquark continuous phase transition. The approximate linear dependence of the fermion number density with µ is the expected scaling behavior above the transition in lowest-order chiral perturbation theory [121], [146]. Now consider the phase diagram in the temperature–chemical-potential plane. To begin, we review some small (83 × 4) lattice results. Consider a slice through the phase diagram at a fixed small temperature, for variable µ. In Fig. 11.4 we show the diquark condensate for β = 1.3, m = 0.05, with the data extrapolated to λ = 0 as discussed above. As µ increases we find that a second-order phase transition to a diquark condensate appears at µc = 0.2919(4). The dashed-line fit in Fig. 11.4 has the critical index βmag = 0.50(15), in good agreement with mean-field theory. The fit has a confidence level of 21%. The diquark transition is also clear in a plot of the fermion number density, as expected. However, the chiral condensate varies smoothly over the diquark critical region, with χ¯ χ ranging from 0.80 to 0.50. The Wilson line is also smooth, varying from 0.10 to 0.20, with no obvious sign of a transition. Simulations at β = 1.1 and 1.5 gave similar conclusions. At β = 1.1, there was a continuous diquark transition at µ = 0.2544(3) and at β = 1.5 there was
11.4 Four-flavor SU(2) lattice-gauge theory
341
0.7
Diquark Condensate
0.6 0.5 0.4 0.3 0.2 0.1 0
0.1
0.15
0.2
0.25 0.3 Chemical Potential
0.35
0.4
Figure 11.4. The diquark condensate versus µ, on an 83 × 4 lattice, with β = 1.30. a continuous diquark transition at µ = 0.2950(3). So, as the system is heated (β increases), a larger chemical potential is needed to order the system into a diquark condensate. Consider simulation results at fixed µ and variable β. The diquark condensate was measured as a function of β for µ = 0.40 in Fig. 11.5. There is a clear jump in the condensate at β = 1.55(5), suggesting a first-order phase transition. A discontinuous transition is also strongly suggested by the behavior of the Wilson line shown in Fig. 11.6. If this transition at relatively large µ and T is really of first order, then there should be a tricritical point separating the region of continuous transitions at smaller µ and T . The evidence for a first-order transition at µ = 0.40 was even stronger on a larger (123 × 6) lattice. In Figs. 11.7, 11.8, and 11.9 we show the diquark condensate, the Wilson line, and the chiral condensate on the 123 × 6 lattice. The computer-time evolution of the diquark condensate at µ = 0.40 reveals a two-state signal in Fig. 11.10, giving excellent evidence for a first-order transition. β = 1.97(2) lies in the scaling window of the SU(2) lattice-gauge theory with four species of dynamical quarks, so we can compare our µc = 0.40 with the theory’s spectroscopic mass scales [147]. In fact, m π /2 = 0.31(1), so finite-T effects have apparently raised µc somewhat from its zero-T value, m π /2. Such effects were discussed in Chapter 10 in the context of effective Lagrangians.
342
Lattice-gauge theory
0.7 0.6
Diquark Condensate
0.5 0.4 0.3 0.2 0.1 0 0. 1
1
1.2
1.4
1.6 β
1.8
2
2.2
Figure 11.5. The diquark condensate versus β for µ = 0.40, on an 83 ×4 lattice, with m = 0.05, and λ = 0.000.
0.8 0.7
Wilson Line
0.6 0.5 0.4 0.3 0.2 0.1
1
1.2
1.4
1.6 β
1.8
2
2.2
Figure 11.6. The Wilson line versus β for µ = 0.40, on an 83 × 4 lattice, with m = 0.05, and λ = 0.000.
11.4 Four-flavor SU(2) lattice-gauge theory
343
1 0.9
Diquark Condensate
0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0
1
1.2
1.4
1.6 β
1.8
2
2.2
Figure 11.7. The diquark condensate versus β, on a 123 × 6 lattice, with µ = 0.40, m = 0.05, and λ = 0.005.
0.4 0.35
Wilson Line
0.3 0.25 0.2 0.15 0.1 0.05 0
1
1.2
1.4
1.6 β
1.8
2
2.2
Figure 11.8. The Wilson line versus β, on a 123 × 6 lattice, with µ = 0.40, m = 0.05, and λ = 0.005.
344
Lattice-gauge theory
0.4 0.35
Chiral Condensate
0.3 0.25 0.2 0.15 0.1 0.05 0
1
1.2
1.4
1.6 β
1.8
2
2.2
Figure 11.9. The chiral condensate versus β, on an 123 × 6 lattice, with µ = 0.40, m = 0.05, and λ = 0.005. 0.7
Diquark Condensate
0.6 0.5 0.4 0.3 0.2 0.1 0
0
500
1000
1500 Trajectories
2000
2500
3000
Figure 11.10. The diquark condensate versus computer time, for β = 1.97, µ = 0.40, m = 0.05, and λ = 0.005.
11.5 High-density QCD and static quarks
345
The exact location of the tricritical point could be found in larger-scale simulations. On the small 83 × 4 lattices there is some evidence that it lies between µ = 0.30 and 0.40 since results of simulations at µ = 0.30 were compatible with mean-field theory. 11.5 High-density QCD and static quarks We will discuss algorithms that may develop into systematic simulation methods of SU(3) lattice-gauge theory at nonzero baryon chemical potential below, but, since that is a difficult and longstanding barrier in the field, we begin with a less-ambitious but instructive topic – simulating heavy-quark SU(3) QCD at nonzero baryon chemical potential. The idea is that, if we take the limit of heavy quarks and large chemical potentials, the complex fermion determinant becomes tractable but nontrivial [148]. Of course, this limiting case is far from the interesting physical situation of light (or massless) quarks, spontaneous chiral-symmetry breaking, and chemical potentials of a few hundred MeV, so the lessons learned here might not be germane. First consider how the lattice Dirac operator simplifies in this limit, M(x, y) = 2am q δx,y + [Uν (x)ην (x)δx+ˆν ,y − Uν† (y)ην (y)δx−ˆν ,y ] ν=1,2,3
†
+ [eµa Ut (x)ηt (x)δx+tˆ,y − e−µa Ut (y)ηt (y)δx−tˆ,y ]
(11.69)
To evaluate the determinant of the hopping matrix, organize it according to time slices, † eµa W0 0 · · · e−µa W Nt S0 −µa † −e W0 S1 eµa W1 · · · † −µa (11.70) M(x, y) = 0 −e W1 S2 .. .. .. . . . −eµa W Nt
Si contains the mass term and the spatial hopping terms in the ith time slice and Wi consists of the hopping terms for passage from the ith to the (i + 1)th slice. When we take the limit in which m → ∞ and µ → ∞, we are left with just the 2ma terms along the diagonal and the forward hopping terms eµa Wi connecting adjoining time slices. Spatial points are now decoupled and the fermion determinant becomes a product of determinants on each static worldline, eµa Nc Nt det(L( x ) + C) (11.71) det M = x
where L( x ) is the Wilson (Polyakov) line at spatial site x, Nc is the number of colors, and Nt is the number of time slices. The parameter C is just (2ma/eµa ) Nt .
346
Lattice-gauge theory
Now the determinant can be evaluated for any SU(N ) gauge theory. For SU(2), det(L( x ) + C) = C 2 + C tr L( x) + 1
(11.72)
det(L( x ) + C) = C 3 + C 2 tr L( x ) + C tr L † ( x) + 1
(11.73)
and for SU(3),
We read off that the SU(2) determinant is explicitly real because the trace of an SU(2) matrix is real. However, the SU(3) determinant is explicitly complex because of the different weightings of the tr L( x ) and the tr L † ( x ). Even though the SU(3) determinant is complex, we can evaluate its partition function using standard methods. The partition function reads eµa Nc Nt det(L( x ) + C) e−Sgauge (11.74) Z = [DU ] =
x
[DU ]e−Sgauge +
x )+C)]} x {µa Nc Nt +ln[det(L(
(11.75)
So we can update the spatial links with a Metropolis algorithm borrowed from quenched QCD simulations. The temporal links require the trick of using the magnitude of the determinant and the gauge action as an effective action and then incorporating the phase of the determinant θ into the expectation value of interest, A =
Aeiθ eiθ
(11.76)
where the expectation values are taken with respect to the ensemble of configurations weighted by the magnitude of the determinant. Of course, this method will not be practical if the phase oscillates wildly and/or averages to zero. In this case, as opposed to examples with light fermions known in statistical mechanics and to QCD itself with light quarks, the average of eiθ is free of such problems and useful estimates can be made. One calculates all the usual thermodynamic quantities, including the average quark density, and investigates the fate of the µ = 0.0 finite-T transition, which is of first order for three light quarks in SU(3) QCD, as one turns the chemical potential up. One expects that the first-order transition will be stable against perturbations and that the first-order transition will persist to some intermediate value of 1/C. Instead, plots of the Wilson line against coupling for small values of 1/C indicate that the first-order transition becomes a crossover immediately. This peculiar result is not understood. Perhaps it is related to the limiting procedure µ → ∞ and m → ∞. Certainly it can be tested with light quarks in
11.6 The Glasgow algorithm
347
the case of SU(2) lattice-gauge theory and developments we shall review below suggest that this limited question can also be investigated in three-flavor SU(3) lattice-gauge theory for temperatures near the quark–gluon-plasma transition Tc at µ = 0.0. 11.6 The Glasgow algorithm The original Glasgow algorithm was put forward, tested, and develped by Ian Barbour [149]. The idea is, very roughly, the following. Since we do not know how to simulate lattice QCD at nonzero µ for the baryon number because the action becomes complex, we will isolate the action’s µ dependence in a fugacity expansion. The coefficients of the fugacity expansion will then be computed by using an ensemble of configurations generated at the coupling of interest, but at vanishing µ. This naive idea faces some serious hurdles. Forgetting about the details of the fugacity expansion, the method would be like inferring that the Ising model can exist at low temperature in a state of spontaneous magnetization by accumulating Ising configurations at high temperature. One would have to have an astronomically large ensemble of configurations and hope to find every now and then a statistical fluctuation so that a state with nonvanishing mean magnetization would appear. These configurations might be singled out and used to make up a Boltzmann distribution of configurations at low temperatures that possess a nonvanishing mean magnetization. How efficient could such a “reweighting” procedure be? Since we start in the wrong phase where the spins are fluctuating almost independently of each other, an initial estimate of the probability of finding a configuration having a nonzero mean magnetization would be exponentially small in the number of degrees of freedom of the system, i.e. exp(−cV ), where V is the volume of the system and c is a finite constant. Actually, prospects for this method working at zero T might be much worse than these estimates which are based on analogies with similar strategies intended to work for transitions along the temperature axis. We know from the studies of soluble models and general arguments that many observables are completely unaffected by the chemical potential for µ < µc if our physical system has vanishing temperature. How then could one use a fugacity expansion starting in the low-µ phase and find the existence of a phase transition at µc and a completely different phase at high µ? The nonanalyticities involved in the eµ plane suggest that this is not possible. As simple-minded as this assessment of the method is, quantitative studies of the method generally support its conclusion. Why then is there any interest in such an approach? The hope is that there are generalizations of the method that do not have such serious limitations. We will review below a physical situation
348
Lattice-gauge theory
in which a generalization of the method does, in fact, look promising. We will see that the prospects for a multi-variable generalization of the Glasgow method are much better at a substantial T , near the quark–gluon phase transition at vanishing µ. Glasgow-style methods begin with a fugacity expansion of the fermion determinant. The fermion matrix, which is the discrete form of the Dirac operator on the lattice using staggered fermions, reads 2iM = G + 2im + Vˆ eµ + Vˆ † e−µ
(11.77)
Then the determinant of M reads det(2iM) = e−3µV det(P − eµ )
(11.78)
where V is the lattice size N 3 × Nt and P is the propagator matrix, which is independent of µ, −(G + 2im)Vˆ Vˆ P= (11.79) −Vˆ 0 A basis in which the propagator matrix is diagonal can be chosen, det(2iM) = e−3µV (λk − eµ )
(11.80)
k=1,6V
This representation makes it clear that the zeros of the determinant in the eµ plane are the eigenvalues of the propagator matrix. Since the eigenvalues λk of the propagator matrix satisfy λk+ j = ei2π j/Nt λk , j = 0,1, . . . , Nt − 1, we can simplify this expression using the polynomial decomposition (ei2π j/Nt β − eµ ) = β Nt − eµNt (11.81) j=0,Nt −1
and find det(2iM) = e−3µV
(λkNt − eµNt )
(11.82)
k=1,6N 3
which produces the desired fugacity expansion, det(2iM) = bk f k
(11.83)
k=−3N 3 ,3N 3
where f = eµNt is the fugacity. This final expression produces a polynomial expansion of the partition function Z (µ), Z (µ) = bk ekµT (11.84) k=−3N 3 ,3N 3
where the expectation value is taken in a configuration having µ = 0.0.
11.7 The Fodor–Katz method
349
Tests of this method on Gross–Neveu models, which we reviewed above and can be simulated at nonzero chemical potential by conventional algorithms, indicate that the overlap problem does invalidate it. In the case of the Gross–Neveu model one could test whether the Glasgow method could find the chiralityrestoring phase transition at the known µc by generating configurations in either phase, calculating the fugacity expansion as explained above, and searching for the Lee–Yang zeros in the result. In all cases the method failed to find the known transition, indicating that the coefficients of the fugacity expansion for the partition function cannot be estimated accurately starting from an ensemble of limited, but large, statistics in the wrong phase. It is generally believed that the Glasgow method needs some additional ingredients if it is to succeed. Some interesting suggestions have been made and we will review a particularly promising and relevant one next. 11.7 The Fodor–Katz method for high T, low µ The Glasgow reweighting method appears to be more effective at nonzero T . Suppose that you are interested in the region of the QCD phase diagram where T is near the transition from the hadronic phase to the quark–gluon-plasma phase and the baryon chemical potential is relatively small. This region of phase space is of considerable interest because of its relevance to the heavy-ion experimental program which produces nuclear matter under these conditions, very hot with a small excess of baryon number. Using conventional algorithms one can generate field configurations all along the T axis. For moderate quark masses in three-flavor SU(3) lattice-gauge theory, the transition on the T axis should be smoothed to a sharp crossover. However, if the chemical potential µ were turned up, the crossover would be expected to move to slightly smaller T . Eventually, if µ were increased to µT , a critical point where the crossover becomes a line of first-order transitions would be found. It would be particularly interesting if this critical point were experimentally accessible, as we have discussed in previous chapters. A multi-variable Glasgow algorithm has been suggested and tested on small lattices, which suggests that µT can be found by lattice methods available at present. One begins with the reweighting expression for the partition function [150], Z [α] = D[U ] exp(−Sgauge (U ; α0 )) det M(U ; α0 ) detM(U ; α) (11.85) × exp −Sgauge (U ; α, α0 ) detM(U ; α0 ) which means that configurations are generated at the parameter setting α0 = (β0 , m 0 , µ0 ), but, by evaluating the matrix element in large parentheses, we
350
Lattice-gauge theory
-
Figure 11.11. The crossover, critical point, and first-order line predicted by Fodor and Katz. effectively attempt to reweight those configurations to generate an ensemble of configurations at the new parameter setting α = (β, m, µ). One expects the overlap between the two ensembles to be exponentially small in the volume of the system, but large enough on small lattice volumes to simulate successfully in particular region of the T –µ plane. An intuitive and compelling proposal that the overlap is large along the line of crossovers in the T –µ plane has been suggested [110]. It is necessary to vary both β and µ in order to stay on the line. The idea is that configurations along the line are strongly fluctuating between hadronic behavior and plasma behavior. So, if we just move along the crossover line, the character of the configurations does not change dramatically and the overlap between the configurations at α0 = (β0 , m 0 , µ0 ) and α = (β, m, µ) is large. In practice one identifies βc (µ) by calculating the lowest Yang–Lee zero as one steps from smaller to larger µ. At the tricritcal point the Yang–Lee zero would pinch the real axis in the thermodynamic limit N → ∞. In practice one uses finite-size scaling analysis to infer this behavior. The result of this exercise is shown in Fig. 11.11, which is very encouraging but not yet quantitative because the sizes of lattices were small in the actual simulations and the quark masses were rather large. 11.8 QCD at complex chemical potential Although the QCD partition function becomes complex in the presence of a real, physical baryon chemical potential, the partition function remains real for an imaginary chemical potential. This brings up the possibility that the way to study QCD in a baryon-rich environment is to simulate it with an imaginary chemical potential and then analytically continue observables to real chemical potential.
11.8 QCD at complex chemical potential
351
In the early days of lattice-gauge theory this approach was tried and abandoned because, at low T and variable imaginary µ, the chemical potential had little effect on observables. However, if one is less ambitious and more focused, the approach appears very promising for high temperatures, near the quark–gluonplasma transition, and for small chemical potentials [151]. Write the chemical potential as µ = µR + iµI
(11.86)
and note that, for purely imaginary chemical potential, the effect on the theory is the same as a purely temporal Abelian vector potential coupled to the quark number. Such a theory clearly has a real fermion determinant. The imaginary chemical potential affects a forward-hopping fermion with a pure constant phase, eiµI , and affects a backward-hopping fermion with a phase, e−iµI . There is a maximum µI that can be studied by this method. Consider a fermion path that goes once around the anti-periodic temporal lattice. The fermion worldline picks up a total phase of eiµI Nτ . If this phase lies in the center of the gauge group SU(3), i.e. Z(3), then it has no physical effect because it can be removed by a gauge transformation. This restricts the physical region of µI , µI <
2π 2π = T 3Nτ 3
(11.87)
In other words, the physics of the theory is periodic in the variable µI /T . Let’s write this as a symmetry of the grand canonical partition function, ˆ ˆ Z (V, µ, T ) = tr e(− H −µ Q)/T
(11.88)
where Hˆ is the Hamiltonian operator and Qˆ is the quark-number operator. (In terms of the physical baryon number B and the baryon chemical potential µB , we must remember the factors of three, B = Q/3 and µ = µB /3.) At nonzero temperature Z depends on µ/T , so it is often more convenient to use the variable µ¯ ≡ µ/T
(11.89)
and express Z as Z (V, µ, ¯ T ). Our exercise above, which concluded that the physics is periodic in µI /T , reads Z (µ¯ R , µ¯ I ) = Z (µ¯ R , µ¯ I + 2π/3)
(11.90)
In addition Z is an even function of µ¯ because switching the sign of the chemical potential can always be compensated by a time reflection, Z (µ) ¯ = Z (−µ) ¯
(11.91)
352
Lattice-gauge theory
On combining these two symmetry relations, we see that the partition function is symmetric about every multiple of µ¯ I = π/3. In particular, Z (µ¯ R = 0, µ¯ I = π/3 + µ¯ I ) = Z (µ¯ R = 0, µ¯ I = π/3 − µ¯ I )
(11.92)
and the free energy and other observables must inherit this symmetry. Simulations can certainly be made for small imaginary chemical potential. Since the theory has a very clear transition from its hadronic phase to the quark– gluon plasma at T = Tc and µ = 0, there must be a clear line of transitions (crossovers, actually, for intermediate quark masses) inside the phase diagram for T ≈ Tc and µ¯ small. In order to use simulation data at imaginary chemical potential, we must analytically continue them to real chemical potential. We need to know the radius of convergence of the free energy in the complex-µ¯ plane in order to do this. The key to doing this lies in the Z(3) structure of the theory’s gauge group. Consider again fermion loops that wind once around the temporal lattice. As we discussed before, this quantity is best monitored with the free energy of a heavy quark, or the expectation of the Wilson line, L. In the pure gauge theory, the phase of the Wilson line is one of the members of the center of the gauge group, 0, ±2π/3. These three states have identical free energies F in the pure gauge theory. As a function of the phase of the Wilson line, φ, F(φ) must have barriers between its three minima. The physics of the barriers can be studied by simulating interfaces between the different Z(3) vacua. The first-order transitions between them have been studied in great detail because of their intrinsic interest and their potential applications to the physics of the early universe, etc. When dynamical fermions are restored into the theory, the Z(3) symmetry is explicitly broken and, therefore, the degeneracy of the three minima is lifted. The φ = 0 vacuum becomes the unique minimum and the other two states share a common, higher free energy. At high T one can calculate the µ¯ I dependence of the free energy of the φ = 0 vacuum [152], µ¯ 2I 1 2 4 (φ=0) F (11.93) (µ¯ I ) = − π T 1 − 2 4 π where −π < µ¯ I < π, and F (φ=0) (µ¯ I ) is periodic, F (φ=0) (µ¯ I ± 2π k) = F (φ=0) (µ¯ I ). To determine the analogous results for the other Z(3) vacua, we consider Z(3) transformations as illustrated above, leading us to conclude that F (φ=+2π/3) (µ¯ I ) = F (φ=0) (µ¯ I − 2π/3), F (φ=−2π/3) (µ¯ I ) = F (φ=0) (µ¯ I + 2π/3)
(11.94)
These three free energies are shown in Fig. 11.12. The three curves are copies of
11.8 QCD at complex chemical potential
353
V
φ = −2π/3
φ = 2π/3
φ=0
µ¯ I
Figure 11.12. The ground-state energy for the Z(3) vacua.
V
φ = −2π/3
φ=0
φ = 2π/3
µ¯ I
Figure 11.13. The ground-state energy for the physical vacuum at high T .
each other, just translated by µ¯ I = 2π/3. The physical branch of these curves is found by choosing the minimum free energy at each µ¯ I . This produces the free-energy curve in Fig. 11.13, where we see that the vacuum with φ = 0 is preferred for −π/3 < µ¯ I < +π/3, the vacuum with φ = +2π/3 is preferred for +π/3 < µ¯ I < +3π/3, and the vacuum with φ = −2π/3 is preferred for +3π/3 < µ¯ I < +5π/3, etc. Since the physical free energy has cusps at µ¯ I = 2π(k + 12 )/3, the transitions between the different vacua have latent heats and are of first order. Now we can return to the question of the radius of convergence of the partition function in the complex-chemical-potential plane. Since there is a phase transition at µ¯ I = π/3, this is an upper bound to the radius of convergence. So the method is restricted to values of the real part of the chemical potential for µr < π T /3. In practice, the method can be used to follow the finite-temperature transition out into the nonzero-chemical-potential portion of the phase diagram. Since Tc ≈ 170 MeV, this method is resticted to µB ≤ 500 MeV. Simulations to date using this method have been restricted to small lattices, so much remains to be demonstrated. However, the results are encouraging [151]. The transition between hadronic matter and the quark–gluon plasma has been tracked into the β–µ¯ plane by measuring susceptibilities and following their peaks. It turns out that the peaks in various susceptibilities are quite sharp and
354
Lattice-gauge theory
allow a surprisingly accurate determination of the path of the transition into the phase diagram, βc (µ). ¯ One can argue that βc (µ) ¯ is analytic in µ¯ with the same radius of convergence as the partition function itself, ¯ = an µ¯ 2n (11.95) βc (µ) n=0
The coefficients an have been determined for n = 0,1, and 2 on 83 × 4 lattices, and only the first two terms were needed for a good fit to the data. The analytic continuation from imaginary µ, where all the simulations and fits were done, to real µ is done by trivial substitution. Assuming the applicability of asymptotic scaling, the line of transitions in the T –µ plane was determined to be ) µ *2 Tc (µB ) B = 1 − 0.005 63(38) Tc (µB = 0) T
(11.96)
up to µB ∼ 500 MeV. Note that the µ dependence of the line of transitions is weak indeed. However, as we discussed in the previous section, the physics will eventually change rapidly with increasing µ because there is a tricritical point and a line of first-order transitions to the right of µT . Understanding this quantitatively through simulations is an ongoing theme in this young field. The Z(3) first-order transitions for high temperatures were also found numerically. The calculations appear to be under good control, with solid theoretical underpinnings. Immediate tasks for this method are the following. 1. Check the feasibility of imaginary-chemical-potential simulations for large lattices. 2. Carry out finite-size-scaling studies in the vicinity of the line of crossovers, βc (µ). ¯ 3. Determine the order of the hadronic-matter–quark–gluon-plasma transition as a function of µ. 4. Find the critical point µT , assuming that it is within the radius of convergence of the method, and determine whether it is relevant to the RHIC. 5. Map out the equation of state for all T and µ within the radius of convergence of the method.
12 Epilogue
The problems of quark confinement, chiral-symmetry breaking, and strongly interacting matter in extreme environments are converging and much is being learned, both experimentally and theoretically. We hope that this book helps researchers find common ground in this developing field. The breadth of the field is quite remarkable. Concepts from critical behavior, effective-field theory and random-matrix models, lattice-gauge theory and numerical methods, spin models and duality, equilibrium and nonequilibrium thermodynamics, and even string theory play important roles here. It is impossible to tell which approach will lead to the next major advance in the field. The field depends on a “new culture” in theoretical physics, one in which theorists use lattice-gauge theory as a realistic laboratory to test new ideas through numerical methods and computer simulations. The theorist is no longer limited, in many but, alas, not all cases, to unrealistic models for concrete inspiration. Lattice-gauge simulations have led to the sharpening of several approaches to QCD that have influenced the field very productively. We can look forward to the development of new algorithms that can address SU(3) QCD at nonzero baryon chemical potential and find new states of matter at high densities. Once such an algorithm is in place, the space of variable quark masses (m u , m d , and m s ) and variable chemical potentials (µu , µd and µs ) will become accessible, and the promise of a new “chemistry” will play itself out. Certainly new phases of condensates, crystal structures, and flavor symmetries are likely to be found. Perhaps the conventional formulation of lattice-gauge theory will not be powerful enough to make this progress and a new formulation will have to be invented. Time will tell. One should not underestimate the accomplishments of lattice-gauge theory to date. The development of the thermodynamics of hot QCD, complete with the “best” estimates of Tc at which the quark–gluon plasma is produced, is a milestone in the field. Many more specific predictions relevant to the early universe 355
356
Epilogue
and to RHIC experiments have also been made and are being improved. Much of this work is producing insight into the working of QCD in ordinary environments, shedding light on confinement and chiral-symmetry breaking. Perhaps the lofty goal of producing an approach to QCD that can incorporate asympototic freedom at short distances, chiral-symmetry breaking and light-hadron spectroscopy at intermediate distances, and confinement at large distances is not so far off. Most probably it is, but the process of searching for it may produce surprising rewards. As this book has emphasized, it is the convergence of numerical methods with traditional and developing analytic and theoretical methods which makes this field particularly interesting and productive, at least in the eyes of the authors. The hand-in-hand development of both approaches, theoretical approaches to suggest numerical work and numerical work to challenge theoretical approximations, is new to field theory and will, it is hoped, have a bright and long future. The authors hope that this book will serve the reader well in new pioneering efforts from confinement to extreme environments.
References
[1] J. B. Kogut, Rev. Mod. Phys. 51 (1979), 659; 55 (1983), 775. [2] J. B. Kogut and K. G. Wilson, Phys. Rep. C12 (1974), 75. [3] Phase Transition and Critical Phenomena, edited by C. Domb and M. S. Green (London, Academic Press, 1974). [4] N. D. Mermin and H. Wagner, Phys. Rev. Lett. 17 (1966), 1133. [5] J. M. Kosterlitz and D. J. Thouless, J. Phys. C6 (1973), 118. [6] J. M. Kosterlitz, J. Phys. C7 (1974), 1046. [7] A. M. Polyakov, Phys. Lett. B59 (1975), 79. [8] R. H. Swendsen and J.-S. Wang, Phys. Rev. Lett. 58, 86 (1987); U. Wolff, Phys. Rev. Lett. 62 (1989), 361. [9] C. Stzyleson and J.-M. Drouffe, Statistical Field Theory (Cambridge, Cambridge University Press, 1989). [10] K. G. Wilson, Phys. Rev. D14 (1974), 2455; A. M. Polyakov, Phys. Lett. B59 (1975), 82. [11] C. N. Yang and R. L. Mills, Phys. Rev. 96 (1954), 1605. [12] H. D. Politzer, Phys. Rev. Lett. 30 (1973), 1346; D. J. Gross and F. Wilczek, Phys. Rev. Lett. 30 (1973), 1343. [13] J. Schwinger, Phys. Rev. 128 (1962), 2425. [14] S. Adler, Phys. Rev. 177 (1969), 2426; J. S. Bell and R. Jackiw, Nuov. Cim. 60A (1969), 47. [15] J. B. Kogut and L. Susskind, Phys. Rev. D11 (1975), 3594. [16] J. Lowenstein and A. Swieca, Ann. Phys. (NY) 68 (1971), 172. [17] S. Coleman, R. Jackiw, and L. Susskind, Ann. Phys. (NY) 93 (1975), 267. [18] B. Klaiber, in Lectures in Theoretical Physics, Vol. XA, edited by A. O. Barut and W. Brittin (New York, Gordon and Breach, 1968).
357
358
References
[19] T. Schafer and E. Shuryak, Rev. Mod. Phys. 70 (1998), 323. [20] G. ’t Hooft, Nucl. Phys. B138 (1978), 1. [21] B. C. Barrois, Nucl. Phys. B129 (1977), 390. [22] D. Bailin and A. Love, Phys. Rep. 107 (1984), 325, and references therein. [23] M. Alford, K. Rajagopal, and F. Wilczek, Phys. Lett. B422 (1998), 247. [24] R. Rapp, T. Sch¨afer, E. V. Shuryak, and M. Velkovsky, Phys. Rev. Lett. 81 (1998), 53. [25] H. B. Nielsen and M. Ninomiya, Nucl. Phys. 185 (1981), 20. [26] W. Shockley, Phys. Rev. 56 (1939), 317. [27] D. Kaplan, Phys. Lett. B288 (1992), 342. [28] C. Callan and J. Harvey, Nucl. Phys. B250 (1985), 427. [29] P. Vranas, Nucl. Phys. Proc. Suppl. 94 (2001), 177. [30] P. Ginsparg and K. Wilson, Phys. Rev. D25 (1982), 2649. [31] M. Luscher, Phys. Lett. B428 (1998), 342. [32] H. Neuberger, Phys. Lett. B417 (1998), 141. [33] A. Casher, Phys. Lett. B83 (1979), 395. [34] Y. Nambu and G. Jona Lasinio, Phys. Rev. 122 (1961), 345. [35] T. Banks and A. Casher, Nucl. Phys. B169 (1980), 103. [36] S. Raby, S. Dimopoulos, and L. Susskind, Nucl. Phys. B169 (1980), 373. [37] J. M. Blairon, R. Brout, F. Englert, and J. Greensite, Nucl. Phys. B180 (1981), 439. [38] S. Duane and J. B. Kogut, Phys. Rev. Lett. 55 (1985), 2774; S. Gottlieb, W. Liu, D. Toussaint, R. L. Renken, and R. L. Sugar, Phys. Rev. D35 (1987), 2531. [39] S. Duane, A. D. Kennedy, B. J. Pendleton, and D. Roweth, Phys. Lett. B195 (1987), 216. [40] J. B. Kogut and L. Susskind, Phys. Rev. D11 (1975), 395. [41] M. Luscher, Nucl. Phys. B180 (1981), 317. [42] R. B. Pearson, J. Shigemitsu, and J. B. Kogut, Phys. Lett. 98B (1980), 63; J. B. Kogut, D. K. Sinclair, R. B. Pearson, J. L. Richardson, and J. Shigemitsu, Phys. Rev. D23 (1981), 2945. [43] L. Susskind, Phys. Rev. D20 (1979), 2610. [44] B. Svetitsky and L. G. Yaffe, Nucl. Phys. B210 (1982), 423. [45] R. D. Pisarski and F. Wilczek, Phys. Rev. D29 (1984), 338. [46] J. B. Kogut and C. Detar, Phys. Rev. Lett. 59 (1987), 399. [47] F. Karsch, E. Laermann, P. Petreczky, S. Stickan, and I. Wetzorke, heplat/0110208. [48] M. Asakawa, Y. Nakahara, and T. Hatsuda, Prog. Part. Nucl. Phys. 46 (2001), 459.
References
359
[49] T. Matsui and H. Satz, Phys. Lett. B178 (1986), 416. [50] F. Karsch, in Proceedings of QCD@Work (Washington, American Institute of Physics, 2001), 323. [51] J.-P. Blaizot, hep-ph/0107131. [52] N. Bili´c, K. Demeterfi, and B. Petersson, Nucl. Phys. B377 (1992), 651. [53] S. P. Klevansky, Rev. Mod. Phys. 64 (1992), 649. [54] G. E. Brown, M. Buballa, and M. Rho, Nucl. Phys. A609 (1996), 519. [55] J. Stachel, Nucl. Phys. A610 (1996), 509c. [56] A. L. Fetter and J. D. Walecka, Quantum Theory of Many-Particle Systems (McGraw-Hill, New York, 1971). [57] A. B. Migdal, E. E. Saperstein, M. A. Troitsky, and D. N. Voskresensky, Phys. Rep. 192 (1990), 179. [58] I. Klebanov, Nucl. Phys. B262 (1985), 133. [59] D. B. Kaplan and A. E. Nelson, Phys. Lett. B175 (1986), 57. [60] E. Witten, Phys. Rev. D30 (1984), 272. [61] E. Farhi and B. Jaffe, Phys. Rev. D30 (1984), 2379. [62] M. Alford, K. Rajagopal, and F. Wilczek, Nucl. Phys. B537 (1999), 443. [63] T. Sch¨afer and F. Wilczek, Phys. Rev. Lett. 82 (1999), 3956. [64] L. P. Csernai and J. I. Kapusta, Phys. Rep. 131 (1986), 223. [65] M. A. Stephanov, Phys. Rev. Lett. 76 (1996), 4472. [66] W. Trautmann, Multifragmentation in relativistic heavy ion collisions, in Proceedings the International Summer School on Correlations and Clustering Phenomena in Subatomic Physics, Dronten, the Netherlands, 1996 (to be published by Plenum, New York); e-print: nucl-ex/9611002. [67] HEMCGC and HTMCGC collaborations, Nucl. Phys. Proc. Suppl. B30 (1993), 315. [68] R. Pisarski and F. Wilczek, Phys. Rev. D29 (1984), 338. [69] C. Bernard, T. Blum, C. DeTar et al. Phys. Rev. Lett. 78 (1997), 598. [70] F. R. Brown, F. P. Butler, H. Chen et al. Phys. Rev. Lett. 65 (1990), 2491. [71] I. D. Lawrie and S. Sarbach, in Phase Transitions, Vol. 9, edited by C. Domb and J. L. Lebowitz (New York, Academic Press, 1984). [72] E. Shuryak, hep-ph/0205031. [73] F. Karsch, in 40th Internationale Universit¨atswochen f¨ur Theoretische Physik: Dense Matter (Berlin, Springer, 2002), 209. [74] J. B. Kogut and L. Susskind, Phys. Rep. 8 (1973), 75. [75] R. P. Feynman, Photon–Hadron Interactions (New York, W. A. Benjamin, 1972). [76] L. N. Lipatov, Sov. J. Nucl. Phys. 23 (1976), 338.
360
References
[77] J. B. Kogut and L. Susskind, Phys. Rev. D9 (1974), 3391. [78] G. Altarelli and G. Parisi, Nucl. Phys. B126 (1977), 298. [79] L. V. Gribov, E. M. Levin, and M. G. Ryskin, Phys. Rep. 100 (1983), 1; for a recent review, see E. Iancu, A. Leonidov, and L. McLerran, hep-ph/0202270. [80] E. Ferreiro, E. Iancu, K. Itakura, and L. McLerran, hep-ph/0206241. [81] D. Kharzeev, Nucl. Phys. B119 (2003). [82] J. D. Bjorken, Phys. Rev. D27 (1983), 140. [83] M. A. Stephanov, K. Rajagopal, and E. V. Shuryak, Phys. Rev. Lett. 81 (1998), 4816. [84] M. A. Stephanov, K. Rajagopal, and E. V. Shuryak, Phys. Rev. D60 (1999), 114 028. [85] B. Berdnikov and K. Rajagopal, Phys. Rev. D61 (2000), 105 017. [86] D. K. Hong, Phys. Lett. B473 (2000), 118 (hep-ph/9812510). [87] D. T. Son, Phys. Rev. D59 (1999), 094 019. [88] R. D. Pisarski and D. H. Rischke, Phys. Rev. D61 (2000), 051 501 (nuclth/9907041). [89] T. Schafer and F. Wilczek, Phys. Rev. D60 (1999), 114 033 (hep-ph/9906512). [90] W. E. Brown, J. T. Liu, and H. C. Ren, Phys. Rev. D61 (2000), 114 012 (hepph/9908248); Phys. Rev. D62 (2000), 054 016 (hep-ph/9912409). [91] R. Rapp, T. Schafer, E. V. Shuryak, and M. Velkovsky, Ann. Phys. 280 (2000), 35 (hep-ph/9904353); T. Schafer, Nucl. Phys. B575 (2000), 269 (hep-ph/9909574); Phys. Rev. D65 (2002), 094 033 (hep-ph/0201189). [92] D. T. Son, M. A. Stephanov, and A. R. Zhitnitsky, Phys. Rev. Lett. 86 (2001), 3955. [93] R. Casalbuoni and R. Gatto, Phys. Lett. B464 (1999), 111 (hep-ph/9908227). [94] D. T. Son and M. A. Stephanov, Phys. Rev. D61 (2000), 074 012 (hepph/9910491); Phys. Rev. D62 (2000), 059 902 (hep-ph/0004095). [95] E. Witten, Ann. Phys. 128 (1980), 363. [96] N. N. Bogolyubov, V. V. Tolmachev, and D. V. Shirkov, New Methods in the Theory of Superconductivity (New York, Consultants Bureau, 1959); P. W. Anderson, Phys. Rev. 112 (1958), 1900; A. J. Leggett, Rev. Mod. Phys. 47 (1975), 331. [97] T. Schafer, Phys. Rev. D65 (2002), 074 006 (hep-ph/0109052). [98] K. Rajagopal and F. Wilczek, Phys. Rev. Lett. 86 (2001), 3492 (arXiv:hepph/0012039). [99] P. F. Bedaque and T. Schafer, Nucl. Phys. A697 (2002), 802. [100] K. Rajagopal and E. Shuster, Phys. Rev. D62 (2000), 085 007. [101] J. Berges and K. Rajagopal, Nucl. Phys. B538 (1999), 215. [102] D. B. Kaplan and S. Reddy, Phys. Rev. D65 (2002), 054 042.
References
361
[103] M. G. Alford, J. Berges, and K. Rajagopal, Nucl. Phys. B558 (1999), 219. [104] M. G. Alford, J. A. Bowers, and K. Rajagopal, Phys. Rev. D63 (2001), 074 016. [105] P. Fulde and A. Ferrell, Phys. Rev. 135 (1964), A550; A. I. Larkin and Yu. N. Ovchinnikov, Sov. Phys. JETP 20 (1965), 762. [106] J. A. Bowers and K. Rajagopal, Phys. Rev. D66 (2002), 065 002. [107] M. Alford and K. Rajagopal, JHEP 0206 (2002), 031. [108] K. Rajagopal and F. Wilczek, arXiv:hep-ph/0011333. [109] M. Alford, Ann. Rev. Nucl. Part. Sci. 51 (2001), 131. [110] M. G. Alford, A. Kapustin, and F. Wilczek, Phys. Rev. D59 (1999), 054 502. [111] E. V. Shuryak and J. J. M. Verbaarschot, Nucl. Phys. A560 (1993), 306. [112] A. D. Jackson, M. K. Sener, and J. J. M. Verbaarschot, Nucl. Phys. B479 (1996), 707. [113] E. V. Shuryak, Nucl. Phys. B203 (1982), 93. [114] M. A. Halasz, A. D. Jackson, R. E. Shrock, M. A. Stephanov, and J. J. Verbaarschot, Phys. Rev. D58 (1998), 096 007. [115] A. D. Jackson and J. J. M. Verbaarschot, Phys. Rev. D53 (1996), 7223. [116] T. Wettig, A. Sch¨afer, and H. A. Weidenmuller, Phys. Lett. B367 (1996), 28. [117] C. N. Yang and T. D. Lee, Phys. Rev. 87 (1952), 104; T. D. Lee and C. N. Yang, Phys. Rev. 87 (1952), 410. [118] V. L. Girko, Spektralnaia teoriia sluchainykh matrits (Moscow, Nauka, 1988). [119] D. Weingarten, Phys. Rev. Lett. 51 (1983), 1830. [120] J. B. Kogut, D. Toublan, and D. K. Sinclair, Phys. Lett. B514, (2001), 77. [121] J. B. Kogut, M. A. Stephanov, D. Toublan, J. J. Verbaarschot, and A. Zhitnitsky, Nucl. Phys. B582 (2000), 477. [122] D. T. Son and M. A. Stephanov, Phys. Rev. Lett. 86 (2001), 592. [123] See, e.g., H. Georgi, Weak Interaction and Modern Particle Theory (Menlo Park, Benjamin-Cummings, 1984). [124] A. J. Leggett, J. Physique 41 (1980), C7-19; P. Nozi`eres and S. Schmitt-Rink, J. Low Temp. Phys. 59 (1985), 195. [125] See, e.g., M. Randeria, cond-mat/9710223 and references therein. [126] B. I. Halperin and P. C. Hohenberg, Phys. Rev. 188 (1969), 898; P. C. Hohenberg and B. I. Halperin, Rev. Mod. Phys. 49 (1977), 435. [127] D. T. Son and M. A. Stephanov, Phys. Rev. Lett. 88 (2002), 202 302 (arXiv:hepph/0111100). [128] R. D. Pisarski and F. Wilczek, Phys. Rev. D29 (1984), 338. [129] K. Rajagopal and F. Wilczek, Nucl. Phys. B399 (1993), 395. [130] E. V. Shuryak, Phys. Rev. D42 (1990), 1764.
362
References
[131] R. D. Pisarski and M. Tytgat, Phys. Rev. D54 (1996), 2989. [132] D. T. Son and M. A. Stephanov, Phys. Rev. D66 (2002), 076 011. [133] J. L. Goity and H. Leutwyler, Phys. Lett. B228 (1989), 517. [134] B. D. Josephson, Phys. Lett. 21 (1966), 608. [135] G. A. Baker, B. G. Nickel, and D. I. Meiron, Phys. Rev. B17 (1978), 1365. [136] S. Gottlieb, W. Liu, D. Toussaint, R. L. Renken, and R. L. Sugar, Phys. Rev. Lett. 59 (1987), 2247; S. Gottlieb, U. M. Heller, A. D. Kennedy, S. Kim, J. B. Kogut, C. Liu, R. L. Renken, D. K. Sinclair, R. L. Sugar, D. Toussaint, and K. C. Wang, Phys. Rev. D55 (1997), 6852. [137] P. Braun-Munzinger, J. Stachel, J. P. Wessels, and N. Xu, Phys. Lett. B344 (1995), 43; Phys. Lett. B365 (1996), 1; P. Braun-Munzinger, I. Heppe, and J. Stachel, Phys. Lett. B465 (1999), 15. [138] C. M. Hung and E. V. Shuryak, Phys. Rev. C57 (1998), 1891. [139] J. B. Kogut, H. Matsuoka, S. H. Shenker, J. Shigemitsu, D. K. Sinclair, M. Stone, and H. W. Wyld, Nucl. Phys. B225 (1983), 93. [140] F. Karsch and P. Hasenfratz, Phys. Lett. B125 (1983), 308. [141] K. G. Wilson, Phys. Rev. D7 (1973), 2911. [142] Hands, J. B. Kogut, and A. Kocic, Ann. Phys. 224 (1993), 29. [143] B. Rosenstein, B. J. Warr, and S. H. Park, Phys. Rev. Lett. 62 (1989), 1433; Phys. Rep. 205 (1991), 59. [144] Y. Cohen, S. Elitzur, and E. Rabinovici, Nucl. Phys. B220 (1983), 102. [145] J. B. Kogut, M. A. Stephanov, and D. Toublan, Phys. Lett. B464 (1999), 183. [146] K. Splittorff, D. Toublan, and J. J. M. Verbarschot, Nucl. Phys. B620 (2002), 290. [147] K. D. Born, E. Laermann, F. Langhammer, T. F. Walsh, and P. M. Zerwas, Phys. Rev. D40 (1989), 1664. [148] T. C. Blum, J. E. Hetrick, and D. Toussaint, Phys. Rev. Lett. 76 (1996), 1019. [149] I. Barbour and A. J. Bell, Nucl. Phys. B372 (1992), 385. [150] Z. Fodor and S. D. Katz, Nucl. Phys. Proc. Suppl. 106 (2002), 441. [151] P. De Forcrand and O. Philipsen, hep-lat/0205016. [152] A. Roberge and N. Weiss, Nucl. Phys. B275 (1986), 734.
Index
critical point, 23 crystalline color superconductor, 288
action, 14 anomalies, 9 area law, 68, 75 asymptotic freedom, 8, 35, 80, 214, 229 axial anomaly, 65, 83, 94, 117, 125, 176
dimensional reduction, 179 diquark, 98, 305, 308, 345 diquark condensation, 12, 247, 256, 272, 305, 312, 347 domain-wall fermion, 10, 120 duality, 21, 23, 30, 55, 77, 162, 171
Bardeen–Cooper–Schrieffer, 99, 127, 204, 246, 305, 321 baryon number density, 6, 219 beta function, 40, 81, 174 Bose condensation, 210
effective Lagrangian, 272, 308 effective Lagrangians, 12, 278, 304, 327 entropy, 28, 78, 196, 201, 217 extreme environments, 5, 234
canonical ensemble, 195 chemical potential, 7, 98, 198, 214, 223, 246, 249, 289, 292, 304, 309, 327, 333, 346 chemistry of QCD, 7, 219 chiral condensate, 97, 126, 127, 181, 220, 229, 235, 292, 338 chiral perturbation theory, 304, 317 chiral–symmetry breaking, 5, 83, 96, 126, 132, 206 color superconductivity, 8 color–flavor locking, 226, 248, 279 computer simulations, 43, 135, 173 condensate, 6, 29, 246 confinement, 5, 61, 69, 75, 78, 80, 155, 168 continuum limit, 16, 40, 61, 81 correlation length, 16, 17, 19, 25, 39 critical end-point, 245 critical exponent, 17
Fermi surface, 99, 202, 247, 249, 267, 335 flux tubes, 5, 66, 91, 155 free energy, 15, 177, 362 gap equation, 252, 254, 257 Ginsparg–Wilson, 11, 123 Glasgow algorithm, 357, 359 Goldstone, 72, 82, 96, 112, 132, 271, 305, 311 grand canonical ensemble, 198 Hamiltonian, 47, 52, 100, 145 harmonic oscillator, 48 Higgs, 72, 75, 85, 207, 248, 270, 305 high temperature, 8, 168, 213, 235 high-temperature expansion, 23, 68, 154
363
364
Index
hybrid algorithm, 135, 141, 346 hyperscaling, 20, 340
pion condensate, 319 plaquette, 61
instantons, 42, 91, 98 Ising model, 14, 190
quark–gluon plasma, 98, 168, 184, 185, 188, 213, 229, 234, 237, 325 quenched, 7, 168, 176, 291, 300
Kosterlitz-Thouless, 24, 30, 162 Landau damping, 247, 261 Langevin, 135, 136, 141 lattice fermions, 100 linearly confining potential, 5, 66, 75, 78, 91, 156, 172 local gauge invariance, 9, 59, 66, 69, 145 low-temperature expansion, 22 magnetic gluons, 261 magnetization, 15 Matsubara, 179, 212, 325, 341 mean field, 19, 131, 231, 297, 314 Meissner mass, 276 Metropolis algorithm, 43, 356 microcanonical ensemble, 195 modified electromagnetism, 286 monopole, 69, 77 Nambu–Jona Lasinio model, 96, 127, 254, 339 nuclear matter, 6, 224
random matrix, 7, 291, 292, 297 renormalization group, 19, 36, 81 RHIC, 11, 237 rotational symmetry, 63 roughening, 30, 72, 161 screening, 6, 177, 181, 260, 266, 325 screening lengths, 184 sign problem, 246, 289 species doubling, 10, 344 staggered fermions, 10, 104, 111, 153 SU(3), 46, 352 susceptibility, 14, 181 theta vacuum, 87, 94 topology, 29, 41, 69, 92, 125 transfer matrix, 49, 52 tricritical point, 230, 232, 233, 297, 347 universality, 19, 55, 179, 230, 330 vortices, 27, 41, 69
order parameter, 14, 126, 178, 233 partition function, 14, 21, 30, 47, 70, 91, 135, 169, 194, 289, 333 path integral, 14, 47, 48, 196, 289 phase diagram, 219, 222, 247
Wilson fermions, 104 Wilson line, 178 Wilson loop, 65 zero modes, 95, 122, 125, 127