(It turned out that we exchanged messages in the wrong order in this
thread.
Hence you might want to first read my second
answer to Calvin's question.)
Viewing
message <aiok4u$chg$1@rs04.hrz.uni-essen.de>
From: Urs Schreiber (Urs.Schreiber@uni-essen.de)
Subject: time in ordinary QM (was Boosts)
Newsgroups: sci.physics.research
Date: 2002-08-06 18:01:35 PST
"Calvin Ritchie" <DonRitchie870@csWebmail.com> schrieb im Newsbeitrag
news:F9B19.11203$pg2.906940@bgtnsc05-news.ops.worldnet.att.net...
[...]
> There's a curious side issue related
to this involving commutation of
> time and Hamiltonian. Everyone knows that [H,t]=-i,
Hm, this is not true. Not in the naive sense, at least. There is a way,
however, to make it true in some sense and with some modification,
I think. See
below.
> but, when we write the
> 'time-evolution operator', T=exp(-iHt), and take a partial w.r.t.
t of an
> expression involving T (such as a Heisenberg rep operator), we always
let H
> commute with T. [...]
Comparing this situation with that of systems who have a Hamiltonian
*constraint* instead of a Hamiltonian generator of time evolution might
suggest
a way to resolve the issue.
Instead of treating the time parameter different from the other coordinates
of
configuration space one could instead "parameterize" the dynamics by
letting
"t" be one of the system's coordinates. In quantum mechanics it should
work
like this:
Define a Hilbert space of *kinematical states*, say that of rapidly
decreasing
complex values functions over space *and* time. If M is the configuration
space
of the original system and R is the time "axis" then this could be
S(R x M).
This space does not in general contain any "physical" states, which
I'll come
to in a moment, it does however support, among the usual operators
previously
defined on S(M), the canonical pair t, p_t = -i hbar d/dt which satisfies
[t,p_t] = i hbar .
The usual Hamiltonian H, when trivially extended from S(M) to S(RxM),
commutes
with t and p_t. It's special relation to these two operators is not
in the
kinematics but in the *dynamics*. Hence, on top of the kinematic description
that we have set up we now need to impose the dynamics. This is done
by
"projecting" (in a suitably generalized sense) out the physical component
of a
kinematical state. Since we want physical states to be those that satisfy
the
Schroedinger equation
(p_t + H)|phys> = 0
we define the Hamiltonian constraint
C = p_t + H.
In order for it to have any non-trivial solutions we must extend it
in the
usual way from an operator on S(RxM) to one on the dual space S*(RxM),
the
"extended kinematical state space" and physical states |phys> must
be elements
of this space.
So now we have a system, governed purely by the Hamiltonian constraint
C|phys> = 0,
which expresses the fact that everything must be independent of the
parameter
tau that we (here only implicitly) introduced to parameterize the system
and
make time an ordinary coordinate. This can be made precise by observing
that C
does indeed generate reparametrizations because exp(tau C) sends a
kinematical
state concentrated at one instant of coordinate time t_0 to one concentrated
at
t_0 + tau with the same projection onto all physical states.
Hence the constraint C|phys> = 0 expresses the fact that physical states
are
those supposed to be gauge invariant under reparametrization gauge
transformations. In light of this it is not surprising that one finds
that the
standard scalar product on physical states, by the above definition,
is
ill-defined, since naive evaluation yields infinite values. This is
because the
standard scalar product, as can be checked explicitly, integrates over
all
reparametrization-equivalent versions of a state. So one has to try
harder. The
solution is of course to define a suitable gauge-fixed scalar product
which
avoids the integration over gauge-equivalent states. In the simple
setting
discussed here this gauge-fixed scalar product is easily guessed to
be simply
integration over M at a fixed (but arbitrary) time t.
For me, this construction clarifies the relation between the Hamiltonian
and
the time parameter in QM. The Hamiltonian H is not really the canonical
conjugate to time, but p_t is. When restricted to the physical subspace
p_t|phys> = - H|phys>, but both are different objects.
But since all the above is really an exercise in "physics made difficult"
one
might just as well continue and use heavy machinery, for instance BRST
cohomology, to deal with the issue of gauge fixing. I have once tried
to do
this, because it helped me understand gauge fixing in supersymmetric
quantum
cosmology. By extending the state space once again, this time so that
it admits
a grading, one can construct a nilpotent BRST operator Q from our Hamiltonian
constraint C and study its cohomology. Since gauge fixed physical states
are
those annihilated by the constraint (annihilated by Q), but not themselves
the
result of a gauge transformation (not in the image of Q), they are
precisely
the elements of the cohomology of Q. This cohomology can be characterized
by
constructing a "coBRST" operator Q*, which is the adjoint of Q with
respect to
some, non-physical, positive definite scalar product. It is then known
by
general arguments that the cohomology of Q is the intersection of the
kernels
of both Q and Q*. So Q*|phys> can be regarded as the gauge fixing condition.
Using Q and Q* one can define the gauge fixed scalar product by standard
methods of BRST theory and it does indeed turn out to reduce to the
naive
ordinary scalar product. I give more details on this construction in
section
2.3.3 "BRST-cohomology of an ordinary Hamiltonian" in
http://www-stud.uni-essen.de/~sb0264/sqm.html. The point of this construction
is however its generalization to systems whose Hamiltonian is a Dirac
operator,
like supersymmetric theories.
--
Urs.Schreiber@uni-essen.de
Viewing
message <jxd49.1763$Ke2.154487@bgtnsc04-news.ops.worldnet.att.net>
From: Calvin Ritchie (DonRitchie870@csWebmail.com)
Subject: Re: time in ordinary QM (was Boosts)
Newsgroups: sci.physics.research
Date: 2002-08-08 11:45:17 PST
I had written:
>.... Everyone knows that [H,t]=-i,
to which Urs Schreiber replied:
>Hm, this is not true. Not in the naive sense, at least......
I was, I thought, just re-writing the "common
quantum prescription":
H(=P^0)=id/dt,
P^j=-id/dx^j, etc.
where d means partial, in the most naive sense. Do you also object
to the
second, P^j, expression? If so, I think that I understand part of your
objection, but let's pursue it a bit.
Most of your comments are over my head. Perhaps
you can pull me up a
bit, or bring the comments down bit, and answer and/or comment on the
following questions/points:
You say:
>Comparing this situation with that of systems who have a Hamiltonian
>*constraint* instead of a Hamiltonian generator of time evolution
might suggest
>a way to resolve the issue.
I don't understand what you mean by this distinction.
I understand that
P^0 is the generator of time displacements in the same sense that P^j,
j=1,3, is a generator of space displacements.
.....you then say
>Instead of treating the time parameter different from the other coordinates
>of configuration space one could instead "parameterize" the dynamics
by
>letting "t" be one of the system's coordinates.
In SR, don't we "always" treat time the same
(except for the
metric, -1,1,1,1) as the other coordinates?
It has always bothered me that, in quantizing
the fields, we use the
"equal-time commutation relations", which does seem to place time on
a
special footing. Is that what you're getting at in the remainder of
your
comments?
>Define a Hilbert space of *kinematical states*, say that of rapidly
decreasing
>complex values functions over space *and* time. ......
...plus a lot of things that I understand even less than the above.
In QFT, the one-particle fields are always
parameterized by x^mu,
mu=0,3, by way of the terms exp(iP_mu x^mu), aren't they? Doesn't this
mean
that we're working on a 4-D manifold (with the usual spacetime metric)?
Although I've read Flanders, Schutz, and Frankel
on differential
geometry, I've not yet gotten into topology beyond the mentions in
those.
Nor have I yet worked with the concepts in Frankel enough to feel
comfortable with them. Also, I've read Ryder (why did BB guns just
come to
mind?:-) and Weinberg on BRST, and the like, and can follow their
developments, although I haven't worked enough with them to see how
this
ties into the question of time in the present context.
I'll stop there for now and try to digest some more of your comments.
Submitted afternoon 7 August 2002,
Don Ritchie
DonRitchie870@csWebmail.com
Viewing
message <aj0ftr$duu$1@rs04.hrz.uni-essen.de>
From: Urs Schreiber (Urs.Schreiber@uni-essen.de)
Subject: Re: time in ordinary QM (was Boosts)
Newsgroups: sci.physics.research
Date: 2002-08-11 13:00:49 PST
"Calvin Ritchie" <DonRitchie870@csWebmail.com> schrieb im Newsbeitrag
news:jxd49.1763$Ke2.154487@bgtnsc04-news.ops.worldnet.att.net...
[...]
> I was, I thought, just re-writing the
"common quantum prescription":
> H(=P^0)=id/dt,
> P^j=-id/dx^j, etc.
> where d means partial, in the most naive sense.
Ah, I see. I misinterpreted the situation and so my comment couldn't
make sense to you. You are right, but I think, from looking back at
your original post, where you also used "H" as the generator of
parameter evolution, that you are mixing up two different cases which
need to be distinguished, namely that where time is a coordinate, a
degree of freedom of the system, and that where it is merely the
parameter of evolution. Both cases are related, but different.
That's
partly what I was trying to address in my last post. Let's try to sort
that out:
So what's mechanics all about?
(Heh, as you see I'll start from the very beginning. But I think it
will be worth it and I'll make it as brief as possible) So just
briefly recapitulating: We have some sort of physical system which
can
have some sort of configuration. We enumerate all theoretically
possible configurations by a set of parameters, the canonical
coordinates "x^i". These parametrize what is called configuration
space M. Associated to each canonical coordinate is a canonical
momentum "p_i". Canonical coordinates and canonical momenta together
are coordinates on the phase space over configuration space (which
is
technically the cotangent bundle over config space). There is the
Poisson bracket for functions on phase space, which, among other
things, says that
{x^i, p_j} = delta^i_j .
A single point in phase space is your system in some state and
prepared to move somewhere. For it to move there must be a parameter
along which to move (1). Let's call it tau. (Don't think of that
as
related to "time" yet, it's just a parameter.) The laws of classical
mechanics tell us that given some Hamiltonian, which is a *function
on
phase space* (assume it to be independent of tau, for simplicity),
we
can produce one-parameter families of points in phase space, the
"classical trajectories", by solving Hamilton's equations:
d x^i / d tau = d H / d p_i
d p_i / d tau = - d H / d x^i .
So these solutions are phase-space valued functions of tau and hence
necessarily one-dimensional objects.
That's how ordinary mechanics works. It requires precisely one
parameter of evolution, no matter what.
Now on to quantum mechanics. We realize that the above classical
evolution is really only a first approximation to reality and that
for
some largely unknown reason (but, ok, we can derive everything from
the path integral, for instance) we have to make phase space
non-commutative and all that. In particular the above ingredients are
transformed as follows (roughly, heuristically and with no more detail
than shall be needed below):
The space of states becomes a Hilbert space h, say h = S(M), the space
of complex valued functions square integrable and rapidly decreasing
over configuration space M. Every canonical coordinate x^i becomes
the multiplication operator by x^i on h and every canonical momentum
becomes the differential operator p_j = -i d_j .
Why am I saying all that, even though you know it already? Here is one
point I think is important to make: The partial derivative with
respect to our parameter of evolution, d/d tau, is *not* an operator
on the Hilbert space h. That's because the functions that are
elements of h are functions over configuration space M and hence
depend solely on the canonical coordinates x^i. There is absolutely
no tau-dependence that could be differentiated. The elements of h are
a quantum substitute (roughly) for the classical points in phase
space. You cannot differentiate a point in phase space by tau, either,
that would be nonsense. What you CAN differentiate with respect to
tau
in classical mechanics is the entire classical trajectory, the
one-parameter family of phase-space points, parameterized by
tau. Analogously, what you CAN differentiate with respect to tau in
quantum mechanics is a one-parameter family of elements of h,
parameterized by tau. Of course, such is obtained by solving the
quantum version of Hamilton's equations, the Schroedinger equation
d_tau psi(tau) = (-i) H(x^i, -i d_j) psi(tau) .
Note well here two things:
First: For fixed tau, say tau=12, psi(tau) = psi(12) is a fixed
element of the Hilbert space h. But psi itself is rather a function
psi: R -> h
tau |-> psi(tau)
that is not an element of h, but a one-parameter family of elements
of
h, a map from the reals into h.
Second: The Hamiltonian H in the above equation is a function on the
quantum phase space. It is represented (induced by the above
representation of the canonical coordinates and momenta) on h as some
differential operator. As such it knows nothing of tau. (I have
assumed it to be tau-independent in order to avoid the complication
that H itself is a one-parameter family of operators, which would only
make things look more difficult than they are and would not affect
anything of what I am trying to say here.) In particular, the
"commutator of H with tau" is not even well defined. The commutator
instead commutes two operators that act on h; but d_tau is not an
operator acting on h.
But you are perhaps thinking all along: What is this fuss about d_tau
not being a quantum operator. Don't we have an operator p_0 = -i d_t
in relativistic quantum mechanics?
Yes we do, and everything is fine. We just have to sort out what kind
of object -i d_t really is in "relativistic" quantum mechanics. The
answer is easy, even trivial.
I have taken pain to use the name "tau" for the evolution parameter
instead of the usual "t". Names of variables don't affect the
formalism, but they may affect our thinking about the formalism. The
usual presentation of the non-relativistic Schroedinger equation in
QM
textbooks as
i d_t psi = - (1/2m) d_x^2 psi + V psi
does not make it particularly clear that d_t enters the conceptual
framework here on a rather different footing than d_x does. In
particular, it is conceptually quite different from the d_t that
enters the Klein-Gordon equation as in
-(d_t)^2 psi + (d_x)^2 psi = m^2 psi .
How can that be? The reason is that in the context of the relativistic
Klein-Gordon particle the physical time "t" is not a parameter, but
a
canonical coordinate! Hence it has a canonically conjugate momentum
p_t, which is quantized as usual to yields p_t = -i d_t <=> p^t
= + i
d_t . (So is there no further parameter, as opposed to my above claim
that mechanics requires precisely one external parameter? I'll come
to
that in a moment.)
Let's see in more detail why this is true:
Hamiltonian formalism is fine, but now we need a Lagrangian first: The
Lagrangian of the massive relativistic point propagating in Minkowski
space with metric eta_mn is
dL = - m ds .
Here m is the mass of the point and s is the proper distance along the
worldline of the point as it moves through Minkowski space (its
"world-volume").
While that's a nifty definition, as it stands it does not allow us to
calculate anything. To do that we have to measure the proper distance
somehow and to do that we need a coordinate on the point's
worldline. With hindsight, this coordinate shall be called tau. (Now
guess what...) So every point on the worldline is continuously and
monotonically indexed by a value of tau and we can calculate the
proper distance by the following well known expression
(ds)^2 = eta_mn dx^m dx^n
=>
ds = sqrt( -eta_mn d x^m / d tau d x^n / d tau ) d tau .
(Here the minus sign is because my eta is "space-like", i.e
eta = diag(-1,1,1,1).)
Comparing this Lagrangian with that of a non-relativistic point of
mass m in Euclidean space, which reads
dL = ( delta_ij d x^i / dt d x^j /d t ) dt
one sees, as advertised, that "tau" and "t" play analogous roles. One
also sees that the Minkowski coordinates x^n(tau) (n=0,1,2,3) are the
canonical coordinates of the relativistic point. In fact, the
configuration space of the point is identical to physical space
itself. (But that's a coincidence. It does not hold for systems
consisting of 2 or more points or for fields.) In particular, x^0 =
t,
the time coordinate of the point, is a *canonical coordinate* of this
system. It is not the evolution parameter. Tau is. Tau is the
arbitrary parameter that parameterizes the classical trajectories in
configuration space, which in this simple case is just the same as
the
classical trajectory in physical space. Note also that, while tau
monotonically increases along the world-line, the point may a-priori
move in any direction in configuration space, in particular it may
a
priori take non-monotonically increasing values of x^0. We have to
look at the equations of motion to see if it really does so, though,
but it is an important principal point that arises when physical time
is taken as a canonical coordinate (a degree of freedom of the system)
instead of as the evolution parameter.
To every canonical coordinate there is a canonical momentum p_n(tau)
with the Poisson bracket
{x^n, p_m} = delta^n_m .
Note how I put the indices upstairs on x^n and downstairs on
p_n. That's the way to do it and it's due to the momenta being really
elements of the cotangent space. When you shift indices by means of
the Minkowski metric you get
{x^n, p^m} = eta^nm .
For possible later generalizations to curved space it is crucial to
keep in mind that the p_n come with covariant (down-stairs) indices
and that it is a non-trivial operation to raise them by means of the
metric.
But we want to canonically quantize and so we want a Hamiltonian. To
get that we need the canonical momenta in terms of the
"tau-velocities"
v^n = d x^n / d tau
by way of
p_n = d L / d v^n .
One finds
p_n = m v_n / sqrt(-v^n v_n) .
So we happily go along and compute the Hamiltonian as
H = v^n p_n - L
= m v^n v_n / sqrt(- v^n v_n) + m sqrt(- v^n v_n)
= 0
Oops! Now that's something, the Hamiltonian identically vanishes!
I don't
know if this surprises you, perhaps you know all that, but otherwise
it may
reassure you to know that the equation
H = 0
in a somewhat more involved context, namely that where the original
action is not that of a point in spacetime but that of the quite
analogous situation of the entire universe in *its* configuration
space, has made a lot of people write considerable amounts of paper
full of erudite stuff. However, there is actually no mystery
here. This, interestingly, becomes quite clear in the quantum theory
-once we have figured out to construct this from the strange
Hamiltonian above.
The clue to progress is to have a closer look at the canonical
momenta. They were found to be
p_n = m v_n / sqrt(-v^n v_n) .
Usually the momenta can, a priori, take all kinds of values. But these
cannot. To see this take their square
p^n p_n eta^nm p_n p_m = m^2 eta_nm v^n v^n / (-v^n v_n) = -m^2 .
Hence, due to the normalization factor 1/sqrt(-v^n v_n) they carry,
the canonical momenta sort of always dynamically adjust so as to
square always to -m^2. Because of
p^n p_n = -(p_0)^2 + (p_i)^2 = - m^2
this defines a hyperbola and one says that the momenta have to be "on
mass shell", or simply "on shell". But this means that the system
we
are studying here is not free to explore its phase space but that it
is *constrained* to be on-shell and to satisfy p_n p^n = -m^2 . This
wasn't manifest in the original Lagrangian and because we ignored it
we ran into the strange situation of finding an identically vanishing
Hamiltonian.
The remedy, to make a long story short (hmm...), is to replace our
original Lagrangian with one that is classically equivalent, in that
it produces the same equations of motion, but also explicitly
incorporates the constraint. The way to do that is by introducing a
new independent canonical coordinates, N, which plays the role of a
Lagrange multiplier.
Therefore consider now the Lagrangian
L = p_n v^n - N(p_n p^n + m^2)/2 .
Varying it with respect to N gives the mass-shell constraint, by
construction, and varying with respect to x^n gives
d p / d tau = 0,
which is again the classical equation of motion. So this is indeed
equivalent to our original Lagrangian. (This is also evident when you
insert the definition of p_n into the above expression for L.) Note
that now
H = N(p_n p^n + m^2) / 2,
is not *identically* zero, even though the equations of motion demand
that it be zero.
With the Hamiltonian at hand the system may finally be canonically
quantized, but since I have already written so much I can just as well
take the time to quickly point out what all this constraint business
has to do with the original issue of tau being a parameter along the
worldline:
Namely from the above Hamiltonian one finds
d x^n / d tau = N p^n
and from this
dL = (1/2)( (1/N) (d x^n / d tau) (d x_n / d tau) - N m^2 ) d tau .
Note that N is so far a free variable, but in the combination N d tau
it looks a lot like a measure on the worldline of the
particle. Indeed, let's define a metric on the worldline, with only
one element
g_(tau tau)
and identify the volume element
dvol = sqrt(-g_(tau tau)) d tau
with N dtau, so that
N = sqrt(- g_(tau tau)) .
This way the above Lagrangian takes the form
dL = (1/2) (g^(tau tau) d_tau x^n d_tau x^m eta_nm - m^2) dvol .
This is much handier than the original square-root Lagrangian but also
more illuminating. It makes it manifest that the freedom we have in
choosing our parameterization tau of the worldline of the particle
is
that of choosing a metric on the worldline.
Ok, so much for the digression, now on to the point: We had found the
Hamiltonian
H = N/2 (p_n p^n + m^2) .
The p_n are canonical momenta, so we know how to quantize them
p_n = -i d_n = -i d / d x^n .
This gives
H = H(x^n,p_m) = N/2 (- eta^mn d_n d_m + m^2) .
Following the general prescriptions for doing canonical quantum
mechanics that I outlined at the very beginning, we have the
Schroedinger equation
i d/d tau psi = N/2 (- eta^mn d_n d_m + m^2) psi
that describes how wave functions psi, (elements of a Hilbert space
h
consisting of functions over configuration space which is space*time*
in our case) evolve along with the evolution parameter tau. But we
know that tau has no physical meaning. It is an arbitrary metric on
the worldline of the particle. If we change the parameterization of
the worldline we do not want the physics to be affected by that at
all! So is the above Schroedinger equation meaningless? No, it must
be
correct if quantum mechanics is correct. The point is, however, that
not *all* its solutions are acceptable, because our system is subject
to a constraint, the mass-shell constraint. This says classically that
p^2 + m^2 = 0, which quantum mechanically means that
(- eta^mn d_n d_m + m^2)psi = 0 .
This in turn says that the Hamiltonian has to annihilate physical states
H(x^n, -i d_n) psi = 0 .
Therefore the Schroedinger equation in terms of tau is quite alright,
it needs however be accompanied by the restriction that only the
subset of its solutions which also satisfy the above constraint are
physically realized. This gives
d/d tau psi = 0
(- eta^mn d_n d_m + m^2)psi = 0 ,
and that's good, because the first equation (the left hand side of the
Schroedinger equation) now says that every physical state must be
independent of the arbitrary metric that we put on the worldline,
while the second equation is just the Klein-Gordon equation!
Pooh! I didn't expect it would take me so long to make this point. :-)
But anyway, I hope this makes it quite clear that the p_t = -i d_t
operator that appears in the Klein-Gordon equation is a *canonical
momentum* operator which acts on a Hilbert space of functions over
configuration space, where in this case the configuration space does
include the physical time coordinate as a degree of freedom. It is
not, however, the derivative with respect to the evolution parameter
tau.
Note also that
[x^0, p_0] = [t,-i d_t] = i
[x^0,H] = iN
[x^i, H] = -iN
and, if you wish
[tau, H] = 0
but the last equation does not really make sense, as I discussed above.
The situation is superficially different when you look at
non-relativistic quantum mechanics in the usual formulation, because
then what is written as d/d t is what I have written as d/ d tau and
one must not confuse the d_t derivative in non-relativistic QM with
the d_t derivative as in the above treatment of the Klein-Gordon
equation.
The whole point of my previous post was actually to demonstrate (not
that this is news to the experts) that the kind of "parameterized
dynamics" that is encountered in the classical and quantum mechanics
of the *relativistic* point particle is *not unique to relativity*,
but that any boring-old non-relativistic system may be parameterized
by introducing a coordinate on the system's world-line and making
physical time one of the canonical coordinates.
> In SR, don't we "always" treat time
the same (except for the
> metric, -1,1,1,1) as the other coordinates?
> It has always bothered me that, in
quantizing the fields, we use the
> "equal-time commutation relations", which does seem to place time
on a
> special footing. Is that what you're getting at in the remainder
of your
> comments?
I really wanted to address this issue, but now I am too tired. Maybe
another time. But you can also figure out the answer yourself: The
essential idea is that an ordinary field-theory Lagrangian on a
*fixed* spacetime, say on Minkowski space, gives you a
*non-relativistic*-type of Schroedinger equation for the fields, with
no constraints and no arbitrary parameterization. Field theory is
really no different from quantum mechanics in this respect. You can
consider the quantum mechanics of a single non-relativistic particle
as a field theory on the particle's wordline (with fixed
parameterization), which has "coordinate field" propagating on
it. This is one way to see why you get "equal-time commutation
relations" in field theory, just like you also have equal-time
relations in quantum mechanics. To do away with this you again have
to
parameterize the field's dynamics and ensure that everything is
independent of the parameter. Since a field is to spacetime as a
particle's coordinates are to its wordline, this means that you have
to introduce a *variable metric on spacetime*, just as I did above
for
the particle-worldline. If you then vary the metric on spacetime you
get a Hamiltonian constraint which is then a Klein-Gordon type of
equation. If you do that and also allow for a term sqrt(g)R in the
action of the fields on spacetime, to make the metric really
dynamical, you get general relativity coupled to your fields. One can
do essentially the same construction as for the free relativistic
point particle for this huge system, called the ADM approach. One can
write down the quantum Hamiltonian of it, which will look like a
Klein-Gordon equation. It is not well defined due to
non-renormalizability and all other kinds of problems, but you can
certainly write it down just to see how everything follows the same
pattern.
[...]
> I'll stop there for now and try to digest some more of your comments.
Forget the comments from my first post until you have read the
comments in THIS post! :-)
(1) There are formulations of mechanics which do not involve precisely
one parameter of evolution. For instance you can find recent papers
by
C. Rovelli on the arXive where he is developing field theory with more
than one parameter. But that's not the canonical way to do it.
Viewing
message <ajhbob$f0i$1@rs04.hrz.uni-essen.de>
From: Urs Schreiber (Urs.Schreiber@uni-essen.de)
Subject: Re: time in ordinary QM (was Boosts)
Newsgroups: sci.physics.research
Date: 2002-08-19 19:49:31 PST
We had been talking about the free relativistic point particle, which
is an
example of a parameterized system, and how its quantization gives rise
to the
Klein-Gordon equation. We had mentioned some aspects of field theory
but have
not yet gone into that. In particular, Calvin Ritchie was concerned
about the
appearance of "equal-time commutation" relations in relativistic physics:
"Calvin Ritchie" <DonRitchie870@csWebmail.com> schrieb im Newsbeitrag
news:gYz59.7688$Ke2.648503@bgtnsc04-news.ops.worldnet.att.net...
[...]
> I think I see where you're going with
the "equal-time commutation", and
> will try to work it out for myself. If I have trouble, I'll yell.
If you
> want to expand on it in the meantime, I'll guarantee you at least
one very
> interested reader.
I have the weird plan to explain this using brane physics.
The answer to "Why equal time commutation relations in relativistic
field
theory?" is roughly "Because we have deparameterized the field theory
by
identifying physical time with the parameter of evolution. When we
reintroduce
an independent evolution parameter, then we instead get 'equal parameter
commutation relations', in much the same way as we have 'equal parameter
commutation relations' for the relativistic point particle." Ultimately
this
amounts to discussing the Hamiltonian (ADM) formulation of general
relativity
coupled to some fields. I don't want to do that right now. But I do
want to
make some progress in that direction by building on our expertise with
the
Hamiltonian formulation of the free relativistic point - which is just
the free
0-brane.
So what I want to do is take the action functionals from my last post
in this
thread and straightforwardly generalize them by enlarging the dimension
d of
the world-volume of the object they describe. (Recall that for the
point
particle this world-volume, namely its world-*line* was (d=1)-dimensional.)
The
resulting action is that of a free relativistic p-brane, where p=d-1.
I'll make
an ADM split of the metric on the brane and obtain the ADM-Hamiltonian.
The point of all this is that
1) It's fun.
2) It's easy.
3) It gives further insight into the relation between physical
time and
unphysical parameters.
4) It is a direct preparation for the Hamiltonian formulation
of general
relativity.
I should admit that I am only beginning to learn about branes myself
right now.
So I cannot completely rule out that I am subject to some beginner's
misconception. While I am rather convinced that I have the (easy) calculations
right, it may well be that I fail to properly use some terminology
or misjudge
the true meaning of some formulas. With some luck there'll be an expert
willing
to chime in and fix such things.
After this lengthy introduction I am finally ready to begin:
First we need to adjust our brains to branes. I mean, we need to get
straight
what we are trying to describe and how that is to be forged into formulas.
What were the ingredients for the description of the relativistic point?
First,
there was the point's world-line, a (d=1)-dimensional manifold (a line)
which
was parameterized by a real number tau. We saw that one is more or
less forced
to consider this manifold to be equipped with a metric. In one dimension
a
metric is a pretty trivial thing, it is a tensor with only a single
entry
which, I believe, I had called g_(tau tau). This is too clumsy a notation.
We
will soon think of tau as the 0th coordinate on the world-volume, so
that I now
write
g_(tau tau) = g_00 .
This metric was related to a parameter N by
g_00 = - N^2 .
So this was one ingredient: the wordline equipped with a metric. This
shall be
called the *base space* B from now on. The name is due to the fact
that this
space is the base for the second ingredient, namely the coordinate
fields.
The worldline is not just sitting there, but we imagine it to be embedded
into
physical spacetime. So every point of the wordline has D spacetime
coordinates
X^mu associated with it, where D is the number of spacetime dimensions.
Hence
these X^mu are functions from the worldline B=R (the reals) into physical
spacetime T
X^mu : B -> T .
tau |-> X^mu(tau)
The "fields" X^mu are called the *embedding fields* because they embed
the
worldline into spacetime. Maybe this looks like a particularly convoluted
way
of thinking about an ordinary trajectory in spacetime, but it turns
out to be
very useful.
Finally, the third ingredient, which has already made its appearance,
is
spacetime T itself. However, erudition forces us to invent a name that
better
captures the true nature of the concept involved here, so T shall be
called the
*target space* from now on, since it is the space in which the embedding
fields
take values. The important thing is that the target space comes equipped
with a
metric tensor, too. Since I am using capital letters for target space
objects,
this metric shall be called G_mu nu . It is interesting to consider
metrics
G_mu nu of any sort, but before doing that we should have mastered
the easiest
case, which is G_mu nu = eta_mu nu, the flat Minkowski metric. Hence
this is
what I assume from now on. Also, we will be mostly concerned with properties
of
the base space and then all these target space indices will be a nuisance
since
we will mainly have to deal with base space indices. Therefore I decree
the
convenient shorthand
X^mu Y^nu G_mu nu = X .Y
for any two objects X, Y which carry target space indices.
One might think, looking around oneself, that the dimension of target
space, D,
should be equal to 4. But actually most people that you'll encounter
who know
about branes will insist that D=10 or D=11. For our present purposes,
however,
the exact value of D is completely irrelevant and I'll leave it unspecified.
In
fact, the whole machinery that I am describing here can also be applied
to the
case where the base space coincides with physical spacetime and where
the
target space is some abstract space which may well have any dimension.
This is
in fact the case for ordinary field theory, which, however, I do not
consider
here.
Looking back over the last paragraphs, what has been accomplished? We
have
realized that the theory of the relativistic point that we had been
considering
is conceptually the theory of a set of embedding fields that map some
base
space into some target space:
|
. . .. .. ... /... . . .
| X^mu . . .
. . . . . .|. . . . .
| ---------> target space
/
|
. . .. .. . .. ./. . ... . .
|
. . . . . . . . |. . . . .
base
space
Here the irregular line on the right is supposed to indicate the image
of the
base space under the embedding induced by the embedding fields X^mu.
For the
point particle this is just the trajectory in spacetime. This way we
are
distinguishing between the world-line of the particle in its abstract,
platonic
form, being just a line with a metric g_mn (the base space on the left)
and its
physical realization as a trajectory in spacetime, i.e. in target space
with
metric G_mu nu.
It is now extremely easy to generalize this setup to higher dimensional
base
spaces, which is the motivation for why I am going through all this.
You can
guess what one has to do, but I'll briefly spell it out:
Let base space B be a d dimensional manifold for any d>0. Assume that
this
manifold has the topological structure
B = R x Sigma ,
where R indicates the real axis and Sigma is some compact (p=d-1)-dimensional
manifold. We think of Sigma as our p-brane and of R x Sigma as the
world-volume
of this p-brane as it evolves along a parameter varying on the R factor.
So in
the case of the point particle one has Sigma = {.}, the set containing
a single
0-dimensional point, and B = R x {.} = R is the point's worldline.
If we set
Sigma = S^1, the circle, then B = R x S^1 is a cylinder. This is the
world-*sheet* of a closed 1-brane, a closed string. If we set, for
instance,
Sigma = T^2, the torus, then B = R x T^2 is the world-volume of a closed
2-brane, or membrane, of genus 1. And so on.
For simplicity I'll assume that base space B can completely be
parameterized
by a single coordinate patch with coordinates x^m, m = 0,1,..p . The
metric
tensor with respect to this coordinate patch is g_mn. (For the point
particle
we had p=0, x^0 = tau, and g_00 = -N^2 .)
For a general p-brane the coordinate fields are functions of the d base
space
coordinates x^m
X^mu = X^mu(x^m)
and their values determine how the p-brane moves through target space,
i.e.
through spacetime.
One might wonder whether the two metrics, g_mn and G_mu nu need to be
related
in any way. The answer is yes, which is a result of the physical dynamics
to
which I now, finally, come:
So far we have set up the (classical) kinematics of a p-brane, i.e.
we have
specified which degrees of freedom there are. Now we need to impose
the
dynamics by defining an action S. The action needs to be a function
that gives
a real number for every field configuration on the brane. The fields
are the
coordinate embedding fields and hence the action functional needs to
assign a
real number to every embedding of the brane history B into target
space/spacetime T. If you think about it, there are not so many natural
choices
for such a function. For the point particle we used
S = -m Vol_0 ,
where Vol_0 means the proper volume of the point's worldline, i.e. its
proper
length, as measured with the spacetime metric.
Vol_0 = integral over worldline of sqrt(- d_0 X . d_0 X ) dx^0.
Note that in this post I am using "d_n" for the partial derivative with
respect
to the n-th coordinate x^n on the base space. With my above convention
concerning target space index contractions one has
d_0 X . d_0 X = d_tau X^mu d_tau X^mu G_mu nu = ds^2 / d tau^2 .
The natural idea is to straightforwardly generalize this action principle
to
arbitrary p-branes, which have world-volume Vol_p, by setting the general
action proportional to the brane's world-volume:
S = - T Vol_p .
One conventionally replaces the constant factor m by the constant factor
T,
which is then called the brane "tension". For p=0 one has T=m. T is
just some
dimensionful constant that sets the scale of everything. It indicates
that we
are doing physics and not simply mathematics, but it is not essential
for the
following manipulations.
How does one measure the world-volume for p>0? The rule is: Take a tiny
speck
of the brane's history in spacetime (say of the worldsheet cylinder
for p=1)
with edge lengths dx^m for m=0..p . If the speck were flat, then its
volume
would simply be the product dx^0 dx^1 ... dx^p. Since it is not flat
in general
we have to account for the deformation of this volume element which
one can
show amounts to multiplying by a factor of sqrt(-h), where h is the
determinant
of the *induced metric* h_mn on the brane's spacetime history. What
is the
induced metric? It is just the metric which gives the length of a tiny
line
element of the brane history as measured *with the target space metric*:
h_nm = (d_n X) . (d_m X)
This way the target space induces a metric on the embedded manifold
(technically this is called the "pullback" of G_mu nu to B).
Hence an infinitesimal element of the brane's worldvolume has the proper size
dvol = sqrt(-h) dx^0 dx^1 ... dx^p .
and therefore the above action of the free p-brane reads
S = -T integral over B of sqrt(-h)
Conceptually there is nothing more natural than this action S. Yet,
as for the
point particle, it may be a little awkward to actually work with. One
might
rather have an action functional that involves quadratic expressions
in the
field's derivatives.
I'll call the above action the Nambu-Goto action. As far as I can see
this term
is usually reserved for the case where p=1, but here I want to use
it for all
p>0.
If you recall how we modified the point-particle's action from the Nambu-Goto
form to the form which involved the Lagrange multiplier N, then it
is
relatively easy to see what has to be done for general p to achieve
something
analogous:
Note that the Lagrange multiplier, N, turned out to be related to the
metric
g_00 on the world-line. So the idea might be to use the metric g_mn
in general
as some sort of Lagrange multiplier. Let's try this:
S' = -T/2 integral over B of sqrt(-g) ( g^mn h_mn + Lambda ) .
Note that now there are, as before, the embedding fields (hidden in
h_mn),
which are the physical degrees of freedom. But in addition there are
now also
the components of the base space metric entering the action, roughly
analogous
to Lagrange multipliers, and I have introduced a *constant* term
Lambda. Since
this term is multiplied by sqrt(-g) it is, technically, a so-called
*cosmological constant* on the brane. If you haven't encountered this
before
just accept the terminology, it is not supposed to mean anything deep.
But why do I introduce this term? Because it is needed to make the two
action
functional S and S' classically equivalent, and that's what I am aiming
to
achieve:
To see this we need to look at the equations of motion for g_mn that
derive
from S'. To find them one needs to know the variation of the determinant
of the
metric with respect to g^nm . This turns out to be
delta sqrt(-h) = -(1/2) sqrt(-g) g_nm delta g^nm .
Using this relation we find
0 = delta S' / delta g^mn
= sqrt(-g)( h_mn - (1/2)g_mn( g^mn h_mn + Lambda )
) .
Staring at this equation for a while one realizes that we need
Lambda = 2 - d = 1 - p
and that then the above equation of motion implies
g_mn = h_mn,
i.e. that the metric on the base space is just the metric induced by
the target
space metric.
I'll just show that this does work: For g_mn = h_mn we have
g^mn h_mn = d
and the above equation of motion is actually solved with Lambda as given
above.
But when g_mn = h_mn then the primed action becomes
S' = -T/2 integral of sqrt(-g)(g^mn h_mn
- Lambda)
= -T integral sqrt(-g)
= -T integral sqrt(-h)
= S,
and then both action functionals are classically equivalent. (Look at
this for
p=0 to see that it reproduces the analogous derivation for the point
particle.)
Hence the action that I will be concerned with in the following is
S' = -T/2 integral over B of sqrt(-g)( g^mn h_mn - d + 2 ),
and I'll call this the Polyakov action here, even though, again, this
term
seems to be reserved for the case p=1.
When you do this on a sheet of paper you should want to replace h_mn
by its
definition in terms of the fields
h_mn = d_m X . d_n X
to see that this action has a very familiar look. It just describes
a set of
ordinary scalar fields propagating on a curved manifold with metric
g_mn. And
this is why I can claim that this brane-business can bring us closer
to
understanding general relativity coupled to some fields.
There is possibly more truth to that I can make plausible in the present
context. A currently active research field is so-called "brane-cosmology",
which assumes that the 3-dimensional space that we seem to inhabit
*is* a
3-brane which propagates in some higher dimensional spacetime.
Ok. We are essentially doing field theory on the brane now. Our next
task is to
find the Hamiltonian associated with the Polyakov action for general
p. The
Lagrangian density that we are dealing with is
L = -T/2 sqrt(-g)(g^mn h_mn -d + 2) .
We just have to apply the usual prescription which says that one first
must
calculate the canonical momenta
P = delta L / delta d_0 X
= -T sqrt(-g) g^0n d_n X
(I'll leave the target space index of X^mu and P_mu implicit) and then
compute
the Hamiltonian density as
H = P d_0 X - L
and express all occurrences of d_0 X by the expressions in P. When you
write
that down you find a sum of several terms where various components
of g_mn
appear all over the place. It is not very illuminating. But one can
exhibit the
elegance of this Hamiltonian by writing the metric tensor g_mn in a
particular
way. This is called the ADM split:
The idea is simple: Since base space factors as B = R x Sigma it is
natural to
split the line element on B into parts that come from translations
on Sigma and
parts that come from translations on R. One therefore writes
ds^2 = -N^2 (dx^0)^2 + (N^i dt + dx^i )(N^j dt + dx^j) g_ij .
Looking at this expression you see that N, the same N as we had before
for the
point particle, which is called the "lapse function", measures how
much "proper
time" elapses as you move forward in "parameter time" (along the R
factor),
while the N^i, which constitute what is called the "shift vector",
measure how
much the "spatial coordinates" shift below your feet as you move along
R.
Finally g_ij is the metric purely on the spatial factor Sigma. There
is a
subtlety here when talking about space and time. Actually, only when
we are
on-shell, i.e when the equations of motions hold so that g_mn = h_mn
does this
make proper sense in base and target space.
When the metric in the above form is inserted into the Hamiltonian the
latter
simplifies somewhat. In particular it turns out that N and N^i can
be factored
out from all terms, so that they take their places as Lagrange multipliers,
as
expected. In ordinary field theory this would give the Hamiltonian
in its final
form. However, in the present case the Hamiltonian can be made to look
still a
little more elegant by replacing
N -> N / sqrt(k) .
where I have defined k to be the determinant of the spatial metric
k = det(g_ij).
After doing this there are no square roots of k left in the Hamiltonian.
If we
again use the equations of motion to set
d_i X . d_j X = g_ij
then finally the Hamiltonian takes the pretty form
H = N (1/2)( P^2 / T + T k ) + N^i (P . d_i X) .
This is the central equation of this post. Recall the various abbreviations
that enter it
P^2 = P.P = P^mu P^nu eta_mu nu
P . d_i X = P^mu d_i X^nu eta_mu nu
k = det(g_ij)
To better understand this Hamiltonian I go through the first few values of p:
p=0: This is the relativistic point that we already know. There is no
shift
vector on a point, N^i = 0 and no metric, k = 1. Also, as mentioned
above, the
"0-brane tension" is just the point particle's mass. Thus we have
H = N/2 (P^2/m + m),
as expected. Quantizing this gives the Klein-Gordon equation, as has
been
discussed.
p=1: This is the relativistic string. Now new effects appear. The first
thing
to note is that now we are really talking field theory on the brane.
All fields
in the Hamiltonian depend on the evolution parameter x^0 = tau that
is also
present for the point particle, but furthermore they depend on the
spatial
parameter x^1along Sigma = S^1, i.e. along the string. Hence by varying
N(x^0,x^1) at every point x^1 we get an infinite number of Hamiltonian
constraints
( P(x^1)^2 / T + T k(x^1) ) = P^2 /T + T (X')^2,
(where X' = d_1 X is the spatial derivative of the fields), one at every
point
of the string. This can, roughly, be interpreted as the dynamics of
an infinite
collection of relativistic points as in the p=0 case, however now with
a mutual
interaction which holds all these points together to form the string.
If we
forget that spacetime here is target space and instead look at base
space as
our "spacetime", then this is the first case that allows to address
the issue
of "equal time commutation relations" in quantum field theory. Here,
obviously,
we have "equal tau commutation relations". But tau, the temporal coordinate
on
the string's worldsheet, is just an arbitrary parameter. As for the
relativistic point, no quantum state may depend on tau. Hence it is
completely
unessential that commutations relations are fixed to some tau value.
More
precisely, take any operator A and evolve it along tau with the quantum
version
of H_0 = P^2 /T + T (X')^2
A(tau) = exp(i tau H_0) A(0) exp(-i tau H_0) .
Since all physical states are annihilated by H_0 the expectation value
of
A(tau) is the same for every tau
<A(tau)> = <phys|A(tau)|phys>
= <phys|exp(i tau H_0) A(0) exp(-i tau H_0)|phys> = <phys|A(0)|phys>
.
The fact that I used only H_0 in this example shows that another thing
is new
now that the spatial hyperslice Sigma has a non-vanishing extension:
There is a
further constraint. Namely by varying N^1 in the full Hamiltonian we
find that
P . d_1 X
also has to vanish at every point of the string, or, if you wish, of
our
cylinder spacetime. It can be shown that this expression generates
reparameterization not along tau = x^0 but along x^1. These transformation
are
called diffeomorphisms. Looking back at the full Hamiltonian makes
it clear
that there is one diffeomorphism constraint (at every point and) for
every
spatial parameter x^i. This pattern remains true for every reparameterization
invariant field theory, in particular for general relativity: In Hamiltonian
formulation the dynamics is completely determined by the temporal
reparameterization and the spatial diffeomorphism constraints.
p = 2: This is the relativistic membrane. Conceptually, there are no
new
phenomena here. We again have the temporal reparameterization and the
spatial
diffeomorphism constraints, now two of them. As soon as we know to
handle one
spatial dimension we can handle more of them, in principle, so that
poses no
new problems. But the membrane is far from boring, and that's still
an
euphemism.
When dealing with these extended objects one is soon and often reminded
of the
fact that geometry in 2 dimensions is somehow very special. That's
why the
string is special, its wordsheet is 2-dimensional. And the membrane
is special
because its spatial extension is 2-dimensional. This gives rise to
the
following intriguing facts:
In two dimensions there is a particular way to rewrite the determinant
of the
induced spatial metric as the product of two "symplectic brackets"
{. , .}
k = {X^mu,X^nu}{X_mu, X_nu}/2
where
{X^mu, X^nu} = epsilon^ij d_i X^mu d_j X^nu .
These are called Nambu brackets. With this identical reformulation the
membrane's Hamiltonian now reads
P^2 / T + {X^mu,X^nu}{X_mu, X_nu}/2 .
One can invent similar brackets as above for any dimension and rewrite
the
general p-brane Hamiltonian in this form. But p=2 is special because
only here
can {X^mu,X^nu}, which is an operation between continuous coordinate
fields, be
approximated, in some suitable sense, by the commutator of ordinary
*matrices*.
It can be shown that by replacing the coordinate embedding fields X^mu
in the
entire theory by large square n x n matrices one obtains some sort
of
regularization of the original theory, i.e. a theory that approximates
the
originally infinite number of degrees of freedom (D coordinate fields
at every
point on Sigma) with a large but finite number of degrees of freedom
(the
components of D matrices). This reduces all membrane theory to quantum
mechanics and this is essentially what is known as "Matrix Theory".
Now,
"Matrix Theory" sounds innocent enough, but there are serious suggestions
that
something very deep is hidden in this theory, somehow, namely nothing
less than
M-theory itself, the hypothetical non-pertubative version of string
theory.
I find it fascinating how the simple idea of generalizing the natural
point-particle action S = -m Vol to higher dimensional objects by the
most
natural and compelling generalization of Vol to arbitrary proper volumes
gives
rise to a hierarchy of theories of p-branes that somehow seems to be
the
backbone of all of string theory. You start at p=0, you take the very
first
exit and find ordinary relativistic quantum mechanics. You take the
next exit
and find that to use the quantum lane you are not free to move much
but have to
follow the route prescribed by a number of consistency conditions which
lead
you right to a unique universe of new concepts. If you instead take
the exit at
p=2 you find something that first looks totally different, but after
cruising
for a while you find yourself in precisely the same region that the
exit at p=1
led to. And all that simply by thinking S = -m Vol to the end.
P.S.
I'll be on vacation for the next three weeks, with probably only very
limited
internet access. So don't expect any responses from me during that
time.
--
Urs.Schreiber@uni-essen.de