Working definition of Theory: It is a mechanism to select some fields. Given a space and the set , for a certain bundle , whose sections encompass all field degrees of freedom (metrics, matter fields, gauge connections, etc.), we aim to select a subset by means of any criteria (to be determined). These will be the fields of our theory.
The criteria is usually a variational principle.
Examples
Example 1. A zero-dimensional theory: space is a point.
Space: Let , a single-point manifold (0-dimensional spacetime).
Fields: Sections of a trivial bundle
which simply means: an -valued field at the point . The space of sections is:
Selection Mechanism (Theory): A variational principle defined by a function (note: integration becomes evaluation since is just a point):
where represents a field configuration, , and is a fixed parameter (think of an external source term).
Selected Fields (The Theory’s Solutions):
That is, the theory picks out a unique field configuration: the point in .
Example 2. A toy theory selecting fields supported on the line .
Space: Let be a 2-dimensional manifold, with standard coordinates .
Fields: Sections of the trivial bundle
i.e., scalar fields over .
Total Field Space:
Selection Mechanism (Theory): A variational principle with action
where . We are not being very rigorous at this point.
Selected Fields:
That is, the theory only selects one field, , which is entirely supported on the vertical line , taking value 1 there and vanishing elsewhere.
Example 4. Fixed backgrounds.
Some times, theories are formulated on a spacetime with a prescribed structure, mostly a preferred field (a metric, a connection). These cases can be framed within our definition, by simply adding a term to our relevant bundle and the corresponding one to the criteria (something similar to criteria in Example 2).
Passive transformations
I prefer to call them relabelings. There are two types: relabeling the spacetime , the coordinate changes, or relabeling the target space of the fields, i.e., frame changes also known as gauge transformations. Of course, these relabelings change the description of the criteria , and this was a reason for controversy when coordinates were not distinguished of the objects themselves. This is related to Kretschmann objection: any theory can be converted into general passive covariant if we introduce enough mathematical objects.
Example 1 revisited
In the case of Example 1, since the base space is a single point, there is no coordinate system on to transform. The only possible relabeling is in the target space, i.e., in the fibers of the bundle , which is trivial here. These are the analogues of gauge transformations.
Let’s analyze a change of frame in the target space. For instance, let’s consider a linear change of basis given by an invertible matrix . Suppose we use a new frame , and write the field in this frame: if the field in the original frame is described by the vector , then in the new frame it is described by
Then, the variational principle expressed in terms of becomes:
That is,
The minimizer in this frame is given by
which corresponds, via , to
as expected. So again, the selected field is the same, merely described in new coordinates.
This illustrates the principle behind passive gauge transformations: the theory and its solutions remain unchanged, only their description is transformed.
By the way, this could have been done with any .
Example 2 revisited
Let's analyze a coordinate transformation in Example 2. If we take other coordinates , related to the others by the (passive) transformation
with , our criteria takes the form, in the new coordinates,
The solution to this variational problem is
which is nothing but the same distinguished in the Example 2, but expressed in the coordinate .
Importantly, even if is the rotation, we are not rotating anything, we are only changing the labeling.
Similarly, since the bundle in Example 2 is a -bundle, we can consider a different trivialization. We can consider in each fibre , , the basis , instead of 1 (this corresponds to a gauge transformation in the corresponding principal bundle). For instance, suppose . Given a field , described in -coordinates by , with the new moving frame it will be described by . So the new description for the criteria takes the form:
Obviously, the solution for this functional is
which is the transformed version of the description of the distinguished in Example 2.
Remark. In the passive (coordinate‑change) picture on a natural bundle, a single spacetime diffeomorphism carries out two simultaneous relabelings: it reassigns each point’s coordinates on the base manifold, and—via the Jacobian of —it reassigns the local frame (fiber basis) in exactly the way a gauge frame change would. Only in this passive viewpoint, and only for bundles that are naturally tied to the base (tangent, tensor, spinor, etc.), does one diffeomorphism deliver both a base‑point relabeling and an internal (frame) relabeling.
Active transformations
First, at the level of points, it is true that given a general transformation (diffeomorphism) , it can be seen like a passive transformation and viceversa. Keep an eye: when we consider manifolds endowed with a structure, this is no longer true. It is explained here.
The same happens to gauge transformations: they can be interpreted as a change in the description but also as an active transformation of the field itself.
Example 1 revisited
Let’s now interpret the same change as an active transformation: a transformation of the field itself, rather than its description. That is, we define a new field
and construct a new variational principle:
Explicitly,
The new minimizer is then
which is clearly different from the original minimizer . That is, under the active transformation , the set of selected fields changes.
This shows that active covariance is not automatic — the theory is not invariant under arbitrary active transformations of the target space unless these transformations belong to a special subgroup preserving . That subgroup defines the symmetry group of the theory.
In this particular example, consider orthogonal transformations such that . In that case
which is of the same form as . Therefore, the true gauge symmetry group of the theory is the stabilizer of inside , i.e., those transformations preserving the inner product . These are the transformations under which the variational principle and the selected field remain unchanged.
Example 2 revisited
Let's start with the analogous to coordinate changes. Following along with our previous example, we can consider a diffeomorphism induced by the previous change of coordinates:
Then, induces a new (possibly different) variational principle :
which in this case is
Now we have a different selected set of fields
In the original theory we were singling out the line and now we are pointing its rotated version .
So active covariance is not trivial, it only takes place when we restrict to a particular group of transformations called symmetries of the theory. In Example 2, I think that it can be checked that any transformation of the form
is such that , so they are symmetries of the theory.
In Example 3, general relativity, any diffeomorphism is a symmetry for the theory. This is linked to the fact that the relevant bundle is a natural bundle.
Keep an eye: the symmetries of the theory do not have to leave each element invariant, they can permute the elements of . In this sense, the general covariance of general relativity is the statement that any diffeomorphism of spacetime is a permutation of the set of metrics and matter fields.
Something similar happens to gauge transformations in Example 2. We can consider that is not a change in the basis for the fibers but a true transformation on the fibers. If we give that use to the gauge transformation above, we obtain indeed a different variational principle, which distinguish a different preferred field. The group of gauge transformations that leaves invariant is called the gauge symmetry group of the theory or gauge symmetries of the theory. Keep an eye: a gauge symmetry does not fix every , but leaves invariant.
The gauge symmetry group for this theory can be explicitly computed. It consists of the group of active transformations where the function is restricted to be exactly on the line , and can take the values everywhere else:
This is seen by imposing the symmetry condition for all fields . The term in the action weighted by immediately forces for all . Simultaneously, the Dirac delta term, which constrains the fields on the line , requires that on that line. For this to hold for arbitrary field values , we must have when .
Final reflection
It could look like if the symmetries (normal or gauge) of the theory are determined by the selected set , which in turn is determined by an action (Lagrangian) codified in . I am not sure, but I think that the process is the other way around: we take a criteria to determine (Einstein-Hilbert action, or whatever) that respects the symmetries we observe in experiments.
This "gauge principle" is one of the most powerful ideas in theoretical physics, underlying both general relativity and the Standard Model of particle physics.