Second-order theory as a square: what you gain, what you lose
- Definitions / Notation used
- Square-root logic
- One technical lemma (structure of the second-order equation)
- What you gain by squaring
- What you lose by squaring
- Assumptions vs consequences
- Why this matters
- Key takeaway
- Technical takeaway
First-order equations are incisive: they define a tight solution space and often come with good deformation theory. They are also unforgiving: small changes in modeling choices (boundary conditions, gauge-fixing, numerical discretization) can push you off the solution set in a way that is hard to control. Squaring the first-order object (\Upsilon_\omega) is the standard move that trades “sharp constraints” for “energy-like control.”
Definitions / Notation used
- (\Upsilon_\omega := \bullet_\varepsilon(F_B) − \kappa_1 T) is the first-order field object on (Y) ((\mathrm{ad}(P_H))-valued).
- Define the squared action: (I_2(\omega) := \int_Y ⟨\Upsilon_\omega, *_Y \Upsilon_\omega⟩).
Square-root logic
There are two logically distinct statements:
1) If (\Upsilon_\omega = 0), then (\omega) is a stationary point of (I_2).
This is immediate: the integrand is quadratic in (\Upsilon_\omega), so if (\Upsilon_\omega) vanishes pointwise, any first variation of (I_2) vanishes.
2) If (\omega) is a stationary point of (I_2), then (\Upsilon_\omega = 0).
This is false in general. Stationary points of a square include (\Upsilon_\omega = 0) solutions, but can also include configurations where (\Upsilon_\omega) is nonzero yet satisfies the second-order Euler–Lagrange equation (a covariant “divergence-free” condition). This is the precise sense in which “first-order implies second-order,” but not conversely.
So the square-root slogan here is not mystical: it is a strict inclusion of solution sets: ({\Upsilon_\omega = 0} ⊂ {EL(I_2) = 0}).
One technical lemma (structure of the second-order equation)
Lemma (Second-order Euler–Lagrange has the form “adjoint derivative of (\Upsilon)”).
Under variations of (\omega) that respect the boundary conditions, the first variation of (I_2) can be written schematically as
$$ \delta I_2(\omega) = 2 \int_Y \langle \delta \omega, 𝓛_\omega^†(\Upsilon_\omega)\rangle, $$
so the Euler–Lagrange equation for (I_2) is
$$ 𝓛_\omega^†(\Upsilon_\omega) = 0, $$
where (𝓛_\omega) is the linearization of the map (\omega ↦ \Upsilon_\omega) (hence depends on (A_0), (\varepsilon), the Shiab operator, and the (\sigma)-split metric through (*_Y)), and (𝓛_\omega^†) is its formal adjoint with respect to the (⟨·,·⟩/*_Y) pairing.
Proof sketch.
(I_2 = \int ⟨\Upsilon, *_Y \Upsilon⟩). Varying gives (\delta I_2 = 2 \int ⟨\delta \Upsilon, *_Y \Upsilon⟩). But (\delta \Upsilon = 𝓛_\omega (\delta\omega)) by definition of the linearization. Move (𝓛_\omega) off (\delta\omega) by adjunction to obtain the displayed form (plus boundary terms that vanish under the assumed support/decay or imposed boundary conditions). No componentwise “Ricci tracing” occurs: everything is packaged in the covariant linearization of (\Upsilon).
What you gain by squaring
1) A functional that is naturally suited to numerics.
Even in split signature, (I_2) is the canonical “least-squares” objective: it measures failure to satisfy the first-order equation. This is exactly the structure you want if you are doing continuation methods, Newton–Krylov solvers, or constrained minimization.
2) A direct bridge to EFT thinking.
Expanding (I_2) around a background solution (\omega_0) gives a quadratic form governed by the linearized operator (𝓛_{\omega_0}). That is the entry point to propagators, effective operators, and mode suppression.
3) A cleaner path to quantization heuristics (without claiming success yet).
Path integrals over (\omega) weighted by (exp(−I_2)) (or its Lorentzian analogue) are the standard story. In a split-signature ambient space, the real work is to identify the correct involution/projection that yields a well-behaved quadratic form on the propagating sector. Squaring is necessary, not sufficient, but it is the move that makes the question well-posed.
What you lose by squaring
1) You enlarge the solution space.
The first-order equation (\Upsilon_\omega=0) is a strong geometric constraint. The second-order equation (𝓛_\omega^†(\Upsilon_\omega)=0) allows “harmonic” (\Upsilon_\omega) configurations: nonzero, but divergence-free in the appropriate covariant sense. Whether those extra branches are physically relevant, gauge artifacts, or pathological depends on boundary conditions and the sector ((E)-block vs everything). You do not get to ignore this.
2) You obscure the geometric meaning.
(\Upsilon_\omega=0) is a direct balance law: Shiab-contracted curvature equals (\kappa_1) times torsion. The second-order equation reads like “a differential operator applied to that balance law vanishes.” That is less interpretable. It is not worse; it is just further from the conceptual anchor.
3) You inherit the ambient signature problem in a sharper form.
On a ((7,7)) manifold, inner products are not automatically positive. If you want (I_2) to behave like an energy, you must specify the pairing/involution that selects the physical sector (and, in our instantiation, you will do that through the gravitational block (E) and the pullback-visible modes). Until that is spelled out, any positivity language is propaganda. Here we keep it neutral: (I_2) is the natural square; its analytic character depends on the sector.
Assumptions vs consequences
Assumptions:
- Same geometric and gauge setup as before ((\mathrm{Spin}(7,7)), (\sigma)-split, (A_0) fixed, (•_ε) fixed via (E/\Theta_E), (D\Theta_E=0)).
- Boundary conditions that kill integration-by-parts terms (compact (Y), or decay, or explicit boundary term choices).
- A specified adjoint/inner product structure for defining (𝓛_\omega^†) (this is where split signature matters).
Consequences:
• First-order solutions (\Upsilon_\omega=0) are automatically solutions of the second-order EL equation. • Second-order solutions include (possibly many) additional branches with (\Upsilon_ω \neq 0) but (𝓛_\omega^†(\Upsilon_\omega)=0). • The squared action is the right object for perturbation theory and numerical “residual minimization,” but it does not replace the conceptual primacy of the first-order equation.
Why this matters
If the project is going to produce an EFT corner, a numerical fitting program, or any credible discussion of fluctuations, you will end up linearizing something and controlling error norms. (I_2) is that control functional. The first-order equation is the geometric statement; the square is the engineering interface. Keeping both, and being explicit about what each one buys you, is how we avoid overpromising.
Key takeaway
Squaring gives you a robust second-order theory whose solutions include all first-order solutions, but also potentially more.
Technical takeaway
(I_2(ω) = \int_Y \langle Υ_ω, *_Y Υ_ω⟩), with (\Upsilon_ω = 0 ⇒ EL(I_2): 𝓛_ω^†(\Upsilon_ω)=0), but not conversely.
More from GR