Quantum Kernels are Spectral Tensor Networks

Bridging the Fourier and tensor-network descriptions of quantum machine learning. A new paper (arXiv:2606.20402) shows that entangling tensor kernels are Matrix Product Operator (MPO) factorizations of Fourier coefficient tensors, revealing that spectral compressibility dictates classical representability.

Paper: arXiv:2606.20402
Quantum Kernels as Spectral Tensor Networks banner showing quantum feature map mapping to an MPO chain via MPO factorization

In quantum machine learning, kernels are computed as overlaps of quantum feature states. While the traditional Hilbert space perspective connects these kernels to quantum states and measurements, a dual Fourier perspective exposes their spectral complexity based on data-encoding frequencies.

In their recent work, "Quantum Kernels are Spectral Tensor Networks" (arXiv:2606.20402), authors demonstrate that the finite Fourier expansion of any quantum kernel can be written directly as an Entangling Tensor Kernel (ETK). Specifically, they show that ETK contractions are Matrix Product Operator (MPO) factorizations of the corresponding Fourier coefficient tensors, establishing quantum kernels as spectral tensor networks.

The Spectral Tensor Network Identity

For feature maps where data enters through layers of Pauli-rotation gates interleaved with data-independent unitaries, the induced quantum kernel admits a finite Fourier expansion:

$$\kappa(\mathbf{x},\mathbf{x}') = \sum_{\boldsymbol{\mu},\boldsymbol{\nu}} c_{\boldsymbol{\mu},\boldsymbol{\nu}} e^{i(\boldsymbol{\mu}\cdot\boldsymbol{\xi}(\mathbf{x}) - \boldsymbol{\nu}\cdot\boldsymbol{\xi}(\mathbf{x}'))}$$

Here, the multi-indices $\boldsymbol{\mu}$ and $\boldsymbol{\nu}$ label the gate-level Fourier modes. By collecting these coefficients into an operator $\hat{C}$ and contracting it with product-state Fourier feature embeddings $|\Xi(\mathbf{x})\rangle$, the kernel can be expressed as:

$$\kappa(\mathbf{x},\mathbf{x}') = \langle\Xi(\mathbf{x})|\hat{C}|\Xi(\mathbf{x}')\rangle$$

When $\hat{C}$ is factorized as an MPO, this contraction represents the kernel in ETK form. Consequently, the minimum exact MPO bond dimension ($\chi$) is determined by the operator Schmidt rank across bipartitions of the frequency sites.

Feature-Wise Compression

Repeated uploads of the same classical features introduce frequency degeneracies. Grouping gate-level configurations that generate the same net feature frequency vector $\boldsymbol{\omega}$ yields a grouped Fourier representation:

$$\omega_i(\boldsymbol{\mu}) = \sum_{a \in \mathcal{A}_i} \mu_a$$

For a circuit with $d$ features uploaded $L$ times, grouping compresses the parameter space. The gate-level representation contains $3^{2dL}$ coefficients, whereas the feature-wise frequency representation reduces this to $(2L+1)^{2d}$. This polynomial scaling in $L$ acts as a crucial first compression step before low-rank tensor approximation.

Key Compression Benefits:
  • Exponential to Polynomial: Compresses the local frequency space from gate-level scaling to feature-wise scaling.
  • Schmidt Rank Decay: Operator Schmidt coefficients decay rapidly in layered random circuits, making small bond dimensions highly accurate.
  • Classical Tractability: A small bond dimension $\chi$ ensures that kernel evaluation is classically checkable and tractable.

Kernel Target Alignment as Cosine Similarity

The paper establishes that Kernel Target Alignment (KTA), typically used to optimize kernels on a data set $\mathcal{D}$, corresponds to a metric-weighted alignment in coefficient space. Crucially, when evaluated on a frequency-resolving grid $\mathcal{G}_{\Omega}$, the empirical metric $G$ reduces to the identity matrix $\mathbb{I}$ through discrete Fourier orthogonality.

Under this condition, KTA reduces to the Frobenius cosine similarity between the Fourier coefficient tensors:

$$\text{KTA}(K, K_Q) = \frac{\langle \tilde{C}, \tilde{C}_Q \rangle_F}{\|\tilde{C}\|_F \|\tilde{C}_Q\|_F}$$

This identity bridges the geometric data-space evaluation of KTA with the algebraic tensor-network structure, providing an optimization target for learning the MPO directly.

Classical Simulatability Diagnostics:
The rapid decay of the operator Schmidt spectrum across frequency partitions reveals that many layered quantum kernels are highly compressible. Thus, spectral compressibility serves as a powerful diagnostic tool, identifying whether a quantum kernel's representability remains classically tractable.

Conclusion

By framing quantum kernels as spectral tensor networks, this work demonstrates that a kernel's complexity is not merely dictated by its Hilbert space size or state entanglement. Instead, classical simulatability is governed by correlations in frequency space. Examining this spectral compressibility allows researchers to rigorously assess which quantum kernels are viable candidates for true quantum advantage.