菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-02-16
📄 Abstract - WavePhaseNet: A DFT-Based Method for Constructing Semantic Conceptual Hierarchy Structures (SCHS)

This paper reformulates Transformer/Attention mechanisms in Large Language Models (LLMs) through measure theory and frequency analysis, theoretically demonstrating that hallucination is an inevitable structural limitation. The embedding space functions as a conditional expectation over a {\sigma}-algebra, and its failure to be isomorphic to the semantic truth set fundamentally causes logical consistency breakdown. WavePhaseNet Method The authors propose WavePhaseNet, which explicitly constructs a Semantic Conceptual Hierarchy Structure (SCHS) using Discrete Fourier Transform (DFT). By applying DFT along the sequence dimension, semantic information is decomposed into frequency bands: low-frequency components capture global meaning and intent, while high-frequency components represent local syntax and expression. This staged separation enables precise semantic manipulation in diagonalized space. Dimensionality Reduction GPT-4's 24,576-dimensional embedding space exhibits a 1/f spectral structure based on language self-similarity and Zipf's law. Through cumulative energy analysis, the authors derive that approximately 3,000 dimensions constitute the lower bound for "complete representation." This demonstrates that reduction from 24,576 to 3,000 dimensions preserves meaning and intent while enabling rigorous reasoning and suppressing hallucination. Cohomological Consistency Control The reduced embedding space, constructed via cohomological regularization over overlapping local windows, allows defining a graph structure and cochain complex. This quantifies inconsistencies among local inferences as coboundary-based losses. Applying harmonic projection based on Hodge theory positions cohomology as a computable regularization principle for controlling semantic consistency, extracting maximally consistent global representations.

顶级标签: llm theory natural language processing
详细标签: transformer analysis hallucination reduction frequency decomposition embedding compression semantic consistency 或 搜索:

WavePhaseNet:一种基于离散傅里叶变换构建语义概念层次结构的方法 / WavePhaseNet: A DFT-Based Method for Constructing Semantic Conceptual Hierarchy Structures (SCHS)


1️⃣ 一句话总结

这篇论文提出了一种名为WavePhaseNet的新方法,它利用离散傅里叶变换将大语言模型中的语义信息分解为不同频率成分,从而构建清晰的语义层次结构,理论上证明了模型‘幻觉’是其固有结构缺陷,并通过降维和数学上的同调一致性控制,在保留核心语义的同时有效抑制了幻觉,实现了更严谨的推理。

源自 arXiv: 2602.14419