HyFI:用于脑视觉对齐的双曲特征插值 / HyFI: Hyperbolic Feature Interpolation for Brain-Vision Alignment
1️⃣ 一句话总结
这篇论文提出了一种名为HyFI的新方法,它利用双曲空间的几何特性,将图像中的语义和感知特征进行融合与压缩,从而更有效地将大脑信号与视觉信息对齐,显著提升了从大脑信号中零样本检索图像的准确率。
Recent progress in artificial intelligence has encouraged numerous attempts to understand and decode human visual system from brain signals. These prior works typically align neural activity independently with semantic and perceptual features extracted from images using pre-trained vision models. However, they fail to account for two key challenges: (1) the modality gap arising from the natural difference in the information level of representation between brain signals and images, and (2) the fact that semantic and perceptual features are highly entangled within neural activity. To address these issues, we utilize hyperbolic space, which is well-suited for considering differences in the amount of information and has the geometric property that geodesics between two points naturally bend toward the origin, where the representational capacity is lower. Leveraging these properties, we propose a novel framework, Hyperbolic Feature Interpolation (HyFI), which interpolates between semantic and perceptual visual features along hyperbolic geodesics. This enables both the fusion and compression of perceptual and semantic information, effectively reflecting the limited expressiveness of brain signals and the entangled nature of these features. As a result, it facilitates better alignment between brain and visual features. We demonstrate that HyFI achieves state-of-the-art performance in zero-shot brain-to-image retrieval, outperforming prior methods with Top-1 accuracy improvements of up to +17.3% on THINGS-EEG and +9.1% on THINGS-MEG.
HyFI:用于脑视觉对齐的双曲特征插值 / HyFI: Hyperbolic Feature Interpolation for Brain-Vision Alignment
这篇论文提出了一种名为HyFI的新方法,它利用双曲空间的几何特性,将图像中的语义和感知特征进行融合与压缩,从而更有效地将大脑信号与视觉信息对齐,显著提升了从大脑信号中零样本检索图像的准确率。
源自 arXiv: 2603.22721