菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-09
📄 Abstract - EEG2Vision: A Multimodal EEG-Based Framework for 2D Visual Reconstruction in Cognitive Neuroscience

Reconstructing visual stimuli from non-invasive electroencephalography (EEG) remains challenging due to its low spatial resolution and high noise, particularly under realistic low-density electrode configurations. To address this, we present EEG2Vision, a modular, end-to-end EEG-to-image framework that systematically evaluates reconstruction performance across different EEG resolutions (128, 64, 32, and 24 channels) and enhances visual quality through a prompt-guided post-reconstruction boosting mechanism. Starting from EEG-conditioned diffusion reconstruction, the boosting stage uses a multimodal large language model to extract semantic descriptions and leverages image-to-image diffusion to refine geometry and perceptual coherence while preserving EEG-grounded structure. Our experiments show that semantic decoding accuracy degrades significantly with channel reduction (e.g., 50-way Top-1 Acc from 89% to 38%), while reconstruction quality slight decreases (e.g., FID from 76.77 to 80.51). The proposed boosting consistently improves perceptual metrics across all configurations, achieving up to 9.71% IS gains in low-channel settings. A user study confirms the clear perceptual preference for boosted reconstructions. The proposed approach significantly boosts the feasibility of real-time brain-2-image applications using low-resolution EEG devices, potentially unlocking this type of applications outside laboratory settings.

顶级标签: medical multi-modal computer vision
详细标签: eeg decoding visual reconstruction diffusion models multimodal llm cognitive neuroscience 或 搜索:

EEG2Vision:基于多模态脑电的认知神经科学二维视觉重建框架 / EEG2Vision: A Multimodal EEG-Based Framework for 2D Visual Reconstruction in Cognitive Neuroscience


1️⃣ 一句话总结

这篇论文提出了一个名为EEG2Vision的模块化框架,能够利用低分辨率、少电极的脑电信号重建人脑看到的图像,并通过一个基于提示的后处理增强机制显著提升图像质量,为在实验室外实现实时‘脑到图像’应用提供了可能。

源自 arXiv: 2604.08063