菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-06
📄 Abstract - GTF: Omnidirectional EPI Transformer for Light Field Super-Resolution

Light field (LF) image super-resolution benefits from Epipolar Plane Images (EPIs), whose line slopes explicitly encode disparity. However, existing Transformer-based LF SR methods mainly attend to horizontal and vertical EPIs, leaving diagonal epipolar geometry underexplored. We present GTF, an omnidirectional EPI Transformer that explicitly models horizontal, vertical, 45-degree, and 135-degree EPIs within a unified reconstruction framework. GTF combines directional EPI processing, MacPI-based prior injection, adaptive directional fusion, and a topology-preserving feed-forward network to better exploit LF geometry. For the NTIRE 2026 fidelity tracks, we use GTF as the main model, while a lightweight GTF-Tiny variant targets the efficiency track. On five standard LF SR benchmarks covering both real-captured and synthetic scenes, GTF reaches 32.78 dB without inference-time enhancement, and stronger inference settings with EPSW and test-time augmentation further improve performance. Under the NTIRE 2026 efficiency constraint, GTF-Tiny attains 32.57 dB with only 0.915M parameters and 19.81 GFLOPs. In the NTIRE 2026 Light Field Image Super-Resolution Challenge, our submissions rank 3rd on Track 1 and Track 3 and 4th on Track 2. Architecture-evolution, channel-width, and inference analyses further support the effectiveness of diagonal EPI modeling, directional fusion, and the lightweight design.

顶级标签: computer vision model evaluation benchmark
详细标签: light field super-resolution epipolar plane images transformer ntire challenge 或 搜索:

全方位EPI Transformer:用于光场图像超分辨率 / GTF: Omnidirectional EPI Transformer for Light Field Super-Resolution


1️⃣ 一句话总结

本文提出了GTF,一种新型Transformer模型,通过同时处理水平、垂直及两个对角线方向的光场极线图像(EPI),显著提升了光场图像的超分辨率重建质量,并在NTIRE 2026挑战赛中获得多个赛道前三名。

源自 arXiv: 2605.04581