Surflo: Consistent 3D Surface Flow Model with Global State

📄 Abstract - Surflo: Consistent 3D Surface Flow Model with Global State

Geometry is invariant to viewpoint, which makes any collection of images a redundant encoding of a single 3D state. Existing feed-forward reconstruction models fail to exploit this: per-view methods emit overlapping, unaligned pointmaps that grow linearly with input count, while global-latent methods commit to a fixed, low-resolution output. We introduce Surflo, which compresses a variable number of unposed RGB views into K latent tokens-one global state-and decodes oriented 3D surface points by independently transporting them from noise onto the surface via flow matching. This frees the output from any fixed grid or token budget: the same latent yields from a few thousand to a million points in a single forward pass. To suppress the local inconsistencies inherent to independent per-point decoding, an inference-time guidance term correlates nearby points by injecting a photometric gradient during ODE integration. Surflo matches or surpasses feed-forward baselines on surface metrics, runs an order of magnitude faster than optimization-based methods that require hundreds of views, and is the only feed-forward approach to combine a global latent with arbitrary-resolution decoding.

Surflo：具有全局状态的一致三维表面流模型 / Surflo: Consistent 3D Surface Flow Model with Global State

1️⃣ 一句话总结

本文提出了一种名为Surflo的新方法，通过将任意数量的图片压缩成一个全局的3D状态，然后像“吹气球”一样从噪声点逐步生成任意密度的三维表面点，并在生成过程中利用图片颜色信息确保各点位置一致，从而快速、高质量地重建出任意分辨率的三维物体表面。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要