← 返回列表

arXiv 提交日期: 2026-02-16

📄 Abstract - Image Generation with a Sphere Encoder

We introduce the Sphere Encoder, an efficient generative framework capable of producing images in a single forward pass and competing with many-step diffusion models using fewer than five steps. Our approach works by learning an encoder that maps natural images uniformly onto a spherical latent space, and a decoder that maps random latent vectors back to the image space. Trained solely through image reconstruction losses, the model generates an image by simply decoding a random point on the sphere. Our architecture naturally supports conditional generation, and looping the encoder/decoder a few times can further enhance image quality. Across several datasets, the sphere encoder approach yields performance competitive with state of the art diffusions, but with a small fraction of the inference cost. Project page is available at this https URL .

顶级标签: computer vision model training aigc

基于球面编码器的图像生成 / Image Generation with a Sphere Encoder

1️⃣ 一句话总结

这篇论文提出了一种名为‘球面编码器’的新型图像生成框架，它通过将图像映射到球面空间并直接解码生成图片，仅需一次或数次前向计算就能达到与复杂多步扩散模型相媲美的效果，同时大大降低了计算成本。

👋 没兴趣 ☆ 感兴趣 📌 待读

打开原文 PDF

源自 arXiv: 2602.15030

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要