菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-27
📄 Abstract - ClothTransformer: Unified Latent-Space Transformers for Scalable Cloth Simulation

Unified and scalable Transformers have recently achieved remarkable success in modeling diverse phenomena traditionally associated with computer graphics, such as 3D visual effects, rendering processes, and motion in videos. In this work, we take a step further by investigating whether modern Transformer techniques can tackle the challenging task of cloth simulation. To this end, we present ClothTransformer, a framework that reformulates cloth simulation as autoregressive sequence modeling in a learned latent space. Existing neural cloth simulators are largely specialized to single scenarios, intrinsically coupled to the mesh discretization, and lack robust collision handling. Our approach addresses these limitations through three contributions: (1) a unified Transformer architecture that handles diverse scenarios -- body-driven garments, robotic manipulation, and free-fall collisions -- under a single model and achieves approximately $4$--$9{\times}$ lower error than prior state-of-the-art methods across all scenarios; (2) a scalable latent-space formulation that compresses arbitrary-resolution meshes into a fixed-size set of latent tokens, making temporal dynamics computation independent of mesh resolution; and (3) a diverse-scenario high-fidelity penetration-free dataset of ${\sim}$493.4k frames spanning all three settings, which enables a differentiable Continuous Collision Detection (CCD) module to suppress penetration artifacts.

顶级标签: machine learning systems multi-modal
详细标签: cloth simulation transformer latent space collision detection dataset 或 搜索:

ClothTransformer:面向可扩展布料模拟的统一潜在空间Transformer / ClothTransformer: Unified Latent-Space Transformers for Scalable Cloth Simulation


1️⃣ 一句话总结

本文提出ClothTransformer,利用Transformer模型在压缩后的潜在空间中自回归预测布料运动,首次用一个统一框架处理人体穿着、机器人操作和自由落体等多种场景,既大幅降低模拟误差,又解决了网格分辨率依赖和穿透碰撞等传统难题。

源自 arXiv: 2605.27852