塑造NeRF几何:基于人类偏好微调的3D感知人脸GAN / Sculpting NeRF Geometry: Human-Preference Fine-Tuning of a 3D-Aware Face GAN
1️⃣ 一句话总结
本文提出一种无需外部形状参考或三维网格的微调方法,直接利用人类偏好奖励信号优化神经辐射场的密度场,从而改进3D人脸生成模型的几何质量,并以一次性标注员的偏好作为概念验证,使生成的人脸在74.4%的双选比较中更受用户喜爱。
Reinforcement learning from human feedback (RLHF) for 3D generation is now established across a number of works, but most existing pipelines optimise explicit surface representations, often by converting radiance fields into meshes and training heavily on surface-supervised data. We instead fine-tune a pretrained 3D-aware generative model directly from a learned reward over radiance-field density ($\sigma$) values, with no externally supplied mesh or shape prior. The reward model requires no pretraining, trains easily on a small set of preference samples, and yields robust improvement in 3D geometry. Working on an unconditional 3D-aware face GAN (EG3D), our reward reads the continuous 3D density field of the neural radiance field (NeRF) directly and supplies a geometry-only learning signal, requiring neither text conditioning, mesh extraction, nor multi-view rendering. A density-consistency constraint keeps the 2D appearance qualitatively similar while the geometry is reshaped, at a measurable but bounded distributional cost (FID-50k rises from 4.09 to 6.66): the fine-tuned generator, trained from the preferences of a single annotator as a proof of concept, produces face geometries preferred by users in 74.4% of pairwise comparisons.
塑造NeRF几何:基于人类偏好微调的3D感知人脸GAN / Sculpting NeRF Geometry: Human-Preference Fine-Tuning of a 3D-Aware Face GAN
本文提出一种无需外部形状参考或三维网格的微调方法,直接利用人类偏好奖励信号优化神经辐射场的密度场,从而改进3D人脸生成模型的几何质量,并以一次性标注员的偏好作为概念验证,使生成的人脸在74.4%的双选比较中更受用户喜爱。
源自 arXiv: 2606.27305