菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-14
📄 Abstract - PROVE: A Perceptual RemOVal cohErence Benchmark for Visual Media

Evaluating object removal in images and videos remains challenging because the task is inherently one-to-many, yet existing metrics frequently disagree with human perception. Full-reference metrics reward copy-paste behaviors over genuine erasure; no-reference metrics suffer from systematic biases such as favoring blurry results; and global temporal metrics are insensitive to localized artifacts within edited regions. To address these limitations, we propose RC (Removal Coherence), a pair of perception-aligned metrics: RC-S, which measures spatial coherence via sliding-window feature comparison between masked and background regions, and RC-T, which measures temporal consistency via distribution tracking within shared restored regions across adjacent frames. To validate RC and support community benchmarking, we further introduce PROVE-Bench, a two-tier real-world benchmark comprising PROVE-M, an 80-video paired dataset with motion augmentation, and PROVE-H, a 100-video challenging subset without ground truth. Together, RC metrics and PROVE-Bench form the PROVE (Perceptual RemOVal cohErence) evaluation framework for visual media. Experiments across diverse image and video benchmarks demonstrate that RC achieves substantially stronger alignment with human judgments than existing evaluation protocols. The code for RC metrics and PROVE-Bench are publicly available at: this https URL.

顶级标签: computer vision benchmark model evaluation
详细标签: object removal perceptual metric temporal coherence spatial coherence video editing 或 搜索:

PROVE:一种面向视觉媒体的感知一致性物体移除评测基准 / PROVE: A Perceptual RemOVal cohErence Benchmark for Visual Media


1️⃣ 一句话总结

本文针对图像和视频中物体移除效果的评估难题,提出了一套更符合人类感知的新评测指标(RC-S和RC-T)以及配套的基准数据集(PROVE-Bench),有效解决了现有指标容易误判或偏袒模糊结果的缺陷。

源自 arXiv: 2605.14534