自适应文本匿名化:通过提示优化学习隐私与效用的权衡 / Adaptive Text Anonymization: Learning Privacy-Utility Trade-offs via Prompt Optimization
1️⃣ 一句话总结
这篇论文提出了一种能自动适应不同场景需求的自适应文本匿名化框架,通过优化提示让语言模型在保护隐私和保留文本可用性之间找到最佳平衡点,效果优于传统固定方法。
Anonymizing textual documents is a highly context-sensitive problem: the appropriate balance between privacy protection and utility preservation varies with the data domain, privacy objectives, and downstream application. However, existing anonymization methods rely on static, manually designed strategies that lack the flexibility to adjust to diverse requirements and often fail to generalize across domains. We introduce adaptive text anonymization, a new task formulation in which anonymization strategies are automatically adapted to specific privacy-utility requirements. We propose a framework for task-specific prompt optimization that automatically constructs anonymization instructions for language models, enabling adaptation to different privacy goals, domains, and downstream usage patterns. To evaluate our approach, we present a benchmark spanning five datasets with diverse domains, privacy constraints, and utility objectives. Across all evaluated settings, our framework consistently achieves a better privacy-utility trade-off than existing baselines, while remaining computationally efficient and effective on open-source language models, with performance comparable to larger closed-source models. Additionally, we show that our method can discover novel anonymization strategies that explore different points along the privacy-utility trade-off frontier.
自适应文本匿名化:通过提示优化学习隐私与效用的权衡 / Adaptive Text Anonymization: Learning Privacy-Utility Trade-offs via Prompt Optimization
这篇论文提出了一种能自动适应不同场景需求的自适应文本匿名化框架,通过优化提示让语言模型在保护隐私和保留文本可用性之间找到最佳平衡点,效果优于传统固定方法。
源自 arXiv: 2602.20743