菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-27
📄 Abstract - REED: Post-Training Representation Editing for Cross-Domain Linguistic Steganalysis

In real-world scenarios of linguistic steganalysis, tested texts usually come from unseen domains with different vocabularies, topics, writing styles, and steganographic generation patterns, which can significantly degrade the detection performance. Although existing cross-domain steganalysis methods can effectively alleviate this problem through distribution alignment, domain-invariant feature learning, etc., the detection performance is not satisfactory. In this paper, we propose a post-training representation editing method for cross-domain linguistic steganalysis. Specifically, the detector is first trained on source-domain data, and then the feature extractor and classifier are kept frozen, and the intermediate representations are deterministically edited before classification. For domain adaptation, we construct a domain-offset vector from marginal source and target representations. For domain generalization, we derive a source-domain cover-to-stego direction to guide sample-specific editing. Experimental results show that compared with the advanced methods, the proposed method can achieve high cross-domain detection performance, especially in terms of F1-score, while requiring no architecture modification or parameter updates after source-domain training.

顶级标签: natural language processing model training machine learning
详细标签: steganalysis cross-domain representation editing domain adaptation text classification 或 搜索:

REED:面向跨域语言隐写分析的后训练表示编辑方法 / REED: Post-Training Representation Editing for Cross-Domain Linguistic Steganalysis


1️⃣ 一句话总结

本文提出一种名为REED的后训练方法,通过在已训练好的检测模型上编辑中间特征表示,而无需修改模型结构或重新训练,就能有效提升跨领域(不同词汇、主题、写作风格等)的隐写文本检测性能,尤其显著提高了F1分数。

源自 arXiv: 2605.28298