SenFlow: Inter-Sentence Flow Modeling for AI-Generated Text Detection in Hybrid Documents

📄 Abstract - SenFlow: Inter-Sentence Flow Modeling for AI-Generated Text Detection in Hybrid Documents

Sentence-level AI-generated text detection (S-AGTD) for hybrid documents, where humans and LLMs co-author one text, faces two gaps: existing methods classify each sentence in isolation, discarding inter-sentence dependencies, and existing benchmarks omit the newest generation of generators. We construct MOSAIC, a benchmark of 16,000 hybrid documents over PubMed and XSum, generated by DeepSeek-V3.2 and Kimi K2 under stringent quality controls including a perplexity-consistency filter absent from prior benchmarks. We recast S-AGTD as structured prediction over the document sentence sequence and instantiate it as SenFlow, integrating graph-based inter-sentence propagation with linear-chain CRF decoding in a single document-level pass over a sentence graph. SenFlow reaches state-of-the-art performance on MOSAIC, with a +4.15 pp average Macro-F1 margin on cross-domain transfer, the hardest of three protocols of increasing difficulty. We further find that even after the perplexity filter equalizes overt cues, AI insertions retain a generator-dependent sentence-length gap that sentence-level detectors still exploit. Code and data: this https URL

SenFlow：面向混合文档中AI生成文本检测的句间流建模 / SenFlow: Inter-Sentence Flow Modeling for AI-Generated Text Detection in Hybrid Documents

1️⃣ 一句话总结

本文针对人类与AI协作撰写的混合文档，提出了一种名为SenFlow的新方法，通过建模句子之间的依赖关系（而非独立分类每个句子），并结合图传播与序列解码技术，显著提升了AI生成句子的检测准确率；同时，作者构建了包含最新大模型生成内容的高质量测试基准MOSAIC，实验证明该方法在跨领域场景下表现最优。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要