PingPong: A Natural Benchmark for Multi-Turn Code-Switching Dialogues

📄 Abstract - PingPong: A Natural Benchmark for Multi-Turn Code-Switching Dialogues

Code-switching is a widespread practice among the world's multilingual majority, yet few benchmarks accurately reflect its complexity in everyday communication. We present PingPong, a benchmark for natural multi-party code-switching dialogues covering five language-combination variations, some of which are trilingual. Our dataset consists of human-authored conversations among 2 to 4 participants covering authentic, multi-threaded structures where replies frequently reference much earlier points in the dialogue. We demonstrate that our data is significantly more natural and structurally diverse than machine-generated alternatives, offering greater variation in message length, speaker dominance, and reply distance. Based on these dialogues, we define three downstream tasks: Question Answering, Dialogue Summarization, and Topic Classification. Evaluations of several state-of-the-art language models on PingPong reveal that performance remains limited on code-switched inputs, underscoring the urgent need for more robust NLP systems capable of addressing the intricacies of real-world multilingual discourse.

PingPong：多轮语码转换对话的自然基准 / PingPong: A Natural Benchmark for Multi-Turn Code-Switching Dialogues

1️⃣ 一句话总结

这篇论文提出了一个名为PingPong的新基准数据集，它包含了真实、多线程的多语言混合对话，用于评估和改进自然语言处理模型在处理复杂、自然的语码转换对话方面的能力。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要