菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-02-02
📄 Abstract - DFKI-Speech System for WildSpoof Challenge: A robust framework for SASV In-the-Wild

This paper presents the DFKI-Speech system developed for the WildSpoof Challenge under the Spoofing aware Automatic Speaker Verification (SASV) track. We propose a robust SASV framework in which a spoofing detector and a speaker verification (SV) network operate in tandem. The spoofing detector employs a self-supervised speech embedding extractor as the frontend, combined with a state-of-the-art graph neural network backend. In addition, a top-3 layer based mixture-of-experts (MoE) is used to fuse high-level and low-level features for effective spoofed utterance detection. For speaker verification, we adapt a low-complexity convolutional neural network that fuses 2D and 1D features at multiple scales, trained with the SphereFace loss. Additionally, contrastive circle loss is applied to adaptively weight positive and negative pairs within each training batch, enabling the network to better distinguish between hard and easy sample pairs. Finally, fixed imposter cohort based AS Norm score normalization and model ensembling are used to further enhance the discriminative capability of the speaker verification system.

顶级标签: audio systems model evaluation
详细标签: speaker verification spoofing detection graph neural networks score normalization model ensembling 或 搜索:

DFKI-Speech 面向WildSpoof挑战的系统:一个用于野外环境的SASV鲁棒框架 / DFKI-Speech System for WildSpoof Challenge: A robust framework for SASV In-the-Wild


1️⃣ 一句话总结

这篇论文提出了一个用于‘野外环境’下防欺骗说话人验证的鲁棒系统,它通过结合一个先进的图神经网络防欺骗检测器和一个多尺度特征融合的说话人验证网络,并采用多种优化技术,有效提升了系统在复杂真实场景中识别真假语音和验证说话人身份的能力。

源自 arXiv: 2602.02286