mmWave Radar Aware Dual-Conditioned GAN for Speech Reconstruction of Signals With Low SNR

📄 Abstract - mmWave Radar Aware Dual-Conditioned GAN for Speech Reconstruction of Signals With Low SNR

Millimeter-wave (mmWave) radar captures are band-limited and noisy, making for difficult reconstruction of intelligible full-bandwidth speech. In this work, we propose a two-stage speech reconstruction pipeline for mmWave using a Radar-Aware Dual-conditioned Generative Adversarial Network (RAD-GAN), which is capable of performing bandwidth extension on signals with low signal-to-noise ratios (-5 dB to -1 dB), captured through glass walls. We propose an mmWave-tailored Multi-Mel Discriminator (MMD) and a Residual Fusion Gate (RFG) to enhance the generator input to process multiple conditioning channels. The proposed two-stage pipeline involves pretraining the model on synthetically clipped clean speech and finetuning on fused mel spectrograms generated by the RFG. We empirically show that the proposed method, trained on a limited dataset, with no pre-trained modules, and no data augmentations, outperformed state-of-the-art approaches for this specific task. Audio examples of RAD-GAN are available online at this https URL.

用于低信噪比信号语音重建的毫米波雷达感知双条件生成对抗网络 / mmWave Radar Aware Dual-Conditioned GAN for Speech Reconstruction of Signals With Low SNR

1️⃣ 一句话总结

这项研究提出了一种名为RAD-GAN的两阶段智能语音重建方法，它能有效利用毫米波雷达信号，即使信号被玻璃墙阻挡且质量很差，也能清晰还原出人声。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要