面向十亿级多模态生物特征检索 / Towards Billion-scale Multi-modal Biometric Search
1️⃣ 一句话总结
本文介绍了首个基于开源架构的十亿级多模态生物特征检索系统(Bharat ABIS),它通过融合指纹、人脸和虹膜信息,为印度Aadhaar数据库中的15.5亿用户提供高效、准确的去重搜索,在2.2亿身份的测试中仅需0.5%的误识别率即可检出99.7%的真实身份,且单台服务器每秒可处理4000万人的搜索请求。
Searching a multi-biometric database of a billion records for a country-level identity system requires pushing the limits of all aspects of a biometric system, including acquisition, preprocessing, feature extraction, accuracy, matching speed, presentation attack detection, and handling of special cases (e.g., missing finger digits). This is the first paper that gives insights into such a large-scale multimodal biometric search system, called Bharat ABIS, based on open-source architectures. The end-to-end pipeline of Bharat ABIS processes fingerprint, face and iris modalities through modality-specific stages of preprocessing (segmentation), quality assessment, presentation attack detection, and learning an embedding (feature extraction), producing a concatenated template of 13.5KB per person. We present a detailed analysis of the modalities and how they are integrated to create an efficient and effective solution for 1:N search (de-duplication). Evaluations on a demographically stratified gallery of 220 million identities, randomly sampled from 1.55 billion records in India's Aadhaar database, yield an FNIR of 0.3% at an FPIR of 0.5%, for adult probes (over 18 years). We also compare the performance of Bharat ABIS against three state-of-the-art COTS systems on a 20M gallery. Our system achieves a throughput of 100 searches per second on a gallery of 40M on a single server (8xNvidia H100 GPUs, 2TB RAM).
面向十亿级多模态生物特征检索 / Towards Billion-scale Multi-modal Biometric Search
本文介绍了首个基于开源架构的十亿级多模态生物特征检索系统(Bharat ABIS),它通过融合指纹、人脸和虹膜信息,为印度Aadhaar数据库中的15.5亿用户提供高效、准确的去重搜索,在2.2亿身份的测试中仅需0.5%的误识别率即可检出99.7%的真实身份,且单台服务器每秒可处理4000万人的搜索请求。
源自 arXiv: 2605.07655