菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-02-16
📄 Abstract - CT-Bench: A Benchmark for Multimodal Lesion Understanding in Computed Tomography

Artificial intelligence (AI) can automatically delineate lesions on computed tomography (CT) and generate radiology report content, yet progress is limited by the scarcity of publicly available CT datasets with lesion-level annotations. To bridge this gap, we introduce CT-Bench, a first-of-its-kind benchmark dataset comprising two components: a Lesion Image and Metadata Set containing 20,335 lesions from 7,795 CT studies with bounding boxes, descriptions, and size information, and a multitask visual question answering benchmark with 2,850 QA pairs covering lesion localization, description, size estimation, and attribute categorization. Hard negative examples are included to reflect real-world diagnostic challenges. We evaluate multiple state-of-the-art multimodal models, including vision-language and medical CLIP variants, by comparing their performance to radiologist assessments, demonstrating the value of CT-Bench as a comprehensive benchmark for lesion analysis. Moreover, fine-tuning models on the Lesion Image and Metadata Set yields significant performance gains across both components, underscoring the clinical utility of CT-Bench.

顶级标签: medical benchmark computer vision
详细标签: medical imaging computed tomography lesion analysis multimodal vqa dataset 或 搜索:

CT-Bench:一个用于计算机断层扫描中多模态病灶理解的基准数据集 / CT-Bench: A Benchmark for Multimodal Lesion Understanding in Computed Tomography


1️⃣ 一句话总结

这篇论文创建了一个名为CT-Bench的公开基准数据集,它包含了大量带详细标注的CT病灶图像和对应的问答对,用于全面评估和提升AI模型在识别、描述和分析CT图像中病灶的能力,从而推动医疗影像AI的发展。

源自 arXiv: 2602.14879