菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-02
📄 Abstract - Retrieval-Augmented Question Answering over Scientific Literature for the Electron-Ion Collider

To harness the power of Language Models in answering domain specific specialized technical questions, Retrieval Augmented Generation (RAG) is been used widely. In this work, we have developed a Q\&A application inspired by the Retrieval Augmented Generation (RAG), which is comprised of an in-house database indexed on the arXiv articles related to the Electron-Ion Collider (EIC) experiment - one of the largest international scientific collaboration and incorporated an open-source LLaMA model for answer generation. This is an extension to it's proceeding application built on proprietary model and Cloud-hosted external knowledge-base for the EIC experiment. This locally-deployed RAG-system offers a cost-effective, resource-constraint alternative solution to build a RAG-assisted Q\&A application on answering domain-specific queries in the field of experimental nuclear physics. This set-up facilitates data-privacy, avoids sending any pre-publication scientific data and information to public domain. Future improvement will expand the knowledge base to encompass heterogeneous EIC-related publications and reports and upgrade the application pipeline orchestration to the LangGraph framework.

顶级标签: llm natural language processing systems
详细标签: retrieval-augmented generation question answering scientific literature domain-specific qa local deployment 或 搜索:

面向电子-离子对撞机科学文献的检索增强问答系统 / Retrieval-Augmented Question Answering over Scientific Literature for the Electron-Ion Collider


1️⃣ 一句话总结

本研究为实验核物理领域开发了一个本地化、低成本的检索增强问答系统,它利用开源大模型和自建的学术文献数据库,能够安全、高效地回答关于电子-离子对撞机的专业问题。

源自 arXiv: 2604.02259