菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-10
📄 Abstract - Hidden in Plain Sight: Visual-to-Symbolic Analytical Solution Inference from Field Visualizations

Recovering analytical solutions of physical fields from visual observations is a fundamental yet underexplored capability for AI-assisted scientific reasoning. We study visual-to-symbolic analytical solution inference (ViSA) for two-dimensional linear steady-state fields: given field visualizations (and first-order derivatives) plus minimal auxiliary metadata, the model must output a single executable SymPy expression with fully instantiated numeric constants. We introduce ViSA-R2 and align it with a self-verifying, solution-centric chain-of-thought pipeline that follows a physicist-like pathway: structural pattern recognition solution-family (ansatz) hypothesis parameter derivation consistency verification. We also release ViSA-Bench, a VLM-ready synthetic benchmark covering 30 linear steady-state scenarios with verifiable analytical/symbolic annotations, and evaluate predictions by numerical accuracy, expression-structure similarity, and character-level accuracy. Using an 8B open-weight Qwen3-VL backbone, ViSA-R2 outperforms strong open-source baselines and the evaluated closed-source frontier VLMs under a standardized protocol.

顶级标签: computer vision natural language processing multi-modal
详细标签: visual reasoning symbolic regression scientific ai physics-informed benchmark 或 搜索:

隐藏于眼前:从场可视化图像中推理视觉到符号的解析解 / Hidden in Plain Sight: Visual-to-Symbolic Analytical Solution Inference from Field Visualizations


1️⃣ 一句话总结

这篇论文提出了一种名为ViSA-R2的AI方法,它能像物理学家一样,通过观察二维物理场的可视化图像,自动推理并输出一个精确的、可执行的数学公式,从而将视觉信息转化为符号化的解析解。

源自 arXiv: 2604.08863