菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-04
📄 Abstract - Is It Novel and Why? Fine-Grained Patent Novelty Prediction Based on Passage Retrieval

Novelty assessment is a critical yet complex task in the examination process for patent acceptance, requiring examiners to determine whether an invention is disclosed in a prior art document. The process involves intricate matching between specific features of a patent claim and passages in the prior art. While prior work has approached novelty prediction primarily as a binary classification task at the claim level, we argue that this formulation is susceptible to spurious correlations and lacks the granularity required for practical application. In this work, we introduce FiNE-Patents (Fine-grained Novelty Examination of Patents), a novel dataset comprising 3,658 first patent claims annotated with fine-grained, feature-level prior art references extracted from European Search Opinion (ESOP) documents. We propose shifting the evaluation paradigm from simple binary classification to a joint retrieval and abstract reasoning task at the feature level, requiring models to identify specific passages from a prior art document that disclose individual claim features, and to identify which features of a claim make it novel. We implement and evaluate LLM-based workflows that decompose claims into features, analyze each feature against prior art, and finally derive a claim-level novelty prediction. Our experiments demonstrate that these workflows outperform embedding-based baselines on passage retrieval and novel feature identification. Furthermore, we show that unlike trained classifiers, LLMs are robust against spurious correlations present in the claim-level novelty classification task. We release the dataset and code to foster further research into transparent and granular patent analysis.

顶级标签: natural language processing llm machine learning
详细标签: patent analysis novelty detection passage retrieval dataset fine-grained evaluation 或 搜索:

它是否新颖且为何新颖?基于段落检索的细粒度专利新颖性预测 / Is It Novel and Why? Fine-Grained Patent Novelty Prediction Based on Passage Retrieval


1️⃣ 一句话总结

本文提出一种新的专利新颖性判断方法,通过引入细粒度特征级数据集和检索推理任务,利用大语言模型逐条分析专利权利要求与现有技术的匹配情况,从而更准确、透明地预测专利是否具有新颖性,并解释其具体原因。

源自 arXiv: 2605.02392