菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-20
📄 Abstract - Assessing socio-economic climate impacts from text data

Recent advances in natural language processing (NLP) and large language models (LLMs) have enabled the systematic use of large-scale textual data from news, social media, and reports to create datasets with socio-economic impacts of climate hazards such as floods, droughts, storms, and multi-hazard events. As the field of text-as-data for impact assessment expands, so does its methodological complexity. Yet research remains fragmented, with no clear guidelines for defining what constitutes an impact, handling temporal and spatial biases, and selecting appropriate modeling and post-processing strategies. This lack of coherence limits transparency and comparability across studies. Here, we address this gap by synthesising common practices, describing key challenges specific to the use of text-as-data methods for analyzing socio-economic impact data, and proposing recommendations to address them. By providing guidance on best practices, we aim to support the construction of robust text-derived socio-economic impact datasets that can more accurately inform disaster risk management and attribution studies.

顶级标签: natural language processing llm
详细标签: climate impact socio-economic text-as-data disaster risk impact assessment 或 搜索:

从文本数据评估社会经济气候影响 / Assessing socio-economic climate impacts from text data


1️⃣ 一句话总结

本文系统总结了如何利用自然语言处理技术从新闻、社交媒体等文本中提取气候灾害(如洪水、干旱)的社会经济影响数据,并针对数据定义、时空偏差和建模策略等关键挑战提出了实用建议,旨在帮助构建更可靠的数据集来支持灾害风险管理。

源自 arXiv: 2605.20793