分析极化地缘政治背景下大语言模型的人物生成与公平性解读 / Analysing LLM Persona Generation and Fairness Interpretation in Polarised Geopolitical Contexts
1️⃣ 一句话总结
这篇论文通过分析五种主流大语言模型在巴以冲突等不同地缘政治背景下生成的人物角色,发现模型会系统性地产生带有社会经济地位刻板印象的描述,并且其内部关于公平性的推理与实际生成结果之间存在不一致。
Large language models (LLMs) are increasingly utilised for social simulation and persona generation, necessitating an understanding of how they represent geopolitical identities. In this paper, we analyse personas generated for Palestinian and Israeli identities by five popular LLMs across 640 experimental conditions, varying context (war vs non-war) and assigned roles. We observe significant distributional patterns in the generated attributes: Palestinian profiles in war contexts are frequently associated with lower socioeconomic status and survival-oriented roles, whereas Israeli profiles predominantly retain middle-class status and specialised professional attributes. When prompted with explicit instructions to avoid harmful assumptions, models exhibit diverse distributional changes, e.g., marked increases in non-binary gender inferences or a convergence toward generic occupational roles (e.g., "student"), while the underlying socioeconomic distinctions often remain. Furthermore, analysis of reasoning traces reveals an interesting dynamics between model reasoning and generation: while rationales consistently mention fairness-related concepts, the final generated personas follow the aforementioned diverse distributional changes. These findings illustrate a picture of how models interpret geopolitical contexts, while suggesting that they process fairness and adjust in varied ways; there is no consistent, direct translation of fairness concepts into representative outcomes.
分析极化地缘政治背景下大语言模型的人物生成与公平性解读 / Analysing LLM Persona Generation and Fairness Interpretation in Polarised Geopolitical Contexts
这篇论文通过分析五种主流大语言模型在巴以冲突等不同地缘政治背景下生成的人物角色,发现模型会系统性地产生带有社会经济地位刻板印象的描述,并且其内部关于公平性的推理与实际生成结果之间存在不一致。
源自 arXiv: 2603.22837