人类与大型语言模型中系统1和系统2语义记忆结构在偏见形成中的作用 / The role of System 1 and System 2 semantic memory structure in human and LLM biases
1️⃣ 一句话总结
这项研究发现,人类通过两种不同的语义记忆结构(快速直觉的‘系统1’和缓慢理性的‘系统2’)来调节隐性偏见,其中系统2结构能有效降低偏见,而当前的大型语言模型缺乏这种内在的、类似人类的认知调节机制。
Implicit biases in both humans and large language models (LLMs) pose significant societal risks. Dual process theories propose that biases arise primarily from associative System 1 thinking, while deliberative System 2 thinking mitigates bias, but the cognitive mechanisms that give rise to this phenomenon remain poorly understood. To better understand what underlies this duality in humans, and possibly in LLMs, we model System 1 and System 2 thinking as semantic memory networks with distinct structures, built from comparable datasets generated by both humans and LLMs. We then investigate how these distinct semantic memory structures relate to implicit gender bias using network-based evaluation metrics. We find that semantic memory structures are irreducible only in humans, suggesting that LLMs lack certain types of human-like conceptual knowledge. Moreover, semantic memory structure relates consistently to implicit bias only in humans, with lower levels of bias in System~2 structures. These findings suggest that certain types of conceptual knowledge contribute to bias regulation in humans, but not in LLMs, highlighting fundamental differences between human and machine cognition.
人类与大型语言模型中系统1和系统2语义记忆结构在偏见形成中的作用 / The role of System 1 and System 2 semantic memory structure in human and LLM biases
这项研究发现,人类通过两种不同的语义记忆结构(快速直觉的‘系统1’和缓慢理性的‘系统2’)来调节隐性偏见,其中系统2结构能有效降低偏见,而当前的大型语言模型缺乏这种内在的、类似人类的认知调节机制。
源自 arXiv: 2604.12816