六只羊驼:通过LoRA自适应语言模型进行比较宗教伦理学 / Six Llamas: Comparative Religious Ethics Through LoRA-Adapted Language Models
1️⃣ 一句话总结
该研究通过对同一大型语言模型进行不同宗教经典(基督教、伊斯兰教、犹太教、印度教、佛教)的微调,发现这些模型在面对道德困境时会产生与各自宗教伦理逻辑一致的、可区分的回答模式,从而证明了用微调AI模型来比较研究文化伦理的可行性。
We present Six Llamas, a comparative study examining whether large language models fine-tuned on distinct religious corpora encode systematically different patterns of ethical reasoning. Six variants of Meta-Llama-3.1-8B are constructed: one unmodified control and five LoRA-adapted models trained exclusively on the sacred and theological texts of Christianity, Islam, Judaism, Hinduism, or Buddhism. All six models are probed with an identical battery of 17 standardized ethical prompts spanning moral dilemmas, game-theoretic scenarios, public policy questions, and moral-psychological self-assessments. To assess robustness and reproducibility, we implement a multi-temperature sampling design spanning ten temperature settings. We compute response consistency metrics, pairwise inter-model agreement rates, temperature sensitivity coefficients across four prompt domains, and run-to-run stability analyses. Findings show that LoRA-adapted models produce ethical reasoning patterns that are (a) systematically differentiated from the base model, (b) consistent with the moral logics of their training traditions, (c) structured along interpretable dimensions in moral-philosophical space, (d) core ethical positions remain stable across temperature variations for high-consensus dilemmas. The Trolley Problem achieves 100% consistency across all models and temperatures, while (e) tradition-specific divergence intensifies at higher temperatures in morally contested domains, and (f) the base model exhibits the highest overall response consistency (mean 88.3%), suggesting LoRA adaptation introduces both tradition-specific signal and increased sampling sensitivity. The study offers a proof-of-concept for the condensate comparative method using differentially trained language models as instruments for cultural and ethical analysis and identifies specific criteria for falsification and planned extensions.
六只羊驼:通过LoRA自适应语言模型进行比较宗教伦理学 / Six Llamas: Comparative Religious Ethics Through LoRA-Adapted Language Models
该研究通过对同一大型语言模型进行不同宗教经典(基督教、伊斯兰教、犹太教、印度教、佛教)的微调,发现这些模型在面对道德困境时会产生与各自宗教伦理逻辑一致的、可区分的回答模式,从而证明了用微调AI模型来比较研究文化伦理的可行性。
源自 arXiv: 2604.18404