菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-06-03
📄 Abstract - Neetyabhas: A Framework for Uncertainty-Aware Public Policy Optimization in Rational Agent-Based Models

Purpose The WHO's COVID-19 non-pharmaceutical interventions (e.g., lockdowns, vaccinations) effectively curb transmission but impose heavy economic strains. Existing research often neglects individual behaviors and falsely assumes perfect infection tracking and flawless policy execution, failing to account for real-world uncertainties and errors. Methods We propose an integrative approach incorporating uncertainties in both epidemic measurement (infections/hospitalizations) and policy implementation. We built a simulation model of 1,000 individuals making real-time choices regarding mask-wearing, vaccination, and shopping. Concurrently, policymakers deploy interventions (lockdowns, mandates) based on health and economic observations. This framework is driven by hierarchical reinforcement learning agents, utilizing deep Q-networks alongside uncertainty-aware policy gradient variants (DDPG and TD3). Results The simulations effectively managed the epidemic's progression. Masking and vaccinations proved highly effective, significantly reducing both the outbreak's peak height and duration. By integrating individual behaviors, policy uncertainties, and multifaceted interventions, our dynamic control approach successfully mitigated the epidemic's impact. Conclusions Our model overcomes previous research limitations by embedding uncertainty and human behavior into public health policy frameworks. The simulation demonstrates that accounting for individual choices and imperfect data is crucial for designing effective interventions during complex pandemics, with masks and vaccines serving as pivotal tools.

顶级标签: reinforcement learning agents systems
详细标签: uncertainty-aware public policy epidemic simulation hierarchical rl human behavior 或 搜索:

Neetyabhas:理性智能体模型中面向不确定性感知的公共政策优化框架 / Neetyabhas: A Framework for Uncertainty-Aware Public Policy Optimization in Rational Agent-Based Models


1️⃣ 一句话总结

该研究提出一个新的模拟框架,通过将个体行为、测量误差和政策执行中的不确定性纳入模型,利用深度强化学习智能体同时模拟人群的实时选择(如戴口罩、打疫苗)和政府干预(如封锁、强制措施),从而更真实地评估疫情防控策略的效果,并发现口罩和疫苗是降低疫情峰值和缩短持续时间的关键工具。

源自 arXiv: 2606.04562