菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-26
📄 Abstract - Self-Improvement Imitation with Biologically Guided Search for Protein Design Under Oracle Budgets

Protein sequence optimization under tight oracle budgets requires methods that explore vast combinatorial spaces while making each evaluation informative. Existing reinforcement learning and off-policy generative approaches often degrade under surrogate noise, and position-agnostic mutation proposals risk disrupting functionally critical residues. We introduce SILO, a trajectory-level self-improvement imitation framework for oracle-budgeted protein design. SILO uses a hierarchical edit policy that decomposes each mutation into a position choice followed by a residue choice. In each active-learning round, the policy samples candidate trajectories via incremental stochastic beam search without replacement (SBS), and a UCB-based proxy ensemble, combined with an alanine-scan fitness score (AFS), selects candidates with functionally relevant edits for in silico oracle evaluation. The policy is then updated by next-action cross-entropy imitation on the round's best oracle-labeled trajectories, avoiding value-function estimation. Across eight reproduced protein fitness landscapes and five strong baselines from prior work, SILO achieves the highest maximum and top-100 mean fitness on 8 of 8 landscapes within our evaluations, often exhibiting faster early-stage improvement. In low-data and noisy-proxy stress tests on two landscapes per setting, SILO remains competitive or best when several baselines degrade. Ablations show that SBS with AFS account for much of the gains, with iterative imitation providing additional improvement. Code is available at: this https URL

顶级标签: machine learning biology
详细标签: protein design imitation learning oracle budget hierarchical editing beam search 或 搜索:

受生物学启发的搜索与自我改进模仿:面向有限预算的蛋白质设计方法 / Self-Improvement Imitation with Biologically Guided Search for Protein Design Under Oracle Budgets


1️⃣ 一句话总结

本文提出了一种名为SILO的新方法,它模仿自我改进的学习过程,结合受生物学启发的搜索策略(如丙氨酸扫描评分),在有限的实验评估预算下高效优化蛋白质序列,显著优于现有方法,尤其适合早期快速获得高性能候选蛋白。

源自 arXiv: 2605.26690