类脑的多智能体决策算法及其在扑克中的应用

BRAIN-LIKE MULTI-AGENT DECISION-MAKING ALGORITHM RESEARCH AND ITS APPLICATION

  • 摘要: 多智能体决策领域存在模型的不同解释性、训练时间长的问题。因此,基于知识库对应用场景进行表达,在知识之间建立因果联系,使用图结构固定小样本学习后生成的新知识。在斗地主场景下设计出多智能体决策算法,实现行为表现类脑的智能水平,具备可解释性。相比于当下最先进的斗地主模型,作为地主身份对战时能取得 40% 的胜率,时间效率提升了 1 728 倍。

     

    Abstract: The multi-agent decision-making suffers from the problems of non-interpretability of models and long training time. Therefore, this paper represented the application scenario based on the knowledge base, established causal links between knowledge, and used a self-growing graph structure to fix new knowledge learned after few-shot-learning. A multi-agent decision-making model was designed in the Doudizhu scenario to achieve a behavioral performance brain-like level of intelligence with explainability. Compared with the state-of-the-art landlord model of today, it can achieve a 40% win rate when playing against each other as a landlord with 1 728 times higher time efficiency.

     

/

返回文章
返回