类脑的多智能体决策算法及其在扑克中的应用

BRAIN-LIKE MULTI-AGENT DECISION-MAKING ALGORITHM RESEARCH AND ITS APPLICATION

摘要: 多智能体决策领域存在模型的不同解释性、训练时间长的问题。因此，基于知识库对应用场景进行表达，在知识之间建立因果联系，使用图结构固定小样本学习后生成的新知识。在斗地主场景下设计出多智能体决策算法，实现行为表现类脑的智能水平，具备可解释性。相比于当下最先进的斗地主模型，作为地主身份对战时能取得 40% 的胜率，时间效率提升了 1 728 倍。

Abstract: The multi-agent decision-making suffers from the problems of non-interpretability of models and long training time. Therefore, this paper represented the application scenario based on the knowledge base, established causal links between knowledge, and used a self-growing graph structure to fix new knowledge learned after few-shot-learning. A multi-agent decision-making model was designed in the Doudizhu scenario to achieve a behavioral performance brain-like level of intelligence with explainability. Compared with the state-of-the-art landlord model of today, it can achieve a 40% win rate when playing against each other as a landlord with 1 728 times higher time efficiency.