AI人工智能导论 (28).pdf
《AI人工智能导论 (28).pdf》由会员分享,可在线阅读,更多相关《AI人工智能导论 (28).pdf(9页珍藏版)》请在淘文阁 - 分享文档赚钱的网站上搜索。
1、19 5. Adversarial Search Contents: 5.1. Games 5.2. Optimal Decisions in Games 5.3. Alpha-Beta Pruning 5.4. Imperfect Real-time Decisions 5.5. Stochastic Games 5.6. Monte-Carlo Methods 20 In normal search普通搜索 The optimal solution would be a sequence of actions leading to a goal state (terminal state)
2、 that is a win. 最优解将是导致获胜的目标状态(终端状态)的一系列动作。 In adversarial search 对抗搜索 Both of MAXand MINcould have an optimal strategy. MAX和MIN都会有一个最优策略。 In initial state, MAXmust find a strategy to specify MAXs move, 在初始状态,MAX必须找到一个策略来确定MAX的动作, then MAXs moves in the states resulting from every possible response
3、by MIN, and so on. 然后MAX针对MIN的每个合理的对应采取相应的动作,以此类推。 Optimal Solution 最优解 5.2. Optimal Decisions in Games Artificial Intelligence : Searching : Adversarial Search 21 For a zero sum game, the name minimax arises because each player minimizes the maximum payoff possible for the other, he also minimizes
4、his own maximum loss. 对于零和博弈来说,其名称minimax的由来是因为每个玩家会使对手可能的最大收益变得最小,还 会使自己的最大损失变得最小。 Minimax Theorem 最小最大定理 5.2. Optimal Decisions in Games For every two-player, zero-sum game with finitely many strategies, there exists a value V and a mixed strategy for each player, such that 对于两个玩家、具有有限多个策略的零和博弈,每个
5、玩家存在一个值V和一个混合策略,使得: (a) Given player 2s strategy, the best payoff possible for player 1 is V, 给定玩家2的策略,则玩家1可能的最好收益是V, (b) Given player 1s strategy, the best payoff possible for player 2 is V. 给定玩家1的策略,则玩家2可能的最好收益是-V。 Artificial Intelligence : Searching : Adversarial Search 22 Given a game tree, the
6、optimal strategy can be determined from the minimax value of each node, write as MINIMAX(n). 给定一棵博弈树,则最优策略可以由每个节点的minimax值来确定,记作MINIMAX(n)。 Assume that both players play optimally from there to the end of the game. 假设两个玩家博弈自始至终都发挥得很好。 Optimal Solution in Adversarial Search 对抗搜索的最优解 5.2. Optimal Deci
7、sions in Games The minimax value of a terminal state is just its utility. MAXprefers to move to a state of maximum value, MINprefers a state of minimum value. 终端状态的minimax值只是其效用。 MAX倾向于移动到一个最大值状态,MIN则倾向于一个最小值状态。 function MINIMAX(s) returns an action if TERMINAL-TEST(s) then return UTILITY(s) if PLAY
8、ER(s) = MAXthen return maxa ACTIONS(s)MINIMAX(RESULT(s, a) if PLAYER(s) = MINthen return mina ACTIONS(s)MINIMAX(RESULT(s, a) Artificial Intelligence : Searching : Adversarial Search 23 MAXs best move at root is a1(with the highest minimax value) 根节点处MAX的最佳移动是a1 (具有最高的minimax值) MINs best reply at B i
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- 人工智能导论
限制150内