人工智能原理人工智能原理 (28).pdf
《人工智能原理人工智能原理 (28).pdf》由会员分享,可在线阅读,更多相关《人工智能原理人工智能原理 (28).pdf(18页珍藏版)》请在淘文阁 - 分享文档赚钱的网站上搜索。
1、Artificial IntelligenceArtificial Intelligence2Artificial Intelligence Part 1.Basics Part 2.Searching Part 3.Reasoning Part 4.Planning Part 5.LearningContents:Artificial Intelligence3Part 2.Searching 3.Solving Problems by Search 4.Local Search and Swarm Intelligence 5.Adversarial Search 6.Constraint
2、 Satisfaction ProblemsContents:Artificial Intelligence4Objectives 教学目的5.Adversarial Search To examine the problems that arise when we try to plan ahead in a world where other agents are planning against us.去考察这样一些会发生的问题,当我们试图在某个环境预先规划时,其他智能体也正在针对我们做规划。Artificial Intelligence55.Adversarial SearchCont
3、ents:5.1.Games 5.2.Optimal Decisions in Games 5.3.Alpha-Beta Pruning 5.4.Imperfect Real-time Decisions 5.5.Stochastic Games 5.6.Monte-Carlo Methods Artificial Intelligence:Searching:Adversarial Search6Search vs.Adversarial Search 搜索与对抗搜索5.1.GamesSearch 搜索Adversarial Search 对抗搜索Single agent 单智能体Multi
4、ple agents多智能体Solution is(heuristic)method for finding goal.解是寻找目标的(启发式)方法Solution is strategy(strategy specifies move for every possible opponent reply).解是策略(指定对每个可能对手回应的行动策略)Heuristics can find optimal solution.启发式法可以找到最优解Time limits force an approximate solution.时间受限被迫执行一个近似解Evaluation function:e
5、stimate of cost from start to goal through given node.评价函数:穿过给定节点从起始到目标的代价估计Evaluation function:evaluate“goodness”of game position.评价函数:评估博弈局势的“好坏”Artificial Intelligence:Searching:Adversarial Search7 Definitions of Game theory 博弈论的定义 Study of strategic decision making.Specifically,study of mathemat
6、ical models of conflict and cooperation between intelligent rational decision-makers.研究战略决策制定。具体来说,研究智能理性决策者之间的冲突与合作的数学模型。An alternative term is interactive decision theory.一个可替代的术语是交互式决策理论。Applications of Game theory 博弈论的应用 Economics,political science,psychology,logic,computer science,and biology.经
7、济学、政治学、心理学、逻辑、计算机科学、以及生物学。Behavioral relations and decision science,including both humans and non-humans(puters).行为关系与决策科学,包括人类与非人类(如计算机等)。Adversarial Search often Known as Games 对抗搜索通常称为博弈5.1.GamesArtificial Intelligence:Searching:Adversarial Search8 Machines(players)need“human-like”intelligence.机器
8、(玩家)需要“类人”的智能。Requiring to make decision within limited time.要求在有限的时间内进行决策。Features of games:博弈的特征:Games are Good Problems for AI 博弈是AI研究的好材料5.1.GamesTwo,or more players(agents)Turn-taking vs.simultaneous movesPerfect information vs.imperfect informationDeterministic vs.stochasticCooperative petitiv
9、e Zero-sum vs.non zero-sum两个、或多个玩家(智能体)轮流、与同步行动完全信息、与不完全信息确定性、与随机合作式、与对抗式零和、与非零和Artificial Intelligence:Searching:Adversarial Search9Zero Sum vs.Non-zero Sum 零和与非零和博弈5.1.Games Zero sum games 零和博弈 Agents have opposite utilities.智能体之间是对立的方式。Pure competition:win-lose,its sum is zero.纯竞争:输赢、其和为零。Non-zer
10、o sum games 非零和博弈 Agents have independent utilities.智能体之间是自主的方式。Cooperation,indifference,competition,.合作、中立、竞争、Win-win,win-lose or lose-lose,its sum is not zero.双赢、输赢、或双输,其和不为零。Artificial Intelligence:Searching:Adversarial Search10 Two members of a criminal gang are arrested and imprisoned.Each pris
11、oner is given the opportunity either to:betray the other by testifying that the other committed the crime,or to cooperate with the other by remaining silent.Here is the offer:有两个犯罪集团的成员被逮捕和监禁。每个囚徒只有二选一的机会:揭发对方并证明其犯罪,或者与对方合作保持沉默。惩罚方式如下:If A and B each betray the other,each of them serves 2 years in p
12、rison.若A和B彼此揭发对方,则每个囚徒监禁2年。If A betrays B but B remains silent,A will be set free and B will serve 3 years in prison(and vice versa).若A揭发B而B保持沉默,则A被释放而B监禁3年(反之亦然)。If A and B both remain silent,both of them will only serve 1 year in prison.若A和B都保持沉默,则他们仅被监禁1年。Example:Prisoners Dilemma 囚徒困境5.1.GamesAr
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- 人工智能原理人工智能原理 28 人工智能 原理 28
限制150内