登录
首页 » matlab » WindyGridWorldQLearning

WindyGridWorldQLearning

于 2013-04-19 发布 文件大小:2KB
0 228
下载积分: 1 下载次数: 31

代码说明:

  Q-learning (Watkins, 1989) is a simple way for agents to learn how to act optimally in controlled Markovian domains. It amounts to an incremental method for dynamic programming which imposes limited computational demands. It works by successively improving its evaluations of the quality of particular actions at particular states. This paper presents and proves in detail a convergence theorem for Q,-learning based on that outlined in Watkins (1989). We show that Q-learning converges to the optimum action-values with probability 1 so long as all actions are repeatedly sampled in all states and the action-values are represented discretely. We also sketch extensions to the cases of non-discounted, but absorbing, Markov environments, and where many Q values can be changed each iteration, rather than just one.

文件列表:

下载说明:请别用迅雷下载,失败请重下,重下不扣分!

发表评论

0 个回复

  • NNMATLAB
    neural network matlab example
    2009-10-17 15:00:29下载
    积分:1
  • xuedingyu_MATLAB
    这是薛定宇的经典书籍:高等数学问题的MATLAB求解~非常有用,非常详细,欢迎大家下载~(This is of Xue Dingyu the classic books: MATLAB higher mathematics problem solving to be very useful, very detailed, welcome to download to)
    2013-01-07 11:53:03下载
    积分:1
  • cat-map
    cat map function to scramble a one dimentson sequence
    2013-12-09 07:47:27下载
    积分:1
  • LRD
    室内无线定位技术,基于比例圆算法的matlab仿真程序(Indoor wireless positioning technology, based on proportional circle algorithm matlab simulation program)
    2013-10-27 01:01:11下载
    积分:1
  • MATLAB-CODES_TIME-HISTORY_RESPONSE-SPECTRUM-INDIA
    MATLAB CODES OF MODAL ANALYSIS OF MULTI-STORIED STRUCTURE, RESPONSE SPECTRUM, TIME HISTORY ANALYSIS USING CENTRAL DIFFERENCE METHOD FOR ELCENTRO GROUND MOTION
    2013-03-21 15:04:36下载
    积分:1
  • FT_analyse
    matlab环境下的时频分析,有仿真结果,有程序,希望对初学者有用(matlab Frequency-time analysis)
    2013-11-28 17:08:39下载
    积分:1
  • liziqun
    提供一个粒子群算法的实例代码,对于初学者和进一步开发者用很大的帮助(Provide a PSO algorithm code examples for beginners and more developers with great help)
    2015-04-18 10:57:55下载
    积分:1
  • 8psk
    8psk in Rayleigh fading
    2009-04-19 14:55:06下载
    积分:1
  • GUI_get_corr_points
    A matlab program, using a image processing. Testing with small images and upload. GO go go.
    2009-11-20 04:02:08下载
    积分:1
  • MATLAB-drawing
    介绍matlab的ppt,介绍了matlab中所使用的各种画图函数,初学者很容易上手。(Matlab introduction of ppt, introduced matlab used in a variety of drawing functions, beginners can easily get started.)
    2008-05-02 00:23:10下载
    积分:1
  • 696516资源总数
  • 106914会员总数
  • 0今日下载