登录
首页 » Matlab » Q学习MATLAB代码

Q学习MATLAB代码

于 2023-04-10 发布 文件大小:2.14 kB
0 109
下载积分: 2 下载次数: 1

代码说明:

强化学习Q学习MATLAB代码,小车实验。通过Q学习贪婪学习所有可能性,采用时间差分方法,找到最优策略

下载说明:请别用迅雷下载,失败请重下,重下不扣分!

发表评论

0 个回复

  • IP113M
    IP113M circuit application
    2009-11-22 20:21:47下载
    积分:1
  • 4
    说明:  about the lms algorethem
    2010-05-25 22:04:18下载
    积分:1
  • ComManTel_Matlab
    分布式空时码的多符号差分检测,这是IEEE IEEE International Conference on Computing, Management and Telecommunications (ComManTel), Vietnam, April 2014的文章的对应仿真,是作者本人写的程序(Description simulation of the following paper: M. R. Avendi and Ha H. Nguyen, Multiple-Symbol Differential Detection for Distributed Space-Time Coding, IEEE International Conference on Computing, Management and Telecommunications (ComManTel), Vietnam, April 2014, Required Products MATLAB MATLAB release MATLAB 7.13 (R2011b) )
    2016-08-20 10:35:39下载
    积分:1
  • Fig5x37
    电力电子、电机控制系统仿真模型 经典仿真源程序 洪乃刚版本(Power electronics, motor control system simulation model version of the classic simulation source Hong Naigang)
    2011-05-09 23:23:11下载
    积分:1
  • 划分数据集Kylberg Sintorn
    说明:  Kylberg Sintorn数据集可用于进行算法的旋转不变测试,然而官网中并没有直接划分训练集与测试集,本代码对该数据集的图片进行重新命名与划分,可用于算法测试中(The Kylberg Sintorn dataset can be used to perform the rotation-invariant test of the algorithm. However, the official website does not directly divide the training set and the test set. This code renames and divides the picture of the dataset, which can be used in algorithm testing)
    2020-03-26 23:08:41下载
    积分:1
  • matlab_s
    matlab 的s函数应用心得,介绍了s函数的用法(matlab s-function application of experience, describes the use of the function s)
    2010-06-10 15:02:16下载
    积分:1
  • dcal
    病态方程组求解的matlab程序,涵盖LU分解、Jacobi迭代、GS迭代、SOR迭代四种方法,通过输入参数M来选去对应的算法。(Sick Equations matlab program, covering LU decomposition, Jacobi iteration, GS iteration, SOR iterative four methods, through M to choose input parameters to the corresponding algorithm.)
    2010-05-06 11:02:41下载
    积分:1
  • LFM信号脉冲压缩时的关键问题仿真
    以下几个matlab程序对雷达常用的线性调频信号(lfm信号)进行脉冲压缩时的关键问题进行了仿真,其中包括旁瓣抑制影响(加窗与不加窗)、多卜勒频移影响,并对时域脉压与频域脉压结果进行了对比分析,供相关技术人员参考。(The following several matlab programs simulate the key issues when pulse compression is performed on radar commonly used chirp signal (lfm signal), including sidelobe suppression effects (window and windowless), Doppler frequency shift effects, and The time-domain pulse pressure and frequency-domain pulse pressure results were compared and analyzed for reference by the relevant technical personnel.)
    2018-04-08 15:04:09下载
    积分:1
  • pso-SVM
    粒子群优化算法pso优化支持向量机svm(Particle swarm optimization algorithm pso optimization support vector machine svm)
    2020-11-17 14:39:39下载
    积分:1
  • STATCOM_Ud
    STATCOM 为一种快速可控的动态无功补偿设备(STATCOM is a fast and controllable dynamic reactive power compensation device)
    2018-04-25 22:11:35下载
    积分:1
  • 696516资源总数
  • 106914会员总数
  • 0今日下载