The extension of reinforcement learning to MDPs with large state,action space and high complexity has inevitably encountered the problem of the curse of dimensionality,which results in slow convergence and long training time.

 
  • 传统的强化学习算法应用到大状态、动作空间和任务复杂的马尔可夫决策过程问题时;存在收敛速度慢;训练时间长等问题.
今日热词
目录 附录 查词历史