Q-learning is a typical RL method with a slow convergence speed especially as the scales of the state space and the action space increase.

 
  • 利用模糊综合决策方法处理专家经验和环境信息得到Q学习的先验知识,对Q学习的初始状态进行优化。
今日热词
目录 附录 查词历史