The extension of reinforcement learning to MDPs with large state,action space and high complexity has inevitably encountered the problem of the curse of dimensionality,which results in slow convergence and long training time.

英美

释义

- 传统的强化学习算法应用到大状态、动作空间和任务复杂的马尔可夫决策过程问题时；存在收敛速度慢；训练时间长等问题.

把海词放在桌面上，查词最方便

触屏版| 电脑版

©2003 - 2025 海词词典(Dict.cn)

立即下载