海词手机词典
  • Hierarchical reinforcement learning can be used to solve curse of dimensionality problem. MAXQ method decomposes the task into gubtasks in different levels through decomposing value function hierarchically, so it can be realized in low dimension space.

    播放读音 播放读音