海词手机词典
  • A multi-step Q-learning algorithm was employed to overcome the slow convergence rate of standard Q-learning, and CMAC neural network was used to generalize the continuous state space.

    播放读音 播放读音