问题描述
每次调用 A3CdiscreteDense().train() 时都会出错;
Exception in thread "Thread-7" java.lang.Arrayindexoutofboundsexception: -1
at java.util.ArrayList.elementData(UnkNown Source)
at java.util.ArrayList.get(UnkNown Source)
at org.deeplearning4j.rl4j.learning.async.a3c.discrete.AdvantageActorCriticUpdatealgorithm.computeGradients(AdvantageActorCriticUpdatealgorithm.java:63)
at org.deeplearning4j.rl4j.learning.async.a3c.discrete.AdvantageActorCriticUpdatealgorithm.computeGradients(AdvantageActorCriticUpdatealgorithm.java:32)
at org.deeplearning4j.rl4j.learning.async.AsyncThreaddiscrete.trainSubEpoch(AsyncThreaddiscrete.java:130)
at org.deeplearning4j.rl4j.learning.async.AsyncThread.handleTraining(AsyncThread.java:192)
at org.deeplearning4j.rl4j.learning.async.AsyncThread.run(AsyncThread.java:168)
但是当我使用 DQN 时,相同的代码可以正常工作。
解决方法
我遇到了同样的问题。似乎默认学习配置不起作用,因为 nStep
值不应为 0。从构建器创建学习配置时只需调用 .nStep(5)
。您可以在此处找到更多信息:https://github.com/eclipse/deeplearning4j-examples/issues/991#issuecomment-823133909