General Suggestions
The environments heavily use JIT compilation for speed ups. But it might return NaN gradients in some rare situations. The training will explicitly fail in such conditions. In these situations use
PYTORCH_JIT=0
.A minor bottleneck might be horovod caching. Disable caching with
HOROVOD_CACHE_CAPACITY=0
.By default we simulate the environment on CPU, this is performant for low nagents due to the high kernel launch overhead. In case you want to use our
tsim
andagents
modules for simulating a large number of vehicles, uncomment this line or the corresponding line in other trainers.
Last updated