Experiment with ICM and PPO bunch for environment with sparse reward signal.
The experiment tests the contribution of intrinsic reward to the agent's ability to solve the sparse-reward environment from Unity ML-Agents Toolkit.
| Name | Name | Last commit date | ||
|---|---|---|---|---|
Experiment with ICM and PPO bunch for environment with sparse reward signal.
The experiment tests the contribution of intrinsic reward to the agent's ability to solve the sparse-reward environment from Unity ML-Agents Toolkit.