Meta AI, University of Texas at Austin
ICLR 2024, https://arxiv.org/abs/2305.14550
Download the humanoid offline RL data from here: https://dl.fbaipublicfiles.com/prajj/rl_paradigm/humanoid_offline_rl_data.tar.gz
Each directory contains a script directory within it. The script within script directory was used to run experiments for this study. All the scripts are highly configurable. Change the parameter according to your needs.
$ cd atari
$ sbatch scripts/train_atari.shFor DT and BC, use
$ cd gym
$ sbatch scripts/train_gym.shFor CQL, use
$ cd gym
$ sbatch scripts/train_cql.sh$ cd robomimic
$ sbatch scripts/train_default.shThese scripts will create five directories named 1, 2, etc. depending on the number of seed provided. Each of these directory will contain a result.json file. Each directory is accompanied by read_data.py or read_data_cql.py. After the results are dumped, these scripts can be used to get the average mean and std dev from all the runs.
python3 read_data.py --json_file_path $FILE_TO_RESULT_DIR