Skip to content

Code for training embodied agents using imitation learning at scale in Habitat-Lab

License

Notifications You must be signed in to change notification settings

Ram81/habitat-imitation-baselines

Repository files navigation

Habitat-Web

Code for training imitation learning agents for Objectnav and Pick-and-Place in Habitat. This repo is the official code repository for the paper Habitat-Web: Learning Embodied Object-Search from Human Demonstrations at Scale

Reproducing Results

We provide best checkpoints for agents trained on ObjectNav and Pick-and-Place. You can use the following checkpoints to reproduce results reported in our paper.

Task Split Checkpoint Success Rate SPL
🆕ObjectNav v1 objectnav_semseg.ckpt 27.8 9.9
🆕Pick-and-Place New Initializations pick_place_rgbd_new_inits.ckpt 17.5 9.8
🆕Pick-and-Place New Instructions pick_place_rgbd_new_insts.ckpt 15.1 8.3
🆕Pick-and-Place New Environments pick_place_rgbd_new_envs.ckpt 8.3 4.1

You can find the pretrained RedNet semantic segmentation model weights here and the pretrained depth encoder weights here.

Overview

The primary code contributions from the paper are located in:

  • Imitation Learning Baselines:

    • ObjectNav: habitat_baselines/il/env_based/
    • Pick-and-Place: habitat_baselines/il/disk_based/
  • Experiment Configurations:

    • ObjectNav: habitat_baselines/config/objectnav/*.yaml
    • Pick-and-Place: habitat_baselines/config/pickplace/*.yaml
  • Replay Scripts:

    • ObjectNav: examples/objectnav_replay.py
    • Pick-and-Place: examples/pickplace_replay.py

Installation

  1. Clone the repository and install habitat-web-baselines using the commands below. Note that python=3.6 is required for working with habitat-web-baselines. All the development was done on habitat-lab=0.1.6.

    git clone https://github.com/Ram81/habitat-web-baselines.git
    cd habitat-web-baselines
    
    # We require python>=3.6 and cmake>=3.10
    conda create -n habitat-web python=3.6 cmake=3.14.0
    conda activate habitat-web
    
    pip install -e .
    python setup.py develop --all
  2. Install our custom build of habitat-sim, we highly recommend using the habitat-sim build from source for working with habitat-web-baselines. Use the following commands to set it up:

    git clone [email protected]:Ram81/habitat-sim.git
    cd habitat-sim
  3. Install dependencies

    Common

    pip install -r requirements.txt

    Linux (Tested with Ubuntu 18.04 with gcc 7.4.0)

    sudo apt-get update || true
    # These are fairly ubiquitous packages and your system likely has them already,
    # but if not, let's get the essentials for EGL support:
    sudo apt-get install -y --no-install-recommends \
         libjpeg-dev libglm-dev libgl1-mesa-glx libegl1-mesa-dev mesa-utils xorg-dev freeglut3-dev

    See this configuration for a full list of dependencies that our CI installs on a clean Ubuntu VM. If you run into build errors later, this is a good place to check if all dependencies are installed.

  4. Build Habitat-Sim

    Default build with bullet (for machines with a display attached)

    # Assuming we're still within habitat conda environment
    ./build.sh --bullet

    For headless systems (i.e. without an attached display, e.g. in a cluster) and multiple GPU systems

    ./build.sh --headless --bullet
  5. For use with habitat-web-baselines and your own python code, add habitat-sim to your PYTHONPATH. For example modify your .bashrc (or .bash_profile in Mac OS X) file by adding the line:

    export PYTHONPATH=$PYTHONPATH:/path/to/habitat-sim/

Data

Downloading MP3D Scene Dataset

Downloading Object Assets

  • Download the object assets used for Pick-and-Place task and THDA ObjectNav episodes from here.

  • Unzip the object assets and verify they are stored at data/test_assets/objects/.

Downloading Human Demonstrations Dataset

You can use the following datasets to reproduce results reported in our paper.

Dataset Scene dataset Split Link Extract path
ObjectNav-HD MP3D 70k objectnav_mp3d_70k.json.gz data/datasets/objectnav/objectnav_mp3d_70k/
ObjectNav-HD MP3D+Gibson Full objectnav_mp3d_gibson_80k.json.gz data/datasets/objectnav/objectnav_mp3d_gibson_80k/
Pick-and-Place-HD MP3D Full pick_place_12k.json.gz data/datasets/pick_place/pick_place_12k/
Pick-and-Place-HD MP3D New Initializations pick_place_unseen_initializations.json.gz data/datasets/pick_place/unseen_initializations/
Pick-and-Place-HD MP3D New Instructions pick_place_unseen_instructions.json.gz data/datasets/pick_place/unseen_instructions/
Pick-and-Place-HD MP3D New Environments pick_place_unseen_scenes.json.gz data/datasets/pick_place/unseen_scenes/

The demonstration datasets released as part of this project are licensed under a Creative Commons Attribution-NonCommercial 4.0 License.

Dataset Folder Structure

The code requires the datasets in data folder in the following format:

├── habitat-web-baselines/
│  ├── data
│  │  ├── scene_datasets/
│  │  │  ├── mp3d/
│  │  │  │  ├── JeFG25nYj2p.glb
│  │  │  │  └── JeFG25nYj2p.navmesh
│  │  ├── datasets
│  │  │  ├── objectnav/
│  │  │  │  ├── objectnav_mp3d_70k/
│  │  │  │  │  ├── train/
│  │  │  ├── pick_place/
│  │  │  │  ├── pick_place_12k/
│  │  │  │  │  ├── train/
│  │  ├── test_assets/
│  │  │  ├── objects/
│  │  │  │  ├── apple.glb
│  │  │  │  └── plate.glb

Packaging Demonstration Datasets

We also provide an example of packaging your own demonstrations dataset to train imitation learning agents with habitat-imitation-baselines here.

Test Setup

To verify that the data is set up correctly, run:

python examples/objectnav_replay.py --path data/datasets/objectnav/objectnav_mp3d_70k/sample/sample.json.gz

Usage

Training

For training the behavior cloning policy on the ObjectGoal Navigation task using the environment based setup:

  1. Use the following script for multi-node training
sbatch job_scripts/run_objectnav_training.sh habitat_baselines/config/objectnav/il_ddp_objectnav.yaml
  1. To run training on a single node use:
sbatch job_scripts/run_objectnav_training.sh habitat_baselines/config/objectnav/il_objectnav.yaml

For training the behavior cloning policy on the Pick-and-Place task using the disk based setup:

  1. Use the following script for multi-node training
sbatch job_scripts/run_pickplace_training.sh ddp
  1. To run training on a single node use:
sbatch job_scripts/run_pickplace_training.sh single_node

Evaluation

To evaluate pretrained checkpoint on ObjectGoal Navigation, download the objectnav_mp3d_v1 dataset from here.

For evaluating a checkpoint on the ObjectGoal Navigation task using the environment based setup:

  1. Use the following script if trained using distributed setup
sbatch job_scripts/run_objectnav_eval.sh habitat_baselines/config/objectnav/il_ddp_objectnav.yaml data/datasets/objectnav_mp3d_v1 /path/to/checkpoint
  1. Use the following script for evaluating single node checkpoint
sbatch job_scripts/run_objectnav_eval.sh habitat_baselines/config/objectnav/il_objectnav.yaml data/datasets/objectnav_mp3d_v1 /path/to/checkpoint

For evaluating the behavior cloning policy on the Pick-and-Place task using the disk based setup:

  1. Use the following script if trained using dristributed setup
sbatch job_scripts/run_pickplace_eval.sh ddp
  1. Use the following script for evaluating single node checkpoint
sbatch job_scripts/run_pickplace_eval.sh single_node

Citation

If you use this code in your research, please consider citing:

@inproceedings{ramrakhya2022,
      title={Habitat-Web: Learning Embodied Object-Search Strategies from Human Demonstrations at Scale},
      author={Ram Ramrakhya and Eric Undersander and Dhruv Batra and Abhishek Das},
      year={2022},
      booktitle={{Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
}

About

Code for training embodied agents using imitation learning at scale in Habitat-Lab

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published