ViViDex: Learning Vision-based Dexterous Manipulation from Human Videos

Zerui Chen¹ Shizhe Chen¹ Etienne Arlaud¹ Ivan Laptev² Cordelia Schmid¹

¹WILLOW, INRIA Paris, France
²MBZUAI

This is the implementation of ViViDex under the SAPIEN simulator, a novel system for learning dexterous manipulation skills from human videos:

Installation 👷

git clone https://github.com/zerchen/vividex_sapien.git

conda create -n rl python=3.10
conda activate rl
conda install pytorch==2.4.1 torchvision==0.19.1 torchaudio==2.4.1 pytorch-cuda=12.1 -c pytorch -c nvidia
pip install -r requirements.txt

Usuage 🚀

cd tools
# Train the state-based policy
python train.py env.name=seq_name env.norm_traj=True

Available seq_name can be found at: norm_trajectories. You can also download trained checkpoints here and check their config files for a reference. When state-based policies are trained, rollout these policies with generate_expert_trajs.py and train the visual policy with imitate_train.py using either BC or diffusion policy.

Real robot 🤖

Please refer to our UR5 ROS code and Allegro hand ROS code as an example to set up the real robot experiment.

Acknowledgements

Parts of the code are based on DexArt, DexPoint and 3D-Diffusion-Policy. We thank the authors for sharing their excellent work!

Citation 📝

If you find ViViDex useful for your research, please consider citing our paper:

@inproceedings{chen2025vividex,
  title={{ViViDex}: Learning Vision-based Dexterous Manipulation from Human Videos},
  author={Chen, Zerui and Chen, Shizhe and Arlaud, Etienne and Laptev, Ivan and Schmid, Cordelia},
  booktitle={ICRA},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
algos		algos
assets		assets
dataset		dataset
hand_imitation		hand_imitation
mano		mano
norm_trajectories		norm_trajectories
scripts		scripts
third_party/pytorch3d_simplified		third_party/pytorch3d_simplified
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ViViDex: Learning Vision-based Dexterous Manipulation from Human Videos

Installation 👷

Usuage 🚀

Real robot 🤖

Acknowledgements

Citation 📝

About

Uh oh!

Releases

Packages

Languages

License

zerchen/vividex_sapien

Folders and files

Latest commit

History

Repository files navigation

ViViDex: Learning Vision-based Dexterous Manipulation from Human Videos

Installation 👷

Usuage 🚀

Real robot 🤖

Acknowledgements

Citation 📝

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages