🎼 Beat-Aware Diffusion for Music-Driven Choral Conducting Motion Generation

Author: Shang Ni

Supervisor: Hammadi Nait-Charif

Music-driven motion generation has seen growing interest, while conducting remains less explored. We study this task in the context of choral repertoire. Our method uses a phase-based beat cue that locates each frame within the current beat and a diffusion model conditioned on musical features to promote timing consistency and natural upper-body motion. Evaluations on held-out pieces indicate clearer beat alignment and plausible gestures compared with representative baselines.

📂 Dataset & Pretrained Models

1. Our Dataset

We introduce a publicly available corpus for music-driven conducting motion with a focus on choral conducting. The dataset comprises approximately 21.9 hours of professionally recorded conductor performances spanning 663 distinct pieces. Download all .npy files and place them into the demo/ folder:
🔗 Google Drive Link

2. Pretrained Weights

Download the pretrained model (.pth) and put it in the weight/ folder:
🔗 Google Drive Link

3. Body Models

SMPL-H (male) → place in body_models/smplh/
🔗 Download Link
SMPL (neutral) → place in body_models/smpl/
🔗 Download Link

📊 Quantitative Results

We evaluate our method on the test set against two baselines (M²S-GAN and Zhao et al.).
Metrics include MSE (lower is better), FGD (lower is better), BC (higher is better), and Diversity (higher is better).
Values are reported as mean with 95% confidence intervals.

Methods	MSE ↓	FGD ↓	BC ↑	Diversity ↑
Real	0.000 ± 0.000	0.000 ± 0.000	0.842 ± 0.018	1.210 ± 0.036
M²S-GAN	1.432 ± 0.095	0.921 ± 0.068	0.482 ± 0.030	1.083 ± 0.041
Zhao et al.	0.812 ± 0.052	0.643 ± 0.059	0.553 ± 0.027	0.963 ± 0.048
Ours	0.588 ± 0.040	0.587 ± 0.051	0.616 ± 0.022	1.043 ± 0.044

🎥 Demo

We provide demo videos showcasing music-driven conducting motion generated by our method:
👉 Watch Demo Video

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
assets		assets
body_models		body_models
data_loaders		data_loaders
diffusion		diffusion
eval		eval
fig		fig
glove		glove
model		model
prepare		prepare
sample		sample
train		train
utils		utils
visualize		visualize
Data_Preprocessing.ipynb		Data_Preprocessing.ipynb
Model_Training_Test.ipynb		Model_Training_Test.ipynb
README.md		README.md
demo_video.mp4		demo_video.mp4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎼 Beat-Aware Diffusion for Music-Driven Choral Conducting Motion Generation

📂 Dataset & Pretrained Models

1. Our Dataset

2. Pretrained Weights

3. Body Models

📊 Quantitative Results

🎥 Demo

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎼 Beat-Aware Diffusion for Music-Driven Choral Conducting Motion Generation

📂 Dataset & Pretrained Models

1. Our Dataset

2. Pretrained Weights

3. Body Models

📊 Quantitative Results

🎥 Demo

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages