Parameters tunable in the config

fp_16 -> whether to keep fixed precision 16 training

wandb_project -> expects name of wandb project. Useful to monitor metrics

num_workers -> number of workers in the dataloader to load data

max_tokens -> the maximum tokens you can fit at one time into the GPU. 3200000 / 16000 ~ 200 seconds ~ 3.3 minutes

max_updates -> maximum updates to which you want to run training for

lr -> the learning rate to use during training

update_freq -> How frequently the gradient updates happen in distributed environment. Depends on the number of GPU's to simulate and you have currently.

optimizer -> optimizer to use for training purpose

w2v_path -> path to pretrained model

mask_prob -> probability of masking the latent speech output

freeze_finetune_updates -> the updates for which you want to freeze the entire network apart from the final layer

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Parameters tunable in the config

Files

README.md

Latest commit

History

README.md

File metadata and controls

Parameters tunable in the config