Skip to content

Conversation

@samsja
Copy link
Member

@samsja samsja commented Mar 1, 2025

Screenshot from 2025-02-28 17-37-54
Screenshot from 2025-02-28 17-37-45

llama 3 8b

./scripts/simulate_multi_node_diloco.sh 1 8 src/zeroband/train.py @ configs/7B/H100.toml --train.micro_bs 4 --no-train.ac_ckpt --data.seq_length 1024 --data.fake --project sami_debug --type_model llama3 --name-model 8B

samsja added 7 commits March 1, 2025 01:29
Signed-off-by: Sami Jaghouar <[email protected]>
Signed-off-by: Sami Jaghouar <[email protected]>
Signed-off-by: Sami Jaghouar <[email protected]>
Signed-off-by: Sami Jaghouar <[email protected]>
Signed-off-by: Sami Jaghouar <[email protected]>
Signed-off-by: Sami Jaghouar <[email protected]>
Signed-off-by: Sami Jaghouar <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants