Skip to content

Commit

Permalink
updated ccl perf target for llama all gather async
Browse files Browse the repository at this point in the history
  • Loading branch information
caixunshiren committed Feb 10, 2025
1 parent 503ca1c commit ccd3f98
Showing 1 changed file with 2 additions and 2 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -112,7 +112,7 @@ def get_core_range_set(output_core_grid):
}
),
ttnn.TILE_LAYOUT,
40,
32,
),
( # AllGather after Binary Mult+Silu
ttnn.TensorMemoryLayout.WIDTH_SHARDED,
Expand All @@ -123,7 +123,7 @@ def get_core_range_set(output_core_grid):
(32, 160),
get_core_range_set(PREFETCHER_NOC1_RING),
ttnn.TILE_LAYOUT,
32,
25,
),
),
)
Expand Down

0 comments on commit ccd3f98

Please sign in to comment.