-
Notifications
You must be signed in to change notification settings - Fork 104
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
#17972 and #17975 Fixing PCC and Program Cache issues in Repeat and Expand #18002
Conversation
ttnn/cpp/ttnn/operations/data_movement/repeat/device/host/repeat_program_factory.cpp
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think your implementation of program callback is correct. You only need to update the args that actually change, but it seems like you copied most of the program setup itself, which is really expensive.
Another thing is, you can capture a lot more args (eg. cores) from the program setup instead of recalculating everything (unless they are expected to change).
ttnn/cpp/ttnn/operations/data_movement/repeat/device/host/repeat_program_factory.cpp
Outdated
Show resolved
Hide resolved
ttnn/cpp/ttnn/operations/data_movement/repeat/device/host/repeat_program_factory.cpp
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please add explanation on how does it break repeat calls with the same parameters in a cycle. I currently don't' understand why hash is broken in this case.
ttnn/cpp/ttnn/operations/data_movement/repeat/device/host/repeat_program_factory.cpp
Outdated
Show resolved
Hide resolved
…enstorrent/tt-metal into jvega/fixing_repeat_regression
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, thanks!
Ticket
#17975
#17972
Problem description
This PR closes two P0 errors by applying bug fixes to the repeat program factory and giving repeat program cache support.
What's changed
Program factory changes to repeat
Adding Program Cache testing to the CI pipelines for repeat
Removed redundant CI tests in Repeat to help improve CI pipeline times
Checklist