-
Notifications
You must be signed in to change notification settings - Fork 86
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Compile pass for hipBLASLt #3594
base: develop
Are you sure you want to change the base?
Conversation
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## develop #3594 +/- ##
========================================
Coverage 92.18% 92.18%
========================================
Files 513 513
Lines 21576 21576
========================================
Hits 19889 19889
Misses 1687 1687 ☔ View full report in Codecov by Sentry. |
4169e71
to
120705e
Compare
Windows build failure should be fixed by: 4ff36ae:
|
…s in addition to throwing an exception
…owing an exception
…ce size when algo is not supported.
4ff36ae
to
a7196e4
Compare
This needs a test. There are many compiler pass tests in directory |
Current code looks good. As Brian mentions would like to see some tests. The main reason for this compiler pass is to do output fusions with hipblaslt, right? |
The main reason is to be able to insert accurate workspace allocation after tuning. We cant insert workspace allocation instructions during lowering because we dont know the layouts until after eliminate_contiguous and we also need to run tuning as well. |
…olutions is supported for the algo 2. Use default workspace size when returned workspace size is 0, temporary workaround for memory issues with 0 workspace size.
Chris reached out to Bryant and discussed the issue where none of the algos returned by |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Few comments.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Fix CI but others LGTM
This build is not recommended to merge 🔴 |
🔴bert_large_uncased_fp16: FAILED: MIGraphX is not within tolerance - check verbose output |
This pass helps to allocate actual workspace size needed for hipblaslt algos.
It does so by querying for the workspace needed for the particular algo, and allocates memory accordingly.