-
Notifications
You must be signed in to change notification settings - Fork 60
Add ONNX Sub Functions Export Feature for AutoModelForCausalLM #621
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
92f320c to
acec54f
Compare
|
Let's keep it uniform. Can we rename |
done. |
Please modify the PR commit message and desp. accordingly. thanks |
ochougul
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
review WIP.
Signed-off-by: abhishek-singh591 <[email protected]>
Signed-off-by: abhishek-singh591 <[email protected]>
Signed-off-by: abhishek-singh591 <[email protected]>
Fix for this JIRA from Imagine team Signed-off-by: Ann Kuruvilla <[email protected]> Signed-off-by: abhishek-singh591 <[email protected]>
Signed-off-by: abhishek-singh591 <[email protected]>
65d24bc to
6daa209
Compare
|
@abhishek-singh591 please rebase the PR |
vbaddi
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, thanks 👍
Signed-off-by: abhishek-singh591 <[email protected]>
Signed-off-by: abhishek-singh591 <[email protected]>
ochougul
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Approving. Add todo for CustomOpTransform and merge once CI is passing.
Signed-off-by: abhishek-singh591 <[email protected]>
Signed-off-by: abhishek-singh591 <[email protected]>
ONNX Functions Export Support
Overview
This PR introduces support for exporting ONNX modules as functions, enabling more efficient model compilation and execution on hardware.
Key Changes
use_onnx_subfunctionsto control ONNX function export behavior.How to Enable ONNX Function Export
Set the flag before running inference (either during export or compile):
Backward Compatibility
This feature is opt-in and requires an explicit environment variable. Existing workflows remain unaffected when the flag is disabled.