Skip to content

Sync quant dialect changes 20x #104

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed

Conversation

dbudii
Copy link

@dbudii dbudii commented Mar 7, 2025

Summary

Upon the Sub-channel quantized type Pull Request being blocked, we decided that synchronizing the quant dialect changes from llvm-project upstream can reduce the effort for the upcoming LLVM update to our fork. The changes introduced are cherry-picked from llvm/llvm-project#100667 and as an addition custom npu patches have been applied.
With this early integration, we aim to have that Pull Request unblocked and continue the integration of the new data type to our fork.

JIRA ticket

EISW-159244

Related PR in NPU Compiler and/or OpenVINO repository with sub-module update

  • PR-xxx

Other related tickets

List tickets for additional work, eg, something was found during review but you agreed to address it in another Jira

  • E-xxxxx

@dbudii dbudii requested a review from a team as a code owner March 7, 2025 13:02
@dbudii
Copy link
Author

dbudii commented Mar 7, 2025

@nikita-kud can you please provide feedback on this new change?

Copy link
Contributor

@nikita-kud nikita-kud left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@nikita-kud can you please provide feedback on this new change?

In fact, I've already done that in Teams..

Zoran:

What we propose is for us to firstly bring the bigger quant dialect refactoring https://github.com/llvm/llvm-project/pull/100667/files into our LLVM fork https://jira.devtools.intel.com/browse/EISW-159244
and after that it we'll apply the sub-channel quantization changes as it s done on the public llvm repo.

The problem for me is that this refactoring is not part of LLVM19. So you have to merge it twice, for LLVM18 and then move this patch to LLVM19. Unless we skip version 19 and move to 20 and we discuss this option above:

Gelb:

Well I agree, more commits = more difficult to track down to a specific one that breaks something

Zoran:

This will simplify the llvm version update wrt to the quant dialect;

So, on the one hand, I agree, on the other hand, I believe we should first upgrade LLVM to version 19.

@nikita-kud nikita-kud self-requested a review March 7, 2025 17:38
@dbudii dbudii closed this Mar 28, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants