Skip to content

Conversation

@martin-frbg
Copy link
Collaborator

@martin-frbg martin-frbg commented Aug 18, 2025

eventually fixes #5414

#define C6 x22 //Constant6: N*SVLs
#define C2 x19 //Constant2: N + SVLs
#define C3 x20 //Constant3: K*SVLs + SVLs
#define C4 x21 //Constant4: SVLs-2
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Modifying x20 to x21 will require below dependent changes.
At line 65: sub w21, w21, #2
At line 202: cmp w13, w21

Copy link
Collaborator Author

@martin-frbg martin-frbg Aug 20, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oops, sorry, I had already corrected this locally but pushed the wrong version. Unfortunately this correction has no effect on the wrong xscblat3 test results seen for M odd (and contrary to my expectations this PR also does not fix the divergence between SGEMM and SGEMMT seen in test_sgemmt of utest/openblas_utest_ext that was flagged in #5414)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

test_extensions/test_sgemmt.c fails with SME on Apple M4

2 participants