Updated SGEMV ramps. #5551

almayne · 2025-11-24T14:56:27Z

Significant performance improvements are gained by the proposed changes on both c7g (NEOVERSEV1) and c8g (NEOVERSEV2) instances. To reproduce these values you need to run with OMP_ADAPTIVE=1. The plots below show the average time taken for 10000 iterations of sgemv operations on increasing square matrix/vector sizes, from 2x2 through to 1024x1024. The x axis reaches 2046 as we first run sgemv without transposition, then with. I've also include plots of the ratio, relative to the original stats (lower is better). These generate the following stats:
Geometric mean for c7g_sgemv.txt: 0.890437968914142
Geometric mean for c8g_sgemv.txt: 0.7884951978206536

aditew01 · 2025-11-25T12:04:49Z

cc: @martin-frbg @Mousius

aditew01 · 2025-11-25T12:15:43Z

interface/gemv.c

          : (MN < 1050625L)   ? MIN(ncpu, 40)
          : ncpu;
  #else
-      return (MN < 25600L)     ? 1


do we want to guard it for NEOVERSEV2 / NEOVERSEN2?

Discussed offline, happy that this is done by the calling function. No changes needed.

Updated SGEMV ramps.

8da0a1f

aditew01 reviewed Nov 25, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Updated SGEMV ramps. #5551

Updated SGEMV ramps. #5551

almayne commented Nov 24, 2025

Uh oh!

aditew01 commented Nov 25, 2025

Uh oh!

aditew01 Nov 25, 2025 •

edited

Loading

Uh oh!

almayne Nov 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Updated SGEMV ramps. #5551

Are you sure you want to change the base?

Updated SGEMV ramps. #5551

Conversation

almayne commented Nov 24, 2025

Uh oh!

aditew01 commented Nov 25, 2025

Uh oh!

aditew01 Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

almayne Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

aditew01 Nov 25, 2025 •

edited

Loading