Skip to content

Conversation

@vlkale
Copy link
Contributor

@vlkale vlkale commented Jan 29, 2022

This PR contains tile and unroll modifications to the OpenMP target loop on 137-140 in Benchmark_su3.cc.

It contains a version of manual tiling and unrolling and version with using the LLVM/clang OpenMP loop transformation directives 'tile' and 'unroll'.

The unroll factor and the tile size for both versions can be set through a #define at the top of Benchmark_su3.cc.

Note that the OpenMP loop transformations aren't available in other OpenMP C implementations, e.g., Cray-llvm. Also, the loop transformations haven't been added to LLVM/flang OpenMP though there are plans to in the future.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant