Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cleanup CUDA, Reuse Memory, Add Serial Model, Cleaup Std Parallelism #202

Open
wants to merge 14 commits into
base: develop
Choose a base branch
from
Open
Prev Previous commit
Next Next commit
Arm toolchains prefer -mcpu=native
gonzalobg committed Aug 14, 2024

Unverified

No user is associated with the committer email.
commit 084ef3bf724231e97b5d143d93981441f4c5efef
4 changes: 4 additions & 0 deletions CMakeLists.txt
Original file line number Diff line number Diff line change
@@ -50,7 +50,11 @@ option(BUILD_NATIVE "Builds for the current systems CPU and GPU architecture." O
set(DEFAULT_DEBUG_FLAGS -O2 -fno-omit-frame-pointer)
set(DEFAULT_RELEASE_FLAGS -O3)
if (BUILD_NATIVE)
if(CMAKE_SYSTEM_PROCESSOR STREQUAL aarch64)
set(DEFAULT_RELEASE_FLAGS ${DEFAULT_RELEASE_FLAGS} -mcpu=native)
else()
set(DEFAULT_RELEASE_FLAGS ${DEFAULT_RELEASE_FLAGS} -march=native)
endif()
endif()

macro(hint_flag FLAG DESCRIPTION)