Skip to content
Snippets Groups Projects
Verified Commit 2cbccd73 authored by Daniel Reinert's avatar Daniel Reinert :grimacing: Committed by Yen-Chen Chen
Browse files

Performance optimization for NEC SX AURORA (!97)


The subroutines `init_zero_4d_[dp,sp,i4]` do not vectorize properly on NEC SX AURORA.

Loop collapsing is enforced by a compiler directive for the subroutines `init_zero_4d_[dp,sp,i4]`, in order to ensure proper vectorization on NEC SX AURORA.

Approved-by: default avatarYen-Chen Chen <yen-chen.chen@kit.edu>
Merged-by: default avatarYen-Chen Chen <yen-chen.chen@kit.edu>
Changelog: feature
parent 0c423645
No related branches found
No related tags found
No related merge requests found
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment