forked from madgraph5/madgraph4gpu
-
Notifications
You must be signed in to change notification settings - Fork 0
Test ARM patches in the CI #1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
valassi
wants to merge
27
commits into
master
Choose a base branch
from
armdmav
base: master
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
…where (with Andrea Valassi)
…ests on aarch64 (with DanieleM) This fixes a hang in the testMisc tests on aarch64 in sqrtNewtonRaphson (madgraph5#1064) (testMisc -> constexpr_tan -> constexpr_tan_quad -> constexpr_cos_quad -> constexpr_sqrt -> sqrtNewtonRaphson) It uses the same workaround previously adopted for avoiding testMisc hangs when running valgrind (madgraph5#906)
…ts on aarch64 (with DanieleM) This fixes a hang in the testMisc tests on aarch64 in sqrtNewtonRaphson (madgraph5#1064) (testMisc -> constexpr_tan -> constexpr_tan_quad -> constexpr_cos_quad -> constexpr_sqrt -> sqrtNewtonRaphson) It uses the same workaround previously adopted for avoiding testMisc hangs when running valgrind (madgraph5#906)
…(with DanieleM)
Remove the custom __ARM_NEON__ with two extra underscores
Use 'g++ -march=armv8.2-a+simd -E -dM - < /dev/null | grep ARM' to check
Results on lxplus-arm:
for avx in none sse4; do ./build.${avx}_m_inl0_hrd0/check_cpp.exe -p 1024 256 1 \
| \egrep '(EvtsPerSec\[MECalcOnly\]|MeanMatrixElemValue|fptype_sv)'; done
Internal loops fptype_sv = SCALAR ('none': ~vector[1], no SIMD)
EvtsPerSec[MECalcOnly] (3a) = ( 2.425635e+05 ) sec^-1
MeanMatrixElemValue = ( 2.080788e+00 +- 6.803789e-03 ) GeV^0
Internal loops fptype_sv = VECTOR[2] ('sse4': ARM NEON, 128bit) [cxtype_ref=YES]
EvtsPerSec[MECalcOnly] (3a) = ( 3.261666e+05 ) sec^-1
MeanMatrixElemValue = ( 2.080788e+00 +- 6.803789e-03 ) GeV^0
…ith DanieleM) Remove the custom __ARM_NEON__ with two extra underscores Use 'g++ -march=armv8.2-a+simd -E -dM - < /dev/null | grep ARM' to check
…ple (with DanieleM)
Results on an Apple M1 (thanks Olivier!)
for avx in none sse4; do ./build.${avx}_m_inl0_hrd0/check_cpp.exe -p 1024 256 1
| \egrep '(EvtsPerSec\[MECalcOnly\]|MeanMatrixElemValue|fptype_sv)'; done
Internal loops fptype_sv = SCALAR ('none': ~vector[1], no SIMD)
EvtsPerSec[MECalcOnly] (3a) = ( 5.535410e+05 ) sec^-1
MeanMatrixElemValue = ( 2.080788e+00 +- 6.803789e-03 ) GeV^0
Internal loops fptype_sv = VECTOR[2] ('sse4': ARM NEON, 128bit) [cxtype_ref=NO]
EvtsPerSec[MECalcOnly] (3a) = ( 8.327442e+05 ) sec^-1
MeanMatrixElemValue = ( 2.080788e+00 +- 6.803789e-03 ) GeV^0
…e (with DanieleM)
…ts on aarch64 (with DanieleM) This fixes a hang in the testMisc tests on aarch64 in sqrtNewtonRaphson (madgraph5#1064) (testMisc -> constexpr_tan -> constexpr_tan_quad -> constexpr_cos_quad -> constexpr_sqrt -> sqrtNewtonRaphson) It uses the same workaround previously adopted for avoiding testMisc hangs when running valgrind (madgraph5#906)
…ith DanieleM) Remove the custom __ARM_NEON__ with two extra underscores Use 'g++ -march=armv8.2-a+simd -E -dM - < /dev/null | grep ARM' to check
…e (with DanieleM)
…ODEGEN commits Revert "[fix-arm-support] gg_tt.mad: undefine__ARM_NEON for cppnone on arm/apple (with DanieleM)" This reverts commit d427fcf. Revert "[fix-arm-support] gg_tt.mad: use builtin __ARM_NEON for aarch64 simd (with DanieleM)" This reverts commit 82645d2. Revert "[fix-arm-support] gg_tt.mad: use higher tolerance for constexpr_tan tests on aarch64 (with DanieleM)" This reverts commit 625bca8. Revert "Regenerate processes" This reverts commit 1e1c899.
…ts (#2) on aarch64 (with DanieleM)
…_tan tests (#2) on aarch64 (with DanieleM)
…using constexpr_sqrt on aarch64 (with DanieleM)
…l tests using constexpr_sqrt on aarch64 (with Danie>
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
No description provided.