Modern ARM processors, including Apple silicon, support sdot/udot instructions which could improve many of your NNUE operations, also overriding Simd::neon_m128_add_dpbusd_epi32x2 could help.
Any ideas how to make it work?
Usage of sdot/udot in ARM · Issue #4193 · official-stockfish/Stockfish
Modern ARM processors, including Apple silicon, support sdot/udot instructions which could improve many of your NNUE operations, also overriding Simd::neon_m128_add_dpbusd_epi32x2 could help. Infor...
github.com
Any ideas how to make it work?