Become a MacRumors Supporter for $50/year with no ads, ability to filter front page stories, and private forums.

Appletoni

Suspended
Original poster
Mar 26, 2021
443
177
Modern ARM processors, including Apple silicon, support sdot/udot instructions which could improve many of your NNUE operations, also overriding Simd::neon_m128_add_dpbusd_epi32x2 could help.

Any ideas how to make it work?
 

Nygaard

macrumors member
Dec 7, 2022
47
20
Houston
What do you mean? If the processor supports those instructions, you simply use them. The compiler will automatically use them in obvious situations (assuming it knows about the instructions). Otherwise, you can write assembly/intrinsics and directly use them yourself.
 
Register on MacRumors! This sidebar will go away, and you'll see fewer ads.