-
Quency Lin authored
Use blend instead of set and extract to rearrange, faster Use 256 SIMD for every 2 RB, use 128 SIMD for ARM and last RB
fcb23b44
Use blend instead of set and extract to rearrange, faster Use 256 SIMD for every 2 RB, use 128 SIMD for ARM and last RB