Good chance I will soon :). Likely some ASM because supporting the cross-arch SIMD instructions is a pain in itself. AVX is pretty old now
i'm sure i could do it given a month time window, i have reasonable familiarity with assembler syntax in general, and have done a lot of work with iterative operations just haven't had the time to actually sit down and do it, if you get going on it, i'm keen to follow and if i can, help out i'm already ridiculously curious about what AVX assembler ops look like