A response to the blog post "{n} times faster than C". Our final program achieved a speedup of 128x (36 GiB/s throughput) by reformulating the problem and leveraging SIMD intrinsics.
The title is extremely misleading. They're comparing a Rust program using SIMD intrinsics to a Rust program which doesn't, without even using auto vectorization to let the driver look for SIMD opportunities.
This is a good example of why SIMD is useful and how fast it can be, but it is not an example of how Rust is faster than C or how humans are smarter than compilers. It doesn't even have a C reference source to compare to at all.
This post seems to have taken the title from a previous post that this is built upon. So that is probably why the title gets a bit confusing when viewed standalone.