DeepSeek R1 just got a 2X speed boost, the code for the boost was written by R1 itself!
DeepSeek R1 just got a 2X speed boost, the code for the boost was written by R1 itself!
PR by Xuan-Son Nguyen for `llama.cpp`: > This PR provides a big jump in speed for WASM by leveraging SIMD instructions for `qX_K_q8_K` and `qX_0_q8_0` dot product functions. > > …