Skip Navigation

Technology @lemmy.ml ☆ Yσɠƚԋσʂ ☆ @lemmy.ml 4 mo. ago

1-bit LLM performs similarly to full-precision Transformer LLMs with the same model size and training tokens but is much more efficient in terms of latency, memory, throughput, and energy consumption.

arxiv.org /abs/2402.17764

Hacker News @lemmy.smeargle.fans bot @lemmy.smeargle.fans

4 mo. ago

The Era of 1-bit LLMs: ternary parameters for cost-effective computing

arxiv.org /abs/2402.17764

You're viewing a single thread.

4 comments

Why use lot bit when one bit do trick?
- Bits together weak