Currently writing an implementation of the 1-bit quantization linear layer from this ...

2024-02-29 23:02:55

Currently writing an implementation of the 1-bit quantization linear layer from this paper. Fresh trained LLM costs go to ~zero if this works at scale. https://arxiv.org/pdf/2402.17764.pdf

Author Public Key

npub1ys0mgpapdctxw3yw7f6cgvg2kn7smlgepmhm07edjsz5k7pppk4s6mea0t

Show more details

gary on Nostr: Currently writing an implementation of the 1-bit quantization linear layer from this ...