iefan 🕊️ on Nostr: o1 style chain of thought with a local Llama 1B model (aka shrek sampler) is mostly ...
o1 style chain of thought with a local Llama 1B model (aka shrek sampler) is mostly working...👀
quoting nevent1q…ylz8Lol, did the open-source community accidentally just fix the LLM hallucination problem? 😳
Context:
It seems like the Shrek entropy sampler with early exit solves, if not significantly reduces, the hallucination problem with big boy models. Some people are running evaluations now, and so far, it seems promising. 👀