o1 style chain of thought with a local Llama 1B model (aka shrek sampler) is mostly ...

2024-10-07 15:15:14

o1 style chain of thought with a local Llama 1B model (aka shrek sampler) is mostly working...👀

quoting nevent1q…ylz8
Lol, did the open-source community accidentally just fix the LLM hallucination problem? 😳

Context:
It seems like the Shrek entropy sampler with early exit solves, if not significantly reduces, the hallucination problem with big boy models. Some people are running evaluations now, and so far, it seems promising. 👀

Author Public Key

npub1cmmswlckn82se7f2jeftl6ll4szlc6zzh8hrjyyfm9vm3t2afr7svqlr6f

Seen on

wss://nos.lol

Show more details

iefan 🕊️ on Nostr: o1 style chain of thought with a local Llama 1B model (aka shrek sampler) is mostly ...