What is Nostr?
Greg Clarke 🇨🇦 :mverified: /
npub1p06…x828
2024-02-10 14:31:31

Greg Clarke 🇨🇦 :mverified: on Nostr: Does anyone have a good list of logical questions to judge large language models ...

Does anyone have a good list of logical questions to judge large language models ability to reason?

Questions like "if it takes 3 hours for 3 towels to dry, how long does it take for 9 towels to dry?"

I'm playing around with Mistrals leaked 70b Miqu LLM and want to test it's reasoning skills for a project I'm working on. I've been really impressed so far. It's slower than Mistral & Mixtral but it's been producing the best reasoned answers I've seen from an LLM. And it's running locally!

#LLM #LLMs #Mistral #Miqu #LargeLanguageModels #GPT #ChatGPT
Author Public Key
npub1p06wvmnguqxrtamqg9htprhlfntnt06hcaafeauj5acxyr2f8q9quwx828