Greg Clarke 🇨🇦 :mverified: on Nostr: Does anyone have a good list of logical questions to judge large language models ...
Does anyone have a good list of logical questions to judge large language models ability to reason?
Questions like "if it takes 3 hours for 3 towels to dry, how long does it take for 9 towels to dry?"
I'm playing around with Mistrals leaked 70b Miqu LLM and want to test it's reasoning skills for a project I'm working on. I've been really impressed so far. It's slower than Mistral & Mixtral but it's been producing the best reasoned answers I've seen from an LLM. And it's running locally!
#LLM #LLMs #Mistral #Miqu #LargeLanguageModels #GPT #ChatGPT
Questions like "if it takes 3 hours for 3 towels to dry, how long does it take for 9 towels to dry?"
I'm playing around with Mistrals leaked 70b Miqu LLM and want to test it's reasoning skills for a project I'm working on. I've been really impressed so far. It's slower than Mistral & Mixtral but it's been producing the best reasoned answers I've seen from an LLM. And it's running locally!
#LLM #LLMs #Mistral #Miqu #LargeLanguageModels #GPT #ChatGPT