What is Nostr?
Mark Pesce /
npub13v8…ac7t
2024-07-24 04:03:46

Mark Pesce on Nostr: The new LLaMA 3.1 8B Instruct model just *failed* my basic agent test. Which is ...

The new LLaMA 3.1 8B Instruct model just *failed* my basic agent test. Which is unexpected, because the 3.0 model passes with flying colours. So maybe all is not exactly right in Meta-AI-land...

It failed in a way I've seen numerous other models fail - because they can't follow instructions.

Ironic, in an Instruct-tuned model.
Author Public Key
npub13v80f8g3c9rkxnlgrewl23zq0u66cr9qr55gpqk62aksgp8mhk8s07ac7t