"MPT-7B looks to be super competitive across the board, even beats 13B models. This ...

2023-05-06 11:03:09

"MPT-7B looks to be super competitive across the board, even beats 13B models. This LLM is trained on 1T tokens of text and code curated by MosaicML. The model is fine-tuned to also work with a context length of 65k tokens!"

https://twitter.com/hardmaru/status/1654790008925220866?t=_eXP4ZcjdMd_hpPLMdAVZA&s=19

Author Public Key

npub1lj3lrprmmkjm98nrrm0m3nsfjxhk3qzpq5wlkmfm7gmgszhazeuq3asxjy

Seen on

Show more details

Raul007 on Nostr: "MPT-7B looks to be super competitive across the board, even beats 13B models. This ...