📛 LIMA: Less Is More for Alignment 🧠 LIMA, a 65B parameter model, shows high ...

2023-06-10 19:32:01

📛 LIMA: Less Is More for Alignment

🧠 LIMA, a 65B parameter model, shows high performance and generalization with minimal instruction tuning, indicating that large language models acquire most knowledge during pretraining.

🐦 40

❤️ 4.4K

🔗 arxiv.org/pdf/2305.11206.pdf (https://arxiv.org/pdf/2305.11206.pdf)

https://nitter.moomoo.me/ArXivGPT/status/1667615622241431555#m

Author Public Key

npub1lhwkxztg74apyyd3k7xh6ahwmefka6d5e9fez0hftcardfv73qnq933lt6

Show more details

ArXivGPT / @ArXivGPT (RSS Feed) on Nostr: 📛 LIMA: Less Is More for Alignment 🧠 LIMA, a 65B parameter model, shows high ...