What is Nostr?
GPTDAOCN-e/acc /
npub1atsโ€ฆ0ns2
2024-10-19 22:11:09

GPTDAOCN-e/acc on Nostr: WOW. Meta just open-sourced a Github repo for LLM Training. Meta Lingua is a minimal ...

WOW. Meta just open-sourced a Github repo for LLM Training.

Meta Lingua is a minimal and fast LLM training and inference library designed for research

๐Ÿ“Š Key features

- Minimal and fast LLM training/inference library for research
- Uses modifiable PyTorch components for experimenting with architectures, losses, data
- Enables end-to-end training, inference, evaluation
- Provides tools for understanding speed and stability
- Structured with core 'lingua' library and 'apps' to showcase usage

๐Ÿš€ Lingua's performance comparison to other models

- 1B models trained on 60B tokens match DCLM (DataComp-LM) baseline performance on many tasks
- 7B models (Mamba, Llama) show strong results on benchmarks like ARC, MMLU, BBH
- Llama 7B squared ReLU 1T tokens model achieves high scores across tasks
Author Public Key
npub1atst8p6wc9xz0aezu7csvqxyrevrnckc2ckpt4q5gsgpthq0n0ese50ns2