"We introduce phi-1, a new large language model for code, with significantly smaller ...

2024-09-12 11:16:49

"We introduce phi-1, a new large language model for code, with significantly smaller size than competing models: phi-1 is a Transformer-based model with 1.3B parameters, trained for 4 days on 8 A100s, using a selection of 'textbook quality' data from the web (6B tokens) and synthetically generated textbooks and exercises with GPT-3.5 (1B tokens)."

Textbooks are all you need https://arxiv.org/pdf/2306.11644 #AI #GenAI #LLM #compsci

Author Public Key

npub16gwdrptcxzppxyx4vmzza3l4kl9xg8qxs29y64w0g6wurqnms80sv45mgn

Show more details

Vladimir Savić on Nostr: "We introduce phi-1, a new large language model for code, with significantly smaller ...