What is Nostr?
Vladimir Savić /
npub16gw…5mgn
2024-09-12 11:16:49

Vladimir Savić on Nostr: "We introduce phi-1, a new large language model for code, with significantly smaller ...

"We introduce phi-1, a new large language model for code, with significantly smaller size than competing models: phi-1 is a Transformer-based model with 1.3B parameters, trained for 4 days on 8 A100s, using a selection of 'textbook quality' data from the web (6B tokens) and synthetically generated textbooks and exercises with GPT-3.5 (1B tokens)."

Textbooks are all you need https://arxiv.org/pdf/2306.11644 #AI #GenAI #LLM #compsci
Author Public Key
npub16gwdrptcxzppxyx4vmzza3l4kl9xg8qxs29y64w0g6wurqnms80sv45mgn