What is Nostr?
filip / Filip
npub1sg6…56mg
2025-01-28 12:18:00

filip on Nostr: Great summary of how DeepSeek's reinforcement learning algorithm accomplishes model ...

Great summary of how DeepSeek's reinforcement learning algorithm accomplishes model training.

Also easy to see how having an open source model and APIs can be a game changer for creating custom agents and workflows and the AI startup indusry in general:
https://youtu.be/sGUjmyfof4Q?si=nUIFIYpsBRDCprhm
Author Public Key
npub1sg6j4yu5ah628xkf25ar2fgh87p8rzl2ke9jwyx7sxyqh3t9efkqm756mg