What is Nostr?
David Meyer /
npub1eek…k3gv
2024-09-10 12:58:06

David Meyer on Nostr: That you could actually learn through this kind of policy optimization is really an ...

That you could actually learn through this kind of policy optimization is really an amazing result/insight.

A few of my notes on this and related topics are here:
https://davidmeyer.github.io/ml/pg.pdf. As always, questions/comments/corrections/* greatly appreciated.

#policygradients #reinforcementlearning #machinelearning #math #maths

Author Public Key
npub1eek26lgxr8a5e9xytuhsauxn82j6ytkdu27xeuqv4sz4rtvsw62qjlk3gv