That you could actually learn through this kind of policy optimization is really an ...

2024-09-10 12:58:06

That you could actually learn through this kind of policy optimization is really an amazing result/insight.

A few of my notes on this and related topics are here:
https://davidmeyer.github.io/ml/pg.pdf. As always, questions/comments/corrections/* greatly appreciated.

#policygradients #reinforcementlearning #machinelearning #math #maths

Author Public Key

npub1eek26lgxr8a5e9xytuhsauxn82j6ytkdu27xeuqv4sz4rtvsw62qjlk3gv

Show more details

David Meyer on Nostr: That you could actually learn through this kind of policy optimization is really an ...