What is Nostr?
Brandon Rohrer /
npub1jh4…eag0
2024-06-25 15:16:32
in reply to nevent1q…q7c9

Brandon Rohrer on Nostr: This is the open secret of reinforcement learning. Sure, there are methods that can ...

This is the open secret of reinforcement learning. Sure, there are methods that can optimize against arbitrary reward functions, but the process of choosing a reward function to get the behavior you want is the darkest of arts.
Author Public Key
npub1jh4qsxnz0nhyfefjsfvcdmxxvgfe6p5vf0dvh6pq4r6ytwwxcp4sl9eag0