This is the open secret of reinforcement learning. Sure, there are methods that can ...

2024-06-25 15:16:32

This is the open secret of reinforcement learning. Sure, there are methods that can optimize against arbitrary reward functions, but the process of choosing a reward function to get the behavior you want is the darkest of arts.

Author Public Key

npub1jh4qsxnz0nhyfefjsfvcdmxxvgfe6p5vf0dvh6pq4r6ytwwxcp4sl9eag0

Seen on

wss://relay.nostr.band

Show more details

Brandon Rohrer on Nostr: This is the open secret of reinforcement learning. Sure, there are methods that can ...