What is Nostr?
mark tyler /
npub1nwh…409r
2023-04-19 01:58:03
in reply to nevent1q…x8r8

mark tyler on Nostr: Have you seen the stuff about how RLHF it makes it dumber? That’s what makes me ...

Have you seen the stuff about how RLHF it makes it dumber? That’s what makes me suspect it being a generalist is actually the unexpected path toward specialized ability.

Like… what it really needs is to be trained on everything, and then to memorize domain facts by including them in the prompt. For that, they need to release models that can have prompts in the 100k to multi-million token range.

Here’s an interesting exercise, how many characters would the document that explains your job fully to a new employee? That’s the prompt
Author Public Key
npub1nwhdqvfh6g2t86pnqkdf86m3ea89c6ejyhlhe5g9wk2lvpsgss6qed409r