What is Nostr?
Simon Willison /
npub13v9…w5eu
2023-06-08 20:40:11
in reply to nevent1q…0x06

Simon Willison on Nostr: The tokenizers have a strong bias towards English: "The dog eats the apples" is 5 ...

The tokenizers have a strong bias towards English: "The dog eats the apples" is 5 tokens, "El perro come las manzanas" is 8 tokens, and many Japanese characters end up using two integer tokens for each character of text.
Author Public Key
npub13v97j0kknscwnf5pt87nsn7cxzxwfwl3dsu7ss8qsq7ukmqgwg8q84w5eu