Think of how you 'hold a memory', no one else can interact with it unless you talk ...

npub1m3x…a5sf

2024-11-22 00:46:35

in reply to nevent1q…fmm6

Think of how you 'hold a memory', no one else can interact with it unless you talk about it or draw it, a memory or idea needs to be conveyed somehow to the outside world. An encoding models that produce embeddings are basically half an LLM, it takes in the words/tokens and says "this string is located here", it assigns a coordinate, an address for it. That address is the context, anything with addresses nearby are more related in ideas. The second part of the LLM is the decoder, where it takes the address as a kind of starting point. The decoder uses the context of that coordinate and responds with words that are also in the right context (which is learned by training).

H.T. to TheGuySwann (npub1h8n…rpev) for his fantastic read of "A gentle introduction to Large Large Language Models"
https://fountain.fm/episode/yCpvsos8iUfXsfLeUPon
https://mark-riedl.medium.com/a-very-gentle-introduction-to-large-language-models-without-the-hype-5f67941fa59e

Author Public Key

npub1m3xdppkd0njmrqe2ma8a6ys39zvgp5k8u22mev8xsnqp4nh80srqhqa5sf

Show more details

Published at

2024-11-22 00:46:35

Kind type

1 Short Text Note

Event JSON

{ "id": "1eecd526d0a375b883d192c66dd84b25ca7dce76b1761ccbf873bb0d32c48107", "pubkey": "dc4cd086cd7ce5b1832adf4fdd1211289880d2c7e295bcb0e684c01acee77c06", "created_at": 1732236395, "kind": 1, "tags": [ [ "e", "4e10bf0777aacc26d338be497a4b0e9ac5a9909e16519530548ff9b1c80a909a", "", "root" ], [ "e", "a9fde24f6601f7732f28de6684db6cbbc5a096645a31f8756b34277cd9261d25", "", "reply" ], [ "p", "958b754a1d3de5b5eca0fe31d2d555f451325f8498a83da1997b7fcd5c39e88c", "", "mention" ], [ "p", "32e1827635450ebb3c5a7d12c1f8e7b2b514439ac10a67eef3d9fd9c5c68e245", "", "mention" ], [ "p", "3bf0c63fcb93463407af97a5e5ee64fa883d107ef9e558472c4eb9aaaefa459d", "", "mention" ], [ "p", "b9e76546ba06456ed301d9e52bc49fa48e70a6bf2282be7a1ae72947612023dc", "", "mention" ] ], "content": "Think of how you 'hold a memory', no one else can interact with it unless you talk about it or draw it, a memory or idea needs to be conveyed somehow to the outside world. An encoding models that produce embeddings are basically half an LLM, it takes in the words/tokens and says \"this string is located here\", it assigns a coordinate, an address for it. That address is the context, anything with addresses nearby are more related in ideas. The second part of the LLM is the decoder, where it takes the address as a kind of starting point. The decoder uses the context of that coordinate and responds with words that are also in the right context (which is learned by training).\n\nH.T. to nostr:npub1h8nk2346qezka5cpm8jjh3yl5j88pf4ly2ptu7s6uu55wcfqy0wq36rpev for his fantastic read of \"A gentle introduction to Large Large Language Models\"\nhttps://fountain.fm/episode/yCpvsos8iUfXsfLeUPon\nhttps://mark-riedl.medium.com/a-very-gentle-introduction-to-large-language-models-without-the-hype-5f67941fa59e", "sig": "3fb5664f279aea7acffff0ca76a8a3ae4ed396bc36712406319704fa8295afc3dbdfaa7fce1cbec36aaa9357d26dd43ca5d6d1c3c8f27aabb69713d210fa6414" }