tl;dr you feed images to an embedding model, get out embeddings (also known as points ...

2025-01-09 15:56:26

tl;dr you feed images to an embedding model, get out embeddings (also known as points in high-dimensional space) and then later use those to search

you train another model to generate embeddings in the same space, but with captions instead of images

you usually take some images and get their embeddings, caption the images and use that to train the model

Author Public Key

npub12262qa4uhw7u8gdwlgmntqtv7aye8vdcmvszkqwgs0zchel6mz7s6cgrkj

Show more details

Semisol 👨‍💻 on Nostr: tl;dr you feed images to an embedding model, get out embeddings (also known as points ...