liminal 🦠on Nostr: If you are grabbing embeddings from other users, there should be a link to the ...
If you are grabbing embeddings from other users, there should be a link to the original model used for the embeddings. There are probably solutions for recovering the original text from the embedding and you can also just reembed the text yourself to compare the resulting vectors. But if you're grabbing enough of them that you're trying to make comparisons, you're giving some level of trust because at that level you might as well do the embeddings yourself. Additionally, if the embeddings are already being used in a recommendation system I'd imagine that they are there because they are useful and help increase the organization of the content - so I'd expect that there is less incentive to find embeddings that are maliciously inserted by a user.
Published at
2024-10-10 17:44:16Event JSON
{
"id": "afd311ac8dc50f20b2583906e7dfe0a365980c2060bb1628814c8077ff6ba7a4",
"pubkey": "dc4cd086cd7ce5b1832adf4fdd1211289880d2c7e295bcb0e684c01acee77c06",
"created_at": 1728582256,
"kind": 1,
"tags": [
[
"p",
"e989aa6e0137d52a410ecd89ae59f7adbfb0bdec9786b9181c3707954b4cfa69"
],
[
"p",
"a9434ee165ed01b286becfc2771ef1705d3537d051b387288898cc00d5c885be"
],
[
"p",
"70122128273bdc07af9be7725fa5c4bc0fc146866bec38d44360dc4bc6cc18b9"
],
[
"p",
"fd208ee8c8f283780a9552896e4823cc9dc6bfd442063889577106940fd927c1"
],
[
"e",
"a89bcb089b90dc33278e32d9e546444d8cf86250879608db30ca03f7e027583a",
"wss://niel.nostr1.com/",
"root"
],
[
"e",
"6a120a3fb58c012e92d6325d9fe1bd93c022df117bdc89d2b5e4f4845d55566b",
"wss://relay.nostr.band/",
"reply"
]
],
"content": "If you are grabbing embeddings from other users, there should be a link to the original model used for the embeddings. There are probably solutions for recovering the original text from the embedding and you can also just reembed the text yourself to compare the resulting vectors. But if you're grabbing enough of them that you're trying to make comparisons, you're giving some level of trust because at that level you might as well do the embeddings yourself. Additionally, if the embeddings are already being used in a recommendation system I'd imagine that they are there because they are useful and help increase the organization of the content - so I'd expect that there is less incentive to find embeddings that are maliciously inserted by a user.",
"sig": "e8924f7208072fb74f1bc88d5b2efebca6d87aca4aace0c7661fc304a21eb7c57a0d945201f459b0230d82a7aab4756e4a22cca9c88ac35cfc1b008327e316e5"
}