JD Long ✅ on Nostr: Much ink has been spilled about keeping your stuff out of LLM training data. What can ...
Much ink has been spilled about keeping your stuff out of LLM training data. What can we do to ensure our text IS included in training data? I’m thinking about open source library managers who want to ensure an LLM knows their library.
Published at
2023-11-15 02:58:55Event JSON
{
"id": "6ecc5144a6ea2d161ec9319413cf1891eb5262cc5b92e5a7a7da4477f80e230a",
"pubkey": "64e223fe7b9fd9f955bdbfcb7a47353b85253544cd7c2a8f2f2878864c8a0d41",
"created_at": 1700017135,
"kind": 1,
"tags": [
[
"proxy",
"https://mastodon.social/users/Cmastication/statuses/111412322981176020",
"activitypub"
]
],
"content": "Much ink has been spilled about keeping your stuff out of LLM training data. What can we do to ensure our text IS included in training data? I’m thinking about open source library managers who want to ensure an LLM knows their library.",
"sig": "f56dd20d7849089badec51f98e98cb6c1c2e4ed59f5da24c606300175487c1e880bbcc38d18077871afc650eaff0ba91417ab2d377495f4b6112631771eb8ff1"
}