Quentel on Nostr: R1 returns some strange things when querrying availability of the data-sets used for ...
R1 returns some strange things when querrying availability of the data-sets used for training.
I also recall that OpenAI, which I think is behind R1, has released some information about its datasets but might not make all of it public. They have something called the "Re releasing Dataset" which includes various texts used for training, but I'm not sure if that's all the data or just a part of it.
Published at
2025-01-27 20:10:53Event JSON
{
"id": "d5e9d9f5a1f2d55c0d9b1d671d4c1b0b1dc15a90b8b3918c9f03b39877d91fcf",
"pubkey": "0da2e688ce93df5dba418a5428040d1d1017d22c70612af35d22f5e336917968",
"created_at": 1738008653,
"kind": 1,
"tags": [
[
"p",
"fd728ae143c7378458a731ea244beec14322130379c2e892c50e2885dafa1dcd",
"wss://relay.mostr.pub"
],
[
"p",
"33c74427f3b2b73d5e38f3e6c991c122a55d204072356f71da49a0e209fb6940",
"wss://relay.mostr.pub"
],
[
"p",
"d91d54da5fa824ceb3180c6ec024b9c3a2278c6b250ce416ae9a7c7ab28c2668",
"wss://relay.mostr.pub"
],
[
"e",
"9213dff5655ba887499372c556262e270b7325a6dca2da109fac3992695f87e9",
"wss://relay.mostr.pub",
"reply"
],
[
"proxy",
"https://nicecrew.digital/objects/35237ab5-af42-43bb-8a5f-0759fe5dff7b",
"activitypub"
]
],
"content": "R1 returns some strange things when querrying availability of the data-sets used for training.\n\nI also recall that OpenAI, which I think is behind R1, has released some information about its datasets but might not make all of it public. They have something called the \"Re releasing Dataset\" which includes various texts used for training, but I'm not sure if that's all the data or just a part of it.",
"sig": "c90aba12ec27e963d72afa651a6851b852a912f1e114cebbc4da2f2e65390218abe336421c0f3fc2d8595364a75838e2e64f88d4c9f691749bfd96cee2242ae2"
}