caleb on Nostr: 🤗 datasets is a great library for easy sharing and access to common data sets, but ...
🤗 datasets is a great library for easy sharing and access to common data sets, but for building anything bigger than a toy data sets, I find it’s far easier and less error prone to build my own loop and write to Parquet files myself.
Published at
2023-09-09 19:35:14Event JSON
{
"id": "9ce8856928db29e40f59a7545fea8c3965bd9a0a61c9e1b5c8ccff2b05f12c49",
"pubkey": "28853cacb62492c970f0d27a76962710c0ad97f56e0163693981ffabc0faec3c",
"created_at": 1694288114,
"kind": 1,
"tags": [],
"content": "🤗 datasets is a great library for easy sharing and access to common data sets, but for building anything bigger than a toy data sets, I find it’s far easier and less error prone to build my own loop and write to Parquet files myself.",
"sig": "2944a4e2ef932bb012c7501ee6364f7f716b5600414a00c06975a5259f9c59c50cdcaedfeb16b415144271a74a67ffd4dcbd8cab9eb5a83621b2d13466a58530"
}