Ecologia Digital on Nostr: "the rise of AI products like #ChatGPT, and the #LLMs underlying them, have made ...
"the rise of AI products like #ChatGPT, and the #LLMs underlying them, have made high-quality training data one of the internet’s most valuable commodities. That has caused internet providers of all sorts to reconsider the value of the data on their servers, and rethink who gets access to what. Being too permissive can bleed your website of all its value; being too restrictive can make you invisible."
#robotstxt
https://www.theverge.com/24067997/robots-txt-ai-text-file-web-crawlers-spidersPublished at
2024-03-31 10:31:46Event JSON
{
"id": "6da5682a4ed6ba212b8f7992be0b3ede4e96ff355aba2b2ed28c1a721065d3c6",
"pubkey": "ba690b0651faa8aa1dde73f74c502a4c1d0cd81a8aa84303c6fe847e17f85d57",
"created_at": 1711881106,
"kind": 1,
"tags": [
[
"t",
"chatgpt"
],
[
"t",
"llms"
],
[
"t",
"robotstxt"
],
[
"proxy",
"https://mato.social/users/josemurilo/statuses/112189840189072720",
"activitypub"
]
],
"content": "\"the rise of AI products like #ChatGPT, and the #LLMs underlying them, have made high-quality training data one of the internet’s most valuable commodities. That has caused internet providers of all sorts to reconsider the value of the data on their servers, and rethink who gets access to what. Being too permissive can bleed your website of all its value; being too restrictive can make you invisible.\"\n#robotstxt \nhttps://www.theverge.com/24067997/robots-txt-ai-text-file-web-crawlers-spiders",
"sig": "7c516b7dad16185f3cf984ada4ac0379ae5087f39b813cf0395a774f73a1f0fab1087bd3fc40d0abe023bfa8756ae935f4ec97b63be9b0fba5a8fc1431a27637"
}