Cory Doctorow on Nostr: It's pretty easy to have a high-res scanner auto-detect the positions of each page on ...
It's pretty easy to have a high-res scanner auto-detect the positions of each page on the fiche and to run the text through OCR, but a human would still need to go through all those pages, marking the first and last page of each journal and identifying the table of contents and indexing it to the scanned pages.
2/
Published at
2024-10-30 12:46:55Event JSON
{
"id": "883ffd6cac386d1589d1193e71d31ff567437b67ec2b0d3b768cadffd7b652ec",
"pubkey": "21856daf84c2e4e505290eb25e3083b0545b8c03ea97b89831117cff09fadf0d",
"created_at": 1730292415,
"kind": 1,
"tags": [
[
"e",
"feda5108fa64150914e57ecc921ff00c3c629f10ac2960625f94fb7323ce0993",
"wss://relay.mostr.pub",
"reply"
],
[
"content-warning",
"Long thread/3"
],
[
"proxy",
"https://mamot.fr/users/pluralistic/statuses/113396443732305518",
"activitypub"
]
],
"content": "It's pretty easy to have a high-res scanner auto-detect the positions of each page on the fiche and to run the text through OCR, but a human would still need to go through all those pages, marking the first and last page of each journal and identifying the table of contents and indexing it to the scanned pages. \n\n2/",
"sig": "04f0738c1297d3bd351814362a3790cbe4c8bf6b8cd0960e65146c66a47becca18ca48cc45cdbe8dbe7af459ec6cbcfaab45e6d2a02518f97d629ad84d37a80a"
}