In the long run (after launching with something simple) I would use A/B testing to ...

whoever relays stuff 🇵🇸🏴‍☠️🇺🇲

npub1wam…u3l2

2025-02-18 15:47:04

in reply to nevent1q…2n7v

In the long run (after launching with something simple) I would use A/B testing to determine scores based on relative performance.

An example of how it could work (not the only example) -

By default, every other time the user loads a thread, their sorting could switch back and forth between "best comments" and "random." They would still be able to manually override this.

The "random" tab wouldn't actually be completely random, it would always be trying to show 2 different users 2 different sorting arrangements that can be compared to each other - like, users A and B get post X first and post Y second in the thread, but users C and D get post Y first and post X second, seeing which ranking results in more upvotes vs downvotes for all posts overall.

This is useless with too few users, so there's no point launching with it, but with enough users it gets the data needed to fill the "best answers" tab with the actual best answers. It's very resistant to newer answers being drowned out by older answers (which had more time to collect votes). It's very resistant to popularity contest bullshit, because an unpopular answer that gets a lot of discussion will still be recognized for its contribution to the thread, instead of buried in downvotes (i.e. reddit).

Author Public Key

npub1wamvxt2tr50ghu4fdw47ksadnt0p277nv0vfhplmv0n0z3243zyq26u3l2

Seen on

wss://relay.nostr.band

Show more details

Published at

2025-02-18 15:47:04

Kind type

1 Short Text Note

Event JSON

{ "id": "04adc561dcc9b92655b5a79446edc9bc0334326847c3eb0e3e414bfc255af834", "pubkey": "7776c32d4b1d1e8bf2a96babeb43ad9ade157bd363d89b87fb63e6f145558888", "created_at": 1739893624, "kind": 1, "tags": [ [ "e", "11978eb7e2035ebd962e17b97d51bacf6928f722a65387ba095e412f2d36822c", "nostr-idb://cache-relay", "root", "8aa70f4433129dadb71330ac89f62b534caa200a9f3ee349a0f4a5593073d1a6" ], [ "e", "11978eb7e2035ebd962e17b97d51bacf6928f722a65387ba095e412f2d36822c", "nostr-idb://cache-relay", "reply", "8aa70f4433129dadb71330ac89f62b534caa200a9f3ee349a0f4a5593073d1a6" ], [ "p", "8aa70f4433129dadb71330ac89f62b534caa200a9f3ee349a0f4a5593073d1a6" ] ], "content": "In the long run (after launching with something simple) I would use A/B testing to determine scores based on relative performance.\n\nAn example of how it could work (not the only example) -\n\nBy default, every other time the user loads a thread, their sorting could switch back and forth between \"best comments\" and \"random.\" They would still be able to manually override this.\n\nThe \"random\" tab wouldn't actually be completely random, it would always be trying to show 2 different users 2 different sorting arrangements that can be compared to each other - like, users A and B get post X first and post Y second in the thread, but users C and D get post Y first and post X second, seeing which ranking results in more upvotes vs downvotes for all posts overall.\n\nThis is useless with too few users, so there's no point launching with it, but with enough users it gets the data needed to fill the \"best answers\" tab with the actual best answers. It's very resistant to newer answers being drowned out by older answers (which had more time to collect votes). It's very resistant to popularity contest bullshit, because an unpopular answer that gets a lot of discussion will still be recognized for its contribution to the thread, instead of buried in downvotes (i.e. reddit).", "sig": "b8e2ff6b4f5b479e6f5a56dd102cf2b0d39f0725a500b39e157f7e17cebf0643eef23a487aaa25bfd1dceddd5b2579df5954f7544d194d77de38d40b90e2cb28" }