allison on Nostr: i got so angry after reading this paper on LLMs and African American English that i ...
i got so angry after reading this paper on LLMs and African American English that i literally had to stand up and go walk around the block to cool off
https://www.nature.com/articles/s41586-024-07856-5 it's a very compelling paper, with a super clever methodology, and (i'm paraphrasing/extrapolating) shows that "alignment" strategies like RLHF only work to ensure that it never seems like a white person is saying something overtly racist, rather than addressing the actual prejudice baked into the model
Published at
2024-08-30 21:15:58Event JSON
{
"id": "2dbc1c9a330433a9d8839956c37c06f1ff967b73fa4105bd1763cdb1790b048e",
"pubkey": "ceb8b691f6e928b961b45bd08b55c2c996288e57d03e0c4be4b0c8c15dd75c8a",
"created_at": 1725052558,
"kind": 1,
"tags": [
[
"proxy",
"https://friend.camp/users/aparrish/statuses/113053044485254385",
"activitypub"
]
],
"content": "i got so angry after reading this paper on LLMs and African American English that i literally had to stand up and go walk around the block to cool off https://www.nature.com/articles/s41586-024-07856-5 it's a very compelling paper, with a super clever methodology, and (i'm paraphrasing/extrapolating) shows that \"alignment\" strategies like RLHF only work to ensure that it never seems like a white person is saying something overtly racist, rather than addressing the actual prejudice baked into the model",
"sig": "ce828905d58a314fc2e3c420d2c9991d39ff179ed9ff8eb799defca7bac318498a6309c18e962f8a26838365eb0993ee21879500340cf6de9fd13af7b295bda0"
}