What is Nostr?
Prof. Emily M. Bender(she/her) /
npub1z0k…pp3u
2024-05-20 18:20:40
in reply to nevent1q…zyvn

Prof. Emily M. Bender(she/her) on Nostr: So, even if a test has been established to have construct validity as a test relating ...

So, even if a test has been established to have construct validity as a test relating to human cognition, you can't just throw it at a chatbot and take the results as meaningful.

What would it mean for a language model to have a theory of mind? How does string manipulation relate to that? Without answers to these questions, the tests are meaningless.
Author Public Key
npub1z0kfl4g93gvv6ztazp0adm6rwk0r04v3tvwqrmfk4ncw7k37du4qk0pp3u