geeknik on Nostr: Researchers at UC San Diego conducted an experiment to determine whether OpenAI's ...
Researchers at UC San Diego conducted an experiment to determine whether OpenAI's GPT-4 could pass the Turing Test, in comparison to GPT-3.5, the 1960s program ELIZA, and human participants. GPT-4 achieved a success rate of 41%. The study, released as a preprint on arXiv and not yet peer-reviewed, revealed that variables such as education level and familiarity with AI did not markedly affect the participants' capacity to identify AI presence. The study emphasized the enduring significance of the Turing Test in evaluating the abilities of AI in social interaction and deception.
https://arstechnica.com/information-technology/2023/12/real-humans-appeared-human-63-of-the-time-in-recent-turing-test-ai-study/Published at
2023-12-03 23:20:23Event JSON
{
"id": "7d02ca81a18993a38ba7109d7a5430ba35a4e7f47e1e72cf0fd56a98c1e32403",
"pubkey": "4d8e327543efbe13ef4f49e43922a40258ac60ededcee062a568f18845a09a04",
"created_at": 1701645623,
"kind": 1,
"tags": [
[
"r",
"https://arstechnica.com/information-technology/2023/12/real-humans-appeared-human-63-of-the-time-in-recent-turing-test-ai-study/"
]
],
"content": "Researchers at UC San Diego conducted an experiment to determine whether OpenAI's GPT-4 could pass the Turing Test, in comparison to GPT-3.5, the 1960s program ELIZA, and human participants. GPT-4 achieved a success rate of 41%. The study, released as a preprint on arXiv and not yet peer-reviewed, revealed that variables such as education level and familiarity with AI did not markedly affect the participants' capacity to identify AI presence. The study emphasized the enduring significance of the Turing Test in evaluating the abilities of AI in social interaction and deception.\n\nhttps://arstechnica.com/information-technology/2023/12/real-humans-appeared-human-63-of-the-time-in-recent-turing-test-ai-study/",
"sig": "d46874f3d9b7cb059338cefc07b80de75c917fab9c89d9f01dcb376e45363133a471d71c8fb1e842c6112d58bb3ad40c82bb906359a2730573cafa5333a72e12"
}