Mark Riedl on Nostr: This paper evaluates LLMs in several search engines. Responses contain unsupported ...
This paper evaluates LLMs in several search engines.
Responses contain unsupported statements and inaccurate citations a majority of the time.
LLM-powered search isn't ready for prime-time.
Seems like a "no duh" sort of result, but the paper needed to be written
https://arxiv.org/abs/2304.09848
Responses contain unsupported statements and inaccurate citations a majority of the time.
LLM-powered search isn't ready for prime-time.
Seems like a "no duh" sort of result, but the paper needed to be written
https://arxiv.org/abs/2304.09848