lhl on Nostr: #[0] There are certain classes of problems that current LLMs just can't do (I tried ...
npub1j46f9va5qdrmamcwmncew7g8mwekgma386dhws6k04fsnlkqpcpsj23gm7 (npub1j46…3gm7) There are certain classes of problems that current LLMs just can't do (I tried your prompt w/ ChatGPT4 (which is smarter than 3.5) - it eventually concludes that it's too hard and gives up (I don't know if it really is impossible or not): https://sharegpt.com/c/rFtJboN - I make a typo btw, but I changed the prompt around a bit to try to get it to do more Chain of Thought reasoning). Here's a good article on some maybe not-so-obvious observations on how LLMs work: https://arxiv.org/pdf/2304.00612.pdf . Currently their reasoning ability is limited, but here are some examples of how I use it: using the browsing module to backfill concepts while reading a paper: https://sharegpt.com/c/JFexqvm , as interactive documentation, especially for devops tasks https://sharegpt.com/c/OYGitBE , it also does a bangup job of converting/manipulating tabular data (to/from Markdown, CSV, adding and merging columns and rows etc) as well.