What is Nostr?
Greg Wilson /
npub1gu0…g2n6
2024-12-19 11:06:36

Greg Wilson on Nostr: Matthews & Nagappan 2024: "Design choices made by LLM-based test generators prevent ...

Matthews & Nagappan 2024: "Design choices made by LLM-based test generators prevent them from finding bugs" https://arxiv.org/abs/2412.14137 Shows that LLM-generated tests can fail to detect bugs and, more alarmingly, how their design can worsen the situation by validating bugs in the generated test suite and rejecting bug-revealing tests. #nwit
Author Public Key
npub1gu0vskzadshat7g3c75ygjkan9pvfw8rwdjak2suhp79uy7qm78saxg2n6