Search
❯
Mar 02, 20261 min read
A benchmark to measure how models mimic human falsehoods. https://truthful.ai/papers/truthfulqa/