Introducing SimpleQA

1 year ago 10
A factuality benchmark called SimpleQA that measures the ability for language models to answer short, fact-seeking questions.
Read Entire Article