Tagged with

1 article found

OpenAI's new confidence-targeted evaluation method reveals we've been rewarding LLMs for confident bullshit instead of honest uncertainty