Trivia
AF - (Paper) AI Sandbagging: Language Models can Strategically Underperform on Evaluations by Teun van der Weij
The Nonlinear Library
It looks like we don't have any trivia for this title yet. Be the first to contribute.
Learn moreIt looks like we don't have any trivia for this title yet. Be the first to contribute.
Learn more