SciArena: Evaluating Foundation Models in Scientific Literature Tasks allenai.org 3 points by maxloh 11 hours ago