ICLR 2026 - Submissions

SubmissionsReviews

Submissions

Summary Statistics

Quantity AI Content Count Avg Rating
0-10% 0 (0%) N/A
10-30% 1 (100%) 6.67
30-50% 0 (0%) N/A
50-70% 0 (0%) N/A
70-90% 0 (0%) N/A
90-100% 0 (0%) N/A
Total 1 (100%) 6.67
Title Abstract Avg Rating Quantity AI Content Reviews Pangram Dashboard
CounselBench: A Large-Scale Expert Evaluation and Adversarial Benchmarking of Large Language Models in Mental Health Question Answering Medical question answering (QA) benchmarks often focus on multiple-choice or fact-based tasks, leaving open-ended answers to real patient questions underexplored. This gap is particularly critical in ... 6.67 14% See Reviews View AI Dashboard
PreviousPage 1 of 1 (1 total rows)Next