ICLR 2026 - Submissions

SubmissionsReviews

Submissions

Summary Statistics

Quantity AI Content Count Avg Rating
0-10% 1 (100%) 4.00
10-30% 0 (0%) N/A
30-50% 0 (0%) N/A
50-70% 0 (0%) N/A
70-90% 0 (0%) N/A
90-100% 0 (0%) N/A
Total 1 (100%) 4.00
Title Abstract Avg Rating Quantity AI Content Reviews Pangram Dashboard
Imitation Learning for Multi-turn LM Agents via On-policy Expert Corrections A popular paradigm for training LM agents relies on *imitation learning*, fine-tuning on expert trajectories. However, we show that the off-policy nature of imitation learning for multi-turn LM agents... 4.00 0% See Reviews View AI Dashboard
PreviousPage 1 of 1 (1 total rows)Next