ICLR 2026 - Submissions
Submissions
Summary Statistics
| Quantity AI Content | Count | Avg Rating |
|---|---|---|
| 0-10% | 0 (0%) | N/A |
| 10-30% | 1 (100%) | 4.00 |
| 30-50% | 0 (0%) | N/A |
| 50-70% | 0 (0%) | N/A |
| 70-90% | 0 (0%) | N/A |
| 90-100% | 0 (0%) | N/A |
| Total | 1 (100%) | 4.00 |
| Title | Abstract | Avg Rating | Quantity AI Content | Reviews | Pangram Dashboard |
|---|---|---|---|---|---|
| Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models | Reward-based alignment methods for large language models (LLMs) face two key limitations: vulnerability to reward hacking, where models exploit flaws in the reward signal; and reliance on brittle, lab... | 4.00 | 12% | See Reviews | View AI Dashboard |