ICLR 2026 - Submissions
Submissions
Summary Statistics
| Quantity AI Content | Count | Avg Rating |
|---|---|---|
| 0-10% | 0 (0%) | N/A |
| 10-30% | 1 (100%) | 4.00 |
| 30-50% | 0 (0%) | N/A |
| 50-70% | 0 (0%) | N/A |
| 70-90% | 0 (0%) | N/A |
| 90-100% | 0 (0%) | N/A |
| Total | 1 (100%) | 4.00 |
| Title | Abstract | Avg Rating | Quantity AI Content | Reviews | Pangram Dashboard |
|---|---|---|---|---|---|
| Reward Shaping Control Variates for Off-Policy Evaluation Under Sparse Rewards | Off-policy evaluation (OPE) is essential for deploying reinforcement learning in safety-critical settings, yet existing estimators such as importance sampling and doubly robust (DR) often exhibit proh... | 4.00 | 26% | See Reviews | View AI Dashboard |