ICLR 2026 - Submissions

Submissions

Quantity AI Content: 0-10%10-30%30-50%50-70%70-90%90-100%All

Avg Rating: 0-1 1-2 2-3 3-4 4-5 5-6 6-7 7-8 8-9 9-10 All

Summary Statistics

Quantity AI Content	Count	Avg Rating
0-10%	11864 (61%)	4.36
10-30%	3952 (20%)	4.14
30-50%	1846 (9%)	3.93
50-70%	1026 (5%)	3.75
70-90%	494 (3%)	3.39
90-100%	199 (1%)	2.90
Total	19490 (100%)	4.20

Title	Abstract	Avg Rating	Quantity AI Content	Reviews	Pangram Dashboard
Exploring Cross-Modal Flows for Few-Shot Learning	Aligning features from different modalities is one of the most fundamental challenges for cross-modal tasks. Although pre-trained vision-language models can achieve a general alignment between image a...	4.50	0%	See Reviews	View AI Dashboard
Sequential Diffusion Language Models	Diffusion language models (DLMs) have strong theoretical efficiency but are limited by fixed-length decoding and incompatibility with key-value (KV) caches. Block diffusion mitigates these issues, yet...	4.00	0%	See Reviews	View AI Dashboard
Semi-Supervised Dataset Condensation with Dual Consistency Trajectory Matching	Dataset condensation synthesizes a small dataset that preserves the performance of training on the original, large-scale data. However, existing methods rely on fully labeled data, which limits their ...	2.00	13%	See Reviews	View AI Dashboard
Human or Machine? A Preliminary Turing Test for Speech-to-Speech Interaction	The pursuit of human-like conversational agents has long been guided by the Turing test. For modern speech-to-speech (S2S) systems, a critical yet unanswered question is whether they can converse like...	5.50	0%	See Reviews	View AI Dashboard
MODE: Learning compositional representations of complex systems with Mixtures Of Dynamical Experts	Dynamical systems in the life sciences are often composed of complex mixtures of overlapping behavioral regimes. For example, cellular subpopulations may shift from cycling to equilibrium dynamics or ...	4.40	15%	See Reviews	View AI Dashboard
Map as a Prompt: Learning Multi-Modal Spatial-Signal Foundation Models for Cross-scenario Wireless Localization	Accurate and robust wireless localization is a critical enabler for emerging 5G/6G applications, including autonomous driving, extended reality, and smart manufacturing. Despite its importance, achiev...	5.33	95%	See Reviews	View AI Dashboard
Offline Reinforcement Learning of High-Quality Behaviors Under Robust Style Alignment	We study offline reinforcement learning of style-conditioned policies using explicit style supervision via subtrajectory labeling functions. In this setting, aligning style with high task performance ...	4.67	0%	See Reviews	View AI Dashboard
Out-of-Distribution Robust Explainer for Graph Neural Networks	Graph Neural Networks (GNNs) are powerful tools for analyzing graph-structured data; however, their interpretability remains a challenge, leading to the growing use of eXplainable AI (XAI) methods. Mo...	4.50	15%	See Reviews	View AI Dashboard
IPGO: Indirect Prompt Gradient Optimization for Text-to-Image Model Prompt Finetuning	Text-to-Image (T2I) Diffusion models have become the state-of-the-art for image generation, yet they often fail to align with specific reward criteria such as aesthetics or human preference. We propo...	4.00	0%	See Reviews	View AI Dashboard
Color3D: Controllable and Consistent 3D Colorization with Personalized Colorizer	In this work, we present Color3D, a highly adaptable framework for colorizing both static and dynamic 3D scenes from monochromatic inputs, delivering visually diverse and chromatically vibrant reconst...	5.50	8%	See Reviews	View AI Dashboard
MedInsightBench: Evaluating Medical Analytics Agents Through Multi-Step Insight Discovery in Multimodal Medical Data	In medical data analysis, extracting deep insights from complex, multi-modal datasets is essential for improving patient care, increasing diagnostic accuracy, and optimizing healthcare operations. How...	4.00	4%	See Reviews	View AI Dashboard
Learning When to Be Uncertain: A Post-Hoc Meta-Model for Guided Uncertainty Learning	Reliable uncertainty quantification remains a major bottleneck in deploying deep learning models under distribution shift. Existing methods that retrofit pretrained models either inherit misplaced con...	3.50	19%	See Reviews	View AI Dashboard
Property-Driven Protein Inverse Folding with Multi-Objective Preference Alignment	Protein sequence design must balance designability, defined as the ability to recover a target backbone, with multiple, often competing, developability properties such as solubility, thermostability, ...	6.00	4%	See Reviews	View AI Dashboard
Semantic Robustness of Deep Neural Networks in Ophthalmology: A Case Study with Colour Fundus Imaging	Despite the success of Deep Neural Networks (DNNs) in ophthalmic tasks, their robustness in real-world clinical settings remains uncertain. This paper presents a case study on the semantic robustness ...	2.67	0%	See Reviews	View AI Dashboard
When Ethics and Payoffs Diverge: LLM Agents in Morally Charged Social Dilemmas	Recent advances in large language models (LLMs) have enabled their use in complex agentic roles, involving decision-making with humans or other agents, making ethical alignment a key AI safety concern...	4.50	12%	See Reviews	View AI Dashboard
MoRE: Batch-Robust Multi-Omics Representations from Frozen Language Models	Representation learning on multi-omics data is challenging due to extreme dimensionality, modality heterogeneity, and cohort-specific batch effects. While transformer-based large language models (LLMs...	1.50	100%	See Reviews	View AI Dashboard
Beginning with You: Perceptual-Initialization Improves Vision-Language Representation and Alignment	We introduce Perceptual-Initialization (PI), a paradigm shift in visual representation learning that incorporates human perceptual structure during the initialization phase rather than as a downstream...	6.67	29%	See Reviews	View AI Dashboard
ProPerSim: Developing Proactive and Personalized AI Assistants through User-Assistant Simulation	As large language models (LLMs) become increasingly integrated into daily life, there is growing demand for AI assistants that are not only reactive but also proactive and personalized. While recent a...	6.00	11%	See Reviews	View AI Dashboard
Variance Reduced Distributed Non-Convex Optimization Using Matrix Stepsizes	Matrix-stepsized gradient descent algorithms have been shown to have superior performance in non-convex optimization problems compared to their scalar counterparts. The det-CGD algorithm, as introduce...	4.67	0%	See Reviews	View AI Dashboard
Riemannian Stochastic Interpolants for Amorphous Particle Systems	Modern generative models hold great promise for accelerating diverse tasks involving the simulation of physical systems, but they must be adapted to the specific constraints of each domain. Significan...	3.50	0%	See Reviews	View AI Dashboard
Continuous Online Action Detection from Egocentric Videos	Online Action Detection (OAD) tackles the challenge of recognizing actions as they unfold, relying solely on current and past frames. However, most OAD models are trained offline and assume static env...	3.00	16%	See Reviews	View AI Dashboard
Task-Aware Data Selection via Proxy-Label Enhanced Distribution Matching for LLM Finetuning	Task-specific fine-tuning of foundation models is critically dependent on the quality and relevance of the instruction data. While prevailing data selection methods rely exclusively on instruction ins...	4.80	20%	See Reviews	View AI Dashboard
Partner-Aware Hierarchical Skill Discovery for Robust Human-AI Collaboration	Multi-agent collaboration, especially in human-AI teaming, requires agents that can adapt to novel partners with diverse and dynamic behaviors. Conventional Deep Hierarchical Reinforcement Learning (D...	3.50	32%	See Reviews	View AI Dashboard
WAFER-QA: Evaluating Vulnerabilities of Agentic Workflows with Agent-as-Judge	Agentic workflows—where multiple large language model (LLM) instances interact to solve tasks—are increasingly built on feedback mechanisms, where one model evaluates and critiques another. Despite th...	4.00	7%	See Reviews	View AI Dashboard
Codified Finite-state Machines for Role-playing	Modeling latent character states is crucial for consistent and engaging role-playing (RP) with large language models (LLMs). Yet, existing prompting-based approaches mainly capture surface actions, of...	5.33	38%	See Reviews	View AI Dashboard
SciPro Arena: a Case Study of AI Agent Capabilities in Scientific Analysis Tasks	We introduce SciPro (Scientific Process) Arena, a benchmark that measures how well frontier agentic AI systems analyze scientific data. While current benchmarks test knowledge recall and reasoning, fe...	3.50	0%	See Reviews	View AI Dashboard
Frequency-Balanced Retinal Representation Learning with Mutual Information Regularization	We propose a frequency-oriented perspective on retinal representation learning by analyzing masked autoencoders (MAE) through the lens of spatial frequency. Our analysis shows that MAE favors low-freq...	4.00	21%	See Reviews	View AI Dashboard
InstructLR: A Scalable Approach to Create Instruction Dataset for Under-Resourced Languages	Effective text generation and chat interfaces for low-resource languages (LRLs) remain a challenge for state-of-the-art large language models (LLMs) to support. This is mainly due to the difficulty of...	4.00	23%	See Reviews	View AI Dashboard
AlphaSteer: Learning Refusal Steering with Principled Null-Space Constraint	As LLMs are increasingly deployed in real-world applications, ensuring their ability to refuse malicious prompts, especially jailbreak attacks, is essential for safe and reliable use. Recently, activa...	7.00	0%	See Reviews	View AI Dashboard
Ultra-Fast Inverse Tone Mapping via Gain Map-based LUT	We aim to introduce Look-Up Tables (LUTs), a highly efficient approach, for ultra-fast inverse tone mapping (ITM). However, as LUT size scales exponentially with increasing bit-depth, it remains chall...	4.40	8%	See Reviews	View AI Dashboard
Amodal SAM: Open-World Amodal Segmentation	Amodal segmentation, which aims to predict complete object shapes including occluded regions, remains challenging in open-world scenarios where models must generalize to novel objects and contexts. Wh...	3.50	0%	See Reviews	View AI Dashboard
VIRTUAL CELLS AS CAUSAL WORLD MODELS: A PERSPECTIVE ON EVALUATION	This perspective argues that evaluating AI virtual cells requires moving beyond predictive accuracy toward assessing their ability to function as causal world models of biology. Existing benchmarks em...	4.67	83%	See Reviews	View AI Dashboard
Q-Learning with Fine-Grained Gap-Dependent Regret	We study fine-grained gap-dependent regret bounds for model-free reinforcement learning in episodic tabular Markov Decision Processes. Existing model-free algorithms achieve minimax worst-case regret,...	6.50	0%	See Reviews	View AI Dashboard
FM-IRL: Flow-Matching for Reward Modeling and Policy Regularization in Reinforcement Learning	Flow Matching (FM) has shown remarkable ability in modeling complex distributions and achieves strong performance in offline imitation learning for cloning expert behaviors. However, despite its behav...	4.00	0%	See Reviews	View AI Dashboard
Shift-and-Sum Quantization for Visual Autoregressive Models	Post-training quantization (PTQ) enables efficient deployment of deep networks using a small set of data. Its application to visual autoregressive models (VAR), however, remains relatively unexplored....	5.50	N/A	See Reviews
FlowOpt: Fast Optimization Through Whole Flow Processes for Training-Free Editing	The remarkable success of diffusion and flow-matching models has ignited a surge of works on adapting them at test time for controlled generation tasks. Examples range from image editing to restoratio...	3.33	0%	See Reviews	View AI Dashboard
LoFT: Low-Rank Adaptation That Behaves Like Full Fine-Tuning	Large pre-trained models are commonly adapted to downstream tasks using parameter-efficient fine-tuning methods such as Low-Rank Adaptation (LoRA), which injects small trainable low-rank matrices inst...	6.00	23%	See Reviews	View AI Dashboard
CLIP-TTA: Robust Test-Time Adaptation via Dual Regularization Beyond Optimal Transport	Despite the remarkable zero-shot performance of vision-language models, such as Contrastive Language-Image Pretraining (CLIP), on many downstream tasks, their potential may be degraded under distribut...	4.67	0%	See Reviews	View AI Dashboard
HyperBatch: Scaling Contrastive Learning Batch Sizes by Two Orders of Magnitude	Contrastive learning has emerged as a powerful method for learning unsupervised representations of data that maximize similarity between "related" pairs of data and minimize similarity between unrelat...	2.50	9%	See Reviews	View AI Dashboard
GUI Knowledge Bench: Revealing the Knowledge Gap Behind VLM Failures in GUI Tasks	Large vision–language models (VLMs) have advanced graphical user interface (GUI) task automation but still lag behind humans. We hypothesize this gap stems from missing core GUI knowledge, which exist...	4.50	13%	See Reviews	View AI Dashboard
Computational Bottlenecks for Denoising Diffusions	Denoising diffusions sample from a probability distribution $\mu$ in $\mathbb{R}^d$ by constructing a stochastic process $(\hat{\mathbf{x}}_t:t\ge 0)$ in $\mathbb{R}^d$ such that $\hat{\mathbf{x}}_0$...	7.33	0%	See Reviews	View AI Dashboard
Knowledge Reasoning Language Model: Unifying Knowledge and Language for Inductive Knowledge Graph Reasoning	Inductive Knowledge Graph Reasoning (KGR) aims to discover facts in open-domain KGs containing unknown entities and relations, which poses a challenge for KGR models in comprehending uncertain KG comp...	5.00	0%	See Reviews	View AI Dashboard
ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge	Evaluating progress in large language models (LLMs) is often constrained by the challenge of verifying responses, limiting assessments to tasks like mathematics, programming, and short-form question-a...	6.50	0%	See Reviews	View AI Dashboard
StylOS: Multi-View 3D Stylization with Single-Forward Gaussian Splatting	We present Stylos, a single-forward 3D Gaussian framework for 3D style transfer that operates on unposed content, from a single image to a multi- view collection, conditioned on a separate reference s...	7.00	18%	See Reviews	View AI Dashboard
SFdiff : Diffusion Model with Self-Generation for Probabilistic Forecasting	Diffusion models have emerged as an effective approach for time-series probabilistic forecasting, aiming to generate future observations based on historical data through a denoising process. In this p...	3.50	3%	See Reviews	View AI Dashboard
Provably Efficient Policy-Reward Co-Pretraining for Adversarial Imitation Learning	Adversarial imitation learning (AIL) achieves superior expert sample efficiency compared to behavioral cloning (BC) but requires extensive online environment interactions. Recent empirical works have ...	4.00	3%	See Reviews	View AI Dashboard
Diffusion Bridge Variational Inference for Deep Gaussian Processes	Deep Gaussian processes (DGPs) enable expressive hierarchical Bayesian modeling but pose substantial challenges for posterior inference, especially over inducing variables. Denoising diffusion variati...	6.00	60%	See Reviews	View AI Dashboard
Endogenous Communication in Repeated Games with Learning Agents	Communication among learning agents often emerges without explicit supervision. We study endogenous protocol formation in infinitely repeated stage games with a costless pre-play channel. Each agent h...	2.00	43%	See Reviews	View AI Dashboard
Provable Guarantees for Automated Circuit Discovery in Mechanistic Interpretability	Automated circuit discovery is a central tool in mechanistic interpretability for identifying the internal components of neural networks responsible for specific behaviors. While prior methods have ...	7.33	0%	See Reviews	View AI Dashboard
From Curiosity to Caution: Mitigating Reward Hacking for Best-of-$N$ with Pessimism	Inference-time compute scaling has emerged as a powerful paradigm for improving language model performance on a wide range of tasks, but the question of how best to use the additional compute remains ...	6.00	0%	See Reviews	View AI Dashboard

PreviousPage 10 of 390 (19490 total rows)Next