ICLR 2026 - Submissions

SubmissionsReviews

Submissions

Summary Statistics

Quantity AI Content Count Avg Rating
0-10% 11864 (61%) 4.36
10-30% 3952 (20%) 4.14
30-50% 1846 (9%) 3.93
50-70% 1026 (5%) 3.75
70-90% 494 (3%) 3.39
90-100% 199 (1%) 2.90
Total 19490 (100%) 4.20
Title Abstract Avg Rating Quantity AI Content Reviews Pangram Dashboard
Exploring Cross-Modal Flows for Few-Shot Learning Aligning features from different modalities is one of the most fundamental challenges for cross-modal tasks. Although pre-trained vision-language models can achieve a general alignment between image a... 4.50 0% See Reviews View AI Dashboard
Sequential Diffusion Language Models Diffusion language models (DLMs) have strong theoretical efficiency but are limited by fixed-length decoding and incompatibility with key-value (KV) caches. Block diffusion mitigates these issues, yet... 4.00 0% See Reviews View AI Dashboard
Semi-Supervised Dataset Condensation with Dual Consistency Trajectory Matching Dataset condensation synthesizes a small dataset that preserves the performance of training on the original, large-scale data. However, existing methods rely on fully labeled data, which limits their ... 2.00 13% See Reviews View AI Dashboard
Human or Machine? A Preliminary Turing Test for Speech-to-Speech Interaction The pursuit of human-like conversational agents has long been guided by the Turing test. For modern speech-to-speech (S2S) systems, a critical yet unanswered question is whether they can converse like... 5.50 0% See Reviews View AI Dashboard
MODE: Learning compositional representations of complex systems with Mixtures Of Dynamical Experts Dynamical systems in the life sciences are often composed of complex mixtures of overlapping behavioral regimes. For example, cellular subpopulations may shift from cycling to equilibrium dynamics or ... 4.40 15% See Reviews View AI Dashboard
Map as a Prompt: Learning Multi-Modal Spatial-Signal Foundation Models for Cross-scenario Wireless Localization Accurate and robust wireless localization is a critical enabler for emerging 5G/6G applications, including autonomous driving, extended reality, and smart manufacturing. Despite its importance, achiev... 5.33 95% See Reviews View AI Dashboard
Offline Reinforcement Learning of High-Quality Behaviors Under Robust Style Alignment We study offline reinforcement learning of style-conditioned policies using explicit style supervision via subtrajectory labeling functions. In this setting, aligning style with high task performance ... 4.67 0% See Reviews View AI Dashboard
Out-of-Distribution Robust Explainer for Graph Neural Networks Graph Neural Networks (GNNs) are powerful tools for analyzing graph-structured data; however, their interpretability remains a challenge, leading to the growing use of eXplainable AI (XAI) methods. Mo... 4.50 15% See Reviews View AI Dashboard
IPGO: Indirect Prompt Gradient Optimization for Text-to-Image Model Prompt Finetuning Text-to-Image (T2I) Diffusion models have become the state-of-the-art for image generation, yet they often fail to align with specific reward criteria such as aesthetics or human preference. We propo... 4.00 0% See Reviews View AI Dashboard
Color3D: Controllable and Consistent 3D Colorization with Personalized Colorizer In this work, we present Color3D, a highly adaptable framework for colorizing both static and dynamic 3D scenes from monochromatic inputs, delivering visually diverse and chromatically vibrant reconst... 5.50 8% See Reviews View AI Dashboard
MedInsightBench: Evaluating Medical Analytics Agents Through Multi-Step Insight Discovery in Multimodal Medical Data In medical data analysis, extracting deep insights from complex, multi-modal datasets is essential for improving patient care, increasing diagnostic accuracy, and optimizing healthcare operations. How... 4.00 4% See Reviews View AI Dashboard
Learning When to Be Uncertain: A Post-Hoc Meta-Model for Guided Uncertainty Learning Reliable uncertainty quantification remains a major bottleneck in deploying deep learning models under distribution shift. Existing methods that retrofit pretrained models either inherit misplaced con... 3.50 19% See Reviews View AI Dashboard
Property-Driven Protein Inverse Folding with Multi-Objective Preference Alignment Protein sequence design must balance designability, defined as the ability to recover a target backbone, with multiple, often competing, developability properties such as solubility, thermostability, ... 6.00 4% See Reviews View AI Dashboard
Semantic Robustness of Deep Neural Networks in Ophthalmology: A Case Study with Colour Fundus Imaging Despite the success of Deep Neural Networks (DNNs) in ophthalmic tasks, their robustness in real-world clinical settings remains uncertain. This paper presents a case study on the semantic robustness ... 2.67 0% See Reviews View AI Dashboard
When Ethics and Payoffs Diverge: LLM Agents in Morally Charged Social Dilemmas Recent advances in large language models (LLMs) have enabled their use in complex agentic roles, involving decision-making with humans or other agents, making ethical alignment a key AI safety concern... 4.50 12% See Reviews View AI Dashboard
MoRE: Batch-Robust Multi-Omics Representations from Frozen Language Models Representation learning on multi-omics data is challenging due to extreme dimensionality, modality heterogeneity, and cohort-specific batch effects. While transformer-based large language models (LLMs... 1.50 100% See Reviews View AI Dashboard
Beginning with You: Perceptual-Initialization Improves Vision-Language Representation and Alignment We introduce Perceptual-Initialization (PI), a paradigm shift in visual representation learning that incorporates human perceptual structure during the initialization phase rather than as a downstream... 6.67 29% See Reviews View AI Dashboard
ProPerSim: Developing Proactive and Personalized AI Assistants through User-Assistant Simulation As large language models (LLMs) become increasingly integrated into daily life, there is growing demand for AI assistants that are not only reactive but also proactive and personalized. While recent a... 6.00 11% See Reviews View AI Dashboard
Variance Reduced Distributed Non-Convex Optimization Using Matrix Stepsizes Matrix-stepsized gradient descent algorithms have been shown to have superior performance in non-convex optimization problems compared to their scalar counterparts. The det-CGD algorithm, as introduce... 4.67 0% See Reviews View AI Dashboard
Riemannian Stochastic Interpolants for Amorphous Particle Systems Modern generative models hold great promise for accelerating diverse tasks involving the simulation of physical systems, but they must be adapted to the specific constraints of each domain. Significan... 3.50 0% See Reviews View AI Dashboard
Continuous Online Action Detection from Egocentric Videos Online Action Detection (OAD) tackles the challenge of recognizing actions as they unfold, relying solely on current and past frames. However, most OAD models are trained offline and assume static env... 3.00 16% See Reviews View AI Dashboard
Task-Aware Data Selection via Proxy-Label Enhanced Distribution Matching for LLM Finetuning Task-specific fine-tuning of foundation models is critically dependent on the quality and relevance of the instruction data. While prevailing data selection methods rely exclusively on instruction ins... 4.80 20% See Reviews View AI Dashboard
Partner-Aware Hierarchical Skill Discovery for Robust Human-AI Collaboration Multi-agent collaboration, especially in human-AI teaming, requires agents that can adapt to novel partners with diverse and dynamic behaviors. Conventional Deep Hierarchical Reinforcement Learning (D... 3.50 32% See Reviews View AI Dashboard
WAFER-QA: Evaluating Vulnerabilities of Agentic Workflows with Agent-as-Judge Agentic workflows—where multiple large language model (LLM) instances interact to solve tasks—are increasingly built on feedback mechanisms, where one model evaluates and critiques another. Despite th... 4.00 7% See Reviews View AI Dashboard
Codified Finite-state Machines for Role-playing Modeling latent character states is crucial for consistent and engaging role-playing (RP) with large language models (LLMs). Yet, existing prompting-based approaches mainly capture surface actions, of... 5.33 38% See Reviews View AI Dashboard
SciPro Arena: a Case Study of AI Agent Capabilities in Scientific Analysis Tasks We introduce SciPro (Scientific Process) Arena, a benchmark that measures how well frontier agentic AI systems analyze scientific data. While current benchmarks test knowledge recall and reasoning, fe... 3.50 0% See Reviews View AI Dashboard
Frequency-Balanced Retinal Representation Learning with Mutual Information Regularization We propose a frequency-oriented perspective on retinal representation learning by analyzing masked autoencoders (MAE) through the lens of spatial frequency. Our analysis shows that MAE favors low-freq... 4.00 21% See Reviews View AI Dashboard
InstructLR: A Scalable Approach to Create Instruction Dataset for Under-Resourced Languages Effective text generation and chat interfaces for low-resource languages (LRLs) remain a challenge for state-of-the-art large language models (LLMs) to support. This is mainly due to the difficulty of... 4.00 23% See Reviews View AI Dashboard
AlphaSteer: Learning Refusal Steering with Principled Null-Space Constraint As LLMs are increasingly deployed in real-world applications, ensuring their ability to refuse malicious prompts, especially jailbreak attacks, is essential for safe and reliable use. Recently, activa... 7.00 0% See Reviews View AI Dashboard
Ultra-Fast Inverse Tone Mapping via Gain Map-based LUT We aim to introduce Look-Up Tables (LUTs), a highly efficient approach, for ultra-fast inverse tone mapping (ITM). However, as LUT size scales exponentially with increasing bit-depth, it remains chall... 4.40 8% See Reviews View AI Dashboard
Amodal SAM: Open-World Amodal Segmentation Amodal segmentation, which aims to predict complete object shapes including occluded regions, remains challenging in open-world scenarios where models must generalize to novel objects and contexts. Wh... 3.50 0% See Reviews View AI Dashboard
VIRTUAL CELLS AS CAUSAL WORLD MODELS: A PERSPECTIVE ON EVALUATION This perspective argues that evaluating AI virtual cells requires moving beyond predictive accuracy toward assessing their ability to function as causal world models of biology. Existing benchmarks em... 4.67 83% See Reviews View AI Dashboard
Q-Learning with Fine-Grained Gap-Dependent Regret We study fine-grained gap-dependent regret bounds for model-free reinforcement learning in episodic tabular Markov Decision Processes. Existing model-free algorithms achieve minimax worst-case regret,... 6.50 0% See Reviews View AI Dashboard
FM-IRL: Flow-Matching for Reward Modeling and Policy Regularization in Reinforcement Learning Flow Matching (FM) has shown remarkable ability in modeling complex distributions and achieves strong performance in offline imitation learning for cloning expert behaviors. However, despite its behav... 4.00 0% See Reviews View AI Dashboard
Shift-and-Sum Quantization for Visual Autoregressive Models Post-training quantization (PTQ) enables efficient deployment of deep networks using a small set of data. Its application to visual autoregressive models (VAR), however, remains relatively unexplored.... 5.50 N/A See Reviews
FlowOpt: Fast Optimization Through Whole Flow Processes for Training-Free Editing The remarkable success of diffusion and flow-matching models has ignited a surge of works on adapting them at test time for controlled generation tasks. Examples range from image editing to restoratio... 3.33 0% See Reviews View AI Dashboard
LoFT: Low-Rank Adaptation That Behaves Like Full Fine-Tuning Large pre-trained models are commonly adapted to downstream tasks using parameter-efficient fine-tuning methods such as Low-Rank Adaptation (LoRA), which injects small trainable low-rank matrices inst... 6.00 23% See Reviews View AI Dashboard
CLIP-TTA: Robust Test-Time Adaptation via Dual Regularization Beyond Optimal Transport Despite the remarkable zero-shot performance of vision-language models, such as Contrastive Language-Image Pretraining (CLIP), on many downstream tasks, their potential may be degraded under distribut... 4.67 0% See Reviews View AI Dashboard
HyperBatch: Scaling Contrastive Learning Batch Sizes by Two Orders of Magnitude Contrastive learning has emerged as a powerful method for learning unsupervised representations of data that maximize similarity between "related" pairs of data and minimize similarity between unrelat... 2.50 9% See Reviews View AI Dashboard
GUI Knowledge Bench: Revealing the Knowledge Gap Behind VLM Failures in GUI Tasks Large vision–language models (VLMs) have advanced graphical user interface (GUI) task automation but still lag behind humans. We hypothesize this gap stems from missing core GUI knowledge, which exist... 4.50 13% See Reviews View AI Dashboard
Computational Bottlenecks for Denoising Diffusions Denoising diffusions sample from a probability distribution $\mu$ in $\mathbb{R}^d$ by constructing a stochastic process $(\hat{\mathbf{x}}_t:t\ge 0)$ in $\mathbb{R}^d$ such that $\hat{\mathbf{x}}_0$... 7.33 0% See Reviews View AI Dashboard
Knowledge Reasoning Language Model: Unifying Knowledge and Language for Inductive Knowledge Graph Reasoning Inductive Knowledge Graph Reasoning (KGR) aims to discover facts in open-domain KGs containing unknown entities and relations, which poses a challenge for KGR models in comprehending uncertain KG comp... 5.00 0% See Reviews View AI Dashboard
ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge Evaluating progress in large language models (LLMs) is often constrained by the challenge of verifying responses, limiting assessments to tasks like mathematics, programming, and short-form question-a... 6.50 0% See Reviews View AI Dashboard
StylOS: Multi-View 3D Stylization with Single-Forward Gaussian Splatting We present Stylos, a single-forward 3D Gaussian framework for 3D style transfer that operates on unposed content, from a single image to a multi- view collection, conditioned on a separate reference s... 7.00 18% See Reviews View AI Dashboard
SFdiff : Diffusion Model with Self-Generation for Probabilistic Forecasting Diffusion models have emerged as an effective approach for time-series probabilistic forecasting, aiming to generate future observations based on historical data through a denoising process. In this p... 3.50 3% See Reviews View AI Dashboard
Provably Efficient Policy-Reward Co-Pretraining for Adversarial Imitation Learning Adversarial imitation learning (AIL) achieves superior expert sample efficiency compared to behavioral cloning (BC) but requires extensive online environment interactions. Recent empirical works have ... 4.00 3% See Reviews View AI Dashboard
Diffusion Bridge Variational Inference for Deep Gaussian Processes Deep Gaussian processes (DGPs) enable expressive hierarchical Bayesian modeling but pose substantial challenges for posterior inference, especially over inducing variables. Denoising diffusion variati... 6.00 60% See Reviews View AI Dashboard
Endogenous Communication in Repeated Games with Learning Agents Communication among learning agents often emerges without explicit supervision. We study endogenous protocol formation in infinitely repeated stage games with a costless pre-play channel. Each agent h... 2.00 43% See Reviews View AI Dashboard
Provable Guarantees for Automated Circuit Discovery in Mechanistic Interpretability *Automated circuit discovery* is a central tool in mechanistic interpretability for identifying the internal components of neural networks responsible for specific behaviors. While prior methods have ... 7.33 0% See Reviews View AI Dashboard
From Curiosity to Caution: Mitigating Reward Hacking for Best-of-$N$ with Pessimism Inference-time compute scaling has emerged as a powerful paradigm for improving language model performance on a wide range of tasks, but the question of how best to use the additional compute remains ... 6.00 0% See Reviews View AI Dashboard
PreviousPage 10 of 390 (19490 total rows)Next