|
scDFM: Distributional Flow Matching Model for Robust Single-Cell Perturbation Prediction |
A central goal in systems biology and drug discovery is to predict the transcriptional response of cells to perturbations. This task is challenging due to the noisy, sparse nature of single-cell measu... |
4.00 |
52% |
See Reviews |
View AI Dashboard |
|
Rectifying Adaptive Learning Rate Variance via Confidence Estimation |
Recent advances in training physics-informed neural networks (PINNs) highlight the effectiveness of second-order optimization methods. Adaptive variants such as AdaHessian, Sophia, and SOAP leverage a... |
4.00 |
36% |
See Reviews |
View AI Dashboard |
|
Beyond Neural Incompatibility: Easing Cross-Scale Knowledge Transfer in Large Language Models through Latent Semantic Alignment |
Large Language Models (LLMs) encode vast amounts of knowledge in their massive parameters, which is accessible to locate, trace, and analyze.
Despite advances in neural interpretability, it is still n... |
3.33 |
5% |
See Reviews |
View AI Dashboard |
|
When Empowerment Disempowers |
Empowerment, a measure of an agent's ability to control its environment, has been proposed as a universal goal-agnostic objective for motivating assistive behavior in AI agents. While multi-human sett... |
3.00 |
0% |
See Reviews |
View AI Dashboard |
|
Revisit What You See: Disclose Language Prior in Vision Tokens for LVLM Decoding |
Large Vision–Language Models (LVLMs) achieve strong performance across multimodal tasks by integrating visual perception with language understanding.
However, how vision information contributes to th... |
2.67 |
9% |
See Reviews |
View AI Dashboard |
|
PCPO: Proportionate Credit Policy Optimization for Preference Alignment of Image Generation Models |
While reinforcement learning has advanced the alignment of text-to-image (T2I) models, state-of-the-art policy gradient methods are still hampered by training instability and high variance, hindering ... |
5.50 |
6% |
See Reviews |
View AI Dashboard |
|
Modular Multimodal Alignment using Time-Series EHR Data for Enhancing Medical Image Classification |
State-of-the-Art (SOTA) medical image classification models are generally pre-trained with large-scale data via self-supervised learning frameworks, to obtain high-quality image representations for ra... |
1.50 |
41% |
See Reviews |
View AI Dashboard |
|
SparCas: A Dimension-First Cascade for Efficient Long-Context LLM Inference |
Large language models (LLMs) have demonstrated strong capability in handling long-context sequences, but inference efficiency is bottlenecked by the continuously growing KV cache. KV cache selection m... |
3.50 |
55% |
See Reviews |
View AI Dashboard |
|
Randomization Boosts KV Caching, Learning Balances Query Load: A Joint Perspective |
KV caching is a fundamental technique for accelerating Large Language Model (LLM) inference by reusing key-value (KV) pairs from previous queries, but its effectiveness under limited memory is highly ... |
6.50 |
0% |
See Reviews |
View AI Dashboard |
|
Dreamland: Hybrid World Creation with Simulator and Generative Models |
Large-scale video generative models can synthesize diverse and realistic visual content for dynamic world creation, but they often lack element-wise controllability, hindering their use in editing sce... |
5.00 |
0% |
See Reviews |
View AI Dashboard |
|
OAR: Training Quantization-Friendly Object Detectors via Outlier-Aware Restriction |
Model quantization is widely employed to reduce computational resource usage during inference, often in conjunction with specialized hardware system for acceleration. While modern object detectors per... |
2.00 |
0% |
See Reviews |
View AI Dashboard |
|
COIG-Writer: A High-Quality Chinese Creative Writing with Thought Process Dataset |
Large language models exhibit systematic deficiencies in creative writing, particularly in non-English contexts where training data is scarce and lacks process-level supervision. We present COIG-Write... |
3.00 |
95% |
See Reviews |
View AI Dashboard |
|
Prior-based 4D Human-Scene Reconstruction from Monocular Videos |
Accurately capturing dynamic humans as they interact with their 3D environment from a single camera is a pivotal goal for applications spanning from assistive robotics to AR. However, current monocula... |
3.50 |
13% |
See Reviews |
View AI Dashboard |
|
A solvable model of inference-time scaling |
Recent developments in large language models have shown advantages in reallocating a notable share of computational resource from training time to inference time. However, the principles behind infere... |
3.00 |
0% |
See Reviews |
View AI Dashboard |
|
On Universality of Deep Equivariant Networks |
Universality results for equivariant neural networks remain rare. Those that do exist typically hold only in restrictive settings: either they rely on regular or higher-order tensor representations, l... |
5.00 |
1% |
See Reviews |
View AI Dashboard |
|
Proximal Diffusion Neural Sampler |
The task of learning a diffusion-based neural sampler for drawing samples from an unnormalized target distribution can be viewed as a stochastic optimal control problem on path measures. However, the ... |
6.50 |
0% |
See Reviews |
View AI Dashboard |
|
GRID: Scalable Task-Agnostic Prompt-Based Continual Learning for Language Models |
Prompt-based continual learning (CL) provides a parameter-efficient approach for adapting large language models (LLMs) across task sequences. However, most existing methods rely on task-aware inferenc... |
3.00 |
27% |
See Reviews |
View AI Dashboard |
|
Learning Mutation-Aware Visual Context for Antibody-Antigen Affinity Maturation Prediction |
Modeling the impact of amino acid mutations on antibody–antigen binding affinity is critical for therapeutic antibody design. Existing structure-based deep learning approaches can capture structural d... |
3.33 |
54% |
See Reviews |
View AI Dashboard |
|
DAMR: Efficient and Adaptive Context-Aware Knowledge Graph Question Answering with LLM-Guided MCTS |
Knowledge Graph Question Answering (KGQA) aims to interpret natural language queries and perform structured reasoning over knowledge graphs by leveraging their relational and semantic structures to re... |
6.00 |
61% |
See Reviews |
View AI Dashboard |
|
Lifelong Unlearning for Multimodal Large Language Models |
Multimodal large language models (MLLMs) are trained on massive multimodal data, making data unlearning increasingly important as data owners may request the removal of specific content. In practice, ... |
4.00 |
0% |
See Reviews |
View AI Dashboard |
|
When Small Models Team Up: A Weak‑Expert Ensemble Surpassing LLMs for Automated Intellectual‑Property Audits |
Intellectual Property Rights (IPR) enforcement on e-commerce platforms is increasingly challenged by the scale and complexity of modern online marketplaces, where counterfeit goods and brand infringem... |
3.00 |
50% |
See Reviews |
View AI Dashboard |
|
Beyond 'Templates': Category-Agnostic Object Pose, Size, and Shape Estimation from a Single View |
Estimating an object’s 6D pose, size, and shape from visual input is a fundamental problem in computer vision, with critical applications in robotic grasping and manipulation. Existing methods either ... |
5.00 |
27% |
See Reviews |
View AI Dashboard |
|
MedBLINK: Probing Basic Perception in Multimodal Language Models for Medicine |
Multimodal language models (MLMs) show promise for clinical decision support and diagnostic reasoning, raising the prospect of end-to-end automated medical image interpretation. However, clinicians ar... |
3.20 |
11% |
See Reviews |
View AI Dashboard |
|
SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories? |
Code performance optimization is paramount in real-world software engineering and critical for production-level systems. While Large Language Models (LLMs) have demonstrated impressive capabilities in... |
4.00 |
1% |
See Reviews |
View AI Dashboard |
|
MeInTime: Bridging Age Gap in Identity-Preserving Face Restoration |
To better preserve an individual's identity, face restoration has evolved from reference-free to reference-based approaches, which leverage high-quality reference images of the same identity to enhanc... |
5.00 |
0% |
See Reviews |
View AI Dashboard |
|
From Bits to Chips: An LLM-based Hardware-Aware Quantization Agent for Streamlined Deployment of LLMs |
Deploying models, especially large language models (LLMs), is becoming increasingly attractive to a broader user base, including those without specialized expertise. However, due to the resource const... |
4.00 |
24% |
See Reviews |
View AI Dashboard |
|
Expressive Value Learning for Scalable Offline Reinforcement Learning |
Reinforcement learning (RL) is a powerful paradigm for learning to make sequences of decisions. However, RL has yet to be fully leveraged in robotics, principally due to its lack of scalability. Offli... |
5.00 |
0% |
See Reviews |
View AI Dashboard |
|
BIRD: Behavior Induction via Representation-structure Distillation |
Human-aligned deep learning models exhibit behaviors consistent with human values, such as robustness, safety, and fairness. Transferring these behavioral properties to models trained on different ta... |
5.00 |
27% |
See Reviews |
View AI Dashboard |
|
Dchi-Stencil: A Differential Privacy Mechanism for Interacting with LLMs |
The use of language models as remote services requires transmitting private information to external providers, raising significant privacy concerns.
This process not only risks exposing sensitive dat... |
4.00 |
6% |
See Reviews |
View AI Dashboard |
|
Cooperative TD Learning in Heterogeneous Environments via Joint Linear Approximation |
We study cooperative temporal-difference (TD) learning with heterogeneous agents, where a collection of agents interacts with heterogeneous environments, yet tries to accelerate learning via seeking c... |
3.50 |
0% |
See Reviews |
View AI Dashboard |
|
Pseudo Multi-Source Domain Generalization: Bridging the Gap Between Single and Multi-Source Domain Generalization |
Deep learning models often struggle to maintain performance when deployed on data distributions different from their training data, particularly in real-world applications where environmental conditio... |
2.00 |
37% |
See Reviews |
View AI Dashboard |
|
Sparse-Compression Diffusion Models |
Diffusion models have demonstrated capabilities in successful generating synthetic data, especially the pictural ones. However, the conventional reverse diffusion process merely overfits in data denoi... |
3.00 |
48% |
See Reviews |
View AI Dashboard |
|
PriorGuide: Test-Time Prior Adaptation for Simulation-Based Inference |
Amortized simulator-based inference offers a powerful framework for tackling Bayesian inference in computational fields such as engineering or neuroscience, increasingly leveraging modern generative m... |
6.80 |
11% |
See Reviews |
View AI Dashboard |
|
A Comprehensive Benchmark for RNA 3D Structure-Function Modeling |
The relationship between RNA structure and function has recently attracted interest within the deep learning community, a trend expected to intensify as nucleic acid structure models advance.
Despite ... |
2.50 |
0% |
See Reviews |
View AI Dashboard |
|
UniRestorer: Universal Image Restoration via Adaptively Estimating Image Degradation at Proper Granularity |
Recently, considerable progress has been made in all-in-one image restoration. Generally, existing methods can be degradation-agnostic or degradation-aware. However, the former are limited in leveragi... |
6.00 |
0% |
See Reviews |
View AI Dashboard |
|
Test-time scaling of diffusions with flow maps |
A common recipe to improve diffusion models at test-time so that samples score highly against a user-specified reward is to introduce the gradient of the reward into the dynamics of the diffusion itse... |
5.00 |
0% |
See Reviews |
View AI Dashboard |
|
Background Blurring Matters: Improving Visual Grounding by Merging Text-Irrelevant Tokens |
Visual grounding (VG) aims to precisely localize the object in input images based on its natural language descriptions. Most recently proposed methods deal with this task with transformer-based archit... |
4.67 |
3% |
See Reviews |
View AI Dashboard |
|
Beyond Identity: High-Fidelity Face Swapping by Preserving Source Video Attributes |
Video face swapping is crucial in film and entertainment production, where achieving high fidelity and temporal consistency over long and complex video sequences remains a significant challenge.
... |
3.50 |
24% |
See Reviews |
View AI Dashboard |
|
A Dual-Branch Feature Fusion Framework based on GenAI for Robust Detection of Medical CT Image Tampering |
CT images are a critical diagnostic tool in modern medicine, yet they face risks to image authenticity posed by diverse image tampering techniques, which could disrupt the normal medical order and the... |
2.50 |
77% |
See Reviews |
View AI Dashboard |
|
Neural Theorem Proving for Verification Conditions: A Real-World Benchmark |
Theorem proving is fundamental to program verification, where the automated proof of Verification Conditions (VCs) remains a primary bottleneck. Real-world program verification frequently encounters h... |
5.50 |
5% |
See Reviews |
View AI Dashboard |
|
The Three Regimes of Offline-to-Online Reinforcement Learning |
Offline-to-online reinforcement learning (RL) has emerged as a practical paradigm that leverages offline datasets for pretraining and online interactions for fine-tuning. However, its empirical behavi... |
5.00 |
12% |
See Reviews |
View AI Dashboard |
|
DVLA-RL: Dual-Level Vision–Language Alignment with Reinforcement Learning Gating for Few-Shot Learning |
Few-shot learning (FSL) aims to generalize to novel categories with only a few samples. Recent approaches incorporate large language models (LLMs) to enrich visual representations with semantic embedd... |
4.67 |
18% |
See Reviews |
View AI Dashboard |
|
AlphaBench: Benchmarking Large Language Models in Formulaic Alpha Factor Mining |
Formulaic alpha factor mining (FAFM) is a central problem in quantitative investment, where interpretable formulas are designed to extract predictive signals from historical financial series. With the... |
6.00 |
1% |
See Reviews |
View AI Dashboard |
|
Agent-Chained Policy Optimization |
We study Cooperative Multi-Agent Reinforcement Learning (MARL), where the aim is to train decentralized policies that maximize a shared return. Existing methods typically employ either iterative best-... |
3.00 |
2% |
See Reviews |
View AI Dashboard |
|
Implicit Reconstruct Spatiotemporal Super-Resolution Microscopy in Arbitrary Dimension |
High-resolution 4D fluorescence microscopy imaging, essential for deciphering dynamic biological processes, is typically challenged by insufficient spatiotemporal resolutions. Phototoxicity, photoblea... |
5.00 |
6% |
See Reviews |
View AI Dashboard |
|
DiaBlo: Diagonal Blocks Are Sufficient For Finetuning |
Fine-tuning is a critical step for adapting large language models (LLMs) to domain-specific downstream tasks. To mitigate the substantial computational and memory costs of full-model fine-tuning, Para... |
5.50 |
24% |
See Reviews |
View AI Dashboard |
|
SimPlex-GT: A Simple Node-to-Cluster Graph Transformer for synergizing homophily and heterophily in Complex Graphs |
Graph neural networks (GNNs) have proven effective on homophilic graphs, where connected nodes share similar features. However, real-world graphs often exhibit mixed patterns including heterophily, wh... |
4.00 |
8% |
See Reviews |
View AI Dashboard |
|
UTFC-DiffTracker: Short- and Long-Range Temporal Feature Consistency Diffusion for Underwater Object Tracking |
Underwater object tracking (UOT) plays a significant role in marine animal protection, underwater search and rescue, and maritime security, yet faces distinctive challenges including color distortion,... |
4.00 |
64% |
See Reviews |
View AI Dashboard |
|
Adaptive Multi-Prototype Grouping Alignment for Domain Adaptive 3D Detection |
3D object detection is crucial for autonomous driving and robotics, but models often perform poorly when deployed in new environments due to domain shifts. While 3D unsupervised domain adaptation meth... |
3.33 |
0% |
See Reviews |
View AI Dashboard |
|
From Solo to Symphony: Orchestrating Multi-Agent Collaboration with Single-Agent Demos |
Training a team of agents from scratch in multi-agent reinforcement learning (MARL) is highly inefficient, much like asking beginners to play a symphony together without first practicing solo. Existin... |
3.20 |
14% |
See Reviews |
View AI Dashboard |