ICLR 2026 - Reviews

SubmissionsReviews

Reviews

Summary Statistics

EditLens Prediction Count Avg Rating Avg Confidence Avg Length (chars)
Fully AI-generated 2 (50%) 7.00 3.50 3165
Heavily AI-edited 0 (0%) N/A N/A N/A
Moderately AI-edited 1 (25%) 6.00 4.00 2301
Lightly AI-edited 0 (0%) N/A N/A N/A
Fully human-written 1 (25%) 4.00 5.00 2140
Total 4 (100%) 6.00 4.00 2693
Title Ratings Review Text EditLens Prediction
STDDN: A Physics-Guided Deep Learning Framework for Crowd Simulation Soundness: 2: fair Presentation: 2: fair Contribution: 2: fair Rating: 4: marginally below the acceptance threshold Confidence: 5: You are absolutely certain about your assessment. You are very familiar with the related work and checked the math/other details carefully. This work proposes a physics-guided deep learning framework for crowd simulation (STDNN). The authors introduce the continuity equation from fluid dynamics as a strong physical constraint and design a density-velocity coupled dynamic graph learning module. They show that STDNN is significantly superior to simulation performance compared to SOTA methods. 1. The authors propose a network of time-space decoupled differential equations combined with the continuity equation, which is helpful for predicting the physical laws of trajectories in the macroscopic world. 2. In experiments, Tables 1 and 2 clearly illustrate the trajectories and verifies main results from the paper. 1. The proposed method uses Neural ODEs to solve $\rho$, but there are many similar ideas, and the use of Neural ODEs in trajectory prediction is also a very common approach. 2. The proposed method utilizes constraints based on continuity equations, but the specific implementation of this constraint in the Neural ODE framework requires more detailed explanation. 3. The authors conducted many experiments, but it seems that it is necessary to split each subset of the dataset and compare with newer baselines and methods. The current baseline only reaches the year 2024. Based on the trajectory dataset used by the authors, it seems that there are a large number of sota in pedestrian trajectory prediction that have not been compared. 1. How is the continuity equation incorporated into the Neural ODE solution process? It requires a more detailed explanation. 2. The detailed parameters used when solving the Neural ODE in torchdiffeq are not disclosed. 3. Figure 1 contains many typo errors. ⦁ For example, $Gin$($Gout$) should actually be $G_{in}$($G_{out}$). ⦁ The input to Microscopic seems to be $\pho^0$. ⦁ The DDM and CGD in the figure are also too simple. 4. Should the use of the loss function in Eq 10 be more explicit? Eq 8 does not seem to be included in it. 5. Are there more granular comparative tests, such as what the results were for ETH/HOTEL/ZARA1/ZARA2/UNIV, respectively? 6. Should ADE and FDE also be reported for general trajectories? Fully human-written
STDDN: A Physics-Guided Deep Learning Framework for Crowd Simulation Soundness: 3: good Presentation: 4: excellent Contribution: 3: good Rating: 6: marginally above the acceptance threshold Confidence: 4: You are confident in your assessment, but not absolutely certain. It is unlikely, but not impossible, that you did not understand some parts of the submission or that you are unfamiliar with some pieces of related work. This paper proposes STDDN, a novel framework for crowd simulation that addresses the common issues of error accumulation and physical inconsistency in long-term predictions. Its core contribution is the unique integration of a macroscopic physical law—the continuity equation from fluid dynamics—with a microscopic deep learning model for trajectory prediction. By using a Neural ODE to model crowd density evolution, STDDN enforces a strong physical constraint during training. Experiments show that STDDN not only achieves state-of-the-art accuracy but also significantly reduces inference latency compared to leading methods. The paper's primary strength is its originality in creating a macro-micro coupled framework. Using the continuity equation to regularize trajectory prediction is a conceptually novel and powerful idea for this field. The quality of the work is good, supported by rigorous and comprehensive experiments that convincingly demonstrate superior performance in both accuracy and efficiency over strong baselines. The paper is also written with exceptional clarity. - **Lack of Direct Physical Metrics**: The paper claims to improve physical realism by avoiding issues like congestion and collisions, but it fails to provide direct quantitative evidence. The evaluation relies on general error metrics (MAE/OT), which are insufficient proxies. The work would be much stronger if it included systematic measurements and comparisons of collision rates, obstacle penetration rates or density extremum analysis to directly support its core claims. - **Training Cost**: While the paper rightly emphasizes its fast inference speed, it completely neglects to discuss the training cost. The use of a Neural ODE likely makes the training process computationally expensive and slow. An additional analysis should be included in the paper. - **A minor issue**: the table in page 8 has a wrong caption: "**Figure** 4". - The fluid dynamics assumption is a strong prior. Could you clarify the intended scope of your method? In which crowd scenarios (e.g., panic, counterflow) might this assumption become a limitation? - Given the model's sensitivity to grid size, can you offer any practical guidelines or a more principled approach for selecting this crucial hyperparameter for new scenes? Moderately AI-edited
STDDN: A Physics-Guided Deep Learning Framework for Crowd Simulation Soundness: 4: excellent Presentation: 3: good Contribution: 4: excellent Rating: 8: accept, good paper Confidence: 4: You are confident in your assessment, but not absolutely certain. It is unlikely, but not impossible, that you did not understand some parts of the submission or that you are unfamiliar with some pieces of related work. This paper proposes STDDN (Spatio-Temporal Decoupled Differential Equation Network), a novel physics-guided deep learning framework for crowd simulation. STDDN explicitly combines microscopic trajectory prediction with macroscopic density evolution by embedding the continuity equation from fluid dynamics into a Neural ODE structure. The model separates local trajectory dynamics from global density fields, enabling physical consistency and stable long-term simulations. Experiments on four real-world crowd datasets (GC, UCY, ETH, HOTEL) show that STDDN outperforms prior physics-guided baselines such as SPDiff and PCS in both accuracy and inference speed. 1. Good motivation on coupling of micro- and macro-level dynamics. The paper’s main contribution is conceptually sound. By using the continuity equation as a bridge between trajectory prediction and density evolution, STDDN unifies local motion modeling with global flow consistency. 2. Physically meaningful ODE formulation. The introduction of a Neural ODE to simulate density evolution is well justified. It provides continuous-time reasoning while enforcing conservation principles, addressing a key limitation of purely data-driven models that tend to accumulate errors over time. 3. Strong empirical performance. Across four datasets, STDDN shows consistent gains over all baselines, including both physics-based and deep learning methods. The improvements in both accuracy and latency demonstrate that the proposed framework is practically beneficial. 4. Interpretability and efficiency. The method retains interpretability through its physically grounded formulation while remaining computationally tractable, which is uncommon in physics-guided models. 1. Limited experimental diversity. Although the method is tested on multiple datasets, all belong to similar crowd domains. It would strengthen the generality claim to include different physical systems, such as vehicle or swarm simulation. 2. Ablation breadth. The ablation study is informative but it would be useful to show how performance changes under different ODE solvers or with alternative coupling strengths between density and trajectory modules. 3. Minor missing citations for ODE-based trajectory forecasting. The paper would benefit from acknowledging prior studies that have already explored ODE formulations for trajectory or crowd prediction, such as Social ODE: Multi-agent Trajectory Forecasting with Neural Ordinary Differential Equations (ECCV 2022) and Improving Transferability for Cross-Domain Trajectory Prediction via Neural Stochastic Differential Equation (AAAI 2024). These works share conceptual overlap in embedding physical dynamics into continuous differential frameworks. Please see the weakness section Fully AI-generated
STDDN: A Physics-Guided Deep Learning Framework for Crowd Simulation Soundness: 3: good Presentation: 3: good Contribution: 3: good Rating: 6: marginally above the acceptance threshold Confidence: 3: You are fairly confident in your assessment. It is possible that you did not understand some parts of the submission or that you are unfamiliar with some pieces of related work. Math/other details were not carefully checked. This paper proposes STDDN (Spatio-Temporal Decoupled Differential Equation Network), a novel physics-guided deep learning framework for crowd simulation. Unlike prior microscopic or purely data-driven approaches, STDDN introduces a Neural ODE formulation guided by the continuity equation from fluid dynamics, thereby coupling macroscopic density evolution with microscopic trajectory prediction. The model integrates three modules — Differentiable Density Mapping (DDM), Continuous Cross-Grid Detection (CGD), and Node Embedding (NE) — to ensure differentiability and physical consistency. Experiments on four real-world datasets (GC, UCY, ETH, HOTEL) show that STDDN significantly improves both simulation accuracy and inference speed compared with state-of-the-art baselines such as SPDiff and PCS. 1、Novel Integration of Physics and Deep Learning: The paper introduces a principled way to integrate the continuity equation into deep models for crowd simulation. This macro–micro coupling via Neural ODE is both original and physically interpretable. 2、Methodological Sophistication: The DVCG module cleverly connects density and velocity fields through a graph structure, while the DDM and CGD modules effectively address gradient discontinuity and cross-grid flux detection. These designs are mathematically sound and technically detailed. 3、Interpretability and Physical Consistency: The approach offers clear interpretability grounded in physics, addressing a key limitation of previous purely data-driven models that violate conservation laws 1、The proposed model enforces strict mass conservation through the continuity equation, implying that the total population density within the target spatial domain remains constant over time. However, in realistic datasets and surveillance scenarios, the number of pedestrians in view is not fixed — new individuals may enter the scene, and others may leave. Such open-world dynamics inherently violate the closed-system assumption of the continuity equation. Without explicit treatment of source or sink terms (i.e., inflow/outflow of mass) or adaptive boundary conditions, the model may experience cumulative density drift or numerical instability, particularly when crowd density fluctuates significantly. The authors are encouraged to clarify whether boundary inflows are modeled, or to discuss potential modifications to better handle non-conserved population scenarios. 2、The ablation study provides useful insights, particularly regarding the contributions of the ODE solver and the mass constraint loss. Both components appear meaningful; however, the current experimental setup only uses discrete outputs in the loss computation. As a result, the experiments do not adequately demonstrate the benefit of continuous-time modeling enabled by the ODE formulation. To strengthen this section, I suggest decomposing the “w/o ODE” setting into two variants: (1)Purely autoregressive training, as mentioned in the paper (“trained using purely autoregressive methods”). (2)Discrete neural network replacement for ODE, where the ODE solver is replaced with a discrete neural module that still leverages the combined loss function including the mass constraint term. Such a refinement would better isolate the contribution of the continuous-time ODE formulation from the general modeling capacity and loss design, making the ablation analysis more convincing. Can the authors explain why the fluid physics improves results in low-density datasets like ETH and HOTEL, where the fluid assumption barely holds? Fully AI-generated
PreviousPage 1 of 1 (4 total rows)Next