ICLR 2026 - Reviews

Submissions Reviews

Reviews

EditLens Prediction: Fully AI-generated Heavily AI-edited Moderately AI-edited Lightly AI-edited Fully human-written All

Rating: 1 2 3 4 5 6 7 8 9 10 All

Confidence: 1 2 3 4 5 All

Summary Statistics

EditLens Prediction	Count	Avg Rating	Avg Confidence	Avg Length (chars)
Fully AI-generated	3 (75%)	2.67	3.33	1970
Heavily AI-edited	0 (0%)	N/A	N/A	N/A
Moderately AI-edited	0 (0%)	N/A	N/A	N/A
Lightly AI-edited	0 (0%)	N/A	N/A	N/A
Fully human-written	1 (25%)	2.00	4.00	2954
Total	4 (100%)	2.50	3.50	2216

Title	Ratings	Review Text	EditLens Prediction
DyCodeExplainer: Explainable Dynamic Graph Attention for Multi-Agent Reinforcement Learning in Collaborative Coding	Soundness: 2: fair Presentation: 3: good Contribution: 2: fair Rating: 2: reject Confidence: 3: You are fairly confident in your assessment. It is possible that you did not understand some parts of the submission or that you are unfamiliar with some pieces of related work. Math/other details were not carefully checked.	This paper proposes DyCodeExplainer, a framework designed to enhance both the performance and interpretability of multi-agent reinforcement learning systems for collaborative programming. The model uses a dynamic graph attention network to model agent interactions, where edges are gated by a sparsity-inducing hard threshold. It also introduces a hybrid explainability module that combines gradient-based attribution with symbolic rule extraction, aiming to produce interpretable reasoning traces for each coding action. The experiments demonstrate improved correctness and communication efficiency on two collaborative coding benchmarks. * Tackles an emerging and relevant problem—explainable multi-agent collaboration for code generation. * The dynamic attention mechanism is conceptually straightforward and may encourage efficient communication. * The combination of symbolic rules and neural explanations is an interesting hybrid approach that could inspire follow-up work. * The paper is clearly structured and includes good qualitative visualizations of attention maps. * The hard gating function is non-differentiable, yet the paper does not describe how gradients are approximated (e.g., STE, Gumbel-softmax). * The rule learning process is underdefined—there is no description of the search space or optimization mechanism for symbolic rules. * The human evaluation used to justify the explainability claims lacks methodological rigor (no inter-rater reliability, sample size, or blinding). * Experimental validation is narrow (two datasets) and lacks cross-language or cross-agent generalization tests. * The proposed approach, while creative, seems only partially implemented and evaluated. 1. How is the threshold parameter for the hard gate trained or tuned? 2. How are the symbolic rules parameterized and optimized alongside the neural model? 3. Can you provide more methodological details on your human evaluation protocol? 4. Have you tried the method on different coding environments or programming languages?	Fully AI-generated
DyCodeExplainer: Explainable Dynamic Graph Attention for Multi-Agent Reinforcement Learning in Collaborative Coding	Soundness: 2: fair Presentation: 1: poor Contribution: 1: poor Rating: 2: reject Confidence: 4: You are confident in your assessment, but not absolutely certain. It is unlikely, but not impossible, that you did not understand some parts of the submission or that you are unfamiliar with some pieces of related work.	This paper proposes a computational framework to combine dynamic graph attention and explainability to improve multi-agent performance in a collaborative coding task. The authors evaluate their proposed method with baseline MARL methods which indicate that DyCodeExplainer achieves better performance in terms of correctness and efficiency. The idea of combineing dynamic graph attention and explainability in MARL is relatively novel. The literature review section misses important work in the field when identifying the gap. In Section 2.3 the references are survey papers. I would recommend citing the exact empirical work. Attached are a few references to start with. There is work using an attention mechanism [6] or gating function [1-2] to selectively communicate given the task context. In the emergent communication community, researchers have also been working on improving communication interpretability by rewarding agents to communicate in a semantically meaningful space [5] or directly aligning the agent communication space with the human natural language space [3-4]. [1] Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks [2] Interpretable learned emergent communication for human-agent teams [3] Multi-agent cooperation and the emergence of (natural) language [4] Language grounded multi-agent reinforcement learning with human-interpretable communication [5] Emergent discrete communication in semantic spaces [6] Multi-agent graph-attention communication and teaming The target task “collaborative coding” is not explained before being used. I would recommend properly defining the task space (number of agents, form of communication, observation and action space) and the RL learning objective in a separate problem formulation section. The proposed framework is not clearly explained in the method section. Details are missing to reproduce the work. Figure 1 should be extended to include interaction with external task environments. It would be helpful to provide a concrete interaction example between agents in the target task context. Human evaluation details are missing. For example, how many annotators were recruited? What instructions and explanation materials were shown to participants for the evaluation? Are the reported differences statistically significant? It's not clear to me why adding the explainability objective improves performance in the ablation study. My understanding is that explanations are generated for humans and are basically an auxiliary task that does not directly contribute to the main task objective. Could the author elaborate more on this? It's not clear to me why adding the explainability objective improves performance in the ablation study. My understanding is that explanations are generated for humans and are basically an auxiliary task that does not directly contribute to the main task objective. Could the author elaborate more on this?	Fully human-written
DyCodeExplainer: Explainable Dynamic Graph Attention for Multi-Agent Reinforcement Learning in Collaborative Coding	Soundness: 1: poor Presentation: 1: poor Contribution: 1: poor Rating: 0: Confidence: 5: You are absolutely certain about your assessment. You are very familiar with the related work and checked the math/other details carefully.	The paper presents DyCodeExplainer, a framework that uses dynamic graph attention and explainability techniques to improve multi-agent reinforcement learning for collaborative coding. This paper appears to be AI-generated and lacks genuine research depth. It loosely applies MARL concepts to LLM-based agent collaboration without a clear problem definition or credible technical grounding. The proposed framework mixes unrelated components like Transformer-XL, GNN, and explainability modules in a way that feels incoherent and artificial. There are essentially no meaningful strengths, as the work lacks novelty, theoretical justification, and real experimental validity. 1. The paper mixes multi-agent reinforcement learning, dynamic graph attention, and explainability mechanisms without establishing any clear theoretical connection, making it appear as an arbitrary combination of unrelated concepts. 2. The collaborative coding task is not naturally suited for MARL modeling and aligns more closely with LLM-based agent systems. The use of a reinforcement learning framework feels forced and unjustified, giving the impression that the work was written by someone unfamiliar with both fields who attempted to merge them superficially. 3. Although the experimental data and results appear complete, they lack real significance and reproducibility. No implementation details or accessible code are provided, making the reported findings highly questionable. 4. The writing is excessively templated, repeatedly using terms such as “dynamic graph attention” and “explainability framework,” and lacks the logical flow expected in genuine academic writing. 5. The proposed joint optimization objective and explainability modules have no theoretical grounding or derivation; the equations are merely formal decorations without substance. 6. The so-called “collaborative coding” task is poorly defined, with no clear description of the environment or the concrete interaction mechanisms among agents. Overall, this paper appears to be an AI-generated pseudo-academic text that is formally structured but substantively empty, lacking genuine research foundation, methodological rigor, and technical credibility. This paper is clearly an AI-generated and fabricated manuscript with empty content, incoherent logic, and no genuine experiments or theoretical grounding. The text shows extensive signs of patching and template-based generation, with irrelevant citations, mismatched methods and tasks, and even indications of fabricated experimental data and results. It wastes the reviewers’ time and undermines the seriousness and integrity of the academic review process. It is strongly recommended that the conference committee verify the authors’ identities and the source of the submission and hold them accountable for this misconduct.	Fully AI-generated
DyCodeExplainer: Explainable Dynamic Graph Attention for Multi-Agent Reinforcement Learning in Collaborative Coding	Soundness: 2: fair Presentation: 3: good Contribution: 2: fair Rating: 6: marginally above the acceptance threshold Confidence: 2: You are willing to defend your assessment, but it is quite likely that you did not understand the central parts of the submission or that you are unfamiliar with some pieces of related work. Math/other details were not carefully checked.	This paper presents DyCodeExplainer, a novel multi-agent reinforcement learning (MARL) framework that integrates dynamic graph attention with explainability techniques to enhance collaborative coding. The idea is innovative and addresses critical challenges in message prioritization and decision-making transparency within collaborative coding environments. The integration of dynamic graph attention networks (DGAT) with hybrid explainability techniques is a novel approach that effectively captures the evolving nature of agent interactions in collaborative coding tasks. The combination of gradient-based attribution and rule-based post-hoc explanations provides both precise importance scoring and intuitive rationale generation, addressing the semantic gap between low-level attention weights and high-level coding logic. The dependency on predefined rule templates for post-hoc explanations requires manual engineering effort, which may limit scalability and adaptability to new coding domains or languages. No	Fully AI-generated

PreviousPage 1 of 1 (4 total rows)Next