📝 Publications

(* Equal contribution, † Corresponding author)

🤔 LLM Reasoning, Decoding & Code Intelligence

ICML-2026

Pull Requests as a Training Signal for Repo-Level Code Editing
Qinglin Zhu, Tianyu Chen, Shuai Lu, Lei Ji, Runcong Zhao, Murong Ma, Xiangxiang Dai, Yulan He, Lin Gui, Peng Cheng, Yeyun Gong.

Introduces Clean-PR, a pipeline that converts noisy PR diffs into structured edit blocks, yielding 2M training samples across 12 languages.
Achieves +13.6% on SWE-bench Lite and +12.3% on SWE-bench Verified, demonstrating the value of real-world PRs for repo-level code editing.

ICML-2025 Spotlight

Proposes an embedding-based search framework that guides LLM generation by optimising the first token’s embedding.
Combines embedding perturbation for controlled exploration with Bayesian optimisation via a verifier-guided objective, balancing exploration and exploitation.

ACL-2024

Proposes Mirror, enabling LLMs to reflect from multiple perspectives via Navigator–Reasoner cooperation.
Encourages both diverse and consistent reasoning to overcome self-reflection traps.

Preprint

Latent Refinement Decoding: Enhancing Diffusion-Based Language Models by Refining Belief States
Qinglin Zhu, Yizhen Yao, Runcong Zhao, Yanzheng Xiang, Amrutha Saseendran, Chen Jin, Philip Alexander Teare, Bin Liang, Yulan He, Lin Gui.

Proposes Latent Refinement Decoding (LRD), a two-stage framework that tackles information loss and premature commitment in diffusion-based language models via latent refinement and predictive feedback.
Enables faster, globally consistent parallel generation as a principled alternative to autoregressive decoding.

ACL-2026 Demo

Proposes SymbolicThought, a human-in-the-loop system combining LLM extraction and symbolic reasoning for character relationship understanding.
Supports editable relationship graphs, logical constraints, and interactive conflict resolution.

ACL-2024 Findings

Existing datasets for narrative understanding often fail to represent the complexity and uncertainty of relationships in real-life social scenarios.
To address this gap, we introduce a new benchmark, Conan, designed for extracting and analysing intricate character relation graphs from detective narratives.

Preprint

We propose PLAYER*, a novel framework for Murder Mystery Games (剧本杀) using an anytime sampling-based planner and a questioning-driven search framework.

AAAI-2026 Oral

Proposes SPS, a supervision-free metric to assess semantic alignment between retrieved summaries and LLM representations.
Introduces xCompress, an inference-time controller that ranks and compresses retrievals to improve generation and clarify retrieval–generation interaction.

ACL-2025

Proposes EmbQA, an embedding-level framework for open-domain QA that optimizes retrieval with unsupervised contrastive learning and improves answer diversity via exploratory embeddings.

Preprint

Proposes xMemory, which decouples agent memories into semantic components and organises them hierarchically.
Retrieves via top-down aggregation to capture diverse themes, outperforming standard RAG on long-horizon agent tasks.

ACL-2022 JointCL: A Joint Contrastive Learning Framework for Zero-Shot Stance Detection
Bin Liang*, Qinglin Zhu* , Xiang Li, Min Yang, Lin Gui, Yulan He, and Ruifeng Xu.
SIGIR-2022 Enhancing Zero-Shot Stance Detection via Targeted Background Knowledge
Qinglin Zhu, Bin Liang, Jingyi Sun, Jiachen Du, Lanjun Zhou, and Ruifeng Xu.
CCL-2020 Attention-based Recurrent Network Combined with Financial Lexicon for Aspect-level Sentiment Classification
Qinglin Zhu, Bin Liang, Liuyu Han, Yi Chen, Ruifeng Xu, and Ruibin Mao.
LREC-2020 Target-based sentiment annotation in Chinese financial news
Chaofa Yuan, Yuhan Liu, Rongdi Yin, Jun Zhang, Qinglin Zhu, Ruibin Mao, Ruifeng Xu.

ACL-2022 Have my arguments been replied to? Argument Pair Extraction as Machine Reading Comprehension
Jianzhu Bao, Jingyi Sun, Qinglin Zhu, and Ruifeng Xu.
SemEval-2021 HITSZ-HLT at SemEval-2021 Task 5: Ensemble Sequence Labeling and Span Boundary Detection for Toxic Span Detection
Qinglin Zhu, Zijie Lin, Yice Zhang, Jingyi Sun, Xiang Li, Qihui Lin, Yixue Dang, and Ruifeng Xu.
NLPCC-2021 A Hierarchical Sequence Labeling Model for Argument Pair Extraction
Jingyi Sun, Qinglin Zhu, Jianzhu Bao, Jipeng Wu, Caihua Yang, Rui Wang, and Ruifeng Xu.