Skip to main content

Showing 1–50 of 188 results for author: Ding, N

Searching in archive cs. Search in all archives.
.
  1. Semantic-enhanced Modality-asymmetric Retrieval for Online E-commerce Search

    Authors: Zhigong Zhou, Ning Ding, Xiaochuan Fan, Yue Shang, Yiming Qiu, Jingwei Zhuo, Zhiwei Ge, Songlin Wang, Lin Liu, Sulong Xu, Han Zhang

    Abstract: Semantic retrieval, which retrieves semantically matched items given a textual query, has been an essential component to enhance system effectiveness in e-commerce search. In this paper, we study the multimodal retrieval problem, where the visual information (e.g, image) of item is leveraged as supplementary of textual information to enrich item representation and further improve retrieval perform… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: published in sigir2023

  2. arXiv:2506.18254  [pdf, ps, other

    cs.LG cs.AI cs.CL

    RLPR: Extrapolating RLVR to General Domains without Verifiers

    Authors: Tianyu Yu, Bo Ji, Shouli Wang, Shu Yao, Zefan Wang, Ganqu Cui, Lifan Yuan, Ning Ding, Yuan Yao, Zhiyuan Liu, Maosong Sun, Tat-Seng Chua

    Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) demonstrates promising potential in advancing the reasoning capabilities of LLMs. However, its success remains largely confined to mathematical and code domains. This primary limitation stems from the heavy reliance on domain-specific verifiers, which results in prohibitive complexity and limited scalability. To address the challenge, our key o… ▽ More

    Submitted 22 June, 2025; originally announced June 2025.

    Comments: Project Website: https://github.com/openbmb/RLPR

  3. arXiv:2506.13329  [pdf, ps, other

    cs.CL

    EAQuant: Enhancing Post-Training Quantization for MoE Models via Expert-Aware Optimization

    Authors: Zhongqian Fu, Ning Ding, Kai Han, Xianzhi Yu, Xiaosong Li, Xinghao Chen, Yehui Tang, Yunhe Wang

    Abstract: Mixture-of-Experts (MoE) models have emerged as a cornerstone of large-scale deep learning by efficiently distributing computation and enhancing performance. However, their unique architecture-characterized by sparse expert activation and dynamic routing mechanisms-introduces inherent complexities that challenge conventional quantization techniques. Existing post-training quantization (PTQ) method… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  4. arXiv:2506.10972  [pdf, ps, other

    cs.LG cs.AI

    Farseer: A Refined Scaling Law in Large Language Models

    Authors: Houyi Li, Wenzhen Zheng, Qiufeng Wang, Zhenyu Ding, Haoying Wang, Zili Wang, Shijie Xuyang, Ning Ding, Shuigeng Zhou, Xiangyu Zhang, Daxin Jiang

    Abstract: Training Large Language Models (LLMs) is prohibitively expensive, creating a critical scaling gap where insights from small-scale experiments often fail to transfer to resource-intensive production systems, thereby hindering efficient innovation. To bridge this, we introduce Farseer, a novel and refined scaling law offering enhanced predictive accuracy across scales. By systematically constructing… ▽ More

    Submitted 14 June, 2025; v1 submitted 12 June, 2025; originally announced June 2025.

    Comments: 34

    ACM Class: I.2

  5. arXiv:2506.07900  [pdf, ps, other

    cs.CL cs.AI

    MiniCPM4: Ultra-Efficient LLMs on End Devices

    Authors: MiniCPM Team, Chaojun Xiao, Yuxuan Li, Xu Han, Yuzhuo Bai, Jie Cai, Haotian Chen, Wentong Chen, Xin Cong, Ganqu Cui, Ning Ding, Shengdan Fan, Yewei Fang, Zixuan Fu, Wenyu Guan, Yitong Guan, Junshao Guo, Yufeng Han, Bingxiang He, Yuxiang Huang, Cunliang Kong, Qiuzuo Li, Siyuan Li, Wenhao Li, Yanghao Li , et al. (50 additional authors not shown)

    Abstract: This paper introduces MiniCPM4, a highly efficient large language model (LLM) designed explicitly for end-side devices. We achieve this efficiency through systematic innovation in four key dimensions: model architecture, training data, training algorithms, and inference systems. Specifically, in terms of model architecture, we propose InfLLM v2, a trainable sparse attention mechanism that accelera… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Comments: MiniCPM4 Technical Report

  6. arXiv:2506.07851  [pdf, ps, other

    cs.CL

    Learning to Focus: Causal Attention Distillation via Gradient-Guided Token Pruning

    Authors: Yiju Guo, Wenkai Yang, Zexu Sun, Ning Ding, Zhiyuan Liu, Yankai Lin

    Abstract: Large language models (LLMs) have demonstrated significant improvements in contextual understanding. However, their ability to attend to truly critical information during long-context reasoning and generation still falls behind the pace. Specifically, our preliminary experiments reveal that certain distracting patterns can misdirect the model's attention during inference, and removing these patter… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  7. arXiv:2506.07591  [pdf, ps, other

    cs.AI q-bio.QM

    Automating Exploratory Multiomics Research via Language Models

    Authors: Shang Qu, Ning Ding, Linhai Xie, Yifei Li, Zaoqu Liu, Kaiyan Zhang, Yibai Xiong, Yuxin Zuo, Zhangren Chen, Ermo Hua, Xingtai Lv, Youbang Sun, Yang Li, Dong Li, Fuchu He, Bowen Zhou

    Abstract: This paper introduces PROTEUS, a fully automated system that produces data-driven hypotheses from raw data files. We apply PROTEUS to clinical proteogenomics, a field where effective downstream data analysis and hypothesis proposal is crucial for producing novel discoveries. PROTEUS uses separate modules to simulate different stages of the scientific process, from open-ended data exploration to sp… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  8. arXiv:2506.07188  [pdf, ps, other

    cs.CV

    Hierarchical Feature-level Reverse Propagation for Post-Training Neural Networks

    Authors: Ni Ding, Lei He, Shengbo Eben Li, Keqiang Li

    Abstract: End-to-end autonomous driving has emerged as a dominant paradigm, yet its highly entangled black-box models pose significant challenges in terms of interpretability and safety assurance. To improve model transparency and training flexibility, this paper proposes a hierarchical and decoupled post-training framework tailored for pretrained neural networks. By reconstructing intermediate feature maps… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

    Comments: 13 pages, 7 figures,

    ACM Class: I.2.10

  9. arXiv:2506.00022  [pdf, ps, other

    cs.CL cs.LG physics.ed-ph

    Scaling Physical Reasoning with the PHYSICS Dataset

    Authors: Shenghe Zheng, Qianjia Cheng, Junchi Yao, Mengsong Wu, Haonan He, Ning Ding, Yu Cheng, Shuyue Hu, Lei Bai, Dongzhan Zhou, Ganqu Cui, Peng Ye

    Abstract: Large Language Models (LLMs) have achieved remarkable progress on advanced reasoning tasks such as mathematics and coding competitions. Meanwhile, physics, despite being both reasoning-intensive and essential to real-world understanding, received limited academic and industrial attention. This paper introduces PHYSICS, a dataset containing 16,568 high-quality physics problems spanning subjects and… ▽ More

    Submitted 2 June, 2025; v1 submitted 21 May, 2025; originally announced June 2025.

    Comments: Work on physical datasets

  10. arXiv:2505.22617  [pdf, other

    cs.LG cs.AI cs.CL

    The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

    Authors: Ganqu Cui, Yuchen Zhang, Jiacheng Chen, Lifan Yuan, Zhi Wang, Yuxin Zuo, Haozhan Li, Yuchen Fan, Huayu Chen, Weize Chen, Zhiyuan Liu, Hao Peng, Lei Bai, Wanli Ouyang, Yu Cheng, Bowen Zhou, Ning Ding

    Abstract: This paper aims to overcome a major obstacle in scaling RL for reasoning with LLMs, namely the collapse of policy entropy. Such phenomenon is consistently observed across vast RL runs without entropy intervention, where the policy entropy dropped sharply at the early training stage, this diminished exploratory ability is always accompanied with the saturation of policy performance. In practice, we… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  11. arXiv:2505.22407  [pdf, ps, other

    cs.CV

    Self-Reflective Reinforcement Learning for Diffusion-based Image Reasoning Generation

    Authors: Jiadong Pan, Zhiyuan Ma, Kaiyan Zhang, Ning Ding, Bowen Zhou

    Abstract: Diffusion models have recently demonstrated exceptional performance in image generation task. However, existing image generation methods still significantly suffer from the dilemma of image reasoning, especially in logic-centered image generation tasks. Inspired by the success of Chain of Thought (CoT) and Reinforcement Learning (RL) in LLMs, we propose SRRL, a self-reflective RL algorithm for dif… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  12. arXiv:2505.19217  [pdf, other

    cs.CL

    The Overthinker's DIET: Cutting Token Calories with DIfficulty-AwarE Training

    Authors: Weize Chen, Jiarui Yuan, Tailin Jin, Ning Ding, Huimin Chen, Zhiyuan Liu, Maosong Sun

    Abstract: Recent large language models (LLMs) exhibit impressive reasoning but often over-think, generating excessively long responses that hinder efficiency. We introduce DIET ( DIfficulty-AwarE Training), a framework that systematically cuts these "token calories" by integrating on-the-fly problem difficulty into the reinforcement learning (RL) process. DIET dynamically adapts token compression strategies… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: under review

  13. arXiv:2505.18783  [pdf, ps, other

    cs.LG cs.AI

    Soft Weighted Machine Unlearning

    Authors: Xinbao Qiao, Ningning Ding, Yushi Cheng, Meng Zhang

    Abstract: Machine unlearning, as a post-hoc processing technique, has gained widespread adoption in addressing challenges like bias mitigation and robustness enhancement, colloquially, machine unlearning for fairness and robustness. However, existing non-privacy unlearning-based solutions persist in using binary data removal framework designed for privacy-driven motivation, leading to significant informatio… ▽ More

    Submitted 24 May, 2025; originally announced May 2025.

    Comments: 24 pages,22 figures

  14. arXiv:2505.14190  [pdf, ps, other

    cs.LG cs.AI

    $α$-GAN by Rényi Cross Entropy

    Authors: Ni Ding, Miao Qiao, Jiaxing Xu, Yiping Ke, Xiaoyu Zhang

    Abstract: This paper proposes $α$-GAN, a generative adversarial network using Rényi measures. The value function is formulated, by Rényi cross entropy, as an expected certainty measure incurred by the discriminator's soft decision as to where the sample is from, true population or the generator. The discriminator tries to maximize the Rényi certainty about sample source, while the generator wants to reduce… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  15. arXiv:2505.05108  [pdf, ps, other

    cs.AI cs.MA

    Multi-agent Embodied AI: Advances and Future Directions

    Authors: Zhaohan Feng, Ruiqi Xue, Lei Yuan, Yang Yu, Ning Ding, Meiqin Liu, Bingzhao Gao, Jian Sun, Xinhu Zheng, Gang Wang

    Abstract: Embodied artificial intelligence (Embodied AI) plays a pivotal role in the application of advanced technologies in the intelligent era, where AI systems are integrated with physical bodies that enable them to perceive, reason, and interact with their environments. Through the use of sensors for input and actuators for action, these systems can learn and adapt based on real-world feedback, allowing… ▽ More

    Submitted 21 June, 2025; v1 submitted 8 May, 2025; originally announced May 2025.

  16. arXiv:2504.16084  [pdf, ps, other

    cs.CL cs.LG

    TTRL: Test-Time Reinforcement Learning

    Authors: Yuxin Zuo, Kaiyan Zhang, Li Sheng, Shang Qu, Ganqu Cui, Xuekai Zhu, Haozhan Li, Yuchen Zhang, Xinwei Long, Ermo Hua, Biqing Qi, Youbang Sun, Zhiyuan Ma, Lifan Yuan, Ning Ding, Bowen Zhou

    Abstract: This paper investigates Reinforcement Learning (RL) on data without explicit labels for reasoning tasks in Large Language Models (LLMs). The core challenge of the problem is reward estimation during inference while not having access to ground-truth information. While this setting appears elusive, we find that common practices in Test-Time Scaling (TTS), such as majority voting, yield surprisingly… ▽ More

    Submitted 30 June, 2025; v1 submitted 22 April, 2025; originally announced April 2025.

  17. arXiv:2504.03612  [pdf, other

    cs.CL

    AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset

    Authors: Bingxiang He, Wenbin Zhang, Jiaxi Song, Cheng Qian, Zixuan Fu, Bowen Sun, Ning Ding, Haiwen Hong, Longtao Huang, Hui Xue, Ganqu Cui, Wanxiang Che, Zhiyuan Liu, Maosong Sun

    Abstract: Preference learning is critical for aligning large language models (LLMs) with human values, yet its success hinges on high-quality datasets comprising three core components: Preference \textbf{A}nnotations, \textbf{I}nstructions, and \textbf{R}esponse Pairs. Current approaches conflate these components, obscuring their individual impacts and hindering systematic optimization. In this work, we pro… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

    Comments: 29 pages, 11 figures

  18. arXiv:2503.18282  [pdf, other

    cs.CV

    TrackID3x3: A Dataset and Algorithm for Multi-Player Tracking with Identification and Pose Estimation in 3x3 Basketball Full-court Videos

    Authors: Kazuhiro Yamada, Li Yin, Qingrui Hu, Ning Ding, Shunsuke Iwashita, Jun Ichikawa, Kiwamu Kotani, Calvin Yeung, Keisuke Fujii

    Abstract: Multi-object tracking, player identification, and pose estimation are fundamental components of sports analytics, essential for analyzing player movements, performance, and tactical strategies. However, existing datasets and methodologies primarily target mainstream team sports such as soccer and conventional 5-on-5 basketball, often overlooking scenarios involving fixed-camera setups commonly use… ▽ More

    Submitted 23 March, 2025; originally announced March 2025.

  19. arXiv:2503.11224  [pdf, other

    cs.LG cs.AI cs.CL

    Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

    Authors: Xingtai Lv, Youbang Sun, Kaiyan Zhang, Shang Qu, Xuekai Zhu, Yuchen Fan, Yi Wu, Ermo Hua, Xinwei Long, Ning Ding, Bowen Zhou

    Abstract: State Space Models (SSMs) have emerged as a promising alternative to the popular transformer-based models and have been increasingly gaining attention. Compared to transformers, SSMs excel at tasks with sequential data or longer contexts, demonstrating comparable performances with significant efficiency gains. In this survey, we provide a coherent and systematic overview for SSMs, including their… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

  20. arXiv:2503.06930  [pdf, other

    cs.CV

    Post-Training Quantization for Diffusion Transformer via Hierarchical Timestep Grouping

    Authors: Ning Ding, Jing Han, Yuchuan Tian, Chao Xu, Kai Han, Yehui Tang

    Abstract: Diffusion Transformer (DiT) has now become the preferred choice for building image generation models due to its great generation capability. Unlike previous convolution-based UNet models, DiT is purely composed of a stack of transformer blocks, which renders DiT excellent in scalability like large language models. However, the growing model size and multi-step sampling paradigm bring about conside… ▽ More

    Submitted 29 March, 2025; v1 submitted 10 March, 2025; originally announced March 2025.

  21. arXiv:2502.20805  [pdf, other

    cs.RO cs.CV

    Towards Semantic 3D Hand-Object Interaction Generation via Functional Text Guidance

    Authors: Yongqi Tian, Xueyu Sun, Haoyuan He, Linji Hao, Ning Ding, Caigui Jiang

    Abstract: Hand-object interaction(HOI) is the fundamental link between human and environment, yet its dexterous and complex pose significantly challenges for gesture control. Despite significant advances in AI and robotics, enabling machines to understand and simulate hand-object interactions, capturing the semantics of functional grasping tasks remains a considerable challenge. While previous work can gene… ▽ More

    Submitted 28 February, 2025; originally announced February 2025.

  22. arXiv:2502.10961  [pdf, other

    cs.LG cs.AI

    Graders should cheat: privileged information enables expert-level automated evaluations

    Authors: Jin Peng Zhou, Sébastien M. R. Arnold, Nan Ding, Kilian Q. Weinberger, Nan Hua, Fei Sha

    Abstract: Auto-evaluating language models (LMs), i.e., using a grader LM to evaluate the candidate LM, is an appealing way to accelerate the evaluation process and the cost associated with it. But this presents a paradox: how can we trust the grader LM, which is presumably weaker than the candidate LM, to assess problems that are beyond the frontier of the capabilities of either model or both? For instance,… ▽ More

    Submitted 15 February, 2025; originally announced February 2025.

  23. arXiv:2502.07358  [pdf, other

    cs.RO

    SymbioSim: Human-in-the-loop Simulation Platform for Bidirectional Continuing Learning in Human-Robot Interaction

    Authors: Haoran Chen, Yiteng Xu, Yiming Ren, Yaoqin Ye, Xinran Li, Ning Ding, Peishan Cong, Ziyi Wang, Bushi Liu, Yuhan Chen, Zhiyang Dou, Xiaokun Leng, Manyi Li, Yuexin Ma, Changhe Tu

    Abstract: The development of intelligent robots seeks to seamlessly integrate them into the human world, providing assistance and companionship in daily life and work, with the ultimate goal of achieving human-robot symbiosis. To realize this vision, robots must continuously learn and evolve through consistent interaction and collaboration with humans, while humans need to gradually develop an understanding… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

  24. arXiv:2502.04153  [pdf, other

    cs.CL cs.AI

    UltraIF: Advancing Instruction Following from the Wild

    Authors: Kaikai An, Li Sheng, Ganqu Cui, Shuzheng Si, Ning Ding, Yu Cheng, Baobao Chang

    Abstract: Instruction-following made modern large language models (LLMs) helpful assistants. However, the key to taming LLMs on complex instructions remains mysterious, for that there are huge gaps between models trained by open-source community and those trained by leading companies. To bridge the gap, we propose a simple and scalable approach UltraIF for building LLMs that can follow complex instructions… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

  25. arXiv:2502.02869  [pdf, ps, other

    cs.LG cs.AI

    Towards Large-Scale In-Context Reinforcement Learning by Meta-Training in Randomized Worlds

    Authors: Fan Wang, Pengtao Shao, Yiming Zhang, Bo Yu, Shaoshan Liu, Ning Ding, Yang Cao, Yu Kang, Haifeng Wang

    Abstract: In-Context Reinforcement Learning (ICRL) enables agents to learn automatically and on-the-fly from their interactive experiences. However, a major challenge in scaling up ICRL is the lack of scalable task collections. To address this, we propose the procedurally generated tabular Markov Decision Processes, named AnyMDP. Through a carefully designed randomization process, AnyMDP is capable of gener… ▽ More

    Submitted 30 June, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

    Comments: Preprint

  26. arXiv:2502.01456  [pdf, other

    cs.LG cs.AI cs.CL

    Process Reinforcement through Implicit Rewards

    Authors: Ganqu Cui, Lifan Yuan, Zefan Wang, Hanbin Wang, Wendi Li, Bingxiang He, Yuchen Fan, Tianyu Yu, Qixin Xu, Weize Chen, Jiarui Yuan, Huayu Chen, Kaiyan Zhang, Xingtai Lv, Shuo Wang, Yuan Yao, Xu Han, Hao Peng, Yu Cheng, Zhiyuan Liu, Maosong Sun, Bowen Zhou, Ning Ding

    Abstract: Dense process rewards have proven a more effective alternative to the sparse outcome-level rewards in the inference-time scaling of large language models (LLMs), particularly in tasks requiring complex multi-step reasoning. While dense rewards also offer an appealing choice for the reinforcement learning (RL) of LLMs since their fine-grained rewards have the potential to address some inherent issu… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

    Comments: 20 pages. Model&Code&Data available at https://github.com/PRIME-RL/PRIME

  27. arXiv:2501.18362  [pdf, ps, other

    cs.AI cs.CL cs.CV cs.LG

    MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

    Authors: Yuxin Zuo, Shang Qu, Yifei Li, Zhangren Chen, Xuekai Zhu, Ermo Hua, Kaiyan Zhang, Ning Ding, Bowen Zhou

    Abstract: We introduce MedXpertQA, a highly challenging and comprehensive benchmark to evaluate expert-level medical knowledge and advanced reasoning. MedXpertQA includes 4,460 questions spanning 17 specialties and 11 body systems. It includes two subsets, Text for text evaluation and MM for multimodal evaluation. Notably, MM introduces expert-level exam questions with diverse images and rich clinical infor… ▽ More

    Submitted 6 June, 2025; v1 submitted 30 January, 2025; originally announced January 2025.

    Comments: ICML 2025

  28. arXiv:2501.08540  [pdf, other

    cs.CL cs.AI cs.DB

    Knowledge prompt chaining for semantic modeling

    Authors: Ning Pei Ding, Jingge Du, Zaiwen Feng

    Abstract: The task of building semantics for structured data such as CSV, JSON, and XML files is highly relevant in the knowledge representation field. Even though we have a vast of structured data on the internet, mapping them to domain ontologies to build semantics for them is still very challenging as it requires the construction model to understand and learn graph-structured knowledge. Otherwise, the ta… ▽ More

    Submitted 14 January, 2025; originally announced January 2025.

  29. arXiv:2412.17739  [pdf, other

    cs.AI cs.CL

    Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

    Authors: Ermo Hua, Che Jiang, Xingtai Lv, Kaiyan Zhang, Ning Ding, Youbang Sun, Biqing Qi, Yuchen Fan, Xuekai Zhu, Bowen Zhou

    Abstract: Extending the context length of Language Models (LMs) by improving Rotary Position Embedding (RoPE) has become a trend. While existing works mainly address RoPE's limitations within attention mechanism, this paper provides an analysis across nearly all parts of LMs, uncovering their adverse effects on length generalization for RoPE-based attention. Using Discrete Signal Processing theory, we show… ▽ More

    Submitted 6 May, 2025; v1 submitted 23 December, 2024; originally announced December 2024.

    Comments: Accepted to ICML 2025

  30. arXiv:2412.14689  [pdf, other

    cs.CL cs.AI cs.LG

    How to Synthesize Text Data without Model Collapse?

    Authors: Xuekai Zhu, Daixuan Cheng, Hengli Li, Kaiyan Zhang, Ermo Hua, Xingtai Lv, Ning Ding, Zhouhan Lin, Zilong Zheng, Bowen Zhou

    Abstract: Model collapse in synthetic data indicates that iterative training on self-generated data leads to a gradual decline in performance. With the proliferation of AI models, synthetic data will fundamentally reshape the web data ecosystem. Future GPT-$\{n\}$ models will inevitably be trained on a blend of synthetic and human-produced data. In this paper, we focus on two questions: what is the impact o… ▽ More

    Submitted 28 May, 2025; v1 submitted 19 December, 2024; originally announced December 2024.

    Comments: Accepted at ICML 2025

  31. arXiv:2412.01981  [pdf, other

    cs.LG cs.CL

    Free Process Rewards without Process Labels

    Authors: Lifan Yuan, Wendi Li, Huayu Chen, Ganqu Cui, Ning Ding, Kaiyan Zhang, Bowen Zhou, Zhiyuan Liu, Hao Peng

    Abstract: Different from its counterpart outcome reward models (ORMs), which evaluate the entire responses, a process reward model (PRM) scores a reasoning trajectory step by step, providing denser and more fine grained rewards. However, training a PRM requires labels annotated at every intermediate step, presenting significant challenges for both manual and automatic data collection. This paper aims to add… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

    Comments: Models and data are available at: https://github.com/lifan-yuan/ImplicitPRM

  32. arXiv:2411.12992  [pdf, other

    cs.CL

    MemoryFormer: Minimize Transformer Computation by Removing Fully-Connected Layers

    Authors: Ning Ding, Yehui Tang, Haochen Qin, Zhenli Zhou, Chao Xu, Lin Li, Kai Han, Heng Liao, Yunhe Wang

    Abstract: In order to reduce the computational complexity of large language models, great efforts have been made to to improve the efficiency of transformer models such as linear attention and flash-attention. However, the model size and corresponding computational complexity are constantly scaled up in pursuit of higher performance. In this work, we present MemoryFormer, a novel transformer architecture wh… ▽ More

    Submitted 19 November, 2024; originally announced November 2024.

    Comments: NeurIPS2024

  33. arXiv:2411.03743  [pdf, other

    cs.AI q-bio.QM

    Automating Exploratory Proteomics Research via Language Models

    Authors: Ning Ding, Shang Qu, Linhai Xie, Yifei Li, Zaoqu Liu, Kaiyan Zhang, Yibai Xiong, Yuxin Zuo, Zhangren Chen, Ermo Hua, Xingtai Lv, Youbang Sun, Yang Li, Dong Li, Fuchu He, Bowen Zhou

    Abstract: With the development of artificial intelligence, its contribution to science is evolving from simulating a complex problem to automating entire research processes and producing novel discoveries. Achieving this advancement requires both specialized general models grounded in real-world scientific data and iterative, exploratory frameworks that mirror human scientific methodologies. In this paper,… ▽ More

    Submitted 6 November, 2024; originally announced November 2024.

  34. arXiv:2411.02063  [pdf, other

    cs.CL cs.AI cs.LG

    Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention

    Authors: Xingtai Lv, Ning Ding, Kaiyan Zhang, Ermo Hua, Ganqu Cui, Bowen Zhou

    Abstract: Improving the effectiveness and efficiency of large language models (LLMs) simultaneously is a critical yet challenging research goal. In this paper, we find that low-rank pre-training, normally considered as efficient methods that will compromise performance, can be scalably effective when reduced parameters are precisely targeted. Specifically, applying the low-dimensional module only to the att… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

    Comments: Accepted to EMNLP 2024 (Main Conference)

  35. arXiv:2410.01945  [pdf, other

    cs.CL

    CALF: Benchmarking Evaluation of LFQA Using Chinese Examinations

    Authors: Yuchen Fan, Xin Zhong, Heng Zhou, Yuchen Zhang, Mingyu Liang, Chengxing Xie, Ermo Hua, Ning Ding, Bowen Zhou

    Abstract: Long-Form Question Answering (LFQA) refers to generating in-depth, paragraph-level responses to open-ended questions. Although lots of LFQA methods are developed, evaluating LFQA effectively and efficiently remains challenging due to its high complexity and cost. Therefore, there is no standard benchmark for LFQA evaluation till now. To address this gap, we make the first attempt by proposing a we… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  36. arXiv:2409.14588  [pdf, other

    cs.CV

    Space evaluation based on pitch control using drone video in Ultimate

    Authors: Shunsuke Iwashita, Atom Scott, Rikuhei Umemoto, Ning Ding, Keisuke Fujii

    Abstract: Ultimate is a sport in which teams of seven players compete for points by passing a disc into the end zone. A distinctive aspect of Ultimate is that the player holding the disc is unable to move, underscoring the significance of creating space to receive passes. Despite extensive research into space evaluation in sports such as football and basketball, there is a paucity of information available f… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

    Comments: 2 pages, 1 figure. Presented at Cascadia Symposium on Statistics in Sport (CASSIS) 2024

  37. arXiv:2407.05666  [pdf, other

    cs.CV

    Enhancing Neural Radiance Fields with Depth and Normal Completion Priors from Sparse Views

    Authors: Jiawei Guo, HungChyun Chou, Ning Ding

    Abstract: Neural Radiance Fields (NeRF) are an advanced technology that creates highly realistic images by learning about scenes through a neural network model. However, NeRF often encounters issues when there are not enough images to work with, leading to problems in accurately rendering views. The main issue is that NeRF lacks sufficient structural details to guide the rendering process accurately. To add… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  38. arXiv:2407.04969  [pdf, other

    cs.CL

    EVA-Score: Evaluating Abstractive Long-form Summarization on Informativeness through Extraction and Validation

    Authors: Yuchen Fan, Xin Zhong, Yazhe Wan, Chengsi Wang, Haonan Cheng, Gaoche Wu, Ning Ding, Bowen Zhou

    Abstract: Since LLMs emerged, more attention has been paid to abstractive long-form summarization, where longer input sequences indicate more information contained. Nevertheless, the automatic evaluation of such summaries remains underexplored. The current evaluation metrics for long-form summarization either use similarity-based metrics like ROUGE and BERTScore or LLM-based metrics using appropriate prompt… ▽ More

    Submitted 15 October, 2024; v1 submitted 6 July, 2024; originally announced July 2024.

    Comments: 20 pages

  39. arXiv:2407.00676  [pdf, other

    cs.CV

    Instruct-IPT: All-in-One Image Processing Transformer via Weight Modulation

    Authors: Yuchuan Tian, Jianhong Han, Hanting Chen, Yuanyuan Xi, Ning Ding, Jie Hu, Chao Xu, Yunhe Wang

    Abstract: Due to the unaffordable size and intensive computation costs of low-level vision models, All-in-One models that are designed to address a handful of low-level vision tasks simultaneously have been popular. However, existing All-in-One models are limited in terms of the range of tasks and performance. To overcome these limitations, we propose Instruct-IPT -- an All-in-One Image Processing Transform… ▽ More

    Submitted 16 December, 2024; v1 submitted 30 June, 2024; originally announced July 2024.

    Comments: 14 pages, 5 figures

  40. arXiv:2406.12295  [pdf, other

    cs.CL

    Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding

    Authors: Kaiyan Zhang, Jianyu Wang, Ning Ding, Biqing Qi, Ermo Hua, Xingtai Lv, Bowen Zhou

    Abstract: Large Language Models (LLMs) exhibit impressive capabilities across various applications but encounter substantial challenges such as high inference latency, considerable training costs, and the generation of hallucinations. Collaborative decoding between large and small language models (SLMs) presents a promising strategy to mitigate these issues through methods including speculative decoding, co… ▽ More

    Submitted 23 October, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

    Comments: update figures and results on Pythia Series

  41. arXiv:2406.11721  [pdf, other

    cs.CL cs.AI cs.LG

    The Right Time Matters: Data Arrangement Affects Zero-Shot Generalization in Instruction Tuning

    Authors: Bingxiang He, Ning Ding, Cheng Qian, Jia Deng, Ganqu Cui, Lifan Yuan, Haiwen Hong, Huan-ang Gao, Longtao Huang, Hui Xue, Huimin Chen, Zhiyuan Liu, Maosong Sun

    Abstract: Understanding alignment techniques begins with comprehending zero-shot generalization brought by instruction tuning, but little of the mechanism has been understood. Existing work has largely been confined to the task level, without considering that tasks are artificially defined and, to LLMs, merely consist of tokens and representations. To bridge this gap, we investigate zero-shot generalization… ▽ More

    Submitted 7 April, 2025; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: 22 pages, 16 figures

  42. arXiv:2406.03949  [pdf, other

    cs.CL

    UltraMedical: Building Specialized Generalists in Biomedicine

    Authors: Kaiyan Zhang, Sihang Zeng, Ermo Hua, Ning Ding, Zhang-Ren Chen, Zhiyuan Ma, Haoxin Li, Ganqu Cui, Biqing Qi, Xuekai Zhu, Xingtai Lv, Hu Jinfang, Zhiyuan Liu, Bowen Zhou

    Abstract: Large Language Models (LLMs) have demonstrated remarkable capabilities across various domains and are moving towards more specialized areas. Recent advanced proprietary models such as GPT-4 and Gemini have achieved significant advancements in biomedicine, which have also raised privacy and security challenges. The construction of specialized generalists hinges largely on high-quality datasets, enh… ▽ More

    Submitted 29 October, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: Camera ready version for NeurIPS 2024 D&B Track

  43. arXiv:2405.18241  [pdf

    cs.CL cs.AI

    Active Use of Latent Constituency Representation in both Humans and Large Language Models

    Authors: Wei Liu, Ming Xiang, Nai Ding

    Abstract: Understanding how sentences are internally represented in the human brain, as well as in large language models (LLMs) such as ChatGPT, is a major challenge for cognitive science. Classic linguistic theories propose that the brain represents a sentence by parsing it into hierarchically organized constituents. In contrast, LLMs do not explicitly parse linguistic constituents and their latent represe… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 62 pages, 5 figures. Under review

  44. arXiv:2405.11870  [pdf, other

    cs.CL cs.AI

    Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process

    Authors: Ermo Hua, Biqing Qi, Kaiyan Zhang, Yue Yu, Ning Ding, Xingtai Lv, Kai Tian, Bowen Zhou

    Abstract: Supervised Fine-Tuning (SFT) and Preference Optimization (PO) are two fundamental processes for enhancing the capabilities of Language Models (LMs) post pre-training, aligning them better with human preferences. Although SFT advances in training efficiency, PO delivers better alignment, thus they are often combined. However, common practices simply apply them sequentially without integrating their… ▽ More

    Submitted 28 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

  45. arXiv:2405.05615  [pdf, other

    cs.CV cs.CL cs.LG

    Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning

    Authors: Shibo Jie, Yehui Tang, Ning Ding, Zhi-Hong Deng, Kai Han, Yunhe Wang

    Abstract: Current solutions for efficiently constructing large vision-language (VL) models follow a two-step paradigm: projecting the output of pre-trained vision encoders to the input space of pre-trained language models as visual prompts; and then transferring the models to downstream VL tasks via end-to-end parameter-efficient fine-tuning (PEFT). However, this paradigm still exhibits inefficiency since i… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: Accepted to ICML2024

  46. arXiv:2405.00423  [pdf, ps, other

    cs.IT

    $α$-leakage by Rényi Divergence and Sibson Mutual Information

    Authors: Ni Ding, Mohammad Amin Zarrabian, Parastoo Sadeghi

    Abstract: For $\tilde{f}(t) = \exp(\frac{α-1}αt)$, this paper proposes a $\tilde{f}$-mean information gain measure. Rényi divergence is shown to be the maximum $\tilde{f}$-mean information gain incurred at each elementary event $y$ of channel output $Y$ and Sibson mutual information is the $\tilde{f}$-mean of this $Y$-elementary information gain. Both are proposed as $α$-leakage measures, indicating the mos… ▽ More

    Submitted 2 July, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

    Comments: authorship dispute

  47. arXiv:2404.13868  [pdf, other

    cs.CV

    TeamTrack: A Dataset for Multi-Sport Multi-Object Tracking in Full-pitch Videos

    Authors: Atom Scott, Ikuma Uchida, Ning Ding, Rikuhei Umemoto, Rory Bunker, Ren Kobayashi, Takeshi Koyama, Masaki Onishi, Yoshinari Kameda, Keisuke Fujii

    Abstract: Multi-object tracking (MOT) is a critical and challenging task in computer vision, particularly in situations involving objects with similar appearances but diverse movements, as seen in team sports. Current methods, largely reliant on object detection and appearance, often fail to track targets in such complex scenarios accurately. This limitation is further exacerbated by the lack of comprehensi… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  48. arXiv:2404.06395  [pdf, other

    cs.CL cs.LG

    MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

    Authors: Shengding Hu, Yuge Tu, Xu Han, Chaoqun He, Ganqu Cui, Xiang Long, Zhi Zheng, Yewei Fang, Yuxiang Huang, Weilin Zhao, Xinrong Zhang, Zheng Leng Thai, Kaihuo Zhang, Chongyi Wang, Yuan Yao, Chenyang Zhao, Jie Zhou, Jie Cai, Zhongwu Zhai, Ning Ding, Chao Jia, Guoyang Zeng, Dahai Li, Zhiyuan Liu, Maosong Sun

    Abstract: The burgeoning interest in developing Large Language Models (LLMs) with up to trillion parameters has been met with concerns regarding resource efficiency and practical expense, particularly given the immense cost of experimentation. This scenario underscores the importance of exploring the potential of Small Language Models (SLMs) as a resource-efficient alternative. In this context, we introduce… ▽ More

    Submitted 3 June, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: revise according to peer review

  49. arXiv:2404.02078  [pdf, other

    cs.AI cs.CL cs.LG

    Advancing LLM Reasoning Generalists with Preference Trees

    Authors: Lifan Yuan, Ganqu Cui, Hanbin Wang, Ning Ding, Xingyao Wang, Jia Deng, Boji Shan, Huimin Chen, Ruobing Xie, Yankai Lin, Zhenghao Liu, Bowen Zhou, Hao Peng, Zhiyuan Liu, Maosong Sun

    Abstract: We introduce Eurus, a suite of large language models (LLMs) optimized for reasoning. Finetuned from Mistral-7B and CodeLlama-70B, Eurus models achieve state-of-the-art results among open-source models on a diverse set of benchmarks covering mathematics, code generation, and logical reasoning problems. Notably, Eurus-70B beats GPT-3.5 Turbo in reasoning through a comprehensive benchmarking across 1… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Models and data are available at https://github.com/OpenBMB/Eurus

  50. arXiv:2403.08281  [pdf, other

    cs.CL cs.AI

    Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models

    Authors: Ning Ding, Yulin Chen, Ganqu Cui, Xingtai Lv, Weilin Zhao, Ruobing Xie, Bowen Zhou, Zhiyuan Liu, Maosong Sun

    Abstract: Underlying data distributions of natural language, programming code, and mathematical symbols vary vastly, presenting a complex challenge for large language models (LLMs) that strive to achieve high performance across all three domains simultaneously. Achieving a very high level of proficiency for an LLM within a specific domain often requires extensive training with relevant corpora, which is typ… ▽ More

    Submitted 26 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.