Skip to main content

Showing 1–50 of 145 results for author: Ding, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.18241  [pdf

    cs.CL cs.AI

    Active Use of Latent Constituency Representation in both Humans and Large Language Models

    Authors: Wei Liu, Ming Xiang, Nai Ding

    Abstract: Understanding how sentences are internally represented in the human brain, as well as in large language models (LLMs) such as ChatGPT, is a major challenge for cognitive science. Classic linguistic theories propose that the brain represents a sentence by parsing it into hierarchically organized constituents. In contrast, LLMs do not explicitly parse linguistic constituents and their latent represe… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 62 pages, 5 figures. Under review

  2. arXiv:2405.11870  [pdf, other

    cs.CL cs.AI

    Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process

    Authors: Ermo Hua, Biqing Qi, Kaiyan Zhang, Yue Yu, Ning Ding, Xingtai Lv, Kai Tian, Bowen Zhou

    Abstract: Supervised Fine-Tuning (SFT) and Preference Optimization (PO) are two fundamental processes for enhancing the capabilities of Language Models (LMs) post pre-training, aligning them better with human preferences. Although SFT advances in training efficiency, PO delivers better alignment, thus they are often combined. However, common practices simply apply them sequentially without integrating their… ▽ More

    Submitted 28 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

  3. arXiv:2405.05615  [pdf, other

    cs.CV cs.CL cs.LG

    Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning

    Authors: Shibo Jie, Yehui Tang, Ning Ding, Zhi-Hong Deng, Kai Han, Yunhe Wang

    Abstract: Current solutions for efficiently constructing large vision-language (VL) models follow a two-step paradigm: projecting the output of pre-trained vision encoders to the input space of pre-trained language models as visual prompts; and then transferring the models to downstream VL tasks via end-to-end parameter-efficient fine-tuning (PEFT). However, this paradigm still exhibits inefficiency since i… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: Accepted to ICML2024

  4. arXiv:2405.00423  [pdf, ps, other

    cs.IT

    $α$-leakage by Rényi Divergence and Sibson Mutual Information

    Authors: Ni Ding, Mohammad Amin Zarrabian, Parastoo Sadeghi

    Abstract: For $\tilde{f}(t) = \exp(\frac{α-1}αt)$, this paper proposes a $\tilde{f}$-mean information gain measure. Rényi divergence is shown to be the maximum $\tilde{f}$-mean information gain incurred at each elementary event $y$ of channel output $Y$ and Sibson mutual information is the $\tilde{f}$-mean of this $Y$-elementary information gain. Both are proposed as $α$-leakage measures, indicating the mos… ▽ More

    Submitted 13 May, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

    Comments: This paper was adapted from the original sole author version

  5. arXiv:2404.13868  [pdf, other

    cs.CV

    TeamTrack: A Dataset for Multi-Sport Multi-Object Tracking in Full-pitch Videos

    Authors: Atom Scott, Ikuma Uchida, Ning Ding, Rikuhei Umemoto, Rory Bunker, Ren Kobayashi, Takeshi Koyama, Masaki Onishi, Yoshinari Kameda, Keisuke Fujii

    Abstract: Multi-object tracking (MOT) is a critical and challenging task in computer vision, particularly in situations involving objects with similar appearances but diverse movements, as seen in team sports. Current methods, largely reliant on object detection and appearance, often fail to track targets in such complex scenarios accurately. This limitation is further exacerbated by the lack of comprehensi… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  6. arXiv:2404.06395  [pdf, other

    cs.CL cs.LG

    MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

    Authors: Shengding Hu, Yuge Tu, Xu Han, Chaoqun He, Ganqu Cui, Xiang Long, Zhi Zheng, Yewei Fang, Yuxiang Huang, Weilin Zhao, Xinrong Zhang, Zheng Leng Thai, Kaihuo Zhang, Chongyi Wang, Yuan Yao, Chenyang Zhao, Jie Zhou, Jie Cai, Zhongwu Zhai, Ning Ding, Chao Jia, Guoyang Zeng, Dahai Li, Zhiyuan Liu, Maosong Sun

    Abstract: The burgeoning interest in developing Large Language Models (LLMs) with up to trillion parameters has been met with concerns regarding resource efficiency and practical expense, particularly given the immense cost of experimentation. This scenario underscores the importance of exploring the potential of Small Language Models (SLMs) as a resource-efficient alternative. In this context, we introduce… ▽ More

    Submitted 22 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: Enlarge the font size in several figures

  7. arXiv:2404.02078  [pdf, other

    cs.AI cs.CL cs.LG

    Advancing LLM Reasoning Generalists with Preference Trees

    Authors: Lifan Yuan, Ganqu Cui, Hanbin Wang, Ning Ding, Xingyao Wang, Jia Deng, Boji Shan, Huimin Chen, Ruobing Xie, Yankai Lin, Zhenghao Liu, Bowen Zhou, Hao Peng, Zhiyuan Liu, Maosong Sun

    Abstract: We introduce Eurus, a suite of large language models (LLMs) optimized for reasoning. Finetuned from Mistral-7B and CodeLlama-70B, Eurus models achieve state-of-the-art results among open-source models on a diverse set of benchmarks covering mathematics, code generation, and logical reasoning problems. Notably, Eurus-70B beats GPT-3.5 Turbo in reasoning through a comprehensive benchmarking across 1… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Models and data are available at https://github.com/OpenBMB/Eurus

  8. arXiv:2403.08281  [pdf, other

    cs.CL cs.AI

    Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models

    Authors: Ning Ding, Yulin Chen, Ganqu Cui, Xingtai Lv, Weilin Zhao, Ruobing Xie, Bowen Zhou, Zhiyuan Liu, Maosong Sun

    Abstract: Underlying data distributions of natural language, programming code, and mathematical symbols vary vastly, presenting a complex challenge for large language models (LLMs) that strive to achieve high performance across all three domains simultaneously. Achieving a very high level of proficiency for an LLM within a specific domain often requires extensive training with relevant corpora, which is typ… ▽ More

    Submitted 26 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

  9. arXiv:2403.03129  [pdf, other

    cs.CL

    CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following

    Authors: Kaiyan Zhang, Jianyu Wang, Ermo Hua, Biqing Qi, Ning Ding, Bowen Zhou

    Abstract: With the advancement of language models (LMs), their exposure to private data is increasingly inevitable, and their deployment (especially for smaller ones) on personal devices, such as PCs and smartphones, has become a prevailing trend. In contexts laden with user information, enabling models to both safeguard user privacy and execute commands efficiently emerges as an essential research imperati… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  10. arXiv:2403.01414  [pdf, other

    cs.CV

    Unsigned Orthogonal Distance Fields: An Accurate Neural Implicit Representation for Diverse 3D Shapes

    Authors: Yujie Lu, Long Wan, Nayu Ding, Yulong Wang, Shuhan Shen, Shen Cai, Lin Gao

    Abstract: Neural implicit representation of geometric shapes has witnessed considerable advancements in recent years. However, common distance field based implicit representations, specifically signed distance field (SDF) for watertight shapes or unsigned distance field (UDF) for arbitrary shapes, routinely suffer from degradation of reconstruction accuracy when converting to explicit surface points and mes… ▽ More

    Submitted 1 April, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

    Comments: accepted by CVPR 2024

  11. arXiv:2402.19085  [pdf, other

    cs.CL cs.AI eess.SY

    Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment

    Authors: Yiju Guo, Ganqu Cui, Lifan Yuan, Ning Ding, Jiexin Wang, Huimin Chen, Bowen Sun, Ruobing Xie, Jie Zhou, Yankai Lin, Zhiyuan Liu, Maosong Sun

    Abstract: Alignment in artificial intelligence pursues the consistency between model responses and human preferences as well as values. In practice, the multifaceted nature of human preferences inadvertently introduces what is known as the "alignment tax" -a compromise where enhancements in alignment within one objective (e.g.,harmlessness) can diminish performance in others (e.g.,helpfulness). However, exi… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  12. arXiv:2402.04588  [pdf, other

    cs.CL

    UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset

    Authors: Haoyu Wang, Shuo Wang, Yukun Yan, Xujia Wang, Zhiyu Yang, Yuzhuang Xu, Zhenghao Liu, Liner Yang, Ning Ding, Xu Han, Zhiyuan Liu, Maosong Sun

    Abstract: Open-source large language models (LLMs) have gained significant strength across diverse fields. Nevertheless, the majority of studies primarily concentrate on English, with only limited exploration into the realm of multilingual abilities. In this work, we therefore construct an open-source multilingual supervised fine-tuning dataset. Different from previous works that simply translate English in… ▽ More

    Submitted 17 February, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: Work in Progress

  13. arXiv:2401.15202  [pdf, ps, other

    cs.IT

    A Cross Entropy Interpretation of R{é}nyi Entropy for $α$-leakage

    Authors: Ni Ding, Mohammad Amin Zarrabian, Parastoo Sadeghi

    Abstract: This paper proposes an $α$-leakage measure for $α\in[0,\infty)$ by a cross entropy interpretation of R{é}nyi entropy. While Rényi entropy was originally defined as an $f$-mean for $f(t) = \exp((1-α)t)$, we reveal that it is also a $\tilde{f}$-mean cross entropy measure for $\tilde{f}(t) = \exp(\frac{1-α}αt)$. Minimizing this Rényi cross-entropy gives Rényi entropy, by which the prior and posterior… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 7 pages; 1 figure

  14. arXiv:2401.12391  [pdf, other

    cs.IT cs.CR

    Approximation of Pufferfish Privacy for Gaussian Priors

    Authors: Ni Ding

    Abstract: This paper studies how to approximate pufferfish privacy when the adversary's prior belief of the published data is Gaussian distributed. Using Monge's optimal transport plan, we show that $(ε, δ)$-pufferfish privacy is attained if the additive Laplace noise is calibrated to the differences in mean and variance of the Gaussian distributions conditioned on every discriminative secret pair. A typica… ▽ More

    Submitted 6 May, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    Comments: 11 pages, 5 figures, accepted journal version

  15. arXiv:2312.01235  [pdf, ps, other

    cs.GT

    Strategic Data Revocation in Federated Unlearning

    Authors: Ningning Ding, Ermin Wei, Randall Berry

    Abstract: By allowing users to erase their data's impact on federated learning models, federated unlearning protects users' right to be forgotten and data privacy. Despite a burgeoning body of research on federated unlearning's technical feasibility, there is a paucity of literature investigating the considerations behind users' requests for data revocation. This paper proposes a non-cooperative game framew… ▽ More

    Submitted 6 December, 2023; v1 submitted 2 December, 2023; originally announced December 2023.

    Comments: Accepted by IEEE International Conference on Computer Communications (INFOCOM), 2024

  16. arXiv:2311.11696  [pdf, other

    cs.CL cs.AI cs.LG

    Sparse Low-rank Adaptation of Pre-trained Language Models

    Authors: Ning Ding, Xingtai Lv, Qiaosen Wang, Yulin Chen, Bowen Zhou, Zhiyuan Liu, Maosong Sun

    Abstract: Fine-tuning pre-trained large language models in a parameter-efficient manner is widely studied for its effectiveness and efficiency. The popular method of low-rank adaptation (LoRA) offers a notable approach, hypothesizing that the adaptation process is intrinsically low-dimensional. Although LoRA has demonstrated commendable performance, it is implemented with a fixed and unalterable intrinsic r… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: Accepted to EMNLP 2023 (Main Conference)

  17. arXiv:2311.09868  [pdf, other

    cs.SE cs.AI

    INTERVENOR: Prompting the Coding Ability of Large Language Models with the Interactive Chain of Repair

    Authors: Hanbin Wang, Zhenghao Liu, Shuo Wang, Ganqu Cui, Ning Ding, Zhiyuan Liu, Ge Yu

    Abstract: This paper introduces INTERVENOR (INTERactiVE chaiN Of Repair), a system designed to emulate the interactive code repair processes observed in humans, encompassing both code diagnosis and code repair. INTERVENOR prompts Large Language Models (LLMs) to play distinct roles during the code repair process, functioning as both a Code Learner and a Code Teacher. Specifically, the Code Learner is tasked… ▽ More

    Submitted 20 February, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: 26 pages, 19 figures, 8 tables

  18. arXiv:2310.15477  [pdf, other

    cs.CL

    CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model

    Authors: Kaiyan Zhang, Ning Ding, Biqing Qi, Xuekai Zhu, Xinwei Long, Bowen Zhou

    Abstract: Instruction tuning has recently been recognized as an effective way of aligning Large Language Models (LLMs) to enhance their generalization ability across various tasks. However, when tuning publicly accessible, centralized LLMs with private instruction data, privacy concerns are inevitable. While direct transfer of parameterized modules between models is a plausible approach to address this, its… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 (Main Conference)

  19. arXiv:2310.11158  [pdf, other

    cs.CL cs.LG

    Probing the Creativity of Large Language Models: Can models produce divergent semantic association?

    Authors: Honghua Chen, Nai Ding

    Abstract: Large language models possess remarkable capacity for processing language, but it remains unclear whether these models can further generate creative content. The present study aims to investigate the creative thinking of large language models through a cognitive perspective. We utilize the divergent association task (DAT), an objective measurement of creativity that asks models to generate unrelat… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: Accepted for publication in Findings of EMNLP 2023

  20. arXiv:2310.03750  [pdf

    eess.SP cond-mat.mtrl-sci cs.LG physics.app-ph

    Health diagnosis and recuperation of aged Li-ion batteries with data analytics and equivalent circuit modeling

    Authors: Riko I Made, Jing Lin, Jintao Zhang, Yu Zhang, Lionel C. H. Moh, Zhaolin Liu, Ning Ding, Sing Yang Chiam, Edwin Khoo, Xuesong Yin, Guangyuan Wesley Zheng

    Abstract: Battery health assessment and recuperation play a crucial role in the utilization of second-life Li-ion batteries. However, due to ambiguous aging mechanisms and lack of correlations between the recovery effects and operational states, it is challenging to accurately estimate battery health and devise a clear strategy for cell rejuvenation. This paper presents aging and reconditioning experiments… ▽ More

    Submitted 21 September, 2023; originally announced October 2023.

    Comments: 20 pages, 5 figures, 1 table

    Journal ref: iScience (2024)

  21. arXiv:2310.03262  [pdf, other

    cs.CL

    Predicting Emergent Abilities with Infinite Resolution Evaluation

    Authors: Shengding Hu, Xin Liu, Xu Han, Xinrong Zhang, Chaoqun He, Weilin Zhao, Yankai Lin, Ning Ding, Zebin Ou, Guoyang Zeng, Zhiyuan Liu, Maosong Sun

    Abstract: The scientific scale-up of large language models (LLMs) necessitates a comprehensive understanding of their scaling properties. However, the existing literature on the scaling properties only yields an incomplete answer: optimization loss decreases predictably as the model size increases, in line with established scaling law; yet no scaling law for task has been established and the task performanc… ▽ More

    Submitted 17 April, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: After revision

  22. arXiv:2310.01377  [pdf, other

    cs.CL cs.AI cs.LG

    UltraFeedback: Boosting Language Models with High-quality Feedback

    Authors: Ganqu Cui, Lifan Yuan, Ning Ding, Guanming Yao, Wei Zhu, Yuan Ni, Guotong Xie, Zhiyuan Liu, Maosong Sun

    Abstract: Reinforcement learning from human feedback (RLHF) has become a pivot technique in aligning large language models (LLMs) with human preferences. In RLHF practice, preference data plays a crucial role in bridging human proclivity and LLMs. However, the scarcity of diverse, naturalistic datasets of human preferences on LLM outputs at scale poses a great challenge to RLHF as well as feedback learning… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  23. arXiv:2309.16712  [pdf, other

    cs.NI cs.GT

    Joint Participation Incentive and Network Pricing Design for Federated Learning

    Authors: Ningning Ding, Lin Gao, Jianwei Huang

    Abstract: Federated learning protects users' data privacy through sharing users' local model parameters (instead of raw data) with a server. However, when massive users train a large machine learning model through federated learning, the dynamically varying and often heavy communication overhead can put significant pressure on the network operator. The operator may choose to dynamically change the network p… ▽ More

    Submitted 17 August, 2023; originally announced September 2023.

    Journal ref: IEEE International Conference on Computer Communications (INFOCOM), 2023

  24. arXiv:2309.08564  [pdf

    cs.CV

    The Impact of Different Backbone Architecture on Autonomous Vehicle Dataset

    Authors: Ning Ding, Azim Eskandarian

    Abstract: Object detection is a crucial component of autonomous driving, and many detection applications have been developed to address this task. These applications often rely on backbone architectures, which extract representation features from inputs to perform the object detection task. The quality of the features extracted by the backbone architecture can have a significant impact on the overall detect… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: This paper has been accepted by IMECE2023

  25. arXiv:2309.08112  [pdf, other

    cs.HC cs.AI cs.CL

    Empowering Private Tutoring by Chaining Large Language Models

    Authors: Yulin Chen, Ning Ding, Hai-Tao Zheng, Zhiyuan Liu, Maosong Sun, Bowen Zhou

    Abstract: Artificial intelligence has been applied in various aspects of online education to facilitate teaching and learning. However, few approaches has been made toward a complete AI-powered tutoring system. In this work, we explore the development of a full-fledged intelligent tutoring system powered by state-of-the-art large language models (LLMs), covering automatic course planning and adjusting, tail… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  26. arXiv:2309.01183  [pdf, other

    cs.CV

    Holistic Dynamic Frequency Transformer for Image Fusion and Exposure Correction

    Authors: Xiaoke Shang, Gehui Li, Zhiying Jiang, Shaomin Zhang, Nai Ding, Jinyuan Liu

    Abstract: The correction of exposure-related issues is a pivotal component in enhancing the quality of images, offering substantial implications for various computer vision tasks. Historically, most methodologies have predominantly utilized spatial domain recovery, offering limited consideration to the potentialities of the frequency domain. Additionally, there has been a lack of a unified perspective towar… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

  27. arXiv:2308.16320  [pdf, other

    cs.GT

    Information Disclosure under Competition in Sharing Systems

    Authors: Ningning Ding, Zhixuan Fang, Jianwei Huang

    Abstract: Sharing systems have facilitated the redistribution of underused resources by providing convenient online marketplaces for individual sellers and buyers. However, sellers in these systems may not fully disclose the information of their shared commodities, due to strategic behaviors or privacy concerns. Sellers' strategic information disclosure significantly affects buyers' user experiences and sys… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  28. arXiv:2308.12502  [pdf, other

    cs.GT

    Incentivized Federated Learning and Unlearning

    Authors: Ningning Ding, Zhenyu Sun, Ermin Wei, Randall Berry

    Abstract: To protect users' right to be forgotten in federated learning, federated unlearning aims at eliminating the impact of leaving users' data on the global learned model. The current research in federated unlearning mainly concentrated on developing effective and efficient unlearning techniques. However, the issue of incentivizing valuable users to remain engaged and preventing their data from being u… ▽ More

    Submitted 1 December, 2023; v1 submitted 23 August, 2023; originally announced August 2023.

  29. arXiv:2308.09735  [pdf, other

    cs.LG

    CTP:A Causal Interpretable Model for Non-Communicable Disease Progression Prediction

    Authors: Zhoujian Sun, Wenzhuo Zhang, Zhengxing Huang, Nai Ding, Cheng Luo

    Abstract: Non-communicable disease is the leading cause of death, emphasizing the need for accurate prediction of disease progression and informed clinical decision-making. Machine learning (ML) models have shown promise in this domain by capturing non-linear patterns within patient features. However, existing ML-based models cannot provide causal interpretable predictions and estimate treatment effects, li… ▽ More

    Submitted 22 September, 2023; v1 submitted 18 August, 2023; originally announced August 2023.

    Comments: 25 pages, 5 figures, 12 tables

  30. arXiv:2308.08488  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    Improving Audio-Visual Speech Recognition by Lip-Subword Correlation Based Visual Pre-training and Cross-Modal Fusion Encoder

    Authors: Yusheng Dai, Hang Chen, Jun Du, Xiaofei Ding, Ning Ding, Feijun Jiang, Chin-Hui Lee

    Abstract: In recent research, slight performance improvement is observed from automatic speech recognition systems to audio-visual speech recognition systems in the end-to-end framework with low-quality videos. Unmatching convergence rates and specialized input representations between audio and visual modalities are considered to cause the problem. In this paper, we propose two novel techniques to improve a… ▽ More

    Submitted 8 March, 2024; v1 submitted 14 August, 2023; originally announced August 2023.

    Comments: 6 pages, 2 figures, published in ICME2023

  31. arXiv:2308.06912  [pdf, other

    cs.LG cs.CL

    CausalLM is not optimal for in-context learning

    Authors: Nan Ding, Tomer Levinboim, Jialin Wu, Sebastian Goodman, Radu Soricut

    Abstract: Recent empirical evidence indicates that transformer based in-context learning performs better when using a prefix language model (prefixLM), in which in-context samples can all attend to each other, compared to causal language models (causalLM), which use auto-regressive attention that prohibits in-context samples to attend to future samples. While this result is intuitive, it is not understood f… ▽ More

    Submitted 20 February, 2024; v1 submitted 13 August, 2023; originally announced August 2023.

    Comments: ICLR 2024 conference paper. Code available at: https://github.com/google-research/causallm_icl

  32. arXiv:2307.03084  [pdf, other

    cs.LG cs.AI cs.CL

    OpenDelta: A Plug-and-play Library for Parameter-efficient Adaptation of Pre-trained Models

    Authors: Shengding Hu, Ning Ding, Weilin Zhao, Xingtai Lv, Zhen Zhang, Zhiyuan Liu, Maosong Sun

    Abstract: The scale of large pre-trained models (PTMs) poses significant challenges in adapting to downstream tasks due to the high optimization overhead and storage costs associated with full-parameter fine-tuning. To address this, many studies explore parameter-efficient tuning methods, also framed as "delta tuning", which updates only a small subset of parameters, known as "delta modules", while keeping… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: Accepted to ACL 2023 Demo track

  33. arXiv:2306.09296  [pdf, other

    cs.CL

    KoLA: Carefully Benchmarking World Knowledge of Large Language Models

    Authors: Jifan Yu, Xiaozhi Wang, Shangqing Tu, Shulin Cao, Daniel Zhang-Li, Xin Lv, Hao Peng, Zijun Yao, Xiaohan Zhang, Hanming Li, Chunyang Li, Zheyuan Zhang, Yushi Bai, Yantao Liu, Amy Xin, Nianyi Lin, Kaifeng Yun, Linlu Gong, Jianhui Chen, Zhili Wu, Yunjia Qi, Weikai Li, Yong Guan, Kaisheng Zeng, Ji Qi , et al. (10 additional authors not shown)

    Abstract: The unprecedented performance of large language models (LLMs) necessitates improvements in evaluations. Rather than merely exploring the breadth of LLM abilities, we believe meticulous and thoughtful designs are essential to thorough, unbiased, and applicable evaluations. Given the importance of world knowledge to LLMs, we construct a Knowledge-oriented LLM Assessment benchmark (KoLA), in which we… ▽ More

    Submitted 6 July, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

  34. arXiv:2306.04014  [pdf, other

    cs.DC

    Evaluating the Potential of Disaggregated Memory Systems for HPC applications

    Authors: Nan Ding, Pieter Maris, Hai Ah Nam, Taylor Groves, Muaaz Gul Awan, LeAnn Lindsey, Christopher Daley, Oguz Selvitopi, Leonid Oliker, Nicholas Wright, Samuel Williams

    Abstract: Disaggregated memory is a promising approach that addresses the limitations of traditional memory architectures by enabling memory to be decoupled from compute nodes and shared across a data center. Cloud platforms have deployed such systems to improve overall system memory utilization, but performance can vary across workloads. High-performance computing (HPC) is crucial in scientific and enginee… ▽ More

    Submitted 16 June, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: The submission builds on the following conference paper: N. Ding, S. Williams, H.A. Nam, et al. Methodology for Evaluating the Potential of Disaggregated Memory Systems,2nd International Workshop on RESource DISaggregation in High-Performance Computing (RESDIS), November 18, 2022. It is now submitted to the CCPE journal for review

  35. Exploring the Impact of Model Scaling on Parameter-Efficient Tuning

    Authors: Yusheng Su, Chi-Min Chan, Jiali Cheng, Yujia Qin, Yankai Lin, Shengding Hu, Zonghan Yang, Ning Ding, Xingzhi Sun, Guotong Xie, Zhiyuan Liu, Maosong Sun

    Abstract: Parameter-efficient tuning (PET) methods can effectively drive extremely large pre-trained language models (PLMs) by training only minimal parameters. Different PET methods utilize different manually designed tunable modules. In small PLMs, there are usually noticeable performance differences among PET methods. Nevertheless, as the model scale increases, the performance differences become marginal… ▽ More

    Submitted 10 December, 2023; v1 submitted 4 June, 2023; originally announced June 2023.

  36. arXiv:2306.00693  [pdf, other

    cs.CV

    GPT4Image: Can Large Pre-trained Models Help Vision Models on Perception Tasks?

    Authors: Ning Ding, Yehui Tang, Zhongqian Fu, Chao Xu, Kai Han, Yunhe Wang

    Abstract: The recent upsurge in pre-trained large models (e.g. GPT-4) has swept across the entire deep learning community. Such powerful large language models (LLMs) demonstrate advanced generative ability and multimodal understanding capability, which quickly achieve new state-of-the-art performances on a variety of benchmarks. The pre-trained LLM usually plays the role as a universal AI model that can con… ▽ More

    Submitted 7 June, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: GitHub: https://github.com/huawei-noah/Efficient-Computing/tree/master/GPT4Image/

  37. arXiv:2305.19500  [pdf, other

    cs.CL

    Exploring Lottery Prompts for Pre-trained Language Models

    Authors: Yulin Chen, Ning Ding, Xiaobin Wang, Shengding Hu, Hai-Tao Zheng, Zhiyuan Liu, Pengjun Xie

    Abstract: Consistently scaling pre-trained language models (PLMs) imposes substantial burdens on model adaptation, necessitating more efficient alternatives to conventional fine-tuning. Given the advantage of prompting in the zero-shot setting and the observed performance fluctuation among different prompts, we explore the instance-level prompt and their generalizability. By searching through the prompt spa… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL 2023

  38. arXiv:2305.14233  [pdf, other

    cs.CL cs.AI

    Enhancing Chat Language Models by Scaling High-quality Instructional Conversations

    Authors: Ning Ding, Yulin Chen, Bokai Xu, Yujia Qin, Zhi Zheng, Shengding Hu, Zhiyuan Liu, Maosong Sun, Bowen Zhou

    Abstract: Fine-tuning on instruction data has been widely validated as an effective practice for implementing chat language models like ChatGPT. Scaling the diversity and quality of such data, although straightforward, stands a great chance of leading to improved performance. This paper aims to improve the upper bound of open-source models further. We first provide a systematically designed, diverse, inform… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  39. arXiv:2305.07152  [pdf, other

    cs.CV

    Surgical tool classification and localization: results and methods from the MICCAI 2022 SurgToolLoc challenge

    Authors: Aneeq Zia, Kiran Bhattacharyya, Xi Liu, Max Berniker, Ziheng Wang, Rogerio Nespolo, Satoshi Kondo, Satoshi Kasai, Kousuke Hirasawa, Bo Liu, David Austin, Yiheng Wang, Michal Futrega, Jean-Francois Puget, Zhenqiang Li, Yoichi Sato, Ryo Fujii, Ryo Hachiuma, Mana Masuda, Hideo Saito, An Wang, Mengya Xu, Mobarakol Islam, Long Bai, Winnie Pang , et al. (46 additional authors not shown)

    Abstract: The ability to automatically detect and track surgical instruments in endoscopic videos can enable transformational interventions. Assessing surgical performance and efficiency, identifying skilled tool use and choreography, and planning operational and logistical aspects of OR resources are just a few of the applications that could benefit. Unfortunately, obtaining the annotations needed to train… ▽ More

    Submitted 31 May, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

  40. arXiv:2305.06940  [pdf, other

    cs.CV

    SalienDet: A Saliency-based Feature Enhancement Algorithm for Object Detection for Autonomous Driving

    Authors: Ning Ding, Ce Zhang, Azim Eskandarian

    Abstract: Object detection (OD) is crucial to autonomous driving. On the other hand, unknown objects, which have not been seen in training sample set, are one of the reasons that hinder autonomous vehicles from driving beyond the operational domain. To addresss this issue, we propose a saliency-based OD algorithm (SalienDet) to detect unknown objects. Our SalienDet utilizes a saliency-based algorithm to enh… ▽ More

    Submitted 15 June, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: This paper is accepted and being published at IEEE Transactions on Intelligent Vehicles

  41. arXiv:2305.06849  [pdf, other

    cs.CL cs.AI cs.IR

    WebCPM: Interactive Web Search for Chinese Long-form Question Answering

    Authors: Yujia Qin, Zihan Cai, Dian Jin, Lan Yan, Shihao Liang, Kunlun Zhu, Yankai Lin, Xu Han, Ning Ding, Huadong Wang, Ruobing Xie, Fanchao Qi, Zhiyuan Liu, Maosong Sun, Jie Zhou

    Abstract: Long-form question answering (LFQA) aims at answering complex, open-ended questions with detailed, paragraph-length responses. The de facto paradigm of LFQA necessitates two procedures: information retrieval, which searches for relevant supporting facts, and information synthesis, which integrates these facts into a coherent answer. In this paper, we introduce WebCPM, the first Chinese LFQA datase… ▽ More

    Submitted 23 May, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: ACL 2023, main conference

  42. Estimation of control area in badminton doubles with pose information from top and back view drone videos

    Authors: Ning Ding, Kazuya Takeda, Wenhui Jin, Yingjiu Bei, Keisuke Fujii

    Abstract: The application of visual tracking to the performance analysis of sports players in dynamic competitions is vital for effective coaching. In doubles matches, coordinated positioning is crucial for maintaining control of the court and minimizing opponents' scoring opportunities. The analysis of such teamwork plays a vital role in understanding the dynamics of the game. However, previous studies hav… ▽ More

    Submitted 26 October, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

    Comments: 15 pages, 10 figures, to appear in Multimedia Tools and Applications

    Journal ref: Multimedia Tools and Applications (2023)

  43. arXiv:2305.03987  [pdf, other

    cs.CL cs.AI

    Replicating Complex Dialogue Policy of Humans via Offline Imitation Learning with Supervised Regularization

    Authors: Zhoujian Sun, Chenyang Zhao, Zhengxing Huang, Nai Ding

    Abstract: Policy learning (PL) is a module of a task-oriented dialogue system that trains an agent to make actions in each dialogue turn. Imitating human action is a fundamental problem of PL. However, both supervised learning (SL) and reinforcement learning (RL) frameworks cannot imitate humans well. Training RL models require online interactions with user simulators, while simulating complex human policy… ▽ More

    Submitted 6 May, 2023; originally announced May 2023.

  44. arXiv:2304.08354  [pdf, other

    cs.CL cs.AI cs.LG

    Tool Learning with Foundation Models

    Authors: Yujia Qin, Shengding Hu, Yankai Lin, Weize Chen, Ning Ding, Ganqu Cui, Zheni Zeng, Yufei Huang, Chaojun Xiao, Chi Han, Yi Ren Fung, Yusheng Su, Huadong Wang, Cheng Qian, Runchu Tian, Kunlun Zhu, Shihao Liang, Xingyu Shen, Bokai Xu, Zhen Zhang, Yining Ye, Bowen Li, Ziwei Tang, Jing Yi, Yuzhang Zhu , et al. (16 additional authors not shown)

    Abstract: Humans possess an extraordinary ability to create and utilize tools, allowing them to overcome physical limitations and explore new frontiers. With the advent of foundation models, AI systems have the potential to be equally adept in tool use as humans. This paradigm, i.e., tool learning with foundation models, combines the strengths of specialized tools and foundation models to achieve enhanced a… ▽ More

    Submitted 15 June, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

  45. arXiv:2303.15840  [pdf

    cs.CV

    Sparse Depth-Guided Attention for Accurate Depth Completion: A Stereo-Assisted Monitored Distillation Approach

    Authors: Jia-Wei Guo, Hung-Chyun Chou, Sen-Hua Zhu, Chang-Zheng Zhang, Ming Ouyang, Ning Ding

    Abstract: This paper proposes a novel method for depth completion, which leverages multi-view improved monitored distillation to generate more precise depth maps. Our approach builds upon the state-of-the-art ensemble distillation method, in which we introduce a stereo-based model as a teacher model to improve the accuracy of the student model for depth completion. By minimizing the reconstruction error of… ▽ More

    Submitted 3 September, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

    Comments: 7 pages, 8 figures, references added

  46. arXiv:2303.08322  [pdf, other

    cs.LG cs.AI cs.DC cs.GT cs.NI

    Optimization Design for Federated Learning in Heterogeneous 6G Networks

    Authors: Bing Luo, Xiaomin Ouyang, Peng Sun, Pengchao Han, Ningning Ding, Jianwei Huang

    Abstract: With the rapid advancement of 5G networks, billions of smart Internet of Things (IoT) devices along with an enormous amount of data are generated at the network edge. While still at an early age, it is expected that the evolving 6G network will adopt advanced artificial intelligence (AI) technologies to collect, transmit, and learn this valuable data for innovative applications and intelligent ser… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: Accepted in IEEE Nework

  47. On the Lift, Related Privacy Measures, and Applications to Privacy-Utility Tradeoffs

    Authors: Mohammad Amin Zarrabian, Ni Ding, Parastoo Sadeghi

    Abstract: This paper investigates lift, the likelihood ratio between the posterior and prior belief about sensitive features in a dataset. Maximum and minimum lifts over sensitive features quantify the adversary's knowledge gain and should be bounded to protect privacy. We demonstrate that max and min lifts have a distinct range of values and probability of appearance in the dataset, referred to as \emph{li… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  48. arXiv:2302.01034  [pdf, other

    cs.CV

    An Efficient Convex Hull-based Vehicle Pose Estimation Method for 3D LiDAR

    Authors: Ningning Ding

    Abstract: Vehicle pose estimation with LiDAR is essential in the perception technology of autonomous driving. However, due to incomplete observation measurements and sparsity of the LiDAR point cloud, it is challenging to achieve satisfactory pose extraction based on 3D LiDAR with the existing pose estimation methods. In addition, the demand for real-time performance further increases the difficulty of the… ▽ More

    Submitted 6 February, 2024; v1 submitted 2 February, 2023; originally announced February 2023.

  49. arXiv:2301.12739  [pdf, other

    cs.CV

    FractalAD: A simple industrial anomaly detection method using fractal anomaly generation and backbone knowledge distillation

    Authors: Xuan Xia, Weijie Lv, Xing He, Nan Li, Chuanqi Liu, Ning Ding

    Abstract: Although industrial anomaly detection (AD) technology has made significant progress in recent years, generating realistic anomalies and learning priors of normal remain challenging tasks. In this study, we propose an end-to-end industrial anomaly detection method called FractalAD. Training samples are obtained by synthesizing fractal images and patches from normal samples. This fractal anomaly gen… ▽ More

    Submitted 18 September, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: 12 pages, 5 figures

  50. arXiv:2301.05898  [pdf

    cs.SD eess.AS q-bio.NC

    Acoustic correlates of the syllabic rhythm of speech: Modulation spectrum or local features of the temporal envelope

    Authors: Yuran Zhang, Jiajie Zou, Nai Ding

    Abstract: The syllable is a perceptually salient unit in speech. Since both the syllable and its acoustic correlate, i.e., the speech envelope, have a preferred range of rhythmicity between 4 and 8 Hz, it is hypothesized that theta-band neural oscillations play a major role in extracting syllables based on the envelope. A literature survey, however, reveals inconsistent evidence about the relationship betwe… ▽ More

    Submitted 14 January, 2023; originally announced January 2023.