Skip to main content

Showing 1–50 of 272 results for author: Shen, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.12152  [pdf, other

    cs.CL

    FecTek: Enhancing Term Weight in Lexicon-Based Retrieval with Feature Context and Term-level Knowledge

    Authors: Zunran Wang, Zhonghua Li, Wei Shen, Qi Ye, Liqiang Nie

    Abstract: Lexicon-based retrieval has gained siginificant popularity in text retrieval due to its efficient and robust performance. To further enhance performance of lexicon-based retrieval, researchers have been diligently incorporating state-of-the-art methodologies like Neural retrieval and text-level contrastive learning approaches. Nonetheless, despite the promising outcomes, current lexicon-based retr… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  2. arXiv:2404.11981  [pdf, other

    cs.CV

    Tendency-driven Mutual Exclusivity for Weakly Supervised Incremental Semantic Segmentation

    Authors: Chongjie Si, Xuehui Wang, Xiaokang Yang, Wei Shen

    Abstract: Weakly Incremental Learning for Semantic Segmentation (WILSS) leverages a pre-trained segmentation model to segment new classes using cost-effective and readily available image-level labels. A prevailing way to solve WILSS is the generation of seed areas for each new class, serving as a form of pixel-level supervision. However, a scenario usually arises where a pixel is concurrently predicted as a… ▽ More

    Submitted 19 April, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  3. arXiv:2404.11355  [pdf, other

    cs.CV

    Consisaug: A Consistency-based Augmentation for Polyp Detection in Endoscopy Image Analysis

    Authors: Ziyu Zhou, Wenyuan Shen, Chang Liu

    Abstract: Colorectal cancer (CRC), which frequently originates from initially benign polyps, remains a significant contributor to global cancer-related mortality. Early and accurate detection of these polyps via colonoscopy is crucial for CRC prevention. However, traditional colonoscopy methods depend heavily on the operator's experience, leading to suboptimal polyp detection rates. Besides, the public data… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: MLMI 2023

  4. arXiv:2404.10573  [pdf, other

    cs.AI cs.CE q-bio.BM

    AAVDiff: Experimental Validation of Enhanced Viability and Diversity in Recombinant Adeno-Associated Virus (AAV) Capsids through Diffusion Generation

    Authors: Lijun Liu, Jiali Yang, Jianfei Song, Xinglin Yang, Lele Niu, Zeqi Cai, Hui Shi, Tingjun Hou, Chang-yu Hsieh, Weiran Shen, Yafeng Deng

    Abstract: Recombinant adeno-associated virus (rAAV) vectors have revolutionized gene therapy, but their broad tropism and suboptimal transduction efficiency limit their clinical applications. To overcome these limitations, researchers have focused on designing and screening capsid libraries to identify improved vectors. However, the large sequence space and limited resources present challenges in identifyin… ▽ More

    Submitted 17 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

  5. arXiv:2404.07443  [pdf

    physics.optics cs.ET cs.LG

    1-bit Quantized On-chip Hybrid Diffraction Neural Network Enabled by Authentic All-optical Fully-connected Architecture

    Authors: Yu Shao, Haiqi Gao, Yipeng Chen, Yujie liu, Junren Wen, Haidong He, Yuchuan Shao, Yueguang Zhang, Weidong Shen, Chenying Yang

    Abstract: Optical Diffraction Neural Networks (DNNs), a subset of Optical Neural Networks (ONNs), show promise in mirroring the prowess of electronic networks. This study introduces the Hybrid Diffraction Neural Network (HDNN), a novel architecture that incorporates matrix multiplication into DNNs, synergizing the benefits of conventional ONNs with those of DNNs to surmount the modulation limitations inhere… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  6. arXiv:2403.18072  [pdf, other

    stat.CO cs.LG stat.ME stat.ML

    Goal-Oriented Bayesian Optimal Experimental Design for Nonlinear Models using Markov Chain Monte Carlo

    Authors: Shijie Zhong, Wanggang Shen, Tommie Catanach, Xun Huan

    Abstract: Optimal experimental design (OED) provides a systematic approach to quantify and maximize the value of experimental data. Under a Bayesian approach, conventional OED maximizes the expected information gain (EIG) on model parameters. However, we are often interested in not the parameters themselves, but predictive quantities of interest (QoIs) that depend on the parameters in a nonlinear manner. We… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    MSC Class: 62K05; 62F15; 62B15

  7. arXiv:2403.16075  [pdf, other

    cs.LG

    IBCB: Efficient Inverse Batched Contextual Bandit for Behavioral Evolution History

    Authors: Yi Xu, Weiran Shen, Xiao Zhang, Jun Xu

    Abstract: Traditional imitation learning focuses on modeling the behavioral mechanisms of experts, which requires a large amount of interaction history generated by some fixed expert. However, in many streaming applications, such as streaming recommender systems, online decision-makers typically engage in online learning during the decision-making process, meaning that the interaction history generated by o… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  8. arXiv:2403.15124  [pdf, other

    cs.CV

    EndoGSLAM: Real-Time Dense Reconstruction and Tracking in Endoscopic Surgeries using Gaussian Splatting

    Authors: Kailing Wang, Chen Yang, Yuehao Wang, Sikuang Li, Yan Wang, Qi Dou, Xiaokang Yang, Wei Shen

    Abstract: Precise camera tracking, high-fidelity 3D tissue reconstruction, and real-time online visualization are critical for intrabody medical imaging devices such as endoscopes and capsule robots. However, existing SLAM (Simultaneous Localization and Mapping) methods often struggle to achieve both complete high-quality surgical field reconstruction and efficient computation, restricting their intraoperat… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  9. arXiv:2403.13679  [pdf, other

    cs.CL

    RoleInteract: Evaluating the Social Interaction of Role-Playing Agents

    Authors: Hongzhan Chen, Hehong Chen, Ming Yan, Wenshen Xu, Xing Gao, Weizhou Shen, Xiaojun Quan, Chenliang Li, Ji Zhang, Fei Huang, Jingren Zhou

    Abstract: Large language models (LLMs) have advanced the development of various AI conversational agents, including role-playing conversational agents that mimic diverse characters and human behaviors. While prior research has predominantly focused on enhancing the conversational capability, role-specific knowledge, and stylistic attributes of these agents, there has been a noticeable gap in assessing their… ▽ More

    Submitted 21 March, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  10. arXiv:2403.11083  [pdf, other

    cs.CV cs.CL

    Customizing Visual-Language Foundation Models for Multi-modal Anomaly Detection and Reasoning

    Authors: Xiaohao Xu, Yunkang Cao, Yongqi Chen, Weiming Shen, Xiaonan Huang

    Abstract: Anomaly detection is vital in various industrial scenarios, including the identification of unusual patterns in production lines and the detection of manufacturing defects for quality control. Existing techniques tend to be specialized in individual scenarios and lack generalization capacities. In this study, we aim to develop a generic anomaly detection model applicable across multiple scenarios.… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  11. arXiv:2403.07708  [pdf, other

    cs.CL cs.AI

    Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards

    Authors: Wei Shen, Xiaoying Zhang, Yuanshun Yao, Rui Zheng, Hongyi Guo, Yang Liu

    Abstract: Reinforcement learning from human feedback (RLHF) is the mainstream paradigm used to align large language models (LLMs) with human preferences. Yet existing RLHF heavily relies on accurate and informative reward models, which are vulnerable and sensitive to noise from various sources, e.g. human labeling errors, making the pipeline fragile. In this work, we improve the effectiveness of the reward… ▽ More

    Submitted 13 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  12. arXiv:2403.05402  [pdf, other

    cs.CV

    DualBEV: CNN is All You Need in View Transformation

    Authors: Peidong Li, Wancheng Shen, Qihao Huang, Dixiao Cui

    Abstract: Camera-based Bird's-Eye-View (BEV) perception often struggles between adopting 3D-to-2D or 2D-to-3D view transformation (VT). The 3D-to-2D VT typically employs resource intensive Transformer to establish robust correspondences between 3D and 2D feature, while the 2D-to-3D VT utilizes the Lift-Splat-Shoot (LSS) pipeline for real-time application, potentially missing distant information. To address… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 16 pages, 6 figures, Tech Report

  13. arXiv:2403.05171  [pdf, other

    cs.LG cs.AI

    Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation

    Authors: Xiaoying Zhang, Jean-Francois Ton, Wei Shen, Hongning Wang, Yang Liu

    Abstract: We introduce Adversarial Policy Optimization (AdvPO), a novel solution to the pervasive issue of reward over-optimization in Reinforcement Learning from Human Feedback (RLHF) for Large Language Models (LLMs). Over-optimization occurs when a reward model serves as an imperfect proxy for human preference, and RL-driven policy optimization erroneously exploits reward inaccuracies. In this paper, we b… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  14. arXiv:2402.15862  [pdf, other

    cs.CL

    SportQA: A Benchmark for Sports Understanding in Large Language Models

    Authors: Haotian Xia, Zhengbang Yang, Yuqing Wang, Rhys Tracy, Yun Zhao, Dongdong Huang, Zezhi Chen, Yan Zhu, Yuan-fang Wang, Weining Shen

    Abstract: A deep understanding of sports, a field rich in strategic and dynamic content, is crucial for advancing Natural Language Processing (NLP). This holds particular significance in the context of evaluating and advancing Large Language Models (LLMs), given the existing gap in specialized benchmarks. To bridge this gap, we introduce SportQA, a novel benchmark specifically designed for evaluating LLMs i… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  15. arXiv:2402.10259  [pdf, other

    cs.CV cs.GR

    GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object with Gaussian Splatting

    Authors: Chen Yang, Sikuang Li, Jiemin Fang, Ruofan Liang, Lingxi Xie, Xiaopeng Zhang, Wei Shen, Qi Tian

    Abstract: Reconstructing and rendering 3D objects from highly sparse views is of critical importance for promoting applications of 3D vision techniques and improving user experience. However, images from sparse views only contain very limited 3D information, leading to two significant challenges: 1) Difficulty in building multi-view consistency as images for matching are too few; 2) Partially omitted or hig… ▽ More

    Submitted 20 February, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Project page: https://gaussianobject.github.io/

  16. arXiv:2402.05808  [pdf, other

    cs.AI cs.CL cs.LG

    Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

    Authors: Zhiheng Xi, Wenxiang Chen, Boyang Hong, Senjie Jin, Rui Zheng, Wei He, Yiwen Ding, Shichun Liu, Xin Guo, Junzhe Wang, Honglin Guo, Wei Shen, Xiaoran Fan, Yuhao Zhou, Shihan Dou, Xiao Wang, Xinbo Zhang, Peng Sun, Tao Gui, Qi Zhang, Xuanjing Huang

    Abstract: In this paper, we propose R$^3$: Learning Reasoning through Reverse Curriculum Reinforcement Learning (RL), a novel method that employs only outcome supervision to achieve the benefits of process supervision for large language models. The core challenge in applying RL to complex reasoning is to identify a sequence of actions that result in positive rewards and provide appropriate supervision for o… ▽ More

    Submitted 17 March, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: Preprint. Codes released: https://github.com/WooooDyy/LLM-Reverse-Curriculum-RL

  17. arXiv:2402.01391  [pdf, other

    cs.SE cs.CL

    StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback

    Authors: Shihan Dou, Yan Liu, Haoxiang Jia, Limao Xiong, Enyu Zhou, Wei Shen, Junjie Shan, Caishuang Huang, Xiao Wang, Xiaoran Fan, Zhiheng Xi, Yuhao Zhou, Tao Ji, Rui Zheng, Qi Zhang, Xuanjing Huang, Tao Gui

    Abstract: The advancement of large language models (LLMs) has significantly propelled the field of code generation. Previous work integrated reinforcement learning (RL) with compiler feedback for exploring the output space of LLMs to enhance code generation quality. However, the lengthy code generated by LLMs in response to complex human requirements makes RL exploration a challenge. Also, since the unit te… ▽ More

    Submitted 5 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 13 pages, 5 figures

  18. arXiv:2401.17618  [pdf, other

    cs.CR cs.OS

    Beyond Control: Exploring Novel File System Objects for Data-Only Attacks on Linux Systems

    Authors: Jinmeng Zhou, Jiayi Hu, Ziyue Pan, Jiaxun Zhu, Guoren Li, Wenbo Shen, Yulei Sui, Zhiyun Qian

    Abstract: The widespread deployment of control-flow integrity has propelled non-control data attacks into the mainstream. In the domain of OS kernel exploits, by corrupting critical non-control data, local attackers can directly gain root access or privilege escalation without hijacking the control flow. As a result, OS kernels have been restricting the availability of such non-control data. This forces att… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 14 pages, in submission of the 31th ACM Conference on Computer and Communications Security (CCS), 2024

  19. Ambush from All Sides: Understanding Security Threats in Open-Source Software CI/CD Pipelines

    Authors: Ziyue Pan, Wenbo Shen, Xingkai Wang, Yutian Yang, Rui Chang, Yao Liu, Chengwei Liu, Yang Liu, Kui Ren

    Abstract: The continuous integration and continuous deployment (CI/CD) pipelines are widely adopted on Internet hosting platforms, such as GitHub. With the popularity, the CI/CD pipeline faces various security threats. However, current CI/CD pipelines suffer from malicious code and severe vulnerabilities. Even worse, people have not been fully aware of its attack surfaces and the corresponding impacts. Th… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Journal ref: IEEE Transactions on Dependable and Secure Computing (Volume: 21, Issue: 1, Jan.-Feb. 2024)

  20. arXiv:2401.17050  [pdf, other

    cs.CV cs.AI

    ViTree: Single-path Neural Tree for Step-wise Interpretable Fine-grained Visual Categorization

    Authors: Danning Lao, Qi Liu, Jiazi Bu, Junchi Yan, Wei Shen

    Abstract: As computer vision continues to advance and finds widespread applications across various domains, the need for interpretability in deep learning models becomes paramount. Existing methods often resort to post-hoc techniques or prototypes to explain the decision-making process, which can be indirect and lack intrinsic illustration. In this research, we introduce ViTree, a novel approach for fine-gr… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  21. arXiv:2401.16402  [pdf, other

    cs.CV cs.AI

    A Survey on Visual Anomaly Detection: Challenge, Approach, and Prospect

    Authors: Yunkang Cao, Xiaohao Xu, Jiangning Zhang, Yuqi Cheng, Xiaonan Huang, Guansong Pang, Weiming Shen

    Abstract: Visual Anomaly Detection (VAD) endeavors to pinpoint deviations from the concept of normality in visual data, widely applied across diverse domains, e.g., industrial defect inspection, and medical lesion detection. This survey comprehensively examines recent advancements in VAD by identifying three primary challenges: 1) scarcity of training data, 2) diversity of visual modalities, and 3) complexi… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: Work in progress. Yunkang Cao, Xiaohao Xu, and Jiangning Zhang contribute equally to this work

  22. arXiv:2401.16158  [pdf, other

    cs.CL cs.CV

    Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception

    Authors: Junyang Wang, Haiyang Xu, Jiabo Ye, Ming Yan, Weizhou Shen, Ji Zhang, Fei Huang, Jitao Sang

    Abstract: Mobile device agent based on Multimodal Large Language Models (MLLM) is becoming a popular application. In this paper, we introduce Mobile-Agent, an autonomous multi-modal mobile device agent. Mobile-Agent first leverages visual perception tools to accurately identify and locate both the visual and textual elements within the app's front-end interface. Based on the perceived vision context, it the… ▽ More

    Submitted 18 April, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: Accepted by ICLR 2024 Workshop in Large Language Model (LLM) Agents

  23. arXiv:2401.14613  [pdf, ps, other

    cs.GT

    Multiplayer General Lotto game

    Authors: Bonan Ni, Yan Liu, Weiran Shen, Zihe Wang

    Abstract: In this paper, we explore the multiplayer General Lotto Blotto game over a single battlefield, a notable variant of the Colonel Blotto game. In this version, each player employs a probability distribution for resource allocation, ensuring that the expected expenditure does not surpass their budget. We first establish the existence of a Nash equilibrium for a modified version of this game, in which… ▽ More

    Submitted 31 January, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

  24. arXiv:2401.11458  [pdf, other

    cs.CL

    Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback

    Authors: Songyang Gao, Qiming Ge, Wei Shen, Shihan Dou, Junjie Ye, Xiao Wang, Rui Zheng, Yicheng Zou, Zhi Chen, Hang Yan, Qi Zhang, Dahua Lin

    Abstract: The success of AI assistants based on Language Models (LLMs) hinges on Reinforcement Learning from Human Feedback (RLHF) to comprehend and align with user intentions. However, traditional alignment algorithms, such as PPO, are hampered by complex annotation and training requirements. This reliance limits the applicability of RLHF and hinders the development of professional assistants tailored to d… ▽ More

    Submitted 6 May, 2024; v1 submitted 21 January, 2024; originally announced January 2024.

    Comments: Accepted by ICML2024

  25. arXiv:2401.08332  [pdf, other

    cs.CV

    Generative Denoise Distillation: Simple Stochastic Noises Induce Efficient Knowledge Transfer for Dense Prediction

    Authors: Zhaoge Liu, Xiaohao Xu, Yunkang Cao, Weiming Shen

    Abstract: Knowledge distillation is the process of transferring knowledge from a more powerful large model (teacher) to a simpler counterpart (student). Numerous current approaches involve the student imitating the knowledge of the teacher directly. However, redundancy still exists in the learned representations through these prevalent methods, which tend to learn each spatial location's features indiscrimi… ▽ More

    Submitted 17 January, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

  26. arXiv:2401.07324  [pdf, other

    cs.AI cs.CL

    Small LLMs Are Weak Tool Learners: A Multi-LLM Agent

    Authors: Weizhou Shen, Chenliang Li, Hongzhan Chen, Ming Yan, Xiaojun Quan, Hehong Chen, Ji Zhang, Fei Huang

    Abstract: Large Language Model (LLM) agents significantly extend the capabilities of standalone LLMs, empowering them to interact with external tools (e.g., APIs, functions) and complete various tasks in a self-directed fashion. The challenge of tool use demands that LLMs not only understand user queries and generate answers accurately but also excel in task planning, tool invocation, and result summarizati… ▽ More

    Submitted 16 February, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

    Comments: On progress, github repo: https://github.com/X-PLUG/Multi-LLM-Agent

  27. arXiv:2401.06785  [pdf, other

    cs.CL cs.AI

    Human-Instruction-Free LLM Self-Alignment with Limited Samples

    Authors: Hongyi Guo, Yuanshun Yao, Wei Shen, Jiaheng Wei, Xiaoying Zhang, Zhaoran Wang, Yang Liu

    Abstract: Aligning large language models (LLMs) with human values is a vital task for LLM practitioners. Current alignment techniques have several limitations: (1) requiring a large amount of annotated data; (2) demanding heavy human involvement; (3) lacking a systematic mechanism to continuously improve. In this work, we study aligning LLMs to a new domain with limited samples (e.g. < 100). We propose an a… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

  28. arXiv:2401.06080  [pdf, other

    cs.AI

    Secrets of RLHF in Large Language Models Part II: Reward Modeling

    Authors: Binghai Wang, Rui Zheng, Lu Chen, Yan Liu, Shihan Dou, Caishuang Huang, Wei Shen, Senjie Jin, Enyu Zhou, Chenyu Shi, Songyang Gao, Nuo Xu, Yuhao Zhou, Xiaoran Fan, Zhiheng Xi, Jun Zhao, Xiao Wang, Tao Ji, Hang Yan, Lixing Shen, Zhan Chen, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang , et al. (2 additional authors not shown)

    Abstract: Reinforcement Learning from Human Feedback (RLHF) has become a crucial technology for aligning language models with human values and intentions, enabling models to produce more helpful and harmless responses. Reward models are trained as proxies for human preferences to drive reinforcement learning optimization. While reward models are often considered central to achieving high performance, they f… ▽ More

    Submitted 12 January, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

  29. ModuleGuard:Understanding and Detecting Module Conflicts in Python Ecosystem

    Authors: Ruofan Zhu, Xingyu Wang, Chengwei Liu, Zhengzi Xu, Wenbo Shen, Rui Chang, Yang Liu

    Abstract: Python has become one of the most popular programming languages for software development due to its simplicity, readability, and versatility. As the Python ecosystem grows, developers face increasing challenges in avoiding module conflicts, which occur when different packages have the same namespace modules. Unfortunately, existing work has neither investigated the module conflict comprehensively… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: The paper was accepted by ICSE24

    MSC Class: 65-04 ACM Class: D.2; K.6.3

  30. arXiv:2312.15253  [pdf, other

    cs.CV cs.AI

    Efficient Deformable Tissue Reconstruction via Orthogonal Neural Plane

    Authors: Chen Yang, Kailing Wang, Yuehao Wang, Qi Dou, Xiaokang Yang, Wei Shen

    Abstract: Intraoperative imaging techniques for reconstructing deformable tissues in vivo are pivotal for advanced surgical systems. Existing methods either compromise on rendering quality or are excessively computationally intensive, often demanding dozens of hours to perform, which significantly hinders their practical application. In this paper, we introduce Fast Orthogonal Plane (Forplane), a novel, eff… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

  31. arXiv:2312.11034  [pdf, other

    cs.LG

    Appeal: Allow Mislabeled Samples the Chance to be Rectified in Partial Label Learning

    Authors: Chongjie Si, Xuehui Wang, Yan Wang, Xiaokang Yang, Wei Shen

    Abstract: In partial label learning (PLL), each instance is associated with a set of candidate labels among which only one is ground-truth. The majority of the existing works focuses on constructing robust classifiers to estimate the labeling confidence of candidate labels in order to identify the correct one. However, these methods usually struggle to identify and rectify mislabeled samples. To help these… ▽ More

    Submitted 28 March, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: Under review. An extended version of 2024 AAAI oral paper "Partial Label Learning with a Partner"

  32. arXiv:2312.09979  [pdf, other

    cs.CL

    LoRAMoE: Alleviate World Knowledge Forgetting in Large Language Models via MoE-Style Plugin

    Authors: Shihan Dou, Enyu Zhou, Yan Liu, Songyang Gao, Jun Zhao, Wei Shen, Yuhao Zhou, Zhiheng Xi, Xiao Wang, Xiaoran Fan, Shiliang Pu, Jiang Zhu, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang

    Abstract: Supervised fine-tuning (SFT) is a crucial step for large language models (LLMs), enabling them to align with human instructions and enhance their capabilities in downstream tasks. Increasing instruction data substantially is a direct solution to align the model with a broader range of downstream tasks or notably improve its performance on a specific task. However, we find that large-scale increase… ▽ More

    Submitted 8 March, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: 14 pages, 7 figures

  33. arXiv:2312.05286  [pdf, other

    cs.CV

    Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors

    Authors: Tongkun Guan, Wei Shen, Xue Yang, Xuehui Wang, Xiaokang Yang

    Abstract: Existing scene text detection methods typically rely on extensive real data for training. Due to the lack of annotated real images, recent works have attempted to exploit large-scale labeled synthetic data (LSD) for pre-training text detectors. However, a synth-to-real domain gap emerges, further limiting the performance of text detectors. Differently, in this work, we propose FreeReal, a real-dom… ▽ More

    Submitted 9 March, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

  34. arXiv:2312.00860  [pdf, other

    cs.CV

    Segment Any 3D Gaussians

    Authors: Jiazhong Cen, Jiemin Fang, Chen Yang, Lingxi Xie, Xiaopeng Zhang, Wei Shen, Qi Tian

    Abstract: Interactive 3D segmentation in radiance fields is an appealing task since its importance in 3D scene understanding and manipulation. However, existing methods face challenges in either achieving fine-grained, multi-granularity segmentation or contending with substantial computational overhead, inhibiting real-time interaction. In this paper, we introduce Segment Any 3D GAussians (SAGA), a novel 3D… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: Work in progress. Project page: https://jumpat.github.io/SAGA

  35. arXiv:2311.17112  [pdf, other

    cs.CV

    Parameter Efficient Fine-tuning via Cross Block Orchestration for Segment Anything Model

    Authors: Zelin Peng, Zhengqin Xu, Zhilin Zeng, Lingxi Xie, Qi Tian, Wei Shen

    Abstract: Parameter-efficient fine-tuning (PEFT) is an effective methodology to unleash the potential of large foundation models in novel scenarios with limited training data. In the computer vision community, PEFT has shown effectiveness in image classification, but little research has studied its ability for image segmentation. Fine-tuning segmentation models usually require a heavier adjustment of parame… ▽ More

    Submitted 28 March, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: Accepted by CVPR2024

  36. arXiv:2311.13231  [pdf, other

    cs.LG cs.AI cs.CV

    Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model

    Authors: Kai Yang, Jian Tao, Jiafei Lyu, Chunjiang Ge, Jiaxin Chen, Qimai Li, Weihan Shen, Xiaolong Zhu, Xiu Li

    Abstract: Using reinforcement learning with human feedback (RLHF) has shown significant promise in fine-tuning diffusion models. Previous methods start by training a reward model that aligns with human preferences, then leverage RL techniques to fine-tune the underlying models. However, crafting an efficient reward model demands extensive datasets, optimal architecture, and manual hyperparameter tuning, mak… ▽ More

    Submitted 23 March, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: CVPR 2024 accepted; huggingface daily paper

  37. arXiv:2311.08189  [pdf, other

    cs.CL

    All Data on the Table: Novel Dataset and Benchmark for Cross-Modality Scientific Information Extraction

    Authors: Yuhan Li, Jian Wu, Zhiwei Yu, Börje F. Karlsson, Wei Shen, Manabu Okumura, Chin-Yew Lin

    Abstract: Extracting key information from scientific papers has the potential to help researchers work more efficiently and accelerate the pace of scientific progress. Over the last few years, research on Scientific Information Extraction (SciIE) witnessed the release of several new systems and benchmarks. However, existing paper-focused datasets mostly focus only on specific parts of a manuscript (e.g., ab… ▽ More

    Submitted 17 December, 2023; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: Work in progress; 17 pages, 6 figures, 11 tables

  38. arXiv:2311.02782  [pdf, other

    cs.CV cs.AI

    Towards Generic Anomaly Detection and Understanding: Large-scale Visual-linguistic Model (GPT-4V) Takes the Lead

    Authors: Yunkang Cao, Xiaohao Xu, Chen Sun, Xiaonan Huang, Weiming Shen

    Abstract: Anomaly detection is a crucial task across different domains and data types. However, existing anomaly detection models are often designed for specific domains and modalities. This study explores the use of GPT-4V(ision), a powerful visual-linguistic model, to address anomaly detection tasks in a generic manner. We investigate the application of GPT-4V in multi-modality, multi-domain anomaly detec… ▽ More

    Submitted 16 November, 2023; v1 submitted 5 November, 2023; originally announced November 2023.

    Comments: Work in progress. Evaluated GPT-4V on 4 modalities, 9 tasks, and 15 datasets. The first three authors contribute equally

  39. arXiv:2311.00944  [pdf, other

    stat.ML cs.IT cs.LG math.OC

    Stochastic Smoothed Gradient Descent Ascent for Federated Minimax Optimization

    Authors: Wei Shen, Minhui Huang, Jiawei Zhang, Cong Shen

    Abstract: In recent years, federated minimax optimization has attracted growing interest due to its extensive applications in various machine learning tasks. While Smoothed Alternative Gradient Descent Ascent (Smoothed-AGDA) has proved its success in centralized nonconvex minimax optimization, how and whether smoothing technique could be helpful in federated setting remains unexplored. In this paper, we pro… ▽ More

    Submitted 18 April, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

  40. arXiv:2310.17186  [pdf, other

    cs.SE

    Demystifying Compiler Unstable Feature Usage and Impacts in the Rust Ecosystem

    Authors: Chenghao Li, Yifei Wu, Wenbo Shen, Zichen Zhao, Rui Chang, Chengwei Liu, Yang Liu, Kui Ren

    Abstract: Rust programming language is gaining popularity rapidly in building reliable and secure systems due to its security guarantees and outstanding performance. To provide extra functionalities, the Rust compiler introduces Rust unstable features (RUF) to extend compiler functionality, syntax, and standard library support. However, these features are unstable and may get removed, introducing compilatio… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: Published in ICSE'2024 Conference: https://conf.researchr.org/details/icse-2024/icse-2024-research-track/6/Demystifying-Compiler-Unstable-Feature-Usage-and-Impacts-in-the-Rust-Ecosystem. Project webiste: https://sites.google.com/view/ruf-study/home. Released Source Code Zonodo: https://zenodo.org/records/8289375

  41. MUSER: A Multi-View Similar Case Retrieval Dataset

    Authors: Qingquan Li, Yiran Hu, Feng Yao, Chaojun Xiao, Zhiyuan Liu, Maosong Sun, Weixing Shen

    Abstract: Similar case retrieval (SCR) is a representative legal AI application that plays a pivotal role in promoting judicial fairness. However, existing SCR datasets only focus on the fact description section when judging the similarity between cases, ignoring other valuable sections (e.g., the court's opinion) that can provide insightful reasoning process behind. Furthermore, the case similarities are t… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted by CIKM 2023 Resource Track

    Journal ref: CIKM 2023

  42. arXiv:2310.14790  [pdf, other

    cs.LG cs.AI

    Weighted Joint Maximum Mean Discrepancy Enabled Multi-Source-Multi-Target Unsupervised Domain Adaptation Fault Diagnosis

    Authors: Zixuan Wang, Haoran Tang, Haibo Wang, Bo Qin, Mark D. Butala, Weiming Shen, Hongwei Wang

    Abstract: Despite the remarkable results that can be achieved by data-driven intelligent fault diagnosis techniques, they presuppose the same distribution of training and test data as well as sufficient labeled data. Various operating states often exist in practical scenarios, leading to the problem of domain shift that hinders the effectiveness of fault diagnosis. While recent unsupervised domain adaptatio… ▽ More

    Submitted 23 November, 2023; v1 submitted 20 October, 2023; originally announced October 2023.

  43. arXiv:2310.11971  [pdf, other

    cs.LG cs.AI

    Improving Generalization of Alignment with Human Preferences through Group Invariant Learning

    Authors: Rui Zheng, Wei Shen, Yuan Hua, Wenbin Lai, Shihan Dou, Yuhao Zhou, Zhiheng Xi, Xiao Wang, Haoran Huang, Tao Gui, Qi Zhang, Xuanjing Huang

    Abstract: The success of AI assistants based on language models (LLMs) hinges crucially on Reinforcement Learning from Human Feedback (RLHF), which enables the generation of responses more aligned with human preferences. As universal AI assistants, there's a growing expectation for them to perform consistently across various domains. However, previous work shows that Reinforcement Learning (RL) often exploi… ▽ More

    Submitted 25 December, 2023; v1 submitted 18 October, 2023; originally announced October 2023.

  44. arXiv:2310.08877  [pdf, other

    cs.CL

    Retrieval-Generation Alignment for End-to-End Task-Oriented Dialogue System

    Authors: Weizhou Shen, Yingqi Gao, Canbin Huang, Fanqi Wan, Xiaojun Quan, Wei Bi

    Abstract: Developing an efficient retriever to retrieve knowledge from a large-scale knowledge base (KB) is critical for task-oriented dialogue systems to effectively handle localized and specialized tasks. However, widely used generative models such as T5 and ChatGPT often struggle to differentiate subtle differences among the retrieved KB records when generating responses, resulting in suboptimal quality… ▽ More

    Submitted 20 October, 2023; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 Main Conference

  45. arXiv:2310.05199  [pdf, other

    cs.CL

    Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback

    Authors: Wei Shen, Rui Zheng, Wenyu Zhan, Jun Zhao, Shihan Dou, Tao Gui, Qi Zhang, Xuanjing Huang

    Abstract: Reinforcement learning from human feedback serves as a crucial bridge, aligning large language models with human and societal values. This alignment requires a vast corpus of human feedback to learn a reward model, which is subsequently used to finetune language models. However, we have identified that the reward model often finds shortcuts to bypass its intended objectives, misleadingly assuming… ▽ More

    Submitted 29 November, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 findings, Length Bias in RLHF, Mitigate bias in reward modeling

  46. arXiv:2310.04288  [pdf, other

    eess.SY cs.AI cs.FL

    Searching for Optimal Runtime Assurance via Reachability and Reinforcement Learning

    Authors: Kristina Miller, Christopher K. Zeitler, William Shen, Kerianne Hobbs, Sayan Mitra, John Schierman, Mahesh Viswanathan

    Abstract: A runtime assurance system (RTA) for a given plant enables the exercise of an untrusted or experimental controller while assuring safety with a backup (or safety) controller. The relevant computational design problem is to create a logic that assures safety by switching to the safety controller as needed, while maximizing some performance criteria, such as the utilization of the untrusted controll… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

  47. arXiv:2309.16706  [pdf, other

    cs.CR cs.AI cs.LG

    AIR: Threats of Adversarial Attacks on Deep Learning-Based Information Recovery

    Authors: Jinyin Chen, Jie Ge, Shilian Zheng, Linhui Ye, Haibin Zheng, Weiguo Shen, Keqiang Yue, Xiaoniu Yang

    Abstract: A wireless communications system usually consists of a transmitter which transmits the information and a receiver which recovers the original information from the received distorted signal. Deep learning (DL) has been used to improve the performance of the receiver in complicated channel environments and state-of-the-art (SOTA) performance has been achieved. However, its robustness has not been in… ▽ More

    Submitted 17 August, 2023; originally announced September 2023.

  48. arXiv:2309.15252  [pdf, other

    cs.RO cs.LG

    V2X-Lead: LiDAR-based End-to-End Autonomous Driving with Vehicle-to-Everything Communication Integration

    Authors: Zhiyun Deng, Yanjun Shi, Weiming Shen

    Abstract: This paper presents a LiDAR-based end-to-end autonomous driving method with Vehicle-to-Everything (V2X) communication integration, termed V2X-Lead, to address the challenges of navigating unregulated urban scenarios under mixed-autonomy traffic conditions. The proposed method aims to handle imperfect partial observations by fusing the onboard LiDAR sensor and V2X communication data. A model-free a… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: To be published in IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2023

  49. arXiv:2309.14753  [pdf, other

    cs.CV

    Advanced Volleyball Stats for All Levels: Automatic Setting Tactic Detection and Classification with a Single Camera

    Authors: Haotian Xia, Rhys Tracy, Yun Zhao, Yuqing Wang, Yuan-Fang Wang, Weining Shen

    Abstract: This paper presents PathFinder and PathFinderPlus, two novel end-to-end computer vision frameworks designed specifically for advanced setting strategy classification in volleyball matches from a single camera view. Our frameworks combine setting ball trajectory recognition with a novel set trajectory classifier to generate comprehensive and advanced statistical data. This approach offers a fresh p… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: ICDM workshop 2023

  50. arXiv:2309.08864  [pdf, other

    cs.DC

    A Synergy between On- and Off-Chip Data Reuse for GPU-based Out-of-Core Stencil Computation

    Authors: Jingcheng Shen, Linbo Long, Jun Zhang, Weiqi Shen, Masao Okita, Fumihiko Ino

    Abstract: Stencil computation is an extensively-utilized class of scientific-computing applications that can be efficiently accelerated by graphics processing units (GPUs). Out-of-core approaches enable a GPU to handle large stencil codes whose data size is beyond the memory capacity of the GPU. However, current research on out-of-core stencil computation primarily focus on minimizing the amount of data tra… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: 9 pages, 10 figures