Skip to main content

Showing 1–50 of 200 results for author: Lou, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.00586  [pdf, ps, other

    cs.CV

    Context-Aware Academic Emotion Dataset and Benchmark

    Authors: Luming Zhao, Jingwen Xuan, Jiamin Lou, Yonghui Yu, Wenwu Yang

    Abstract: Academic emotion analysis plays a crucial role in evaluating students' engagement and cognitive states during the learning process. This paper addresses the challenge of automatically recognizing academic emotions through facial expressions in real-world learning environments. While significant progress has been made in facial expression recognition for basic emotions, academic emotion recognition… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

    Comments: Accepted to ICCV 2025

  2. arXiv:2506.21033  [pdf, ps, other

    cs.DC

    BLOCKS: Blockchain-supported Cross-Silo Knowledge Sharing for Efficient LLM Services

    Authors: Zhaojiacheng Zhou, Hongze Liu, Shijing Yuan, Hanning Zhang, Jiong Lou, Chentao Wu, Jie Li

    Abstract: The hallucination problem of Large Language Models (LLMs) has increasingly drawn attention. Augmenting LLMs with external knowledge is a promising solution to address this issue. However, due to privacy and security concerns, a vast amount of downstream task-related knowledge remains dispersed and isolated across various "silos," making it difficult to access. To bridge this knowledge gap, we prop… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  3. arXiv:2506.20251  [pdf, ps, other

    cs.LG cs.AI

    Q-resafe: Assessing Safety Risks and Quantization-aware Safety Patching for Quantized Large Language Models

    Authors: Kejia Chen, Jiawen Zhang, Jiacong Hu, Yu Wang, Jian Lou, Zunlei Feng, Mingli Song

    Abstract: Quantized large language models (LLMs) have gained increasing attention and significance for enabling deployment in resource-constrained environments. However, emerging studies on a few calibration dataset-free quantization methods suggest that quantization may compromise the safety capabilities of LLMs, underscoring the urgent need for systematic safety evaluations and effective mitigation strate… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: ICML 2025

  4. arXiv:2506.17250  [pdf, ps, other

    cs.LG cs.AI

    Towards Interpretable Adversarial Examples via Sparse Adversarial Attack

    Authors: Fudong Lin, Jiadong Lou, Hao Wang, Brian Jalaian, Xu Yuan

    Abstract: Sparse attacks are to optimize the magnitude of adversarial perturbations for fooling deep neural networks (DNNs) involving only a few perturbed pixels (i.e., under the l0 constraint), suitable for interpreting the vulnerability of DNNs. However, existing solutions fail to yield interpretable adversarial examples due to their poor sparsity. Worse still, they often struggle with heavy computational… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

    Journal ref: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD 2025)

  5. arXiv:2506.13651  [pdf, ps, other

    cs.LG

    xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations

    Authors: Kaiyuan Chen, Yixin Ren, Yang Liu, Xiaobo Hu, Haotong Tian, Tianbao Xie, Fangfu Liu, Haoye Zhang, Hongzhang Liu, Yuan Gong, Chen Sun, Han Hou, Hui Yang, James Pan, Jianan Lou, Jiayi Mao, Jizheng Liu, Jinpeng Li, Kangyi Liu, Kenkun Liu, Rui Wang, Run Li, Tong Niu, Wenlong Zhang, Wenqi Yan , et al. (8 additional authors not shown)

    Abstract: We introduce xbench, a dynamic, profession-aligned evaluation suite designed to bridge the gap between AI agent capabilities and real-world productivity. While existing benchmarks often focus on isolated technical skills, they may not accurately reflect the economic value agents deliver in professional settings. To address this, xbench targets commercially significant domains with evaluation tasks… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: Project page: https://xbench.org

  6. arXiv:2506.11329  [pdf, ps, other

    cs.AR

    A4: Microarchitecture-Aware LLC Management for Datacenter Servers with Emerging I/O Devices

    Authors: Haneul Park, Jiaqi Lou, Sangjin Lee, Yifan Yuan, Kyoung Soo Park, Yongseok Son, Ipoom Jeong, Nam Sung Kim

    Abstract: In modern server CPUs, the Last-Level Cache (LLC) serves not only as a victim cache for higher-level private caches but also as a buffer for low-latency DMA transfers between CPU cores and I/O devices through Direct Cache Access (DCA). However, prior work has shown that high-bandwidth network-I/O devices can rapidly flood the LLC with packets, often causing significant contention with co-running w… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  7. arXiv:2506.10030  [pdf, ps, other

    cs.CR cs.AI

    Safeguarding Multimodal Knowledge Copyright in the RAG-as-a-Service Environment

    Authors: Tianyu Chen, Jian Lou, Wenjie Wang

    Abstract: As Retrieval-Augmented Generation (RAG) evolves into service-oriented platforms (Rag-as-a-Service) with shared knowledge bases, protecting the copyright of contributed data becomes essential. Existing watermarking methods in RAG focus solely on textual knowledge, leaving image knowledge unprotected. In this work, we propose AQUA, the first watermark framework for image knowledge protection in Mult… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

  8. arXiv:2506.05767  [pdf, ps, other

    cs.CL cs.AI

    dots.llm1 Technical Report

    Authors: Bi Huo, Bin Tu, Cheng Qin, Da Zheng, Debing Zhang, Dongjie Zhang, En Li, Fu Guo, Jian Yao, Jie Lou, Junfeng Tian, Li Hu, Ran Zhu, Shengdong Chen, Shuo Liu, Su Guang, Te Wo, Weijun Zhang, Xiaoming Shi, Xinxin Peng, Xing Wu, Yawen Liu, Yuqiu Ji, Ze Wen, Zhenhai Liu , et al. (2 additional authors not shown)

    Abstract: Mixture of Experts (MoE) models have emerged as a promising paradigm for scaling language models efficiently by activating only a subset of parameters for each input token. In this report, we present dots.llm1, a large-scale MoE model that activates 14B parameters out of a total of 142B parameters, delivering performance on par with state-of-the-art models while reducing training and inference cos… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  9. LRScheduler: A Layer-aware and Resource-adaptive Container Scheduler in Edge Computing

    Authors: Zhiqing Tang, Wentao Peng, Jianxiong Guo, Jiong Lou, Hanshuai Cui, Tian Wang, Yuan Wu, Weijia Jia

    Abstract: Lightweight containers provide an efficient approach for deploying computation-intensive applications in network edge. The layered storage structure of container images can further reduce the deployment cost and container startup time. Existing researches discuss layer sharing scheduling theoretically but with little attention paid to the practical implementation. To fill in this gap, we propose a… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 9 pages, 10 figures, The 20th International Conference on Mobility, Sensing and Networking (MSN 2024)

    Journal ref: IEEE, "LRScheduler: A Layer-aware and Resource-adaptive Container Scheduler in Edge Computing," 20th International Conference on Mobility, Sensing and Networking (MSN 2024), 2024

  10. arXiv:2506.01290  [pdf, other

    cs.LG cs.AI

    TSRating: Rating Quality of Diverse Time Series Data by Meta-learning from LLM Judgment

    Authors: Shunyu Wu, Dan Li, Haozheng Ye, Zhuomin Chen, Jiahui Zhou, Jian Lou, Zibin Zheng, See-Kiong Ng

    Abstract: High-quality time series (TS) data are essential for ensuring TS model performance, rendering research on rating TS data quality indispensable. Existing methods have shown promising rating accuracy within individual domains, primarily by extending data quality rating techniques such as influence functions and Shapley values to account for temporal characteristics. However, they neglect the fact th… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

  11. arXiv:2506.00807  [pdf, other

    cs.AI

    Enhancing LLM Reasoning for Time Series Classification by Tailored Thinking and Fused Decision

    Authors: Jiahui Zhou, Dan Li, Lin Li, Zhuomin Chen, Shunyu Wu, Haozheng Ye, Jian Lou, Costas J. Spanos

    Abstract: The reasoning capabilities of large language models (LLMs) have significantly advanced their performance by enabling in-depth understanding of diverse tasks. With growing interest in applying LLMs to the time series domain, this has proven nontrivial, as evidenced by the limited efficacy of straightforwardly adapting text-domain reasoning techniques. Although recent work has shown promise in sever… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

  12. arXiv:2505.22633  [pdf, other

    cs.CL cs.AI cs.CV cs.LG cs.MM

    Spatial Knowledge Graph-Guided Multimodal Synthesis

    Authors: Yida Xue, Zhen Bi, Jinnan Yang, Jungang Lou, Huajun Chen, Ningyu Zhang

    Abstract: Recent advances in multimodal large language models (MLLMs) have significantly enhanced their capabilities; however, their spatial perception abilities remain a notable limitation. To address this challenge, multimodal data synthesis offers a promising solution. Yet, ensuring that synthesized data adhere to spatial common sense is a non-trivial task. In this work, we introduce SKG2Data, a novel mu… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: Ongoing work

  13. arXiv:2505.19560  [pdf, ps, other

    cs.RO

    LF-GNSS: Towards More Robust Satellite Positioning with a Hard Example Mining Enhanced Learning-Filtering Deep Fusion Framework

    Authors: Jianan Lou, Rong Zhang

    Abstract: Global Navigation Satellite System (GNSS) is essential for autonomous driving systems, unmanned vehicles, and various location-based technologies, as it provides the precise geospatial information necessary for navigation and situational awareness. However, its performance is often degraded by Non-Line-Of-Sight (NLOS) and multipath effects, especially in urban environments. Recently, Artificial In… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  14. arXiv:2505.19065  [pdf, ps, other

    cs.CV

    MMP-2K: A Benchmark Multi-Labeled Macro Photography Image Quality Assessment Database

    Authors: Jiashuo Chang, Zhengyi Li, Jianxun Lou, Zhen Qiu, Hanhe Lin

    Abstract: Macro photography (MP) is a specialized field of photography that captures objects at an extremely close range, revealing tiny details. Although an accurate macro photography image quality assessment (MPIQA) metric can benefit macro photograph capturing, which is vital in some domains such as scientific research and medical applications, the lack of MPIQA data limits the development of MPIQA metri… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: Accepted to the IEEE International Conference on Image Processing, IEEE ICIP 2025

  15. arXiv:2505.10541  [pdf, ps, other

    cs.CV

    Exploring Implicit Visual Misunderstandings in Multimodal Large Language Models through Attention Analysis

    Authors: Pengfei Wang, Guohai Xu, Weinong Wang, Junjie Yang, Jie Lou, Yunhua Xue

    Abstract: Recent advancements have enhanced the capability of Multimodal Large Language Models (MLLMs) to comprehend multi-image information. However, existing benchmarks primarily evaluate answer correctness, overlooking whether models genuinely comprehend the visual input. To address this, we define implicit visual misunderstanding (IVM), where MLLMs provide correct answers without fully comprehending the… ▽ More

    Submitted 23 May, 2025; v1 submitted 15 May, 2025; originally announced May 2025.

  16. arXiv:2505.02134  [pdf, other

    cs.CV

    HiLLIE: Human-in-the-Loop Training for Low-Light Image Enhancement

    Authors: Xiaorui Zhao, Xinyue Zhou, Peibei Cao, Junyu Lou, Shuhang Gu

    Abstract: Developing effective approaches to generate enhanced results that align well with human visual preferences for high-quality well-lit images remains a challenge in low-light image enhancement (LLIE). In this paper, we propose a human-in-the-loop LLIE training framework that improves the visual quality of unsupervised LLIE model outputs through iterative training stages, named HiLLIE. At each stage,… ▽ More

    Submitted 4 May, 2025; originally announced May 2025.

  17. arXiv:2504.14603  [pdf, other

    cs.AI cs.HC cs.OS

    UFO2: The Desktop AgentOS

    Authors: Chaoyun Zhang, He Huang, Chiming Ni, Jian Mu, Si Qin, Shilin He, Lu Wang, Fangkai Yang, Pu Zhao, Chao Du, Liqun Li, Yu Kang, Zhao Jiang, Suzhen Zheng, Rujia Wang, Jiaxu Qian, Minghua Ma, Jian-Guang Lou, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang

    Abstract: Recent Computer-Using Agents (CUAs), powered by multimodal large language models (LLMs), offer a promising direction for automating complex desktop workflows through natural language. However, most existing CUAs remain conceptual prototypes, hindered by shallow OS integration, fragile screenshot-based interaction, and disruptive execution. We present UFO2, a multiagent AgentOS for Windows deskto… ▽ More

    Submitted 25 April, 2025; v1 submitted 20 April, 2025; originally announced April 2025.

    Comments: The source code of UFO2 is publicly available at https://github.com/microsoft/UFO/, with comprehensive documentation provided at https://microsoft.github.io/UFO/

  18. arXiv:2504.13476  [pdf, other

    cs.LG cs.CV eess.IV

    Variational Autoencoder Framework for Hyperspectral Retrievals (Hyper-VAE) of Phytoplankton Absorption and Chlorophyll a in Coastal Waters for NASA's EMIT and PACE Missions

    Authors: Jiadong Lou, Bingqing Liu, Yuanheng Xiong, Xiaodong Zhang, Xu Yuan

    Abstract: Phytoplankton absorb and scatter light in unique ways, subtly altering the color of water, changes that are often minor for human eyes to detect but can be captured by sensitive ocean color instruments onboard satellites from space. Hyperspectral sensors, paired with advanced algorithms, are expected to significantly enhance the characterization of phytoplankton community composition, especially i… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

  19. arXiv:2504.03513  [pdf, ps, other

    cs.DS

    Local Search for Clustering in Almost-linear Time

    Authors: Shaofeng H. -C. Jiang, Yaonan Jin, Jianing Lou, Pinyan Lu

    Abstract: We propose the first \emph{local search} algorithm for Euclidean clustering that attains an $O(1)$-approximation in almost-linear time. Specifically, for Euclidean $k$-Means, our algorithm achieves an $O(c)$-approximation in $\tilde{O}(n^{1 + 1 / c})$ time, for any constant $c \ge 1$, maintaining the same running time as the previous (non-local-search-based) approach [la Tour and Saulpic, arXiv'24… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

  20. arXiv:2503.22458  [pdf, other

    cs.CL cs.AI

    Evaluating LLM-based Agents for Multi-Turn Conversations: A Survey

    Authors: Shengyue Guan, Haoyi Xiong, Jindong Wang, Jiang Bian, Bin Zhu, Jian-guang Lou

    Abstract: This survey examines evaluation methods for large language model (LLM)-based agents in multi-turn conversational settings. Using a PRISMA-inspired framework, we systematically reviewed nearly 250 scholarly sources, capturing the state of the art from various venues of publication, and establishing a solid foundation for our analysis. Our study offers a structured approach by developing two interre… ▽ More

    Submitted 28 March, 2025; originally announced March 2025.

  21. arXiv:2503.20502  [pdf, other

    cs.CV

    MLLM-Selector: Necessity and Diversity-driven High-Value Data Selection for Enhanced Visual Instruction Tuning

    Authors: Yiwei Ma, Guohai Xu, Xiaoshuai Sun, Jiayi Ji, Jie Lou, Debing Zhang, Rongrong Ji

    Abstract: Visual instruction tuning (VIT) has emerged as a crucial technique for enabling multi-modal large language models (MLLMs) to follow user instructions adeptly. Yet, a significant gap persists in understanding the attributes of high-quality instruction tuning data and frameworks for its automated selection. To address this, we introduce MLLM-Selector, an automated approach that identifies valuable d… ▽ More

    Submitted 29 March, 2025; v1 submitted 26 March, 2025; originally announced March 2025.

    Comments: Tech Report

  22. EDEA: Efficient Dual-Engine Accelerator for Depthwise Separable Convolution with Direct Data Transfer

    Authors: Yi Chen, Jie Lou, Malte Wabnitz, Johnson Loh, Tobias Gemmeke

    Abstract: Depthwise separable convolution (DSC) has emerged as a crucial technique, especially for resource-constrained devices. In this paper, we propose a dual-engine for the DSC hardware accelerator, which enables the full utilization of depthwise convolution (DWC) and pointwise convolution (PWC) processing elements (PEs) in all DSC layers. To determine the optimal dataflow, data reuse, and configuration… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  23. arXiv:2503.04858  [pdf, other

    cs.CV cs.AI

    SHAPE : Self-Improved Visual Preference Alignment by Iteratively Generating Holistic Winner

    Authors: Kejia Chen, Jiawen Zhang, Jiacong Hu, Jiazhen Yang, Jian Lou, Zunlei Feng, Mingli Song

    Abstract: Large Visual Language Models (LVLMs) increasingly rely on preference alignment to ensure reliability, which steers the model behavior via preference fine-tuning on preference data structured as ``image - winner text - loser text'' triplets. However, existing approaches often suffer from limited diversity and high costs associated with human-annotated preference data, hindering LVLMs from fully ach… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

  24. arXiv:2503.03122  [pdf, other

    cs.CL cs.AI

    The Devil Is in the Details: Tackling Unimodal Spurious Correlations for Generalizable Multimodal Reward Models

    Authors: Zichao Li, Xueru Wen, Jie Lou, Yuqiu Ji, Yaojie Lu, Xianpei Han, Debing Zhang, Le Sun

    Abstract: Multimodal Reward Models (MM-RMs) are crucial for aligning Large Language Models (LLMs) with human preferences, particularly as LLMs increasingly interact with multimodal data. However, we find that MM-RMs trained on existing datasets often struggle to generalize to out-of-distribution data due to their reliance on unimodal spurious correlations, primarily text-only shortcuts within the training d… ▽ More

    Submitted 21 May, 2025; v1 submitted 4 March, 2025; originally announced March 2025.

    Comments: ICML 2025

  25. arXiv:2503.02424  [pdf, ps, other

    cs.CV

    Exploring Intrinsic Normal Prototypes within a Single Image for Universal Anomaly Detection

    Authors: Wei Luo, Yunkang Cao, Haiming Yao, Xiaotian Zhang, Jianan Lou, Yuqi Cheng, Weiming Shen, Wenyong Yu

    Abstract: Anomaly detection (AD) is essential for industrial inspection, yet existing methods typically rely on ``comparing'' test images to normal references from a training set. However, variations in appearance and positioning often complicate the alignment of these references with the test image, limiting detection accuracy. We observe that most anomalies manifest as local variations, meaning that even… ▽ More

    Submitted 1 July, 2025; v1 submitted 4 March, 2025; originally announced March 2025.

    Comments: Accepted by CVPR2025

  26. arXiv:2502.17173  [pdf, other

    cs.CL cs.AI

    Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch

    Authors: Xueru Wen, Jie Lou, Zichao Li, Yaojie Lu, Xing Yu, Yuqiu Ji, Guohai Xu, Hongyu Lin, Ben He, Xianpei Han, Le Sun, Debing Zhang

    Abstract: Reward models (RMs) are crucial for aligning large language models (LLMs) with human preferences. However, most RM research is centered on English and relies heavily on synthetic resources, which leads to limited and less reliable datasets and benchmarks for Chinese. To address this gap, we introduce CheemsBench, a fully human-annotated RM evaluation benchmark within Chinese contexts, and CheemsPr… ▽ More

    Submitted 26 May, 2025; v1 submitted 24 February, 2025; originally announced February 2025.

    Comments: Accepted to ACL 2025

  27. arXiv:2502.11455  [pdf, other

    cs.CR

    Adversary-Aware DPO: Enhancing Safety Alignment in Vision Language Models via Adversarial Training

    Authors: Fenghua Weng, Jian Lou, Jun Feng, Minlie Huang, Wenjie Wang

    Abstract: Safety alignment is critical in pre-training large language models (LLMs) to generate responses aligned with human values and refuse harmful queries. Unlike LLM, the current safety alignment of VLMs is often achieved with post-hoc safety fine-tuning. However, these methods are less effective to white-box attacks. To address this, we propose $\textit{Adversary-aware DPO (ADPO)}$, a novel training f… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

  28. arXiv:2502.07669  [pdf, other

    cs.DS

    Coresets for Robust Clustering via Black-box Reductions to Vanilla Case

    Authors: Shaofeng H. -C. Jiang, Jianing Lou

    Abstract: We devise $ε$-coresets for robust $(k,z)$-Clustering with $m$ outliers through black-box reductions to vanilla case. Given an $ε$-coreset construction for vanilla clustering with size $N$, we construct coresets of size $N\cdot \mathrm{poly}\log(kmε^{-1}) + O_z\left(\min\{kmε^{-1}, mε^{-2z}\log^z(kmε^{-1}) \}\right)$ for various metric spaces, where $O_z$ hides $2^{O(z\log z)}$ factors. This increa… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

    Comments: The abstract is shortened to meet the length constraint of arXiv

  29. arXiv:2502.04675  [pdf, ps, other

    cs.AI cs.CL cs.LG

    Scalable Oversight for Superhuman AI via Recursive Self-Critiquing

    Authors: Xueru Wen, Jie Lou, Xinyu Lu, Junjie Yang, Yanjiang Liu, Yaojie Lu, Debing Zhang, Xing Yu

    Abstract: As AI capabilities increasingly surpass human proficiency in complex tasks, current alignment techniques including SFT and RLHF face fundamental challenges in ensuring reliable oversight. These methods rely on direct human assessment and become untenable when AI outputs exceed human cognitive thresholds. In response to this challenge, we explore two hypotheses: (1) \textit{Critique of critique can… ▽ More

    Submitted 30 May, 2025; v1 submitted 7 February, 2025; originally announced February 2025.

  30. arXiv:2502.00847  [pdf, other

    cs.CR cs.AI

    SecPE: Secure Prompt Ensembling for Private and Robust Large Language Models

    Authors: Jiawen Zhang, Kejia Chen, Zunlei Feng, Jian Lou, Mingli Song, Jian Liu, Xiaohu Yang

    Abstract: With the growing popularity of LLMs among the general public users, privacy-preserving and adversarial robustness have become two pressing demands for LLM-based services, which have largely been pursued separately but rarely jointly. In this paper, to the best of our knowledge, we are among the first attempts towards robust and private LLM inference by tightly integrating two disconnected fields:… ▽ More

    Submitted 2 February, 2025; originally announced February 2025.

  31. arXiv:2502.00840  [pdf, ps, other

    cs.CR cs.AI

    Activation Approximations Can Incur Safety Vulnerabilities Even in Aligned LLMs: Comprehensive Analysis and Defense

    Authors: Jiawen Zhang, Kejia Chen, Lipeng He, Jian Lou, Dan Li, Zunlei Feng, Mingli Song, Jian Liu, Kui Ren, Xiaohu Yang

    Abstract: Large Language Models (LLMs) have showcased remarkable capabilities across various domains. Accompanying the evolving capabilities and expanding deployment scenarios of LLMs, their deployment challenges escalate due to their sheer scale and the advanced yet complex activation designs prevalent in notable model series, such as Llama, Gemma, Mistral. These challenges have become particularly pronoun… ▽ More

    Submitted 10 June, 2025; v1 submitted 2 February, 2025; originally announced February 2025.

    Comments: 20 pages

  32. arXiv:2501.10985  [pdf, other

    cs.LG cs.CR

    GRID: Protecting Training Graph from Link Stealing Attacks on GNN Models

    Authors: Jiadong Lou, Xu Yuan, Rui Zhang, Xingliang Yuan, Neil Gong, Nian-Feng Tzeng

    Abstract: Graph neural networks (GNNs) have exhibited superior performance in various classification tasks on graph-structured data. However, they encounter the potential vulnerability from the link stealing attacks, which can infer the presence of a link between two nodes via measuring the similarity of its incident nodes' prediction vectors produced by a GNN model. Such attacks pose severe security and pr… ▽ More

    Submitted 13 May, 2025; v1 submitted 19 January, 2025; originally announced January 2025.

  33. arXiv:2412.14905  [pdf, other

    cs.CL cs.AI

    Dehallucinating Parallel Context Extension for Retrieval-Augmented Generation

    Authors: Zexiong Ma, Shengnan An, Zeqi Lin, Yanzhen Zou, Jian-Guang Lou, Bing Xie

    Abstract: Large language models (LLMs) are susceptible to generating hallucinated information, despite the integration of retrieval-augmented generation (RAG). Parallel context extension (PCE) is a line of research attempting to effectively integrating parallel (unordered) contexts, while it still suffers from hallucinations when adapted to RAG scenarios. In this paper, we propose DePaC (Dehallucinating Par… ▽ More

    Submitted 19 December, 2024; originally announced December 2024.

  34. arXiv:2412.12492  [pdf, other

    cs.CV

    DuSSS: Dual Semantic Similarity-Supervised Vision-Language Model for Semi-Supervised Medical Image Segmentation

    Authors: Qingtao Pan, Wenhao Qiao, Jingjiao Lou, Bing Ji, Shuo Li

    Abstract: Semi-supervised medical image segmentation (SSMIS) uses consistency learning to regularize model training, which alleviates the burden of pixel-wise manual annotations. However, it often suffers from error supervision from low-quality pseudo labels. Vision-Language Model (VLM) has great potential to enhance pseudo labels by introducing text prompt guided multimodal supervision information. It neve… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

  35. arXiv:2412.10047  [pdf, other

    cs.AI

    Large Action Models: From Inception to Implementation

    Authors: Lu Wang, Fangkai Yang, Chaoyun Zhang, Junting Lu, Jiaxu Qian, Shilin He, Pu Zhao, Bo Qiao, Ray Huang, Si Qin, Qisheng Su, Jiayi Ye, Yudi Zhang, Jian-Guang Lou, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang

    Abstract: As AI continues to advance, there is a growing demand for systems that go beyond language-based assistance and move toward intelligent agents capable of performing real-world actions. This evolution requires the transition from traditional Large Language Models (LLMs), which excel at generating textual responses, to Large Action Models (LAMs), designed for action generation and execution within dy… ▽ More

    Submitted 13 January, 2025; v1 submitted 13 December, 2024; originally announced December 2024.

    Comments: 25pages,12 figures

  36. arXiv:2412.07261  [pdf, other

    cs.CR cs.LG

    MemHunter: Automated and Verifiable Memorization Detection at Dataset-scale in LLMs

    Authors: Zhenpeng Wu, Jian Lou, Zibin Zheng, Chuan Chen

    Abstract: Large language models (LLMs) have been shown to memorize and reproduce content from their training data, raising significant privacy concerns, especially with web-scale datasets. Existing methods for detecting memorization are primarily sample-specific, relying on manually crafted or discretely optimized memory-inducing prompts generated on a per-sample basis, which become impractical for dataset-… ▽ More

    Submitted 16 February, 2025; v1 submitted 10 December, 2024; originally announced December 2024.

  37. arXiv:2412.06593  [pdf, ps, other

    cs.CL

    Anchoring Bias in Large Language Models: An Experimental Study

    Authors: Jiaxu Lou, Yifan Sun

    Abstract: Large Language Models (LLMs) like GPT-4 and Gemini have significantly advanced artificial intelligence by enabling machines to generate and comprehend human-like text. Despite their impressive capabilities, LLMs are not immune to limitations, including various biases. While much research has explored demographic biases, the cognitive biases in LLMs have not been equally scrutinized. This study del… ▽ More

    Submitted 18 December, 2024; v1 submitted 9 December, 2024; originally announced December 2024.

  38. arXiv:2411.11504  [pdf, other

    cs.AI cs.CL stat.ML

    Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

    Authors: Xinyan Guan, Yanjiang Liu, Xinyu Lu, Boxi Cao, Ben He, Xianpei Han, Le Sun, Jie Lou, Bowen Yu, Yaojie Lu, Hongyu Lin

    Abstract: The evolution of machine learning has increasingly prioritized the development of powerful models and more scalable supervision signals. However, the emergence of foundation models presents significant challenges in providing effective supervision signals necessary for further enhancing their capabilities. Consequently, there is an urgent need to explore novel supervision signals and technical app… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

  39. arXiv:2410.08696  [pdf, other

    cs.CL

    AMPO: Automatic Multi-Branched Prompt Optimization

    Authors: Sheng Yang, Yurong Wu, Yan Gao, Zineng Zhou, Bin Benjamin Zhu, Xiaodi Sun, Jian-Guang Lou, Zhiming Ding, Anbang Hu, Yuan Fang, Yunsong Li, Junyan Chen, Linjun Yang

    Abstract: Prompt engineering is very important to enhance the performance of large language models (LLMs). When dealing with complex issues, prompt engineers tend to distill multiple patterns from examples and inject relevant solutions to optimize the prompts, achieving satisfying results. However, existing automatic prompt optimization techniques are only limited to producing single flow instructions, stru… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: 13 pages, 7 figures, 6 tables

  40. arXiv:2410.08601  [pdf, other

    cs.CL

    StraGo: Harnessing Strategic Guidance for Prompt Optimization

    Authors: Yurong Wu, Yan Gao, Bin Benjamin Zhu, Zineng Zhou, Xiaodi Sun, Sheng Yang, Jian-Guang Lou, Zhiming Ding, Linjun Yang

    Abstract: Prompt engineering is pivotal for harnessing the capabilities of large language models (LLMs) across diverse applications. While existing prompt optimization methods improve prompt effectiveness, they often lead to prompt drifting, where newly generated prompts can adversely impact previously successful cases while addressing failures. Furthermore, these methods tend to rely heavily on LLMs' intri… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: 19 pages, 3 figures, 20 tables

  41. arXiv:2410.06446   

    cs.LG cs.CV

    Machine Unlearning in Forgettability Sequence

    Authors: Junjie Chen, Qian Chen, Jian Lou, Xiaoyu Zhang, Kai Wu, Zilong Wang

    Abstract: Machine unlearning (MU) is becoming a promising paradigm to achieve the "right to be forgotten", where the training trace of any chosen data points could be eliminated, while maintaining the model utility on general testing samples after unlearning. With the advancement of forgetting research, many fundamental open questions remain unanswered: do different samples exhibit varying levels of difficu… ▽ More

    Submitted 21 October, 2024; v1 submitted 8 October, 2024; originally announced October 2024.

    Comments: The senior authors of the draft are not fully convinced that the novelty is significant enough for this submission compared to the latest research progress in this area. Additionally, the senior authors have identified writing issues. Based on these two reasons, we have decided to withdraw the draft from arXiv

  42. arXiv:2410.05584  [pdf, other

    cs.LG cs.AI cs.CL

    Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree?

    Authors: Xueru Wen, Jie Lou, Yaojie Lu, Hongyu Lin, Xing Yu, Xinyu Lu, Ben He, Xianpei Han, Debing Zhang, Le Sun

    Abstract: Reward Models (RMs) are crucial for aligning language models with human preferences. Currently, the evaluation of RMs depends on measuring accuracy against a validation set of manually annotated preference data. Although this method is straightforward and widely adopted, the relationship between RM accuracy and downstream policy performance remains under-explored. In this work, we conduct experime… ▽ More

    Submitted 13 February, 2025; v1 submitted 7 October, 2024; originally announced October 2024.

    Comments: Accepted at ICLR2025 Spotlight

  43. arXiv:2409.18523  [pdf, other

    cs.LG cs.CV

    Token Caching for Diffusion Transformer Acceleration

    Authors: Jinming Lou, Wenyang Luo, Yufan Liu, Bing Li, Xinmiao Ding, Weiming Hu, Jiajiong Cao, Yuming Li, Chenguang Ma

    Abstract: Diffusion transformers have gained substantial interest in diffusion generative modeling due to their outstanding performance. However, their high computational cost, arising from the quadratic computational complexity of attention mechanisms and multi-step inference, presents a significant bottleneck. To address this challenge, we propose TokenCache, a novel post-training acceleration method that… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

  44. arXiv:2409.15985  [pdf, other

    cs.AI

    DataGpt-SQL-7B: An Open-Source Language Model for Text-to-SQL

    Authors: Lixia Wu, Peng Li, Junhong Lou, Lei Fu

    Abstract: In addressing the pivotal role of translating natural language queries into SQL commands, we propose a suite of compact, fine-tuned models and self-refine mechanisms to democratize data access and analysis for non-expert users, mitigating risks associated with closed-source Large Language Models. Specifically, we constructed a dataset of over 20K sample for Text-to-SQL as well as the preference da… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

  45. arXiv:2408.16326  [pdf, ps, other

    cs.CL

    Critic-CoT: Boosting the reasoning abilities of large language model via Chain-of-thoughts Critic

    Authors: Xin Zheng, Jie Lou, Boxi Cao, Xueru Wen, Yuqiu Ji, Hongyu Lin, Yaojie Lu, Xianpei Han, Debing Zhang, Le Sun

    Abstract: Self-critic has become a crucial mechanism for enhancing the reasoning performance of LLMs. However, current approaches mainly involve basic prompts for intuitive instance-level feedback, which resembles System-1 processes and limits the reasoning capabilities. Moreover, there is a lack of in-depth investigations into the relationship between LLM's ability to criticize and its task-solving perform… ▽ More

    Submitted 10 June, 2025; v1 submitted 29 August, 2024; originally announced August 2024.

    Comments: Accepted at ACL 2025 Findings

  46. arXiv:2408.00764  [pdf, other

    cs.CL cs.AI cs.LG

    AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation

    Authors: Mengkang Hu, Pu Zhao, Can Xu, Qingfeng Sun, Jianguang Lou, Qingwei Lin, Ping Luo, Saravan Rajmohan

    Abstract: Large Language Model-based agents have garnered significant attention and are becoming increasingly popular. Furthermore, planning ability is a crucial component of an LLM-based agent, which generally entails achieving a desired goal from an initial state. This paper investigates enhancing the planning abilities of LLMs through instruction tuning, referred to as agent training. Recent studies have… ▽ More

    Submitted 6 February, 2025; v1 submitted 1 August, 2024; originally announced August 2024.

    Comments: Accepted by KDD 2025 (Research Track). Project page: https://agent-gen.github.io/

  47. Towards Robust Vision Transformer via Masked Adaptive Ensemble

    Authors: Fudong Lin, Jiadong Lou, Xu Yuan, Nian-Feng Tzeng

    Abstract: Adversarial training (AT) can help improve the robustness of Vision Transformers (ViT) against adversarial attacks by intentionally injecting adversarial examples into the training data. However, this way of adversarial injection inevitably incurs standard accuracy degradation to some extent, thereby calling for a trade-off between standard accuracy and robustness. Besides, the prominent AT soluti… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: 9 pages

    Journal ref: 2024 ACM International Conference on Information & Knowledge Management (CIKM)

  48. arXiv:2407.11033  [pdf, other

    cs.LG cs.CL

    Hadamard Adapter: An Extreme Parameter-Efficient Adapter Tuning Method for Pre-trained Language Models

    Authors: Yuyan Chen, Qiang Fu, Ge Fan, Lun Du, Jian-Guang Lou, Shi Han, Dongmei Zhang, Zhixu Li, Yanghua Xiao

    Abstract: Recent years, Pre-trained Language models (PLMs) have swept into various fields of artificial intelligence and achieved great success. However, most PLMs, such as T5 and GPT3, have a huge amount of parameters, fine-tuning them is often expensive and time consuming, and storing them takes up a lot of space. Therefore, it is necessary to adopt a parameter-efficient approach to reduce parameters of P… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: Accepted to CIKM 2023 (Long Paper)

  49. arXiv:2407.10627  [pdf, other

    cs.CL cs.AI cs.LG

    Arena Learning: Build Data Flywheel for LLMs Post-training via Simulated Chatbot Arena

    Authors: Haipeng Luo, Qingfeng Sun, Can Xu, Pu Zhao, Qingwei Lin, Jianguang Lou, Shifeng Chen, Yansong Tang, Weizhu Chen

    Abstract: Assessing the effectiveness of large language models (LLMs) presents substantial challenges. The method of conducting human-annotated battles in an online Chatbot Arena is a highly effective evaluative technique. However, this approach is limited by the costs and time required for human annotation. In this paper, we introduce Arena Learning, an innovative offline strategy designed to simulate thes… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  50. arXiv:2407.06915  [pdf, ps, other

    cs.RO

    FE-GUT: Factor Graph Optimization hybrid with Extended Kalman Filter for tightly coupled GNSS/UWB Integration

    Authors: Qijia Zhao, Shaolin Lü, Jianan Lou, Rong Zhang

    Abstract: Precise positioning and navigation information has been increasingly important with the development of the consumer electronics market. Due to some deficits of Global Navigation Satellite System (GNSS), such as susceptible to interferences, integrating of GNSS with additional alternative sensors is a promising approach to overcome the performance limitations of GNSS-based localization systems. Ult… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.