Skip to main content

Showing 1–50 of 3,575 results for author: Chen, X

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.04955  [pdf, other

    cs.CL cs.AI

    Improving Long Text Understanding with Knowledge Distilled from Summarization Model

    Authors: Yan Liu, Yazheng Yang, Xiaokang Chen

    Abstract: Long text understanding is important yet challenging for natural language processing. A long article or document usually contains many redundant words that are not pertinent to its gist and sometimes can be regarded as noise. With recent advances of abstractive summarization, we propose our \emph{Gist Detector} to leverage the gist detection ability of a summarization model and integrate the extra… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2110.04741

  2. arXiv:2405.04757  [pdf, other

    eess.SY cs.GT

    Communication-efficient and Differentially-private Distributed Nash Equilibrium Seeking with Linear Convergence

    Authors: Xiaomeng Chen, Wei Huo, Kemi Ding, Subhrakanti Dey, Ling Shi

    Abstract: The distributed computation of a Nash equilibrium (NE) for non-cooperative games is gaining increased attention recently. Due to the nature of distributed systems, privacy and communication efficiency are two critical concerns. Traditional approaches often address these critical concerns in isolation. This work introduces a unified framework, named CDP-NES, designed to improve communication effici… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  3. arXiv:2405.04687  [pdf

    cs.HC cs.AI

    Towards Human-AI Mutual Learning: A New Research Paradigm

    Authors: Xiaomei Wang, Xiaoyu Chen

    Abstract: This paper describes a new research paradigm for studying human-AI collaboration, named "human-AI mutual learning", defined as the process where humans and AI agents preserve, exchange, and improve knowledge during human-AI collaboration. We describe relevant methodologies, motivations, domain examples, benefits, challenges, and future research agenda under this paradigm.

    Submitted 7 May, 2024; originally announced May 2024.

  4. arXiv:2405.04370  [pdf, other

    cs.CV

    Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos

    Authors: Junyi Ma, Jingyi Xu, Xieyuanli Chen, Hesheng Wang

    Abstract: Understanding how humans would behave during hand-object interaction is vital for applications in service robot manipulation and extended reality. To achieve this, some recent works have been proposed to simultaneously predict hand trajectories and object affordances on human egocentric videos. They are regarded as the representation of future hand-object interactions, indicating potential human m… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  5. arXiv:2405.04305  [pdf, other

    cs.CV cs.AI

    A New Dataset and Comparative Study for Aphid Cluster Detection and Segmentation in Sorghum Fields

    Authors: Raiyan Rahman, Christopher Indris, Goetz Bramesfeld, Tianxiao Zhang, Kaidong Li, Xiangyu Chen, Ivan Grijalva, Brian McCornack, Daniel Flippo, Ajay Sharda, Guanghui Wang

    Abstract: Aphid infestations are one of the primary causes of extensive damage to wheat and sorghum fields and are one of the most common vectors for plant viruses, resulting in significant agricultural yield losses. To address this problem, farmers often employ the inefficient use of harmful chemical pesticides that have negative health and environmental impacts. As a result, a large amount of pesticide is… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  6. arXiv:2405.03990  [pdf, other

    cs.NI cs.AI

    TrimCaching: Parameter-sharing AI Model Caching in Wireless Edge Networks

    Authors: Guanqiao Qu, Zheng Lin, Fangming Liu, Xianhao Chen, Kaibin Huang

    Abstract: Next-generation mobile networks are expected to facilitate fast AI model downloading to end users. By caching models on edge servers, mobile networks can deliver models to end users with low latency, resulting in a paradigm called edge model caching. In this paper, we develop a novel model placement scheme, called parameter-sharing model caching (TrimCaching). TrimCaching exploits the key observat… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 11 pages, 7 figures. This paper has been accepted by ICDCS 2024. arXiv admin note: substantial text overlap with arXiv:2404.14204

  7. arXiv:2405.03718  [pdf, other

    cs.LG cs.AI cs.GT cs.MA

    A Single Online Agent Can Efficiently Learn Mean Field Games

    Authors: Chenyu Zhang, Xu Chen, Xuan Di

    Abstract: Mean field games (MFGs) are a promising framework for modeling the behavior of large-population systems. However, solving MFGs can be challenging due to the coupling of forward population evolution and backward agent dynamics. Typically, obtaining mean field Nash equilibria (MFNE) involves an iterative approach where the forward and backward processes are solved alternately, known as fixed-point i… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  8. arXiv:2405.03638  [pdf, other

    cs.IT

    Cosine Annealing Optimized Denoising Diffusion Error Correction Codes

    Authors: Congyang Ou, Xiaojing Chen, Wan Jiang

    Abstract: To address the issue of increased bit error rates during the later stages of linear search in denoising diffusion error correction codes, we propose a novel method that optimizes denoising diffusion error correction codes (ECC) using cosine annealing. In response to the challenge of decoding long codewords, the proposed method employs a variance adjustment strategy during the reverse diffusion pro… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  9. arXiv:2405.03488  [pdf, other

    cs.PF cs.DS

    Accurate and Fast Approximate Graph Pattern Mining at Scale

    Authors: Anna Arpaci-Dusseau, Zixiang Zhou, Xuhao Chen

    Abstract: Approximate graph pattern mining (A-GPM) is an important data analysis tool for many graph-based applications. There exist sampling-based A-GPM systems to provide automation and generalization over a wide variety of use cases. However, there are two major obstacles that prevent existing A-GPM systems being adopted in practice. First, the termination mechanism that decides when to end sampling lack… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 15 pages, 12 figures

  10. arXiv:2405.03327  [pdf, other

    cs.LG

    Clustering of Disease Trajectories with Explainable Machine Learning: A Case Study on Postoperative Delirium Phenotypes

    Authors: Xiaochen Zheng, Manuel Schürch, Xingyu Chen, Maria Angeliki Komninou, Reto Schüpbach, Ahmed Allam, Jan Bartussek, Michael Krauthammer

    Abstract: The identification of phenotypes within complex diseases or syndromes is a fundamental component of precision medicine, which aims to adapt healthcare to individual patient characteristics. Postoperative delirium (POD) is a complex neuropsychiatric condition with significant heterogeneity in its clinical manifestations and underlying pathophysiology. We hypothesize that POD comprises several disti… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  11. arXiv:2405.03256  [pdf, other

    cs.SE

    MARE: Multi-Agents Collaboration Framework for Requirements Engineering

    Authors: Dongming Jin, Zhi Jin, Xiaohong Chen, Chunhui Wang

    Abstract: Requirements Engineering (RE) is a critical phase in the software development process that generates requirements specifications from stakeholders' needs. Recently, deep learning techniques have been successful in several RE tasks. However, obtaining high-quality requirements specifications requires collaboration across multiple tasks and roles. In this paper, we propose an innovative framework ca… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  12. arXiv:2405.03196  [pdf, ps, other

    cs.IT

    Design and Analysis of Massive Uncoupled Unsourced Random Access with Bayesian Joint Decoding

    Authors: Feiyan Tian, Xiaoming Chen, Yong Liang Guan, Chau Yuen

    Abstract: In this paper, we investigate unsourced random access for massive machine-type communications (mMTC) in the sixth-generation (6G) wireless networks. Firstly, we establish a high-efficiency uncoupled framework for massive unsourced random access without extra parity check bits. Then, we design a low-complexity Bayesian joint decoding algorithm, including codeword detection and stitching. In particu… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  13. arXiv:2405.03192  [pdf, other

    cs.LG cs.AI

    QuadraNet V2: Efficient and Sustainable Training of High-Order Neural Networks with Quadratic Adaptation

    Authors: Chenhui Xu, Xinyao Wang, Fuxun Yu, Jinjun Xiong, Xiang Chen

    Abstract: Machine learning is evolving towards high-order models that necessitate pre-training on extensive datasets, a process associated with significant overheads. Traditional models, despite having pre-trained weights, are becoming obsolete due to architectural differences that obstruct the effective transfer and initialization of these weights. To address these challenges, we introduce a novel framewor… ▽ More

    Submitted 8 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

  14. arXiv:2405.03191  [pdf, ps, other

    cs.IT

    Exploiting Matrix Information Geometry for Integrated Decoding of Massive Uncoupled Unsourced Random Access

    Authors: Feiyan Tian, Xiaoming Chen, Chongwen Huang, Zhaoyang Zhang

    Abstract: In this paper, we explore an efficient uncoupled unsourced random access (UURA) scheme for 6G massive communication. UURA is a typical framework of unsourced random access that addresses the problems of codeword detection and message stitching, without the use of check bits. Firstly, we establish a framework for UURA, allowing for immediate decoding of sub-messages upon arrival. Thus, the processi… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  15. arXiv:2405.03140  [pdf, other

    cs.LG

    TimeMIL: Advancing Multivariate Time Series Classification via a Time-aware Multiple Instance Learning

    Authors: Xiwen Chen, Peijie Qiu, Wenhui Zhu, Huayu Li, Hao Wang, Aristeidis Sotiras, Yalin Wang, Abolfazl Razi

    Abstract: Deep neural networks, including transformers and convolutional neural networks, have significantly improved multivariate time series classification (MTSC). However, these methods often rely on supervised learning, which does not fully account for the sparsity and locality of patterns in time series data (e.g., diseases-related anomalous points in ECG). To address this challenge, we formally reform… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: Accepted by ICML2024

  16. arXiv:2405.03136  [pdf, other

    cs.CR

    FOBNN: Fast Oblivious Binarized Neural Network Inference

    Authors: Xin Chen, Zhili Chen, Benchang Dong, Shiwen Wei, Lin Chen, Daojing He

    Abstract: The superior performance of deep learning has propelled the rise of Deep Learning as a Service, enabling users to transmit their private data to service providers for model execution and inference retrieval. Nevertheless, the primary concern remains safeguarding the confidentiality of sensitive user data while optimizing the efficiency of secure protocols. To address this, we develop a fast oblivi… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  17. arXiv:2405.03121  [pdf, other

    cs.CV cs.AI

    AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding

    Authors: Tao Liu, Feilong Chen, Shuai Fan, Chenpeng Du, Qi Chen, Xie Chen, Kai Yu

    Abstract: The paper introduces AniTalker, an innovative framework designed to generate lifelike talking faces from a single portrait. Unlike existing models that primarily focus on verbal cues such as lip synchronization and fail to capture the complex dynamics of facial expressions and nonverbal cues, AniTalker employs a universal motion representation. This innovative representation effectively captures a… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: 14 pages, 7 figures

  18. arXiv:2405.03106  [pdf, other

    eess.SY cs.GT

    Compression-based Privacy Preservation for Distributed Nash Equilibrium Seeking in Aggregative Games

    Authors: Wei Huo, Xiaomeng Chen, Kemi Ding, Subhrakanti Dey, Ling Shi

    Abstract: This paper explores distributed aggregative games in multi-agent systems. Current methods for finding distributed Nash equilibrium require players to send original messages to their neighbors, leading to communication burden and privacy issues. To jointly address these issues, we propose an algorithm that uses stochastic compression to save communication resources and conceal information through r… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  19. arXiv:2405.02944  [pdf, other

    cs.CV

    Imaging Signal Recovery Using Neural Network Priors Under Uncertain Forward Model Parameters

    Authors: Xiwen Chen, Wenhui Zhu, Peijie Qiu, Abolfazl Razi

    Abstract: Inverse imaging problems (IIPs) arise in various applications, with the main objective of reconstructing an image from its compressed measurements. This problem is often ill-posed for being under-determined with multiple interchangeably consistent solutions. The best solution inherently depends on prior knowledge or assumptions, such as the sparsity of the image. Furthermore, the reconstruction pr… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: Accepted by PBDL-CVPR 2024

  20. arXiv:2405.02924  [pdf, other

    cs.IT

    Optimal Sampling for Uncertainty-of-Information Minimization in a Remote Monitoring System

    Authors: Xiaomeng Chen, Aimin Li, Shaohua Wu

    Abstract: In this paper, we study a remote monitoring system where a receiver observes a remote binary Markov source and decides whether to sample and fetch the source's state over a randomly delayed channel. Due to transmission delay, the observation of the source is imperfect, resulting in the uncertainty of the source's state at the receiver. We thus use uncertainty of information as the metric to charac… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  21. arXiv:2405.02818  [pdf, other

    cs.IT

    Site-Specific Deployment Optimization of Intelligent Reflecting Surface for Coverage Enhancement

    Authors: Dongsheng Fu, Xintong Chen, Jiangbin Lyu, Liqun Fu

    Abstract: Intelligent Reflecting Surface (IRS) is a promising technology for next generation wireless networks. Despite substantial research in IRS-aided communications, the assumed antenna and channel models are typically simplified without considering site-specific characteristics, which in turn critically affect the IRS deployment and performance in a given environment. In this paper, we first investigat… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: 7 pages, 7 figures. To appear in VTC2024-Spring

  22. arXiv:2405.02801  [pdf, other

    cs.SD cs.AI eess.AS

    Mozart's Touch: A Lightweight Multi-modal Music Generation Framework Based on Pre-Trained Large Models

    Authors: Tianze Xu, Jiajun Li, Xuesong Chen, Xinrui Yao, Shuchang Liu

    Abstract: In recent years, AI-Generated Content (AIGC) has witnessed rapid advancements, facilitating the generation of music, images, and other forms of artistic expression across various industries. However, researches on general multi-modal music generation model remain scarce. To fill this gap, we propose a multi-modal music generation framework Mozart's Touch. It could generate aligned music with the c… ▽ More

    Submitted 7 May, 2024; v1 submitted 4 May, 2024; originally announced May 2024.

    Comments: 7 pages, 2 figures, submitted to ACM MM 2024

  23. arXiv:2405.02604  [pdf, ps, other

    cs.IT eess.SP

    Interleave Frequency Division Multiplexing

    Authors: Yuhao Chi, Lei Liu, Yao Ge, Xuehui Chen, Ying Li, Zhaoyang Zhang

    Abstract: In this letter, we study interleave frequency division multiplexing (IFDM) for multicarrier modulation in static multipath and mobile time-varying channels, which outperforms orthogonal frequency division multiplexing (OFDM), orthogonal time frequency space (OTFS), and affine frequency division multiplexing (AFDM) by considering practical advanced detectors. The fundamental principle underlying ex… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: Accepted by IEEE Wireless Communications Letters

  24. Rip-NeRF: Anti-aliasing Radiance Fields with Ripmap-Encoded Platonic Solids

    Authors: Junchen Liu, Wenbo Hu, Zhuo Yang, Jianteng Chen, Guoliang Wang, Xiaoxue Chen, Yantong Cai, Huan-ang Gao, Hao Zhao

    Abstract: Despite significant advancements in Neural Radiance Fields (NeRFs), the renderings may still suffer from aliasing and blurring artifacts, since it remains a fundamental challenge to effectively and efficiently characterize anisotropic areas induced by the cone-casting procedure. This paper introduces a Ripmap-Encoded Platonic Solid representation to precisely and efficiently featurize 3D anisotrop… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: SIGGRAPH 2024, Project page: https://junchenliu77.github.io/Rip-NeRF , Code: https://github.com/JunchenLiu77/Rip-NeRF

  25. arXiv:2405.02301  [pdf, other

    cs.CV

    TFCounter:Polishing Gems for Training-Free Object Counting

    Authors: Pan Ting, Jianfeng Lin, Wenhao Yu, Wenlong Zhang, Xiaoying Chen, Jinlu Zhang, Binqiang Huang

    Abstract: Object counting is a challenging task with broad application prospects in security surveillance, traffic management, and disease diagnosis. Existing object counting methods face a tri-fold challenge: achieving superior performance, maintaining high generalizability, and minimizing annotation costs. We develop a novel training-free class-agnostic object counter, TFCounter, which is prompt-context-a… ▽ More

    Submitted 12 March, 2024; originally announced May 2024.

    Comments: 14pages,11 figuers

    MSC Class: 68

  26. arXiv:2405.01668  [pdf, other

    cs.CR cs.SE

    WitheredLeaf: Finding Entity-Inconsistency Bugs with LLMs

    Authors: Hongbo Chen, Yifan Zhang, Xing Han, Huanyao Rong, Yuheng Zhang, Tianhao Mao, Hang Zhang, XiaoFeng Wang, Luyi Xing, Xun Chen

    Abstract: Originating from semantic bugs, Entity-Inconsistency Bugs (EIBs) involve misuse of syntactically valid yet incorrect program entities, such as variable identifiers and function names, which often have security implications. Unlike straightforward syntactic vulnerabilities, EIBs are subtle and can remain undetected for years. Traditional detection methods, such as static analysis and dynamic testin… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  27. arXiv:2405.01525  [pdf, other

    cs.CL cs.AI

    FLAME: Factuality-Aware Alignment for Large Language Models

    Authors: Sheng-Chieh Lin, Luyu Gao, Barlas Oguz, Wenhan Xiong, Jimmy Lin, Wen-tau Yih, Xilun Chen

    Abstract: Alignment is a standard procedure to fine-tune pre-trained large language models (LLMs) to follow natural language instructions and serve as helpful AI assistants. We have observed, however, that the conventional alignment process fails to enhance the factual accuracy of LLMs, and often leads to the generation of more false facts (i.e. hallucination). In this paper, we study how to make the LLM al… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  28. arXiv:2405.01251  [pdf, other

    cs.LG stat.ML

    Revisiting semi-supervised training objectives for differentiable particle filters

    Authors: Jiaxi Li, John-Joseph Brady, Xiongjie Chen, Yunpeng Li

    Abstract: Differentiable particle filters combine the flexibility of neural networks with the probabilistic nature of sequential Monte Carlo methods. However, traditional approaches rely on the availability of labelled data, i.e., the ground truth latent state information, which is often difficult to obtain in real-world applications. This paper compares the effectiveness of two semi-supervised training obj… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 5 pages, 2 figures

    MSC Class: 65C05; 62M20; 62M45; 62M05

  29. arXiv:2405.00697  [pdf, other

    q-fin.CP cs.LG q-fin.PR stat.AP

    Pricing Catastrophe Bonds -- A Probabilistic Machine Learning Approach

    Authors: Xiaowei Chen, Hong Li, Yufan Lu, Rui Zhou

    Abstract: This paper proposes a probabilistic machine learning method to price catastrophe (CAT) bonds in the primary market. The proposed method combines machine-learning-based predictive models with Conformal Prediction, an innovative algorithm that generates distribution-free probabilistic forecasts for CAT bond prices. Using primary market CAT bond transaction records between January 1999 and March 2021… ▽ More

    Submitted 10 April, 2024; originally announced May 2024.

  30. arXiv:2405.00676  [pdf, other

    cs.CV

    Spectrally Pruned Gaussian Fields with Neural Compensation

    Authors: Runyi Yang, Zhenxin Zhu, Zhou Jiang, Baijun Ye, Xiaoxue Chen, Yifei Zhang, Yuantao Chen, Jian Zhao, Hao Zhao

    Abstract: Recently, 3D Gaussian Splatting, as a novel 3D representation, has garnered attention for its fast rendering speed and high rendering quality. However, this comes with high memory consumption, e.g., a well-trained Gaussian field may utilize three million Gaussian primitives and over 700 MB of memory. We credit this high memory footprint to the lack of consideration for the relationship between pri… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: Code: https://github.com/RunyiYang/SUNDAE Project page: https://runyiyang.github.io/projects/SUNDAE/

  31. arXiv:2405.00280  [pdf, other

    cs.SI cs.CY cs.IR

    Global News Synchrony and Diversity During the Start of the COVID-19 Pandemic

    Authors: Xi Chen, Scott A. Hale, David Jurgens, Mattia Samory, Ethan Zuckerman, Przemyslaw A. Grabowicz

    Abstract: News coverage profoundly affects how countries and individuals behave in international relations. Yet, we have little empirical evidence of how news coverage varies across countries. To enable studies of global news coverage, we develop an efficient computational methodology that comprises three components: (i) a transformer model to estimate multilingual news similarity; (ii) a global event ident… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

  32. arXiv:2405.00236  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    STT: Stateful Tracking with Transformers for Autonomous Driving

    Authors: Longlong Jing, Ruichi Yu, Xu Chen, Zhengli Zhao, Shiwei Sheng, Colin Graber, Qi Chen, Qinru Li, Shangxuan Wu, Han Deng, Sangjin Lee, Chris Sweeney, Qiurui He, Wei-Chih Hung, Tong He, Xingyi Zhou, Farshid Moussavi, Zijian Guo, Yin Zhou, Mingxing Tan, Weilong Yang, Congcong Li

    Abstract: Tracking objects in three-dimensional space is critical for autonomous driving. To ensure safety while driving, the tracker must be able to reliably track objects across frames and accurately estimate their states such as velocity and acceleration in the present. Existing works frequently focus on the association task while either neglecting the model performance on state estimation or deploying c… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: ICRA 2024

  33. arXiv:2404.19723  [pdf, other

    eess.AS cs.SD

    Attention-Constrained Inference for Robust Decoder-Only Text-to-Speech

    Authors: Hankun Wang, Chenpeng Du, Yiwei Guo, Shuai Wang, Xie Chen, Kai Yu

    Abstract: Recent popular decoder-only text-to-speech models are known for their ability of generating natural-sounding speech. However, such models sometimes suffer from word skipping and repeating due to the lack of explicit monotonic alignment constraints. In this paper, we notice from the attention maps that some particular attention heads of the decoder-only model indicate the alignments between speech… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  34. arXiv:2404.19596  [pdf, other

    cs.IR cs.LG

    Debiased Collaborative Filtering with Kernel-Based Causal Balancing

    Authors: Haoxuan Li, Chunyuan Zheng, Yanghao Xiao, Peng Wu, Zhi Geng, Xu Chen, Peng Cui

    Abstract: Debiased collaborative filtering aims to learn an unbiased prediction model by removing different biases in observational datasets. To solve this problem, one of the simple and effective methods is based on the propensity score, which adjusts the observational sample distribution to the target one by reweighting observed instances. Ideally, propensity scores should be learned with causal balancing… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: ICLR 24 Spotlight

  35. arXiv:2404.19448  [pdf, other

    cs.RO cs.AR

    Sensorized Soft Skin for Dexterous Robotic Hands

    Authors: Jana Egli, Benedek Forrai, Thomas Buchner, Jiangtao Su, Xiaodong Chen, Robert K. Katzschmann

    Abstract: Conventional industrial robots often use two-fingered grippers or suction cups to manipulate objects or interact with the world. Because of their simplified design, they are unable to reproduce the dexterity of human hands when manipulating a wide range of objects. While the control of humanoid hands evolved greatly, hardware platforms still lack capabilities, particularly in tactile sensing and p… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: 6 pages, 9 figures ICRA 2024

  36. arXiv:2404.19307  [pdf, other

    cs.SE cs.CR

    Enhancing GUI Exploration Coverage of Android Apps with Deep Link-Integrated Monkey

    Authors: Han Hu, Han Wang, Ruiqi Dong, Xiao Chen, Chunyang Chen

    Abstract: Mobile apps are ubiquitous in our daily lives for supporting different tasks such as reading and chatting. Despite the availability of many GUI testing tools, app testers still struggle with low testing code coverage due to tools frequently getting stuck in loops or overlooking activities with concealed entries. This results in a significant amount of testing time being spent on redundant and repe… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  37. arXiv:2404.19192  [pdf, other

    cs.CL cs.AI

    Mix of Experts Language Model for Named Entity Recognition

    Authors: Xinwei Chen, Kun Li, Tianyou Song, Jiangjian Guo

    Abstract: Named Entity Recognition (NER) is an essential steppingstone in the field of natural language processing. Although promising performance has been achieved by various distantly supervised models, we argue that distant supervision inevitably introduces incomplete and noisy annotations, which may mislead the model training process. To address this issue, we propose a robust NER model named BOND-MoE b… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  38. arXiv:2404.19040  [pdf, other

    cs.CV

    GSTalker: Real-time Audio-Driven Talking Face Generation via Deformable Gaussian Splatting

    Authors: Bo Chen, Shoukang Hu, Qi Chen, Chenpeng Du, Ran Yi, Yanmin Qian, Xie Chen

    Abstract: We present GStalker, a 3D audio-driven talking face generation model with Gaussian Splatting for both fast training (40 minutes) and real-time rendering (125 FPS) with a 3$\sim$5 minute video for training material, in comparison with previous 2D and 3D NeRF-based modeling frameworks which require hours of training and seconds of rendering per frame. Specifically, GSTalker learns an audio-driven Ga… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  39. arXiv:2404.18612  [pdf

    cs.RO

    Enhancing Prosthetic Safety and Environmental Adaptability: A Visual-Inertial Prosthesis Motion Estimation Approach on Uneven Terrains

    Authors: Chuheng Chen, Xinxing Chen, Shucong Yin, Yuxuan Wang, Binxin Huang, Yuquan Leng, Chenglong Fu

    Abstract: Environment awareness is crucial for enhancing walking safety and stability of amputee wearing powered prosthesis when crossing uneven terrains such as stairs and obstacles. However, existing environmental perception systems for prosthesis only provide terrain types and corresponding parameters, which fails to prevent potential collisions when crossing uneven terrains and may lead to falls and oth… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  40. arXiv:2404.18560  [pdf, other

    math.OC cs.RO

    Non-convex Pose Graph Optimization in SLAM via Proximal Linearized Riemannian ADMM

    Authors: Xin Chen, Chunfeng Cui, Deren Han, Liqun Qi

    Abstract: Pose graph optimization (PGO) is a well-known technique for solving the pose-based simultaneous localization and mapping (SLAM) problem. In this paper, we represent the rotation and translation by a unit quaternion and a three-dimensional vector, and propose a new PGO model based on the von Mises-Fisher distribution. The constraints derived from the unit quaternions are spherical manifolds, and th… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  41. arXiv:2404.18440  [pdf, other

    physics.ao-ph astro-ph.EP cs.LG physics.comp-ph

    Potential Paradigm Shift in Hazard Risk Management: AI-Based Weather Forecast for Tropical Cyclone Hazards

    Authors: Kairui Feng, Dazhi Xi, Wei Ma, Cao Wang, Yuanlong Li, Xuanhong Chen

    Abstract: The advents of Artificial Intelligence (AI)-driven models marks a paradigm shift in risk management strategies for meteorological hazards. This study specifically employs tropical cyclones (TCs) as a focal example. We engineer a perturbation-based method to produce ensemble forecasts using the advanced Pangu AI weather model. Unlike traditional approaches that often generate fewer than 20 scenario… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  42. arXiv:2404.18192  [pdf, other

    cs.RO

    Block-Map-Based Localization in Large-Scale Environment

    Authors: Yixiao Feng, Zhou Jiang, Yongliang Shi, Yunlong Feng, Xiangyu Chen, Hao Zhao, Guyue Zhou

    Abstract: Accurate localization is an essential technology for the flexible navigation of robots in large-scale environments. Both SLAM-based and map-based localization will increase the computing load due to the increase in map size, which will affect downstream tasks such as robot navigation and services. To this end, we propose a localization system based on Block Maps (BMs) to reduce the computational l… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: 7 pages, 4 figures, 4 tables, published to ICRA 2024

  43. arXiv:2404.18096  [pdf, other

    eess.IV cs.CV

    Snake with Shifted Window: Learning to Adapt Vessel Pattern for OCTA Segmentation

    Authors: Xinrun Chen, Mei Shen, Haojian Ning, Mengzhan Zhang, Chengliang Wang, Shiying Li

    Abstract: Segmenting specific targets or structures in optical coherence tomography angiography (OCTA) images is fundamental for conducting further pathological studies. The retinal vascular layers are rich and intricate, and such vascular with complex shapes can be captured by the widely-studied OCTA images. In this paper, we thus study how to use OCTA images with projection vascular layers to segment reti… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  44. arXiv:2404.18047  [pdf, other

    cs.RO

    LIKO: LiDAR, Inertial, and Kinematic Odometry for Bipedal Robots

    Authors: Qingrui Zhao, Mingyuan Li, Yongliang Shi, Xuechao Chen, Zhangguo Yu, Lianqiang Han, Zhenyuan Fu, Jintao Zhang, Chao Li, Yuanxi Zhang, Qiang Huang

    Abstract: High-frequency and accurate state estimation is crucial for biped robots. This paper presents a tightly-coupled LiDAR-Inertial-Kinematic Odometry (LIKO) for biped robot state estimation based on an iterated extended Kalman filter. Beyond state estimation, the foot contact position is also modeled and estimated. This allows for both position and velocity updates from kinematic measurement. Addition… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

  45. arXiv:2404.18017  [pdf

    q-fin.PM cs.LG q-fin.CP

    Application of Deep Learning for Factor Timing in Asset Management

    Authors: Prabhu Prasad Panda, Maysam Khodayari Gharanchaei, Xilin Chen, Haoshu Lyu

    Abstract: The paper examines the performance of regression models (OLS linear regression, Ridge regression, Random Forest, and Fully-connected Neural Network) on the prediction of CMA (Conservative Minus Aggressive) factor premium and the performance of factor timing investment with them. Out-of-sample R-squared shows that more flexible models have better performance in explaining the variance in factor pre… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

  46. arXiv:2404.17916  [pdf, other

    cs.LG cs.AI

    FedCRL: Personalized Federated Learning with Contrastive Shared Representations for Label Heterogeneity in Non-IID Data

    Authors: Chenghao Huang, Xiaolu Chen, Yanru Zhang, Hao Wang

    Abstract: To deal with heterogeneity resulting from label distribution skew and data scarcity in distributed machine learning scenarios, this paper proposes a novel Personalized Federated Learning (PFL) algorithm, named Federated Contrastive Representation Learning (FedCRL). FedCRL introduces contrastive representation learning (CRL) on shared representations to facilitate knowledge acquisition of clients.… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

  47. arXiv:2404.17785  [pdf, other

    cs.CL

    Temporal Scaling Law for Large Language Models

    Authors: Yizhe Xiong, Xiansheng Chen, Xin Ye, Hui Chen, Zijia Lin, Haoran Lian, Jianwei Niu, Guiguang Ding

    Abstract: Recently, Large Language Models (LLMs) are widely adopted in a wide range of tasks, leading to increasing attention towards the research on how scaling LLMs affects their performance. Existing works, termed as Scaling Laws, have discovered that the loss of LLMs scales as power laws with model size, computational budget, and dataset size. However, the performance of LLMs throughout the training pro… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: Work in progress

  48. arXiv:2404.17766  [pdf, other

    cs.LG cs.AI cs.DC cs.NI

    Implementation of Big AI Models for Wireless Networks with Collaborative Edge Computing

    Authors: Liekang Zeng, Shengyuan Ye, Xu Chen, Yang Yang

    Abstract: Big Artificial Intelligence (AI) models have emerged as a crucial element in various intelligent applications at the edge, such as voice assistants in smart homes and autonomous robotics in smart factories. Training big AI models, e.g., for personalized fine-tuning and continual model refinement, poses significant challenges to edge devices due to the inherent conflict between limited computing re… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  49. arXiv:2404.17513  [pdf, other

    cs.CL cs.AI

    A Comprehensive Evaluation on Event Reasoning of Large Language Models

    Authors: Zhengwei Tao, Zhi Jin, Yifan Zhang, Xiancai Chen, Xiaoying Bai, Yue Fang, Haiyan Zhao, Jia Li, Chongyang Tao

    Abstract: Event reasoning is a fundamental ability that underlies many applications. It requires event schema knowledge to perform global reasoning and needs to deal with the diversity of the inter-event relations and the reasoning paradigms. How well LLMs accomplish event reasoning on various relations and reasoning paradigms remains unknown. To mitigate this disparity, we comprehensively evaluate the abil… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  50. Beyond Imitation: A Life-long Policy Learning Framework for Path Tracking Control of Autonomous Driving

    Authors: C. Gong, C. Lu, Z. Li, Z. Liu, J. Gong, X. Chen

    Abstract: Model-free learning-based control methods have recently shown significant advantages over traditional control methods in avoiding complex vehicle characteristic estimation and parameter tuning. As a primary policy learning method, imitation learning (IL) is capable of learning control policies directly from expert demonstrations. However, the performance of IL policies is highly dependent on the d… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Journal ref: IEEE Transactions on Vehicular Technology 2024 Pages 1-14