Skip to main content

Showing 1–50 of 5,199 results for author: Chen, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.05641  [pdf, other

    eess.SP cs.IT

    Channel Estimation for Holographic MIMO: Wavenumber-Domain Sparsity Inspired Approaches

    Authors: Yuqing Guo, Yuanbin Chen, Ying Wang

    Abstract: This paper investigates the sparse channel estimation for holographic multiple-input multiple-output (HMIMO) systems. Given that the wavenumber-domain representation is based on a series of Fourier harmonics that are in essence a series of orthogonal basis functions, a novel wavenumber-domain sparsifying basis is designed to expose the sparsity inherent in HMIMO channels. Furthermore, by harnessin… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: This paper has been submitted to IEEE WCL, Major Revision

  2. arXiv:2405.05500  [pdf

    cs.RO eess.SY

    Research on the Tender Leaf Identification and Mechanically Perceptible Plucking Finger for High-quality Green Tea

    Authors: Wei Zhang, Yong Chen, Qianqian Wang, Jun Chen

    Abstract: BACKGROUND: Intelligent identification and precise plucking are the keys to intelligent tea harvesting robots, which are of increasing significance nowadays. Aiming at plucking tender leaves for high-quality green tea producing, in this paper, a tender leaf identification algorithm and a mechanically perceptible plucking finger have been proposed. RESULTS: Based on segmentation algorithm and color… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  3. arXiv:2405.05474  [pdf

    cs.HC

    (Dis)placed Contributions: Uncovering Hidden Hurdles to Collaborative Writing Involving Non-Native Speakers, Native Speakers, and AI-Powered Editing Tools

    Authors: Yimin Xiao, Yuewen Chen, Naomi Yamashita, Yuexi Chen, Zhicheng Liu, Ge Gao

    Abstract: Content creation today often takes place via collaborative writing. A longstanding interest of CSCW research lies in understanding and promoting the coordination between co-writers. However, little attention has been paid to individuals who write in their non-native language and to co-writer groups involving them. We present a mixed-method study that fills the above gap. Our participants included… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  4. arXiv:2405.05237  [pdf, other

    cs.CV

    EVA-X: A Foundation Model for General Chest X-ray Analysis with Self-supervised Learning

    Authors: Jingfeng Yao, Xinggang Wang, Yuehao Song, Huangxuan Zhao, Jun Ma, Yajie Chen, Wenyu Liu, Bo Wang

    Abstract: The diagnosis and treatment of chest diseases play a crucial role in maintaining human health. X-ray examination has become the most common clinical examination means due to its efficiency and cost-effectiveness. Artificial intelligence analysis methods for chest X-ray images are limited by insufficient annotation data and varying levels of annotation, resulting in weak generalization ability and… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: codes available at: https://github.com/hustvl/EVA-X

  5. arXiv:2405.05202  [pdf, other

    cs.DS cs.DM cs.LG

    Guided Combinatorial Algorithms for Submodular Maximization

    Authors: Yixin Chen, Ankur Nath, Chunli Peng, Alan Kuhnle

    Abstract: For constrained, not necessarily monotone submodular maximization, guiding the measured continuous greedy algorithm with a local search algorithm currently obtains the state-of-the-art approximation factor of 0.401 \citep{buchbinder2023constrained}. These algorithms rely upon the multilinear extension and the Lovasz extension of a submodular set function. However, the state-of-the-art approximatio… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  6. arXiv:2405.04985  [pdf

    cs.AI cs.CE

    An Artificial Intelligence Approach for Interpreting Creative Combinational Designs

    Authors: Liuqing Chen, Shuhong Xiao, Yunnong Chen, Linyun Sun, Peter R. N. Childs, Ji Han

    Abstract: Combinational creativity, a form of creativity involving the blending of familiar ideas, is pivotal in design innovation. While most research focuses on how combinational creativity in design is achieved through blending elements, this study focuses on the computational interpretation, specifically identifying the 'base' and 'additive' components that constitute a creative design. To achieve this… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  7. arXiv:2405.04975  [pdf, other

    cs.SE

    Prototype2Code: End-to-end Front-end Code Generation from UI Design Prototypes

    Authors: Shuhong Xiao, Yunnong Chen, Jiazhi Li, Liuqing Chen, Lingyun Sun, Tingting Zhou

    Abstract: UI-to-code technology has streamlined the front-end development process, reducing repetitive tasks for engineers. prior research mainly use design prototypes as inputs, with the effectiveness of the generated code heavily dependent on these prototypes' quality, leading to compromised robustness. Moreover, these approaches also exhibit shortcomings in code quality, including issues such as disorgan… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 11 pages, 6 figures

  8. arXiv:2405.04964  [pdf, other

    cs.CV

    Frequency-Assisted Mamba for Remote Sensing Image Super-Resolution

    Authors: Yi Xiao, Qiangqiang Yuan, Kui Jiang, Yuzeng Chen, Qiang Zhang, Chia-Wen Lin

    Abstract: Recent progress in remote sensing image (RSI) super-resolution (SR) has exhibited remarkable performance using deep neural networks, e.g., Convolutional Neural Networks and Transformers. However, existing SR methods often suffer from either a limited receptive field or quadratic computational overhead, resulting in sub-optimal global representation and unacceptable computational costs in large-sca… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: Frequency-Assisted Mamba for Remote Sensing Image Super-Resolution

  9. arXiv:2405.04902  [pdf, other

    eess.IV cs.CV

    HAGAN: Hybrid Augmented Generative Adversarial Network for Medical Image Synthesis

    Authors: Zhihan Ju, Wanting Zhou, Longteng Kong, Yu Chen, Yi Li, Zhenan Sun, Caifeng Shan

    Abstract: Medical Image Synthesis (MIS) plays an important role in the intelligent medical field, which greatly saves the economic and time costs of medical diagnosis. However, due to the complexity of medical images and similar characteristics of different tissue cells, existing methods face great challenges in meeting their biological consistency. To this end, we propose the Hybrid Augmented Generative Ad… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  10. arXiv:2405.04867  [pdf, other

    eess.IV cs.CV

    MIPI 2024 Challenge on Demosaic for HybridEVS Camera: Methods and Results

    Authors: Yaqi Wu, Zhihao Fan, Xiaofeng Chu, Jimmy S. Ren, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangcheng Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Senyan Xu, Zhijing Sun, Jiaying Zhu, Yurui Zhu, Xueyang Fu, Zheng-Jun Zha, Jun Cao, Cheng Li, Shu Chen, Liang Ma, Shiyang Zhou, Haijin Zeng, Kai Feng , et al. (24 additional authors not shown)

    Abstract: The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: MIPI@CVPR2024. Website: https://mipi-challenge.org/MIPI2024/

  11. arXiv:2405.04795  [pdf, other

    cs.LG

    Variational Schrödinger Diffusion Models

    Authors: Wei Deng, Weijian Luo, Yixin Tan, Marin Biloš, Yu Chen, Yuriy Nevmyvaka, Ricky T. Q. Chen

    Abstract: Schrödinger bridge (SB) has emerged as the go-to method for optimizing transportation plans in diffusion models. However, SB requires estimating the intractable forward score functions, inevitably resulting in the costly implicit training loss based on simulated trajectories. To improve the scalability while preserving efficient transportation plans, we leverage variational inference to linearize… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  12. arXiv:2405.04494  [pdf, other

    cs.LG

    Representation Learning of Daily Movement Data Using Text Encoders

    Authors: Alexander Capstick, Tianyu Cui, Yu Chen, Payam Barnaghi

    Abstract: Time-series representation learning is a key area of research for remote healthcare monitoring applications. In this work, we focus on a dataset of recordings of in-home activity from people living with Dementia. We design a representation learning method based on converting activity to text strings that can be encoded using a language model fine-tuned to transform data from the same participants… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Accepted at ICLR 2024 Workshop on Learning from Time Series For Health: https://openreview.net/forum?id=mmxNNwxvWG

    Journal ref: International Conference on Learning Representations 2024 Workshop on Learning from Time Series For Health

  13. arXiv:2405.04128  [pdf, other

    cs.CL cs.SD eess.AS

    Fine-grained Speech Sentiment Analysis in Chinese Psychological Support Hotlines Based on Large-scale Pre-trained Model

    Authors: Zhonglong Chen, Changwei Song, Yining Chen, Jianqiang Li, Guanghui Fu, Yongsheng Tong, Qing Zhao

    Abstract: Suicide and suicidal behaviors remain significant challenges for public policy and healthcare. In response, psychological support hotlines have been established worldwide to provide immediate help to individuals in mental crises. The effectiveness of these hotlines largely depends on accurately identifying callers' emotional states, particularly underlying negative emotions indicative of increased… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  14. arXiv:2405.04042  [pdf, other

    cs.CV cs.AI

    Space-time Reinforcement Network for Video Object Segmentation

    Authors: Yadang Chen, Wentao Zhu, Zhi-Xin Yang, Enhua Wu

    Abstract: Recently, video object segmentation (VOS) networks typically use memory-based methods: for each query frame, the mask is predicted by space-time matching to memory frames. Despite these methods having superior performance, they suffer from two issues: 1) Challenging data can destroy the space-time coherence between adjacent video frames. 2) Pixel-level matching will lead to undesired mismatching c… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Accepted by ICME 2024. 6 pages, 10 figures

  15. arXiv:2405.04009  [pdf, other

    cs.CV cs.AI

    Structured Click Control in Transformer-based Interactive Segmentation

    Authors: Long Xu, Yongquan Chen, Rui Huang, Feng Wu, Shiwu Lai

    Abstract: Click-point-based interactive segmentation has received widespread attention due to its efficiency. However, it's hard for existing algorithms to obtain precise and robust responses after multiple clicks. In this case, the segmentation results tend to have little change or are even worse than before. To improve the robustness of the response, we propose a structured click intent model based on gra… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 10 pages, 6 figures, submitted to NeurIPS 2024

  16. arXiv:2405.03644  [pdf, other

    cs.CR cs.AI

    When LLMs Meet Cybersecurity: A Systematic Literature Review

    Authors: Jie Zhang, Haoyu Bu, Hui Wen, Yu Chen, Lun Li, Hongsong Zhu

    Abstract: The rapid advancements in large language models (LLMs) have opened new avenues across various fields, including cybersecurity, which faces an ever-evolving threat landscape and need for innovative technologies. Despite initial explorations into the application of LLMs in cybersecurity, there is a lack of a comprehensive overview of this research area. This paper bridge this gap by providing a syst… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 36 pages, 7 figures

  17. arXiv:2405.03547  [pdf, other

    cs.LG cs.AI cs.NE

    Position: Leverage Foundational Models for Black-Box Optimization

    Authors: Xingyou Song, Yingtao Tian, Robert Tjarko Lange, Chansoo Lee, Yujin Tang, Yutian Chen

    Abstract: Undeniably, Large Language Models (LLMs) have stirred an extraordinary wave of innovation in the machine learning research domain, resulting in substantial impact across diverse fields such as reinforcement learning, robotics, and computer vision. Their incorporation has been rapid and transformative, marking a significant paradigm shift in the field of machine learning research. However, the fiel… ▽ More

    Submitted 9 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: International Conference on Machine Learning (ICML) 2024

  18. arXiv:2405.03501  [pdf, other

    cs.LG cs.AI cs.CV

    Boosting Single Positive Multi-label Classification with Generalized Robust Loss

    Authors: Yanxi Chen, Chunxiao Li, Xinyang Dai, Jinhuan Li, Weiyu Sun, Yiming Wang, Renyuan Zhang, Tinghe Zhang, Bo Wang

    Abstract: Multi-label learning (MLL) requires comprehensive multi-semantic annotations that is hard to fully obtain, thus often resulting in missing labels scenarios. In this paper, we investigate Single Positive Multi-label Learning (SPML), where each image is associated with merely one positive label. Existing SPML methods only focus on designing losses using mechanisms such as hard pseudo-labeling and ro… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 14 pages, 5 figures, 6 tables

  19. arXiv:2405.03103  [pdf, other

    cs.LG cs.CV

    Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs

    Authors: Jordan Dotzel, Yuzong Chen, Bahaa Kotb, Sushma Prasad, Gang Wu, Sheng Li, Mohamed S. Abdelfattah, Zhiru Zhang

    Abstract: Large language models (LLMs) have recently achieved state-of-the-art performance across various tasks, yet due to their large computational requirements, they struggle with strict latency and power demands. Deep neural network (DNN) quantization has traditionally addressed these limitations by converting models to low-precision integer formats. Yet recently alternative formats, such as Normal Floa… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: Accepted to ICML 2024

  20. arXiv:2405.02880  [pdf, other

    cs.CV cs.RO

    Blending Distributed NeRFs with Tri-stage Robust Pose Optimization

    Authors: Baijun Ye, Caiyun Liu, Xiaoyu Ye, Yuantao Chen, Yuhai Wang, Zike Yan, Yongliang Shi, Hao Zhao, Guyue Zhou

    Abstract: Due to the limited model capacity, leveraging distributed Neural Radiance Fields (NeRFs) for modeling extensive urban environments has become a necessity. However, current distributed NeRF registration approaches encounter aliasing artifacts, arising from discrepancies in rendering resolutions and suboptimal pose precision. These factors collectively deteriorate the fidelity of pose estimation wit… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  21. arXiv:2405.02826  [pdf, other

    cs.CR

    Nip in the Bud: Forecasting and Interpreting Post-exploitation Attacks in Real-time through Cyber Threat Intelligence Reports

    Authors: Tiantian Zhu, Jie Ying, Tieming Chen, Chunlin Xiong, Wenrui Cheng, Qixuan Yuan, Aohan Zheng, Mingqi Lv, Yan Chen

    Abstract: Advanced Persistent Threat (APT) attacks have caused significant damage worldwide. Various Endpoint Detection and Response (EDR) systems are deployed by enterprises to fight against potential threats. However, EDR suffers from high false positives. In order not to affect normal operations, analysts need to investigate and filter detection results before taking countermeasures, in which heavy manua… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  22. arXiv:2405.02629  [pdf, other

    cs.CR

    SPARSE: Semantic Tracking and Path Analysis for Attack Investigation in Real-time

    Authors: Jie Ying, Tiantian Zhu, Wenrui Cheng, Qixuan Yuan, Mingjun Ma, Chunlin Xiong, Tieming Chen, Mingqi Lv, Yan Chen

    Abstract: As the complexity and destructiveness of Advanced Persistent Threat (APT) increase, there is a growing tendency to identify a series of actions undertaken to achieve the attacker's target, called attack investigation. Currently, analysts construct the provenance graph to perform causality analysis on Point-Of-Interest (POI) event for capturing critical events (related to the attack). However, due… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

  23. arXiv:2405.02023  [pdf, other

    cs.CV

    IFNet: Deep Imaging and Focusing for Handheld SAR with Millimeter-wave Signals

    Authors: Yadong Li, Dongheng Zhang, Ruixu Geng, Jincheng Wu, Yang Hu, Qibin Sun, Yan Chen

    Abstract: Recent advancements have showcased the potential of handheld millimeter-wave (mmWave) imaging, which applies synthetic aperture radar (SAR) principles in portable settings. However, existing studies addressing handheld motion errors either rely on costly tracking devices or employ simplified imaging models, leading to impractical deployment or limited performance. In this paper, we present IFNet,… ▽ More

    Submitted 5 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

  24. arXiv:2405.01736  [pdf, other

    cs.AR

    PipeOrgan: Efficient Inter-operation Pipelining with Flexible Spatial Organization and Interconnects

    Authors: Raveesh Garg, Hyoukjun Kwon, Eric Qin, Yu-Hsin Chen, Tushar Krishna, Liangzhen Lai

    Abstract: Because of the recent trends in Deep Neural Networks (DNN) models being memory-bound, inter-operator pipelining for DNN accelerators is emerging as a promising optimization. Inter-operator pipelining reduces costly on-chip global memory and off-chip memory accesses by forwarding the output of a layer as the input of the next layer within the compute array, which is proven to be an effective optimi… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  25. arXiv:2405.01723  [pdf, other

    cs.CV cs.AI

    Zero-Shot Monocular Motion Segmentation in the Wild by Combining Deep Learning with Geometric Motion Model Fusion

    Authors: Yuxiang Huang, Yuhao Chen, John Zelek

    Abstract: Detecting and segmenting moving objects from a moving monocular camera is challenging in the presence of unknown camera motion, diverse object motions and complex scene structures. Most existing methods rely on a single motion cue to perform motion segmentation, which is usually insufficient when facing different complex environments. While a few recent deep learning based methods are able to comb… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: Accepted by the 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

  26. arXiv:2405.01502  [pdf, other

    cs.CL cs.AI cs.LG

    Analyzing the Role of Semantic Representations in the Era of Large Language Models

    Authors: Zhijing Jin, Yuen Chen, Fernando Gonzalez, Jiarui Liu, Jiayi Zhang, Julian Michael, Bernhard Schölkopf, Mona Diab

    Abstract: Traditionally, natural language processing (NLP) models often use a rich set of features created by linguistic expertise, such as semantic representations. However, in the era of large language models (LLMs), more and more tasks are turned into generic, end-to-end sequence generation problems. In this paper, we investigate the question: what is the role of semantic representations in the era of LL… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: NAACL 2024

  27. arXiv:2405.01466  [pdf, other

    cs.SE

    A Systematic Literature Review on Large Language Models for Automated Program Repair

    Authors: Quanjun Zhang, Chunrong Fang, Yang Xie, YuXiang Ma, Weisong Sun, Yun Yang Zhenyu Chen

    Abstract: Automated Program Repair (APR) attempts to patch software bugs and reduce manual debugging efforts. Very recently, with the advances in Large Language Models (LLMs), an increasing number of APR techniques have been proposed, facilitating software development and maintenance and demonstrating remarkable performance. However, due to ongoing explorations in the LLM-based APR field, it is challenging… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  28. arXiv:2405.00842  [pdf, other

    math.ST cs.IT cs.LG eess.SP math.OC

    Quickest Change Detection with Confusing Change

    Authors: Yu-Zhen Janice Chen, Jinhang Zuo, Venugopal V. Veeravalli, Don Towsley

    Abstract: In the problem of quickest change detection (QCD), a change occurs at some unknown time in the distribution of a sequence of independent observations. This work studies a QCD problem where the change is either a bad change, which we aim to detect, or a confusing change, which is not of our interest. Our objective is to detect a bad change as quickly as possible while avoiding raising a false alarm… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  29. arXiv:2405.00711  [pdf, other

    cs.CL cs.AI cs.CY

    Fake Artificial Intelligence Generated Contents (FAIGC): A Survey of Theories, Detection Methods, and Opportunities

    Authors: Xiaomin Yu, Yezhaohui Wang, Yanfang Chen, Zhen Tao, Dinghao Xi, Shichao Song, Simin Niu, Zhiyu Li

    Abstract: In recent years, generative artificial intelligence models, represented by Large Language Models (LLMs) and Diffusion Models (DMs), have revolutionized content production methods. These artificial intelligence-generated content (AIGC) have become deeply embedded in various aspects of daily life and work. However, these technologies have also led to the emergence of Fake Artificial Intelligence Gen… ▽ More

    Submitted 3 May, 2024; v1 submitted 25 April, 2024; originally announced May 2024.

  30. arXiv:2405.00676  [pdf, other

    cs.CV

    Spectrally Pruned Gaussian Fields with Neural Compensation

    Authors: Runyi Yang, Zhenxin Zhu, Zhou Jiang, Baijun Ye, Xiaoxue Chen, Yifei Zhang, Yuantao Chen, Jian Zhao, Hao Zhao

    Abstract: Recently, 3D Gaussian Splatting, as a novel 3D representation, has garnered attention for its fast rendering speed and high rendering quality. However, this comes with high memory consumption, e.g., a well-trained Gaussian field may utilize three million Gaussian primitives and over 700 MB of memory. We credit this high memory footprint to the lack of consideration for the relationship between pri… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: Code: https://github.com/RunyiYang/SUNDAE Project page: https://runyiyang.github.io/projects/SUNDAE/

  31. arXiv:2405.00579  [pdf, other

    cs.GT

    LEAP: Optimization Hierarchical Federated Learning on Non-IID Data with Coalition Formation Game

    Authors: Jianfeng Lu, Yue Chen, Shuqin Cao, Longbiao Chen, Wei Wang, Yun Xin

    Abstract: Although Hierarchical Federated Learning (HFL) utilizes edge servers (ESs) to alleviate communication burdens, its model performance will be degraded by non-IID data and limited communication resources. Current works often assume that data is uniformly distributed, which however contradicts the heterogeneity of IoT. Solutions of additional model training to check the data distribution inevitably i… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  32. arXiv:2405.00334  [pdf, other

    cs.LG

    A Survey on Deep Active Learning: Recent Advances and New Frontiers

    Authors: Dongyuan Li, Zhen Wang, Yankai Chen, Renhe Jiang, Weiping Ding, Manabu Okumura

    Abstract: Active learning seeks to achieve strong performance with fewer training samples. It does this by iteratively asking an oracle to label new selected samples in a human-in-the-loop manner. This technique has gained increasing popularity due to its broad applicability, yet its survey papers, especially for deep learning-based active learning (DAL), remain scarce. Therefore, we conduct an advanced and… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: This paper is accepted by IEEE Transactions on Neural Networks and Learning Systems

  33. arXiv:2405.00218  [pdf, other

    cs.CR cs.AI cs.LG cs.SE

    Constrained Decoding for Secure Code Generation

    Authors: Yanjun Fu, Ethan Baker, Yizheng Chen

    Abstract: Code Large Language Models (Code LLMs) have been increasingly used by developers to boost productivity, but they often generate vulnerable code. Thus, there is an urgent need to ensure that code generated by Code LLMs is correct and secure. Previous research has primarily focused on generating secure code, overlooking the fact that secure code also needs to be correct. This oversight can lead to a… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: 17 pages, 8 figures

  34. arXiv:2404.19518  [pdf, other

    cs.MA cs.AI cs.RO

    MGCBS: An Optimal and Efficient Algorithm for Solving Multi-Goal Multi-Agent Path Finding Problem

    Authors: Mingkai Tang, Yuanhang Li, Hongji Liu, Yingbing Chen, Ming Liu, Lujia Wang

    Abstract: With the expansion of the scale of robotics applications, the multi-goal multi-agent pathfinding (MG-MAPF) problem began to gain widespread attention. This problem requires each agent to visit pre-assigned multiple goal points at least once without conflict. Some previous methods have been proposed to solve the MG-MAPF problem based on Decoupling the goal Vertex visiting order search and the Singl… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: to be published in IJCAI2024

  35. arXiv:2404.19220  [pdf, other

    stat.ML cs.LG

    Regression for matrix-valued data via Kronecker products factorization

    Authors: Yin-Jen Chen, Minh Tang

    Abstract: We study the matrix-variate regression problem $Y_i = \sum_{k} β_{1k} X_i β_{2k}^{\top} + E_i$ for $i=1,2\dots,n$ in the high dimensional regime wherein the response $Y_i$ are matrices whose dimensions $p_{1}\times p_{2}$ outgrow both the sample size $n$ and the dimensions $q_{1}\times q_{2}$ of the predictor variables $X_i$ i.e., $q_{1},q_{2} \ll n \ll p_{1},p_{2}$. We propose an estimation algor… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  36. arXiv:2404.18961  [pdf, other

    cs.LG cs.AI cs.CV

    Unleashing the Power of Multi-Task Learning: A Comprehensive Survey Spanning Traditional, Deep, and Pretrained Foundation Model Eras

    Authors: Jun Yu, Yutong Dai, Xiaokang Liu, Jin Huang, Yishan Shen, Ke Zhang, Rong Zhou, Eashan Adhikarla, Wenxuan Ye, Yixin Liu, Zhaoming Kong, Kai Zhang, Yilong Yin, Vinod Namboodiri, Brian D. Davison, Jason H. Moore, Yong Chen

    Abstract: MTL is a learning paradigm that effectively leverages both task-specific and shared information to address multiple related tasks simultaneously. In contrast to STL, MTL offers a suite of benefits that enhance both the training process and the inference efficiency. MTL's key advantages encompass streamlined model architecture, performance enhancement, and cross-domain generalizability. Over the pa… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 60 figures, 116 pages, 500+ references

  37. arXiv:2404.18730  [pdf, other

    cs.LG cs.AI stat.AP

    CVTN: Cross Variable and Temporal Integration for Time Series Forecasting

    Authors: Han Zhou, Yuntian Chen

    Abstract: In multivariate time series forecasting, the Transformer architecture encounters two significant challenges: effectively mining features from historical sequences and avoiding overfitting during the learning of temporal dependencies. To tackle these challenges, this paper deconstructs time series forecasting into the learning of historical sequences and prediction sequences, introducing the Cross-… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  38. arXiv:2404.18670  [pdf, other

    cs.LG stat.AP

    Enhancing Uncertain Demand Prediction in Hospitals Using Simple and Advanced Machine Learning

    Authors: Annie Hu, Samuel Stockman, Xun Wu, Richard Wood, Bangdong Zhi, Oliver Y. Chén

    Abstract: Early and timely prediction of patient care demand not only affects effective resource allocation but also influences clinical decision-making as well as patient experience. Accurately predicting patient care demand, however, is a ubiquitous challenge for hospitals across the world due, in part, to the demand's time-varying temporal variability, and, in part, to the difficulty in modelling trends… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  39. arXiv:2404.18564  [pdf, other

    cs.CL cs.AI

    Injecting Salesperson's Dialogue Strategies in Large Language Models with Chain-of-Thought Reasoning

    Authors: Wen-Yu Chang, Yun-Nung Chen

    Abstract: Recent research in dialogue systems and corpora has focused on two main categories: task-oriented (TOD) and open-domain (chit-chat) dialogues. TOD systems help users accomplish specific tasks, while open-domain systems aim to create engaging conversations. However, in real-world scenarios, user intents are often revealed during interactions. A recent study introduced SalesBot, which simulates dial… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2308.14266

  40. arXiv:2404.18527  [pdf

    cs.LG cs.AI cs.CR stat.AP

    Bridging Data Barriers among Participants: Assessing the Potential of Geoenergy through Federated Learning

    Authors: Weike Peng, Jiaxin Gao, Yuntian Chen, Shengwei Wang

    Abstract: Machine learning algorithms emerge as a promising approach in energy fields, but its practical is hindered by data barriers, stemming from high collection costs and privacy concerns. This study introduces a novel federated learning (FL) framework based on XGBoost models, enabling safe collaborative modeling with accessible yet concealed data from multiple parties. Hyperparameter tuning of the mode… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  41. arXiv:2404.18401  [pdf

    cs.CV

    Spectral-Spatial Mamba for Hyperspectral Image Classification

    Authors: Lingbo Huang, Yushi Chen, Xin He

    Abstract: Recently, deep learning models have achieved excellent performance in hyperspectral image (HSI) classification. Among the many deep models, Transformer has gradually attracted interest for its excellence in modeling the long-range dependencies of spatial-spectral features in HSI. However, Transformer has the problem of quadratic computational complexity due to the self-attention mechanism, which i… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: 12 pages

  42. arXiv:2404.18279  [pdf, other

    cs.CV

    Out-of-distribution Detection in Medical Image Analysis: A survey

    Authors: Zesheng Hong, Yubiao Yue, Yubin Chen, Huanjie Lin, Yuanmei Luo, Mini Han Wang, Weidong Wang, Jialong Xu, Xiaoqi Yang, Zhenzhang Li, Sihong Xie

    Abstract: Computer-aided diagnostics has benefited from the development of deep learning-based computer vision techniques in these years. Traditional supervised deep learning methods assume that the test sample is drawn from the identical distribution as the training data. However, it is possible to encounter out-of-distribution samples in real-world clinical scenarios, which may cause silent failure in dee… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: 23 pages, 3 figures

  43. arXiv:2404.18267  [pdf, other

    eess.SY cs.LG q-bio.QM

    LINOCS: Lookahead Inference of Networked Operators for Continuous Stability

    Authors: Noga Mudrik, Eva Yezerets, Yenho Chen, Christopher Rozell, Adam Charles

    Abstract: Identifying latent interactions within complex systems is key to unlocking deeper insights into their operational dynamics, including how their elements affect each other and contribute to the overall system behavior. For instance, in neuroscience, discovering neuron-to-neuron interactions is essential for understanding brain function; in ecology, recognizing the interactions among populations is… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: under review

  44. Static Application Security Testing (SAST) Tools for Smart Contracts: How Far Are We?

    Authors: Kaixuan Li, Yue Xue, Sen Chen, Han Liu, Kairan Sun, Ming Hu, Haijun Wang, Yang Liu, Yixiang Chen

    Abstract: In recent years, the importance of smart contract security has been heightened by the increasing number of attacks against them. To address this issue, a multitude of static application security testing (SAST) tools have been proposed for detecting vulnerabilities in smart contracts. However, objectively comparing these tools to determine their effectiveness remains challenging. Existing studies o… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: to appear at FSE 2024

  45. arXiv:2404.18149  [pdf, other

    cs.CV cs.AI cs.MM

    Compressed Deepfake Video Detection Based on 3D Spatiotemporal Trajectories

    Authors: Zongmei Chen, Xin Liao, Xiaoshuai Wu, Yanxiang Chen

    Abstract: The misuse of deepfake technology by malicious actors poses a potential threat to nations, societies, and individuals. However, existing methods for detecting deepfakes primarily focus on uncompressed videos, such as noise characteristics, local textures, or frequency statistics. When applied to compressed videos, these methods experience a decrease in detection performance and are less suitable f… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  46. arXiv:2404.17990  [pdf, other

    cs.LG cs.DC

    TabVFL: Improving Latent Representation in Vertical Federated Learning

    Authors: Mohamed Rashad, Zilong Zhao, Jeremie Decouchant, Lydia Y. Chen

    Abstract: Autoencoders are popular neural networks that are able to compress high dimensional data to extract relevant latent information. TabNet is a state-of-the-art neural network model designed for tabular data that utilizes an autoencoder architecture for training. Vertical Federated Learning (VFL) is an emerging distributed machine learning paradigm that allows multiple parties to train a model collab… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

  47. arXiv:2404.17839  [pdf, other

    cs.CR cs.SE

    Improving Smart Contract Security with Contrastive Learning-based Vulnerability Detection

    Authors: Yizhou Chen, Zeyu Sun, Zhihao Gong, Dan Hao

    Abstract: Currently, smart contract vulnerabilities (SCVs) have emerged as a major factor threatening the transaction security of blockchain. Existing state-of-the-art methods rely on deep learning to mitigate this threat. They treat each input contract as an independent entity and feed it into a deep learning model to learn vulnerability patterns by fitting vulnerability labels. It is a pity that they disr… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Journal ref: 2024 IEEE/ACM 46th International Conference on Software Engineering (ICSE '24)

  48. arXiv:2404.17806  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining

    Authors: Yi Yuan, Zhuo Chen, Xubo Liu, Haohe Liu, Xuenan Xu, Dongya Jia, Yuanzhe Chen, Mark D. Plumbley, Wenwu Wang

    Abstract: Contrastive language-audio pretraining~(CLAP) has been developed to align the representations of audio and language, achieving remarkable performance in retrieval and classification tasks. However, current CLAP struggles to capture temporal information within audio and text features, presenting substantial limitations for tasks such as audio retrieval and generation. To address this gap, we introd… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: Preprint submitted to IEEE MLSP 2024

  49. arXiv:2404.17685  [pdf

    cs.RO

    Localization Through Particle Filter Powered Neural Network Estimated Monocular Camera Poses

    Authors: Yi Shen, Hao Liu, Xinxin Liu, Wenjing Zhou, Chang Zhou, Yizhou Chen

    Abstract: The reduced cost and computational and calibration requirements of monocular cameras make them ideal positioning sensors for mobile robots, albeit at the expense of any meaningful depth measurement. Solutions proposed by some scholars to this localization problem involve fusing pose estimates from convolutional neural networks (CNNs) with pose estimates from geometric constraints on motion to gene… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  50. arXiv:2404.17604  [pdf, other

    cs.HC cs.SE

    Exploring Remote Hands-on Support for Collaborative Embedded Systems Development

    Authors: Yan Chen, Jasmine Jones

    Abstract: Embedded systems development is a complex task that often requires team collaboration. Given the growing market of freelancers and the global shift to remote work, remote collaboration has become a necessity for many developers and clients. While existing communication and coordination tools help users share, discuss, and edit code collaboratively, these tools were specifically designed for softwa… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.