Skip to main content

Showing 1–50 of 98 results for author: Qin, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.15413  [pdf, other

    eess.IV cs.CV cs.IT

    MambaVC: Learned Visual Compression with Selective State Spaces

    Authors: Shiyu Qin, Jinpeng Wang, Yimin Zhou, Bin Chen, Tianci Luo, Baoyi An, Tao Dai, Shutao Xia, Yaowei Wang

    Abstract: Learned visual compression is an important and active task in multimedia. Existing approaches have explored various CNN- and Transformer-based designs to model content distribution and eliminate redundancy, where balancing efficacy (i.e., rate-distortion trade-off) and efficiency remains a challenge. Recently, state-space models (SSMs) have shown promise due to their long-range modeling capacity a… ▽ More

    Submitted 28 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: 17pages,15 figures

  2. arXiv:2405.15370  [pdf, other

    cs.CL

    Large Language Models can Deliver Accurate and Interpretable Time Series Anomaly Detection

    Authors: Jun Liu, Chaoyun Zhang, Jiaxu Qian, Minghua Ma, Si Qin, Chetan Bansal, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang

    Abstract: Time series anomaly detection (TSAD) plays a crucial role in various industries by identifying atypical patterns that deviate from standard trends, thereby maintaining system integrity and enabling prompt response measures. Traditional TSAD models, which often rely on deep learning, require extensive training data and operate as black boxes, lacking interpretability for detected anomalies. To addr… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  3. arXiv:2404.17780  [pdf, other

    cs.MA cs.AI

    Verco: Learning Coordinated Verbal Communication for Multi-agent Reinforcement Learning

    Authors: Dapeng Li, Hang Dong, Lu Wang, Bo Qiao, Si Qin, Qingwei Lin, Dongmei Zhang, Qi Zhang, Zhiwei Xu, Bin Zhang, Guoliang Fan

    Abstract: In recent years, multi-agent reinforcement learning algorithms have made significant advancements in diverse gaming environments, leading to increased interest in the broader application of such techniques. To address the prevalent challenge of partial observability, communication-based algorithms have improved cooperative performance through the sharing of numerical embedding between agents. Howe… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: 12 pages, 6 figures

  4. arXiv:2404.11895  [pdf, other

    cs.CV

    FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models

    Authors: Wei Wu, Qingnan Fan, Shuai Qin, Hong Gu, Ruoyu Zhao, Antoni B. Chan

    Abstract: Precise image editing with text-to-image models has attracted increasing interest due to their remarkable generative capabilities and user-friendly nature. However, such attempts face the pivotal challenge of misalignment between the intended precise editing target regions and the broader area impacted by the guidance in practice. Despite excellent methods leveraging attention mechanisms that have… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  5. arXiv:2404.09729  [pdf

    eess.SP cs.IT cs.LG stat.ME

    Amplitude-Phase Fusion for Enhanced Electrocardiogram Morphological Analysis

    Authors: Shuaicong Hu, Yanan Wang, Jian Liu, Jingyu Lin, Shengmei Qin, Zhenning Nie, Zhifeng Yao, Wenjie Cai, Cuiwei Yang

    Abstract: Considering the variability of amplitude and phase patterns in electrocardiogram (ECG) signals due to cardiac activity and individual differences, existing entropy-based studies have not fully utilized these two patterns and lack integration. To address this gap, this paper proposes a novel fusion entropy metric, morphological ECG entropy (MEE) for the first time, specifically designed for ECG mor… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 16 pages, 12 figures

    ACM Class: I.5.2

  6. arXiv:2404.07200  [pdf, other

    cs.LG

    Toward a Better Understanding of Fourier Neural Operators: Analysis and Improvement from a Spectral Perspective

    Authors: Shaoxiang Qin, Fuyuan Lyu, Wenhui Peng, Dingyang Geng, Ju Wang, Naiping Gao, Xue Liu, Liangzhu Leon Wang

    Abstract: In solving partial differential equations (PDEs), Fourier Neural Operators (FNOs) have exhibited notable effectiveness compared to Convolutional Neural Networks (CNNs). This paper presents clear empirical evidence through spectral analysis to elucidate the superiority of FNO over CNNs: FNO is significantly more capable of learning low-frequencies. This empirical evidence also unveils FNO's distinc… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  7. arXiv:2404.06835  [pdf, other

    cs.CV

    Tuning-Free Adaptive Style Incorporation for Structure-Consistent Text-Driven Style Transfer

    Authors: Yanqi Ge, Jiaqi Liu, Qingnan Fan, Xi Jiang, Ye Huang, Shuai Qin, Hong Gu, Wen Li, Lixin Duan

    Abstract: In this work, we target the task of text-driven style transfer in the context of text-to-image (T2I) diffusion models. The main challenge is consistent structure preservation while enabling effective style transfer effects. The past approaches in this field directly concatenate the content and style prompts for a prompt-level style injection, leading to unavoidable structure distortions. In this w… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  8. arXiv:2404.03602  [pdf, other

    cs.CL

    Evaluating LLMs at Detecting Errors in LLM Responses

    Authors: Ryo Kamoi, Sarkar Snigdha Sarathi Das, Renze Lou, Jihyun Janice Ahn, Yilun Zhao, Xiaoxin Lu, Nan Zhang, Yusen Zhang, Ranran Haoran Zhang, Sujeeth Reddy Vummanthala, Salika Dave, Shaobo Qin, Arman Cohan, Wenpeng Yin, Rui Zhang

    Abstract: With Large Language Models (LLMs) being widely used across various tasks, detecting errors in their responses is increasingly crucial. However, little research has been conducted on error detection of LLM responses. Collecting error annotations on LLM responses is challenging due to the subjective nature of many NLP tasks, and thus previous research focuses on tasks of little practical value (e.g.… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: Benchmark and code: https://github.com/psunlpgroup/ReaLMistake

  9. arXiv:2404.00762  [pdf, other

    cs.SE

    Enchanting Program Specification Synthesis by Large Language Models using Static Analysis and Program Verification

    Authors: Cheng Wen, Jialun Cao, Jie Su, Zhiwu Xu, Shengchao Qin, Mengda He, Haokun Li, Shing-Chi Cheung, Cong Tian

    Abstract: Formal verification provides a rigorous and systematic approach to ensure the correctness and reliability of software systems. Yet, constructing specifications for the full proof relies on domain expertise and non-trivial manpower. In view of such needs, an automated approach for specification synthesis is desired. While existing automated approaches are limited in their versatility, i.e., they ei… ▽ More

    Submitted 2 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

  10. arXiv:2403.18660  [pdf, other

    cs.GR cs.CV

    InstructBrush: Learning Attention-based Instruction Optimization for Image Editing

    Authors: Ruoyu Zhao, Qingnan Fan, Fei Kou, Shuai Qin, Hong Gu, Wei Wu, Pengcheng Xu, Mingrui Zhu, Nannan Wang, Xinbo Gao

    Abstract: In recent years, instruction-based image editing methods have garnered significant attention in image editing. However, despite encompassing a wide range of editing priors, these methods are helpless when handling editing tasks that are challenging to accurately describe through language. We propose InstructBrush, an inversion method for instruction-based image editing methods to bridge this gap.… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Project Page: https://royzhao926.github.io/InstructBrush/

  11. arXiv:2403.15157  [pdf, other

    cs.SE

    AllHands: Ask Me Anything on Large-scale Verbatim Feedback via Large Language Models

    Authors: Chaoyun Zhang, Zicheng Ma, Yuhao Wu, Shilin He, Si Qin, Minghua Ma, Xiaoting Qin, Yu Kang, Yuyi Liang, Xiaoyu Gou, Yajie Xue, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang

    Abstract: Verbatim feedback constitutes a valuable repository of user experiences, opinions, and requirements essential for software development. Effectively and efficiently extracting valuable insights from such data poses a challenging task. This paper introduces Allhands , an innovative analytic framework designed for large-scale feedback analysis through a natural language interface, leveraging large la… ▽ More

    Submitted 3 April, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

  12. arXiv:2403.09318  [pdf, other

    quant-ph cs.CV cs.LG

    A Hierarchical Fused Quantum Fuzzy Neural Network for Image Classification

    Authors: Sheng-Yao Wu, Run-Ze Li, Yan-Qi Song, Su-Juan Qin, Qiao-Yan Wen, Fei Gao

    Abstract: Neural network is a powerful learning paradigm for data feature learning in the era of big data. However, most neural network models are deterministic models that ignore the uncertainty of data. Fuzzy neural networks are proposed to address this problem. FDNN is a hierarchical deep neural network that derives information from both fuzzy and neural representations, the representations are then fuse… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  13. arXiv:2402.16291  [pdf

    cs.CV

    mAPm: multi-scale Attention Pyramid module for Enhanced scale-variation in RLD detection

    Authors: Yunusa Haruna, Shiyin Qin, Abdulrahman Hamman Adama Chukkol, Isah Bello, Adamu Lawan

    Abstract: Detecting objects across various scales remains a significant challenge in computer vision, particularly in tasks such as Rice Leaf Disease (RLD) detection, where objects exhibit considerable scale variations. Traditional object detection methods often struggle to address these variations, resulting in missed detections or reduced accuracy. In this study, we propose the multi-scale Attention Pyram… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  14. arXiv:2402.11420  [pdf, other

    cs.CL

    Rethinking the Roles of Large Language Models in Chinese Grammatical Error Correction

    Authors: Yinghui Li, Shang Qin, Jingheng Ye, Shirong Ma, Yangning Li, Libo Qin, Xuming Hu, Wenhao Jiang, Hai-Tao Zheng, Philip S. Yu

    Abstract: Recently, Large Language Models (LLMs) have been widely studied by researchers for their roles in various downstream NLP tasks. As a fundamental task in the NLP field, Chinese Grammatical Error Correction (CGEC) aims to correct all potential grammatical errors in the input sentences. Previous studies have shown that LLMs' performance as correctors on CGEC remains unsatisfactory due to its challeng… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  15. arXiv:2402.07939  [pdf, other

    cs.HC cs.AI cs.CL

    UFO: A UI-Focused Agent for Windows OS Interaction

    Authors: Chaoyun Zhang, Liqun Li, Shilin He, Xu Zhang, Bo Qiao, Si Qin, Minghua Ma, Yu Kang, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang

    Abstract: We introduce UFO, an innovative UI-Focused agent to fulfill user requests tailored to applications on Windows OS, harnessing the capabilities of GPT-Vision. UFO employs a dual-agent framework to meticulously observe and analyze the graphical user interface (GUI) and control information of Windows applications. This enables the agent to seamlessly navigate and operate within individual applications… ▽ More

    Submitted 23 May, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

  16. arXiv:2402.02941  [pdf

    cs.CV cs.LG

    Exploring the Synergies of Hybrid CNNs and ViTs Architectures for Computer Vision: A survey

    Authors: Haruna Yunusa, Shiyin Qin, Abdulrahman Hamman Adama Chukkol, Abdulganiyu Abdu Yusuf, Isah Bello, Adamu Lawan

    Abstract: The hybrid of Convolutional Neural Network (CNN) and Vision Transformers (ViT) architectures has emerged as a groundbreaking approach, pushing the boundaries of computer vision (CV). This comprehensive review provides a thorough examination of the literature on state-of-the-art hybrid CNN-ViT architectures, exploring the synergies between these two approaches. The main content of this survey inclu… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  17. arXiv:2401.07206  [pdf, other

    stat.ML cs.LG eess.SY

    Probabilistic Reduced-Dimensional Vector Autoregressive Modeling with Oblique Projections

    Authors: Yanfang Mo, S. Joe Qin

    Abstract: In this paper, we propose a probabilistic reduced-dimensional vector autoregressive (PredVAR) model to extract low-dimensional dynamics from high-dimensional noisy data. The model utilizes an oblique projection to partition the measurement space into a subspace that accommodates the reduced-dimensional dynamics and a complementary static subspace. An optimal oblique decomposition is derived for th… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

    Comments: 16pages, 5 figures

  18. arXiv:2401.07051  [pdf, other

    cs.LG cs.AI

    COIN: Chance-Constrained Imitation Learning for Uncertainty-aware Adaptive Resource Oversubscription Policy

    Authors: Lu Wang, Mayukh Das, Fangkai Yang, Chao Duo, Bo Qiao, Hang Dong, Si Qin, Chetan Bansal, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang

    Abstract: We address the challenge of learning safe and robust decision policies in presence of uncertainty in context of the real scientific problem of adaptive resource oversubscription to enhance resource efficiency while ensuring safety against resource congestion risk. Traditional supervised prediction or forecasting models are ineffective in learning adaptive policies whereas standard online optimiz… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

    Comments: 9 pages, 4 figures

  19. arXiv:2401.07033  [pdf, other

    cs.HC

    Risk-aware Adaptive Virtual CPU Oversubscription in Microsoft Cloud via Prototypical Human-in-the-loop Imitation Learning

    Authors: Lu Wang, Mayukh Das, Fangkai Yang, Junjie Sheng, Bo Qiao, Hang Dong, Si Qin, Victor Rühle, Chetan Bansal, Eli Cortez, Íñigo Goiri, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang

    Abstract: Oversubscription is a prevalent practice in cloud services where the system offers more virtual resources, such as virtual cores in virtual machines, to users or applications than its available physical capacity for reducing revenue loss due to unused/redundant capacity. While oversubscription can potentially lead to significant enhancement in efficient resource utilization, the caveat is that it… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

    Comments: 9 pages, 3 figures

  20. arXiv:2312.12148  [pdf, other

    cs.CL

    Parameter-Efficient Fine-Tuning Methods for Pretrained Language Models: A Critical Review and Assessment

    Authors: Lingling Xu, Haoran Xie, Si-Zhao Joe Qin, Xiaohui Tao, Fu Lee Wang

    Abstract: With the continuous growth in the number of parameters of transformer-based pretrained language models (PLMs), particularly the emergence of large language models (LLMs) with billions of parameters, many natural language processing (NLP) tasks have demonstrated remarkable success. However, the enormous size and computational demands of these models pose significant challenges for adapting them to… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 20 pages, 4 figures

  21. arXiv:2312.11988  [pdf, other

    cs.SE cs.AI cs.PL

    Xpert: Empowering Incident Management with Query Recommendations via Large Language Models

    Authors: Yuxuan Jiang, Chaoyun Zhang, Shilin He, Zhihao Yang, Minghua Ma, Si Qin, Yu Kang, Yingnong Dang, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang

    Abstract: Large-scale cloud systems play a pivotal role in modern IT infrastructure. However, incidents occurring within these systems can lead to service disruptions and adversely affect user experience. To swiftly resolve such incidents, on-call engineers depend on crafting domain-specific language (DSL) queries to analyze telemetry data. However, writing these queries can be challenging and time-consumin… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: Accepted as a reseach paper at ICSE 2024

  22. arXiv:2312.02207  [pdf, other

    cs.CV

    TranSegPGD: Improving Transferability of Adversarial Examples on Semantic Segmentation

    Authors: Xiaojun Jia, Jindong Gu, Yihao Huang, Simeng Qin, Qing Guo, Yang Liu, Xiaochun Cao

    Abstract: Transferability of adversarial examples on image classification has been systematically explored, which generates adversarial examples in black-box mode. However, the transferability of adversarial examples on semantic segmentation has been largely overlooked. In this paper, we propose an effective two-stage adversarial attack strategy to improve the transferability of adversarial examples on sema… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  23. arXiv:2311.17541  [pdf, other

    cs.AI

    TaskWeaver: A Code-First Agent Framework

    Authors: Bo Qiao, Liqun Li, Xu Zhang, Shilin He, Yu Kang, Chaoyun Zhang, Fangkai Yang, Hang Dong, Jue Zhang, Lu Wang, Minghua Ma, Pu Zhao, Si Qin, Xiaoting Qin, Chao Du, Yong Xu, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang

    Abstract: Large Language Models (LLMs) have shown impressive abilities in natural language understanding and generation, leading to their use in applications such as chatbots and virtual assistants. However, existing LLM frameworks face limitations in handling domain-specific data analytics tasks with rich data structures. Moreover, they struggle with flexibility to meet diverse user requirements. To addres… ▽ More

    Submitted 1 December, 2023; v1 submitted 29 November, 2023; originally announced November 2023.

  24. arXiv:2311.13847  [pdf, other

    cs.CV cs.IT eess.IV

    Perceptual Image Compression with Cooperative Cross-Modal Side Information

    Authors: Shiyu Qin, Bin Chen, Yujun Huang, Baoyi An, Tao Dai, Shu-Tao Xia

    Abstract: The explosion of data has resulted in more and more associated text being transmitted along with images. Inspired by from distributed source coding, many works utilize image side information to enhance image compression. However, existing methods generally do not consider using text as side information to enhance perceptual compression of images, even though the benefits of multimodal synergy have… ▽ More

    Submitted 28 November, 2023; v1 submitted 23 November, 2023; originally announced November 2023.

  25. arXiv:2311.13846  [pdf, other

    cs.CV cs.IT

    Progressive Learning with Visual Prompt Tuning for Variable-Rate Image Compression

    Authors: Shiyu Qin, Yimin Zhou, Jinpeng Wang, Bin Chen, Baoyi An, Tao Dai, Shu-Tao Xia

    Abstract: In this paper, we propose a progressive learning paradigm for transformer-based variable-rate image compression. Our approach covers a wide range of compression rates with the assistance of the Layer-adaptive Prompt Module (LPM). Inspired by visual prompt tuning, we use LPM to extract prompts for input images and hidden features at the encoder side and decoder side, respectively, which are fed as… ▽ More

    Submitted 28 November, 2023; v1 submitted 23 November, 2023; originally announced November 2023.

  26. arXiv:2311.05144  [pdf, other

    cs.LG cs.SE

    Counter-Empirical Attacking based on Adversarial Reinforcement Learning for Time-Relevant Scoring System

    Authors: Xiangguo Sun, Hong Cheng, Hang Dong, Bo Qiao, Si Qin, Qingwei Lin

    Abstract: Scoring systems are commonly seen for platforms in the era of big data. From credit scoring systems in financial services to membership scores in E-commerce shopping platforms, platform managers use such systems to guide users towards the encouraged activity pattern, and manage resources more effectively and more efficiently thereby. To establish such scoring systems, several "empirical criteria"… ▽ More

    Submitted 19 December, 2023; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: Accepted by TKDE

  27. arXiv:2311.04254  [pdf, other

    cs.AI cs.LG

    Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation

    Authors: Ruomeng Ding, Chaoyun Zhang, Lu Wang, Yong Xu, Minghua Ma, Wei Zhang, Si Qin, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang

    Abstract: Recent advancements in Large Language Models (LLMs) have revolutionized decision-making by breaking down complex problems into more manageable language sequences referred to as "thoughts". An effective thought design should consider three key perspectives: performance, efficiency, and flexibility. However, existing thought can at most exhibit two of these attributes. To address these limitations,… ▽ More

    Submitted 23 February, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: 17 pages, 5 figures

  28. arXiv:2310.17674  [pdf, other

    cs.CV

    Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis

    Authors: Shangbang Long, Siyang Qin, Yasuhisa Fujii, Alessandro Bissacco, Michalis Raptis

    Abstract: We propose Hierarchical Text Spotter (HTS), a novel method for the joint task of word-level text spotting and geometric layout analysis. HTS can recognize text in an image and identify its 4-level hierarchical structure: characters, words, lines, and paragraphs. The proposed HTS is characterized by two novel components: (1) a Unified-Detector-Polygon (UDP) that produces Bezier Curve polygons of te… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted to WACV 2024

  29. arXiv:2309.13363  [pdf, other

    cs.LG cs.AI

    MLPST: MLP is All You Need for Spatio-Temporal Prediction

    Authors: Zijian Zhang, Ze Huang, Zhiwei Hu, Xiangyu Zhao, Wanyu Wang, Zitao Liu, Junbo Zhang, S. Joe Qin, Hongwei Zhao

    Abstract: Traffic prediction is a typical spatio-temporal data mining task and has great significance to the public transportation system. Considering the demand for its grand application, we recognize key factors for an ideal spatio-temporal prediction method: efficient, lightweight, and effective. However, the current deep model-based spatio-temporal prediction solutions generally own intricate architectu… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

  30. arXiv:2309.05091  [pdf, other

    cs.HC cs.MM

    SpeechMirror: A Multimodal Visual Analytics System for Personalized Reflection of Online Public Speaking Effectiveness

    Authors: Zeyuan Huang, Qiang He, Kevin Maher, Xiaoming Deng, Yu-Kun Lai, Cuixia Ma, Sheng-feng Qin, Yong-Jin Liu, Hongan Wang

    Abstract: As communications are increasingly taking place virtually, the ability to present well online is becoming an indispensable skill. Online speakers are facing unique challenges in engaging with remote audiences. However, there has been a lack of evidence-based analytical systems for people to comprehensively evaluate online speeches and further discover possibilities for improvement. This paper intr… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

    Comments: Main paper (11 pages, 6 figures) and Supplemental document (11 pages, 11 figures). Accepted by VIS 2023

  31. arXiv:2308.13988  [pdf, other

    cs.RO

    A Robot Leg with Compact Variable Stiffness Joint based on Leaf-Spring Mechanism

    Authors: Lei Yu, Haizhou Zhao, Siying Qin, Yuqing Chen

    Abstract: The legged robots with variable stiffness actuators (VSAs) can achieve energy-efficient and versatile locomotion. However, equipping legged robots with VSAs in real-world application is usually restricted by (i) the redundant mechanical structure design, (ii) limited stiffness variation range and speed, and (iii) high energy consumption in stiffness modulation. In this paper, we present a novel Va… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

  32. arXiv:2308.13205  [pdf, other

    cs.RO eess.SY

    Design and Control of a Bio-inspired Wheeled Bipedal Robot

    Authors: Haizhou Zhao, Lei Yu, Siying Qin, Yuqing Chen

    Abstract: Wheeled bipedal robots have the capability to execute agile and versatile locomotion tasks in unknown terrains, with balancing being a key criterion in evaluating their dynamic performance. This paper focuses on enhancing the balancing performance of wheeled bipedal robots through innovations in both hardware and software aspects. A bio-inspired mechanical design, inspired by the human barbell squ… ▽ More

    Submitted 15 January, 2024; v1 submitted 25 August, 2023; originally announced August 2023.

  33. arXiv:2308.12575  [pdf, other

    cs.LG

    Hypergraph Convolutional Networks for Fine-grained ICU Patient Similarity Analysis and Risk Prediction

    Authors: Yuxi Liu, Zhenhao Zhang, Shaowen Qin, Flora D. Salim, Antonio Jimeno Yepes, Jun Shen, Jiang Bian

    Abstract: The Intensive Care Unit (ICU) is one of the most important parts of a hospital, which admits critically ill patients and provides continuous monitoring and treatment. Various patient outcome prediction methods have been attempted to assist healthcare professionals in clinical decision-making. Existing methods focus on measuring the similarity between patients using deep neural networks to capture… ▽ More

    Submitted 21 October, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: 16 pages, 2 figures

  34. arXiv:2308.09896  [pdf, other

    cs.LG

    Contrastive Learning-based Imputation-Prediction Networks for In-hospital Mortality Risk Modeling using EHRs

    Authors: Yuxi Liu, Zhenhao Zhang, Shaowen Qin, Flora D. Salim, Antonio Jimeno Yepes

    Abstract: Predicting the risk of in-hospital mortality from electronic health records (EHRs) has received considerable attention. Such predictions will provide early warning of a patient's health condition to healthcare professionals so that timely interventions can be taken. This prediction task is challenging since EHR data are intrinsically irregular, with not only many missing values but also varying ti… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: 15 pages, 2 figures, accepted at ECML PKDD 2023

  35. arXiv:2308.03985  [pdf, other

    cs.LG math.NA physics.flu-dyn

    Fourier neural operator for real-time simulation of 3D dynamic urban microclimate

    Authors: Wenhui Peng, Shaoxiang Qin, Senwen Yang, Jianchun Wang, Xue Liu, Liangzhu Leon Wang

    Abstract: Global urbanization has underscored the significance of urban microclimates for human comfort, health, and building/urban energy efficiency. They profoundly influence building design and urban planning as major environmental impacts. Understanding local microclimates is essential for cities to prepare for climate change and effectively implement resilience measures. However, analyzing urban microc… ▽ More

    Submitted 30 September, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

  36. arXiv:2307.10438  [pdf, ps, other

    cs.LG physics.chem-ph q-bio.BM

    Uncertainty Quantification for Molecular Property Predictions with Graph Neural Architecture Search

    Authors: Shengli Jiang, Shiyi Qin, Reid C. Van Lehn, Prasanna Balaprakash, Victor M. Zavala

    Abstract: Graph Neural Networks (GNNs) have emerged as a prominent class of data-driven methods for molecular property prediction. However, a key limitation of typical GNN models is their inability to quantify uncertainties in the predictions. This capability is crucial for ensuring the trustworthy use and deployment of models in downstream tasks. To that end, we introduce AutoGNNUQ, an automated uncertaint… ▽ More

    Submitted 31 March, 2024; v1 submitted 19 July, 2023; originally announced July 2023.

  37. arXiv:2306.14498  [pdf, other

    cs.CR cs.LG

    Practical Privacy-Preserving Gaussian Process Regression via Secret Sharing

    Authors: Jinglong Luo, Yehong Zhang, Jiaqi Zhang, Shuang Qin, Hui Wang, Yue Yu, Zenglin Xu

    Abstract: Gaussian process regression (GPR) is a non-parametric model that has been used in many real-world applications that involve sensitive personal data (e.g., healthcare, finance, etc.) from multiple data owners. To fully and securely exploit the value of different data sources, this paper proposes a privacy-preserving GPR method based on secret sharing (SS), a secure multi-party computation (SMPC) te… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: Accepted for the 39th Conference on Uncertainty in Artificial Intelligence (UAI 2023)

  38. arXiv:2305.11598  [pdf, other

    cs.AI cs.CL

    Introspective Tips: Large Language Model for In-Context Decision Making

    Authors: Liting Chen, Lu Wang, Hang Dong, Yali Du, Jie Yan, Fangkai Yang, Shuang Li, Pu Zhao, Si Qin, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang

    Abstract: The emergence of large language models (LLMs) has substantially influenced natural language processing, demonstrating exceptional results across various tasks. In this study, we employ ``Introspective Tips" to facilitate LLMs in self-optimizing their decision-making. By introspectively examining trajectories, LLM refines its policy by generating succinct and valuable tips. Our method enhances the… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: 22 pages, 4 figures

  39. arXiv:2305.09750  [pdf, other

    cs.CV

    ICDAR 2023 Competition on Hierarchical Text Detection and Recognition

    Authors: Shangbang Long, Siyang Qin, Dmitry Panteleev, Alessandro Bissacco, Yasuhisa Fujii, Michalis Raptis

    Abstract: We organize a competition on hierarchical text detection and recognition. The competition is aimed to promote research into deep learning models and systems that can jointly perform text detection and recognition and geometric layout analysis. We present details of the proposed competition organization, including tasks, datasets, evaluations, and schedule. During the competition period (from Janua… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: ICDAR 2023 competition report by organizers (accepted and to be published officially later)

  40. arXiv:2305.02549  [pdf, other

    cs.CL cs.CV cs.LG

    FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction

    Authors: Chen-Yu Lee, Chun-Liang Li, Hao Zhang, Timothy Dozat, Vincent Perot, Guolong Su, Xiang Zhang, Kihyuk Sohn, Nikolai Glushnev, Renshen Wang, Joshua Ainslie, Shangbang Long, Siyang Qin, Yasuhisa Fujii, Nan Hua, Tomas Pfister

    Abstract: The recent advent of self-supervised pre-training techniques has led to a surge in the use of multimodal learning in form document understanding. However, existing approaches that extend the mask language modeling to other modalities require careful multi-task tuning, complex reconstruction target designs, or additional pre-training data. In FormNetV2, we introduce a centralized multimodal graph c… ▽ More

    Submitted 13 June, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL 2023

  41. arXiv:2211.11759  [pdf, other

    cs.LG cs.AI cs.MA

    Learning Cooperative Oversubscription for Cloud by Chance-Constrained Multi-Agent Reinforcement Learning

    Authors: Junjie Sheng, Lu Wang, Fangkai Yang, Bo Qiao, Hang Dong, Xiangfeng Wang, Bo Jin, Jun Wang, Si Qin, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang

    Abstract: Oversubscription is a common practice for improving cloud resource utilization. It allows the cloud service provider to sell more resources than the physical limit, assuming not all users would fully utilize the resources simultaneously. However, how to design an oversubscription policy that improves utilization while satisfying the some safety constraints remains an open problem. Existing methods… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  42. arXiv:2211.06045  [pdf, other

    cs.LG

    Integrated Convolutional and Recurrent Neural Networks for Health Risk Prediction using Patient Journey Data with Many Missing Values

    Authors: Yuxi Liu, Shaowen Qin, Antonio Jimeno Yepes, Wei Shao, Zhenhao Zhang, Flora D. Salim

    Abstract: Predicting the health risks of patients using Electronic Health Records (EHR) has attracted considerable attention in recent years, especially with the development of deep learning techniques. Health risk refers to the probability of the occurrence of a specific health outcome for a specific patient. The predicted risks can be used to support decision-making by healthcare professionals. EHRs are s… ▽ More

    Submitted 13 November, 2022; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: 6 pages, 2 figures, accepted at IEEE BIBM 2022

  43. arXiv:2210.07848  [pdf, other

    cs.CV

    Convolutional Neural Networks: Basic Concepts and Applications in Manufacturing

    Authors: Shengli Jiang, Shiyi Qin, Joshua L. Pulsipher, Victor M. Zavala

    Abstract: We discuss basic concepts of convolutional neural networks (CNNs) and outline uses in manufacturing. We begin by discussing how different types of data objects commonly encountered in manufacturing (e.g., time series, images, micrographs, videos, spectra, molecular structures) can be represented in a flexible manner using tensors and graphs. We then discuss how CNNs use convolution operations to e… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

  44. arXiv:2210.06293  [pdf, other

    eess.SP cs.AI cs.CV cs.LG

    Two-stream Network for ECG Signal Classification

    Authors: Xinyao Hou, Shengmei Qin, Jianbo Su

    Abstract: Electrocardiogram (ECG), a technique for medical monitoring of cardiac activity, is an important method for identifying cardiovascular disease. However, analyzing the increasing quantity of ECG data consumes a lot of medical resources. This paper explores an effective algorithm for automatic classifications of multi-classes of heartbeat types based on ECG. Most neural network based methods target… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

  45. arXiv:2210.01485  [pdf, other

    cs.CV

    APAUNet: Axis Projection Attention UNet for Small Target in 3D Medical Segmentation

    Authors: Yuncheng Jiang, Zixun Zhang, Shixi Qin, Yao Guo, Zhen Li, Shuguang Cui

    Abstract: In 3D medical image segmentation, small targets segmentation is crucial for diagnosis but still faces challenges. In this paper, we propose the Axis Projection Attention UNet, named APAUNet, for 3D medical image segmentation, especially for small targets. Considering the large proportion of the background in the 3D feature space, we introduce a projection strategy to project the 3D features into t… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

    Comments: Accepted by ACCV2022

  46. arXiv:2209.09327  [pdf, ps, other

    cs.PL cs.SE

    S2TD: a Separation Logic Verifier that Supports Reasoning of the Absence and Presence of Bugs

    Authors: Quang Loc Le, Jun Sun, Long H. Pham, Shengchao Qin

    Abstract: Heap-manipulating programs are known to be challenging to reason about. We present a novel verifier for heap-manipulating programs called S2TD, which encodes programs systematically in the form of Constrained Horn Clauses (CHC) using a novel extension of separation logic (SL) with recursive predicates and dangling predicates. S2TD actively explores cyclic proofs to address the path explosion probl… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: 24 pages

    MSC Class: 68N15

  47. arXiv:2209.02514  [pdf, other

    cs.CV cs.IT

    Learned Distributed Image Compression with Multi-Scale Patch Matching in Feature Domain

    Authors: Yujun Huang, Bin Chen, Shiyu Qin, Jiawei Li, Yaowei Wang, Tao Dai, Shu-Tao Xia

    Abstract: Beyond achieving higher compression efficiency over classical image compression codecs, deep image compression is expected to be improved with additional side information, e.g., another image from a different perspective of the same scene. To better utilize the side information under the distributed compression scenario, the existing method (Ayzik and Avidan 2020) only implements patch matching at… ▽ More

    Submitted 27 November, 2022; v1 submitted 6 September, 2022; originally announced September 2022.

    Comments: This work is accepted by the AAAI 2023

  48. arXiv:2208.01320  [pdf, other

    cs.LG

    Compound Density Networks for Risk Prediction using Electronic Health Records

    Authors: Yuxi Liu, Shaowen Qin, Zhenhao Zhang, Wei Shao

    Abstract: Electronic Health Records (EHRs) exhibit a high amount of missing data due to variations of patient conditions and treatment needs. Imputation of missing values has been considered an effective approach to deal with this challenge. Existing work separates imputation method and prediction model as two independent parts of an EHR-based machine learning system. We propose an integrated end-to-end app… ▽ More

    Submitted 24 October, 2022; v1 submitted 2 August, 2022; originally announced August 2022.

    Comments: 8 pages, 6 figures, accepted at IEEE BIBM 2022

  49. Solving the Batch Stochastic Bin Packing Problem in Cloud: A Chance-constrained Optimization Approach

    Authors: Jie Yan, Yunlei Lu, Liting Chen, Si Qin, Yixin Fang, Qingwei Lin, Thomas Moscibroda, Saravan Rajmohan, Dongmei Zhang

    Abstract: This paper investigates a critical resource allocation problem in the first party cloud: scheduling containers to machines. There are tens of services and each service runs a set of homogeneous containers with dynamic resource usage; containers of a service are scheduled daily in a batch fashion. This problem can be naturally formulated as Stochastic Bin Packing Problem (SBPP). However, traditiona… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: To appear in SIGKDD 2022 as Research Track paper

    MSC Class: 90-00; 90C11 ACM Class: G.1.6

  50. arXiv:2205.10230  [pdf, ps, other

    math.NA cs.NE physics.comp-ph

    RAR-PINN algorithm for the data-driven vector-soliton solutions and parameter discovery of coupled nonlinear equations

    Authors: Shu-Mei Qin, Min Li, Tao Xu, Shao-Qun Dong

    Abstract: This work aims to provide an effective deep learning framework to predict the vector-soliton solutions of the coupled nonlinear equations and their interactions. The method we propose here is a physics-informed neural network (PINN) combining with the residual-based adaptive refinement (RAR-PINN) algorithm. Different from the traditional PINN algorithm which takes points randomly, the RAR-PINN alg… ▽ More

    Submitted 29 April, 2022; originally announced May 2022.