Skip to main content

Showing 1–50 of 90 results for author: Gu, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.18725  [pdf, other

    cs.LG cs.MA

    Can We Enhance the Quality of Mobile Crowdsensing Data Without Ground Truth?

    Authors: Jiajie Li, Bo Gu, Shimin Gong, Zhou Su, Mohsen Guizani

    Abstract: Mobile crowdsensing (MCS) has emerged as a prominent trend across various domains. However, ensuring the quality of the sensing data submitted by mobile users (MUs) remains a complex and challenging problem. To address this challenge, an advanced method is required to detect low-quality sensing data and identify malicious MUs that may disrupt the normal operations of an MCS system. Therefore, this… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  2. arXiv:2405.01615  [pdf, other

    cs.NE cs.LG

    Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning

    Authors: Chengqian Gao, William de Vazelhes, Hualin Zhang, Bin Gu, Zhiqiang Xu

    Abstract: Evolution Strategies (ES) have emerged as a competitive alternative for model-free reinforcement learning, showcasing exemplary performance in tasks like Mujoco and Atari. Notably, they shine in scenarios with imperfect reward functions, making them invaluable for real-world applications where dense reward signals may be elusive. Yet, an inherent assumption in ES, that all input features are task-… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 16 pages, including proofs in the appendix

  3. arXiv:2404.19449  [pdf, other

    cs.IT

    AoI-aware Sensing Scheduling and Trajectory Optimization for Multi-UAV-assisted Wireless Backscatter Networks

    Authors: Yusi Long, Songhan Zhao, Shimin Gong, Bo Gu, Dusit Niyato, Xuemin, Shen

    Abstract: This paper considers multiple unmanned aerial vehicles (UAVs) to assist sensing data transmissions from the ground users (GUs) to a remote base station (BS). Each UAV collects sensing data from the GUs and then forwards the sensing data to the remote BS. The GUs first backscatter their data to the UAVs and then all UAVs forward data to the BS by the nonorthogonal multiple access (NOMA) transmissio… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: This paper has been accepted by IEEE TVT

  4. arXiv:2404.08885  [pdf, other

    cs.PL cs.CL cs.LG

    Is Next Token Prediction Sufficient for GPT? Exploration on Code Logic Comprehension

    Authors: Mengnan Qi, Yufan Huang, Yongqiang Yao, Maoquan Wang, Bin Gu, Neel Sundaresan

    Abstract: Large language models (LLMs) has experienced exponential growth, they demonstrate remarkable performance across various tasks. Notwithstanding, contemporary research primarily centers on enhancing the size and quality of pretraining data, still utilizing the next token prediction task on autoregressive transformer model structure. The efficacy of this task in truly facilitating the model's compreh… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  5. arXiv:2404.01897  [pdf, other

    cs.NE cs.AI cs.LG

    Continuous Spiking Graph Neural Networks

    Authors: Nan Yin, Mengzhu Wan, Li Shen, Hitesh Laxmichand Patel, Baopu Li, Bin Gu, Huan Xiong

    Abstract: Continuous graph neural networks (CGNNs) have garnered significant attention due to their ability to generalize existing discrete graph neural networks (GNNs) by introducing continuous dynamics. They typically draw inspiration from diffusion-based methods to introduce a novel propagation scheme, which is analyzed using ordinary differential equations (ODE). However, the implementation of CGNNs req… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  6. arXiv:2403.18388  [pdf, other

    cs.AI cs.CV

    FTBC: Forward Temporal Bias Correction for Optimizing ANN-SNN Conversion

    Authors: Xiaofeng Wu, Velibor Bojkovic, Bin Gu, Kun Suo, Kai Zou

    Abstract: Spiking Neural Networks (SNNs) offer a promising avenue for energy-efficient computing compared with Artificial Neural Networks (ANNs), closely mirroring biological neural processes. However, this potential comes with inherent challenges in directly training SNNs through spatio-temporal backpropagation -- stemming from the temporal dynamics of spiking neurons and their discrete signal processing -… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  7. arXiv:2402.13241  [pdf, other

    cs.LG cs.AI

    Federated Causal Discovery from Heterogeneous Data

    Authors: Loka Li, Ignavier Ng, Gongxu Luo, Biwei Huang, Guangyi Chen, Tongliang Liu, Bin Gu, Kun Zhang

    Abstract: Conventional causal discovery methods rely on centralized data, which is inconsistent with the decentralized nature of data in many real-world situations. This discrepancy has motivated the development of federated causal discovery (FCD) approaches. However, existing FCD methods may be limited by their potentially restrictive assumptions of identifiable functional causal models or homogeneous data… ▽ More

    Submitted 26 February, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: ICLR 2024

  8. arXiv:2402.01146  [pdf, other

    cs.LG

    Limited Memory Online Gradient Descent for Kernelized Pairwise Learning with Dynamic Averaging

    Authors: Hilal AlQuabeh, William de Vazelhes, Bin Gu

    Abstract: Pairwise learning, an important domain within machine learning, addresses loss functions defined on pairs of training examples, including those in metric learning and AUC maximization. Acknowledging the quadratic growth in computation complexity accompanying pairwise loss as the sample size grows, researchers have turned to online gradient descent (OGD) methods for enhanced scalability. Recently,… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted in AAAI 2024

  9. arXiv:2401.12983  [pdf

    cs.CL cs.AI physics.ed-ph

    Assessing Large Language Models in Mechanical Engineering Education: A Study on Mechanics-Focused Conceptual Understanding

    Authors: Jie Tian, Jixin Hou, Zihao Wu, Peng Shu, Zhengliang Liu, Yujie Xiang, Beikang Gu, Nicholas Filla, Yiwei Li, Ning Liu, Xianyan Chen, Keke Tang, Tianming Liu, Xianqiao Wang

    Abstract: This study is a pioneering endeavor to investigate the capabilities of Large Language Models (LLMs) in addressing conceptual questions within the domain of mechanical engineering with a focus on mechanics. Our examination involves a manually crafted exam encompassing 126 multiple-choice questions, spanning various aspects of mechanics courses, including Fluid Mechanics, Mechanical Vibration, Engin… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

    Comments: 30 pages, 7 figures, and 1 table

  10. arXiv:2401.06401   

    cs.SE cs.AI cs.CL

    DevEval: Evaluating Code Generation in Practical Software Projects

    Authors: Jia Li, Ge Li, Yunfei Zhao, Yongmin Li, Zhi Jin, Hao Zhu, Huanyu Liu, Kaibo Liu, Lecheng Wang, Zheng Fang, Lanshen Wang, Jiazheng Ding, Xuanming Zhang, Yihong Dong, Yuqi Zhu, Bin Gu, Mengfei Yang

    Abstract: How to evaluate Large Language Models (LLMs) in code generation is an open question. Many benchmarks have been proposed but are inconsistent with practical software projects, e.g., unreal program distributions, insufficient dependencies, and small-scale project contexts. Thus, the capabilities of LLMs in practical projects are still unclear. In this paper, we propose a new benchmark named DevEval,… ▽ More

    Submitted 5 March, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

    Comments: We are re-checking this benchmark and repeating related experiments. New versions of DevEval will be released later

  11. arXiv:2401.05394  [pdf, other

    eess.SP cs.LG math.OC stat.ML

    Iterative Regularization with k-support Norm: An Important Complement to Sparse Recovery

    Authors: William de Vazelhes, Bhaskar Mukhoty, Xiao-Tong Yuan, Bin Gu

    Abstract: Sparse recovery is ubiquitous in machine learning and signal processing. Due to the NP-hard nature of sparse recovery, existing methods are known to suffer either from restrictive (or even unknown) applicability conditions, or high computational cost. Recently, iterative regularization methods have emerged as a promising fast approach because they can achieve sparse recovery in one pass through ea… ▽ More

    Submitted 19 March, 2024; v1 submitted 19 December, 2023; originally announced January 2024.

    Comments: Accepted at AAAI 2024. Code at https://github.com/wdevazelhes/IRKSN_AAAI2024

  12. arXiv:2401.05373  [pdf, other

    cs.NE cs.AI cs.LG

    Dynamic Spiking Graph Neural Networks

    Authors: Nan Yin, Mengzhu Wang, Zhenghan Chen, Giulia De Masi, Bin Gu, Huan Xiong

    Abstract: The integration of Spiking Neural Networks (SNNs) and Graph Neural Networks (GNNs) is gradually attracting attention due to the low power consumption and high efficiency in processing the non-Euclidean data represented by graphs. However, as a common problem, dynamic graph representation learning faces challenges such as high complexity and large memory overheads. Current work often uses SNNs inst… ▽ More

    Submitted 15 December, 2023; originally announced January 2024.

  13. arXiv:2312.11508  [pdf, other

    cs.CL cs.AI

    Rethinking the Instruction Quality: LIFT is What You Need

    Authors: Yang Xu, Yongqiang Yao, Yufan Huang, Mengnan Qi, Maoquan Wang, Bin Gu, Neel Sundaresan

    Abstract: Instruction tuning, a specialized technique to enhance large language model (LLM) performance via instruction datasets, relies heavily on the quality of employed data. Existing quality improvement methods alter instruction data through dataset expansion or curation. However, the expansion method risks data redundancy, potentially compromising LLM performance, while the curation approach confines t… ▽ More

    Submitted 27 December, 2023; v1 submitted 11 December, 2023; originally announced December 2023.

  14. arXiv:2311.15368  [pdf, other

    cs.CV

    Flow-Guided Diffusion for Video Inpainting

    Authors: Bohai Gu, Yongsheng Yu, Heng Fan, Libo Zhang

    Abstract: Video inpainting has been challenged by complex scenarios like large movements and low-light conditions. Current methods, including emerging diffusion models, face limitations in quality and efficiency. This paper introduces the Flow-Guided Diffusion model for Video Inpainting (FGDVI), a novel approach that significantly enhances temporal consistency and inpainting quality via reusing an off-the-s… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  15. arXiv:2311.06816  [pdf, other

    cs.LG cs.CV

    On original and latent space connectivity in deep neural networks

    Authors: Boyang Gu, Anastasia Borovykh

    Abstract: We study whether inputs from the same class can be connected by a continuous path, in original or latent representation space, such that all points on the path are mapped by the neural network model to the same class. Understanding how the neural network views its own input space and how the latent spaces are structured has value for explainability and robustness. We show that paths, linear or non… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

  16. arXiv:2311.05112  [pdf

    cs.CL cs.AI

    A Survey of Large Language Models in Medicine: Progress, Application, and Challenge

    Authors: Hongjian Zhou, Fenglin Liu, Boyang Gu, Xinyu Zou, Jinfa Huang, Jinge Wu, Yiru Li, Sam S. Chen, Peilin Zhou, Junling Liu, Yining Hua, Chengfeng Mao, Chenyu You, Xian Wu, Yefeng Zheng, Lei Clifton, Zheng Li, Jiebo Luo, David A. Clifton

    Abstract: Large language models (LLMs), such as ChatGPT, have received substantial attention due to their capabilities for understanding and generating human language. While there has been a burgeoning trend in research focusing on the employment of LLMs in supporting different medical tasks (e.g., enhancing clinical diagnostics and providing medical education), a review of these efforts, particularly their… ▽ More

    Submitted 15 May, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: Preprint. Version 5. 6 figures; 14 tables; 41 pages

  17. arXiv:2310.14209  [pdf, other

    cs.SE cs.LG

    SUT: Active Defects Probing for Transcompiler Models

    Authors: Mengnan Qi, Yufan Huang, Maoquan Wang, Yongqiang Yao, Zihan Liu, Bin Gu, Colin Clement, Neel Sundaresan

    Abstract: Automatic Program translation has enormous application value and hence has been attracting significant interest from AI researchers. However, we observe that current program translation models still make elementary syntax errors, particularly, when the target language does not have syntax elements in the source language. Metrics like BLUE, CodeBLUE and computation accuracy may not expose these iss… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

  18. arXiv:2310.11476  [pdf, other

    cs.SE cs.LG

    Program Translation via Code Distillation

    Authors: Yufan Huang, Mengnan Qi, Yongqiang Yao, Maoquan Wang, Bin Gu, Colin Clement, Neel Sundaresan

    Abstract: Software version migration and program translation are an important and costly part of the lifecycle of large codebases. Traditional machine translation relies on parallel corpora for supervised translation, which is not feasible for program translation due to a dearth of aligned data. Recent unsupervised neural machine translation techniques have overcome data limitations by included techniques s… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  19. arXiv:2310.06483  [pdf, other

    cs.LG

    Variance Reduced Online Gradient Descent for Kernelized Pairwise Learning with Limited Memory

    Authors: Hilal AlQuabeh, Bhaskar Mukhoty, Bin Gu

    Abstract: Pairwise learning is essential in machine learning, especially for problems involving loss functions defined on pairs of training examples. Online gradient descent (OGD) algorithms have been proposed to handle online pairwise learning, where data arrives sequentially. However, the pairwise nature of the problem makes scalability challenging, as the gradient computation for a new sample involves al… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted in ACML2023

  20. arXiv:2309.08965  [pdf, other

    cs.AI cs.LG cs.MA

    Multiagent Reinforcement Learning with an Attention Mechanism for Improving Energy Efficiency in LoRa Networks

    Authors: Xu Zhang, Ziqi Lin, Shimin Gong, Bo Gu, Dusit Niyato

    Abstract: Long Range (LoRa) wireless technology, characterized by low power consumption and a long communication range, is regarded as one of the enabling technologies for the Industrial Internet of Things (IIoT). However, as the network scale increases, the energy efficiency (EE) of LoRa networks decreases sharply due to severe packet collisions. To address this issue, it is essential to appropriately assi… ▽ More

    Submitted 16 September, 2023; originally announced September 2023.

    Comments: 6 pages, 3 figures, This paper has been accepted for publication in IEEE Global Communications Conference (GLOBECOM) 2023

  21. arXiv:2308.16031  [pdf, other

    cs.IT

    Breaking the Interference and Fading Gridlock in Backscatter Communications: State-of-the-Art, Design Challenges, and Future Directions

    Authors: Bowen Gu, Dong Li, Haiyang Ding, Gongpu Wang, Chintha Tellambura

    Abstract: As the Internet of Things (IoT) advances by leaps and bounds, a multitude of devices are becoming interconnected, marking the onset of an era where all things are connected. While this growth opens up opportunities for novel products and applications, it also leads to increased energy demand and battery reliance for IoT devices, creating a significant bottleneck that hinders sustainable progress.… ▽ More

    Submitted 9 January, 2024; v1 submitted 30 August, 2023; originally announced August 2023.

  22. arXiv:2306.16077  [pdf, other

    cs.LG cs.AI cs.DC

    Secure and Fast Asynchronous Vertical Federated Learning via Cascaded Hybrid Optimization

    Authors: Ganyu Wang, Qingsong Zhang, Li Xiang, Boyu Wang, Bin Gu, Charles Ling

    Abstract: Vertical Federated Learning (VFL) attracts increasing attention because it empowers multiple parties to jointly train a privacy-preserving model over vertically partitioned data. Recent research has shown that applying zeroth-order optimization (ZOO) has many advantages in building a practical VFL algorithm. However, a vital problem with the ZOO-based VFL is its slow convergence rate, which limits… ▽ More

    Submitted 29 June, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: Under Review

  23. arXiv:2306.13874  [pdf, other

    cs.IT eess.SP

    Enhancing Spectrum Sensing via Reconfigurable Intelligent Surfaces: Passive or Active Sensing and How Many Reflecting Elements are Needed?

    Authors: Hao Xie, Dong Li, Bowen Gu

    Abstract: Cognitive radio has been proposed to alleviate the scarcity of available spectrum caused by the significant demand for wideband services and the fragmentation of spectrum resources. However, sensing performance is quite poor due to the low sensing signal-to-noise ratio, especially in complex environments with severe channel fading. Fortunately, reconfigurable intelligent surface (RIS)-aided spectr… ▽ More

    Submitted 21 October, 2023; v1 submitted 24 June, 2023; originally announced June 2023.

  24. arXiv:2306.05751  [pdf, other

    cs.LG stat.ME

    Advancing Counterfactual Inference through Nonlinear Quantile Regression

    Authors: Shaoan Xie, Biwei Huang, Bin Gu, Tongliang Liu, Kun Zhang

    Abstract: The capacity to address counterfactual "what if" inquiries is crucial for understanding and making use of causal influences. Traditional counterfactual inference, under Pearls' counterfactual framework, typically depends on having access to or estimating a structural causal model. Yet, in practice, this causal model is often unknown and might be challenging to identify. Hence, this paper aims to p… ▽ More

    Submitted 27 February, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

  25. arXiv:2306.01260  [pdf, other

    cs.SE

    FREPA: An Automated and Formal Approach to Requirement Modeling and Analysis in Aircraft Control Domain

    Authors: Jincao Feng, Weikai Miao, Hanyue Zheng, Yihao Huang, Jianwen Li, Zheng Wang, Ting Su, Bin Gu, Geguang Pu, Mengfei Yang, Jifeng He

    Abstract: Formal methods are promising for modeling and analyzing system requirements. However, applying formal methods to large-scale industrial projects is a remaining challenge. The industrial engineers are suffering from the lack of automated engineering methodologies to effectively conduct precise requirement models, and rigorously validate and verify (V&V) the generated models. To tackle this challeng… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: 12 pages, Published by FSE 2020

  26. arXiv:2305.14689  [pdf, other

    stat.ML cs.LG math.ST

    Under-Parameterized Double Descent for Ridge Regularized Least Squares Denoising of Data on a Line

    Authors: Rishi Sonthalia, Xinyue Li, Bochao Gu

    Abstract: The relationship between the number of training data points, the number of parameters in a statistical model, and the generalization capabilities of the model has been widely studied. Previous work has shown that double descent can occur in the over-parameterized regime, and believe that the standard bias-variance trade-off holds in the under-parameterized regime. In this paper, we present a simpl… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  27. arXiv:2305.09946  [pdf

    eess.IV cs.CV cs.LG

    AdaMSS: Adaptive Multi-Modality Segmentation-to-Survival Learning for Survival Outcome Prediction from PET/CT Images

    Authors: Mingyuan Meng, Bingxin Gu, Michael Fulham, Shaoli Song, Dagan Feng, Lei Bi, Jinman Kim

    Abstract: Survival prediction is a major concern for cancer management. Deep survival models based on deep learning have been widely adopted to perform end-to-end survival prediction from medical images. Recent deep survival models achieved promising performance by jointly performing tumor segmentation with survival prediction, where the models were guided to extract tumor-related information through Multi-… ▽ More

    Submitted 19 July, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: Under Review

  28. Computation-Efficient Backscatter-Blessed MEC with User Reciprocity

    Authors: Bowen Gu, Hao Xie, Dong Li

    Abstract: This letter proposes a new user cooperative offloading protocol called user reciprocity in backscatter communication (BackCom)-aided mobile edge computing systems with efficient computation, whose quintessence is that each user can switch alternately between the active or the BackCom mode in different slots, and one user works in the active mode and the other user works in the BackCom mode in each… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

  29. arXiv:2304.11335  [pdf, other

    cs.CV

    Two Birds, One Stone: A Unified Framework for Joint Learning of Image and Video Style Transfers

    Authors: Bohai Gu, Heng Fan, Libo Zhang

    Abstract: Current arbitrary style transfer models are limited to either image or video domains. In order to achieve satisfying image and video style transfers, two different models are inevitably required with separate training processes on image and video domains, respectively. In this paper, we show that this can be precluded by introducing UniST, a Unified Style Transfer framework for both images and vid… ▽ More

    Submitted 1 September, 2023; v1 submitted 22 April, 2023; originally announced April 2023.

    Comments: Conference on International Conference on Computer Vision.(ICCV 2023)

  30. arXiv:2303.01249  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Language-Universal Adapter Learning with Knowledge Distillation for End-to-End Multilingual Speech Recognition

    Authors: Zhijie Shen, Wu Guo, Bin Gu

    Abstract: In this paper, we propose a language-universal adapter learning framework based on a pre-trained model for end-to-end multilingual automatic speech recognition (ASR). For acoustic modeling, the wav2vec 2.0 pre-trained model is fine-tuned by inserting language-specific and language-universal adapters. An online knowledge distillation is then used to enable the language-universal adapters to learn b… ▽ More

    Submitted 28 February, 2023; originally announced March 2023.

  31. arXiv:2302.09967  [pdf, other

    cs.LG cs.AI

    Stability-based Generalization Analysis for Mixtures of Pointwise and Pairwise Learning

    Authors: Jiahuan Wang, Jun Chen, Hong Chen, Bin Gu, Weifu Li, Xin Tang

    Abstract: Recently, some mixture algorithms of pointwise and pairwise learning (PPL) have been formulated by employing the hybrid error metric of "pointwise loss + pairwise loss" and have shown empirical effectiveness on feature selection, ranking and recommendation tasks. However, to the best of our knowledge, the learning theory foundation of PPL has not been touched in the existing works. In this paper,… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Comments: 20 pages

  32. arXiv:2302.09815  [pdf, other

    stat.ML cs.LG

    On the Stability and Generalization of Triplet Learning

    Authors: Jun Chen, Hong Chen, Xue Jiang, Bin Gu, Weifu Li, Tieliang Gong, Feng Zheng

    Abstract: Triplet learning, i.e. learning from triplet data, has attracted much attention in computer vision tasks with an extremely large number of categories, e.g., face recognition and person re-identification. Albeit with rapid progress in designing and applying triplet learning algorithms, there is a lacking study on the theoretical understanding of their generalization performance. To fill this gap, t… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Comments: AAAI2023

  33. arXiv:2302.00910  [pdf, other

    cs.LG cs.AI

    Energy Efficient Training of SNN using Local Zeroth Order Method

    Authors: Bhaskar Mukhoty, Velibor Bojkovic, William de Vazelhes, Giulia De Masi, Huan Xiong, Bin Gu

    Abstract: Spiking neural networks are becoming increasingly popular for their low energy requirement in real-world tasks with accuracy comparable to the traditional ANNs. SNN training algorithms face the loss of gradient information and non-differentiability due to the Heaviside function in minimizing the model loss over model parameters. To circumvent the problem surrogate method uses a differentiable appr… ▽ More

    Submitted 5 February, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

  34. arXiv:2212.13390  [pdf, other

    eess.SY cs.NI

    Hierarchical Deep Reinforcement Learning for Age-of-Information Minimization in IRS-aided and Wireless-powered Wireless Networks

    Authors: Shimin Gong, Leiyang Cui, Bo Gu, Bin Lyu, Dinh Thai Hoang, Dusit Niyato

    Abstract: In this paper, we focus on a wireless-powered sensor network coordinated by a multi-antenna access point (AP). Each node can generate sensing information and report the latest information to the AP using the energy harvested from the AP's signal beamforming. We aim to minimize the average age-of-information (AoI) by adapting the nodes' transmission scheduling and the transmission control strategie… ▽ More

    Submitted 27 December, 2022; originally announced December 2022.

    Comments: 31 pages, 6 figures, 2 tables, 3 algorithms

  35. arXiv:2212.08298  [pdf, other

    cs.IT eess.SP

    Exploring Hybrid Active-Passive RIS-Aided MEC Systems: From the Mode-Switching Perspective

    Authors: Hao Xie, Dong Li, Bowen Gu

    Abstract: Mobile edge computing (MEC) has been regarded as a promising technique to support latencysensitivity and computation-intensive serves. However, the low offloading rate caused by the random channel fading characteristic becomes a major bottleneck in restricting the performance of the MEC. Fortunately, reconfigurable intelligent surface (RIS) can alleviate this problem since it can boost both the sp… ▽ More

    Submitted 21 March, 2024; v1 submitted 16 December, 2022; originally announced December 2022.

  36. arXiv:2211.11751  [pdf, other

    cs.LG cs.AI

    Denoising Multi-Similarity Formulation: A Self-paced Curriculum-Driven Approach for Robust Metric Learning

    Authors: Chenkang Zhang, Lei Luo, Bin Gu

    Abstract: Deep Metric Learning (DML) is a group of techniques that aim to measure the similarity between objects through the neural network. Although the number of DML methods has rapidly increased in recent years, most previous studies cannot effectively handle noisy data, which commonly exists in practical applications and often leads to serious performance deterioration. To overcome this limitation, in t… ▽ More

    Submitted 1 December, 2022; v1 submitted 19 November, 2022; originally announced November 2022.

  37. arXiv:2210.05279  [pdf, other

    cs.LG math.OC

    Zeroth-Order Hard-Thresholding: Gradient Error vs. Expansivity

    Authors: William de Vazelhes, Hualin Zhang, Huimin Wu, Xiao-Tong Yuan, Bin Gu

    Abstract: $\ell_0… ▽ More

    Submitted 18 March, 2024; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: Accepted for publication at NeurIPS 2022

  38. arXiv:2210.03674  [pdf, other

    cs.AI cs.MA eess.SY

    Reinforcement Learning Approach for Multi-Agent Flexible Scheduling Problems

    Authors: Hongjian Zhou, Boyang Gu, Chenghao Jin

    Abstract: Scheduling plays an important role in automated production. Its impact can be found in various fields such as the manufacturing industry, the service industry and the technology industry. A scheduling problem (NP-hard) is a task of finding a sequence of job assignments on a given set of machines with the goal of optimizing the objective defined. Methods such as Operation Research, Dispatching Rule… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

  39. arXiv:2210.01496  [pdf, other

    math.OC cs.LG

    Zeroth-Order Negative Curvature Finding: Escaping Saddle Points without Gradients

    Authors: Hualin Zhang, Huan Xiong, Bin Gu

    Abstract: We consider escaping saddle points of nonconvex problems where only the function evaluations can be accessed. Although a variety of works have been proposed, the majority of them require either second or first-order information, and only a few of them have exploited zeroth-order methods, particularly the technique of negative curvature finding with zeroth-order methods which has been proven to be… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

  40. arXiv:2209.13100  [pdf, other

    cs.IT eess.SP

    Gain without Pain: Recycling Reflected Energy from Wireless Powered RIS-aided Communications

    Authors: Hao Xie, Bowen Gu, Dong Li, Zhi Lin, Yongjun Xu

    Abstract: In this paper, we investigate and analyze energy recycling for a reconfigurable intelligent surface (RIS)-aided wireless-powered communication network. As opposed to the existing works where the energy harvested by Internet of things (IoT) devices only come from the power station, IoT devices are also allowed to recycle energy from other IoT devices. In particular, we propose group switching- and… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

  41. arXiv:2209.12526  [pdf, other

    cs.IT

    Exploiting Hybrid Active and Passive Multiple Access via Slotted ALOHA-Driven Backscatter Communications

    Authors: Bowen Gu, Hao Xie, Dong Li, Ye Liu, Yongjun Xu

    Abstract: In conventional backscatter communication (BackCom) systems, time division multiple access (TDMA) and frequency division multiple access (FDMA) are generally adopted for multiuser backscattering due to their simplicity in implementation. However, as the number of backscatter devices (BDs) proliferates, there will be a high overhead under the traditional centralized control techniques, and the inte… ▽ More

    Submitted 12 May, 2023; v1 submitted 26 September, 2022; originally announced September 2022.

  42. arXiv:2209.07063  [pdf, other

    cs.LG math.OC

    GAGA: Deciphering Age-path of Generalized Self-paced Regularizer

    Authors: Xingyu Qu, Diyang Li, Xiaohan Zhao, Bin Gu

    Abstract: Nowadays self-paced learning (SPL) is an important machine learning paradigm that mimics the cognitive process of humans and animals. The SPL regime involves a self-paced regularizer and a gradually increasing age parameter, which plays a key role in SPL but where to optimally terminate this process is still non-trivial to determine. A natural idea is to compute the solution path w.r.t. age parame… ▽ More

    Submitted 23 September, 2022; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: 33 pages. Published as a conference paper at NeurIPS 2022

  43. arXiv:2208.06058  [pdf, other

    cs.LG stat.ML

    An Accelerated Doubly Stochastic Gradient Method with Faster Explicit Model Identification

    Authors: Runxue Bao, Bin Gu, Heng Huang

    Abstract: Sparsity regularized loss minimization problems play an important role in various fields including machine learning, data mining, and modern statistics. Proximal gradient descent method and coordinate descent method are the most popular approaches to solving the minimization problem. Although existing methods can achieve implicit model identification, aka support set identification, in a finite nu… ▽ More

    Submitted 11 August, 2022; originally announced August 2022.

  44. arXiv:2207.04876  [pdf, other

    cs.NE cs.AI cs.LG

    On the Intrinsic Structures of Spiking Neural Networks

    Authors: Shao-Qun Zhang, Jia-Yi Chen, Jin-Hui Wu, Gao Zhang, Huan Xiong, Bin Gu, Zhi-Hua Zhou

    Abstract: Recent years have emerged a surge of interest in SNNs owing to their remarkable potential to handle time-dependent and event-driven data. The performance of SNNs hinges not only on selecting an apposite architecture and fine-tuning connection weights, similar to conventional ANNs, but also on the meticulous configuration of intrinsic structures within spiking computations. However, there has been… ▽ More

    Submitted 16 November, 2023; v1 submitted 21 June, 2022; originally announced July 2022.

  45. arXiv:2207.03650  [pdf, other

    cs.LG cs.AI

    Balanced Self-Paced Learning for AUC Maximization

    Authors: Bin Gu, Chenkang Zhang, Huan Xiong, Heng Huang

    Abstract: Learning to improve AUC performance is an important topic in machine learning. However, AUC maximization algorithms may decrease generalization performance due to the noisy data. Self-paced learning is an effective method for handling noisy data. However, existing self-paced learning methods are limited to pointwise learning, while AUC maximization is a pairwise learning problem. To solve this cha… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

  46. arXiv:2206.15025  [pdf, other

    cs.LG

    On the Convergence of Distributed Stochastic Bilevel Optimization Algorithms over a Network

    Authors: Hongchang Gao, Bin Gu, My T. Thai

    Abstract: Bilevel optimization has been applied to a wide variety of machine learning models, and numerous stochastic bilevel optimization algorithms have been developed in recent years. However, most existing algorithms restrict their focus on the single-machine setting so that they are incapable of handling the distributed data. To address this issue, under the setting where all participants compose a net… ▽ More

    Submitted 27 March, 2023; v1 submitted 30 June, 2022; originally announced June 2022.

  47. arXiv:2206.02507  [pdf, other

    cs.LG eess.SY

    Learning to Control under Time-Varying Environment

    Authors: Yuzhen Han, Ruben Solozabal, Jing Dong, Xingyu Zhou, Martin Takac, Bin Gu

    Abstract: This paper investigates the problem of regret minimization in linear time-varying (LTV) dynamical systems. Due to the simultaneous presence of uncertainty and non-stationarity, designing online control algorithms for unknown LTV systems remains a challenging task. At a cost of NP-hard offline planning, prior works have introduced online convex optimization algorithms, although they suffer from non… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

  48. Exploiting Constructive Interference for Backscatter Communication Systems

    Authors: Bowen Gu, Dong Li, Ye Liu, Yongjun Xu

    Abstract: Backscatter communication (BackCom), one of the core technologies to realize zero-power communication, is expected to be a pivotal paradigm for the next generation of the Internet of Things (IoT). However, the "strong" direct link (DL) interference (DLI) is traditionally assumed to be harmful, and generally drowns out the "weak" backscattered signals accordingly, thus deteriorating the performance… ▽ More

    Submitted 12 May, 2023; v1 submitted 22 May, 2022; originally announced May 2022.

  49. arXiv:2204.08362  [pdf

    cs.ET physics.optics

    Hardware-algorithm collaborative computing with photonic spiking neuron chip based on integrated Fabry-Pérot laser with saturable absorber

    Authors: Shuiying Xiang, Yuechun Shi, Xingxing Guo, Yahui Zhang, Hongji Wang, Dianzhuang Zheng, Ziwei Song, Yanan Han, Shuang Gao, Shihao Zhao, Biling Gu, Hailing Wang, Xiaojun Zhu, Lianping Hou, Xiangfei Chen, Wanhua Zheng, Xiaohua Ma, Yue Hao

    Abstract: Photonic neuromorphic computing has emerged as a promising avenue toward building a low-latency and energy-efficient non-von-Neuman computing system. Photonic spiking neural network (PSNN) exploits brain-like spatiotemporal processing to realize high-performance neuromorphic computing. However, the nonlinear computation of PSNN remains a significant challenging. Here, we proposed and fabricated a… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Comments: 10 pages, 8 figures

  50. arXiv:2203.10329  [pdf, other

    cs.LG cs.DC math.OC

    Desirable Companion for Vertical Federated Learning: New Zeroth-Order Gradient Based Algorithm

    Authors: Qingsong Zhang, Bin Gu, Zhiyuan Dang, Cheng Deng, Heng Huang

    Abstract: Vertical federated learning (VFL) attracts increasing attention due to the emerging demands of multi-party collaborative modeling and concerns of privacy leakage. A complete list of metrics to evaluate VFL algorithms should include model applicability, privacy security, communication cost, and computation efficiency, where privacy security is especially important to VFL. However, to the best of ou… ▽ More

    Submitted 19 March, 2022; originally announced March 2022.

    Comments: 23 pages, Accepted by CIKM 2021