Skip to main content

Showing 1–50 of 189 results for author: Tian, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.00482  [pdf, other

    cs.CR cs.LG

    PackVFL: Efficient HE Packing for Vertical Federated Learning

    Authors: Liu Yang, Shuowei Cai, Di Chai, Junxue Zhang, Han Tian, Yilun Jin, Kun Guo, Kai Chen, Qiang Yang

    Abstract: As an essential tool of secure distributed machine learning, vertical federated learning (VFL) based on homomorphic encryption (HE) suffers from severe efficiency problems due to data inflation and time-consuming operations. To this core, we propose PackVFL, an efficient VFL framework based on packed HE (PackedHE), to accelerate the existing HE-based VFL algorithms. PackVFL packs multiple cleartex… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 12 pages excluding references

  2. arXiv:2404.16821  [pdf, other

    cs.CV

    How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites

    Authors: Zhe Chen, Weiyun Wang, Hao Tian, Shenglong Ye, Zhangwei Gao, Erfei Cui, Wenwen Tong, Kongzhi Hu, Jiapeng Luo, Zheng Ma, Ji Ma, Jiaqi Wang, Xiaoyi Dong, Hang Yan, Hewei Guo, Conghui He, Botian Shi, Zhenjiang Jin, Chao Xu, Bin Wang, Xingjian Wei, Wei Li, Wenjian Zhang, Bo Zhang, Pinlong Cai , et al. (10 additional authors not shown)

    Abstract: In this report, we introduce InternVL 1.5, an open-source multimodal large language model (MLLM) to bridge the capability gap between open-source and proprietary commercial models in multimodal understanding. We introduce three simple improvements: (1) Strong Vision Encoder: we explored a continuous learning strategy for the large-scale vision foundation model -- InternViT-6B, boosting its visual… ▽ More

    Submitted 29 April, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Technical report

  3. arXiv:2404.15199  [pdf, other

    cs.LG

    Reinforcement Learning with Adaptive Control Regularization for Safe Control of Critical Systems

    Authors: Haozhe Tian, Homayoun Hamedmoghadam, Robert Shorten, Pietro Ferraro

    Abstract: Reinforcement Learning (RL) is a powerful method for controlling dynamic systems, but its learning mechanism can lead to unpredictable actions that undermine the safety of critical systems. Here, we propose RL with Adaptive Control Regularization (RL-ACR) that ensures RL safety by combining the RL policy with a control regularizer that hard-codes safety constraints over forecasted system behaviors… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  4. arXiv:2404.12636  [pdf, other

    cs.SE

    Multi-Objective Fine-Tuning for Enhanced Program Repair with LLMs

    Authors: Boyang Yang, Haoye Tian, Jiadong Ren, Hongyu Zhang, Jacques Klein, Tegawendé F. Bissyandé, Claire Le Goues, Shunfu Jin

    Abstract: Large language models (LLMs) have demonstrated remarkable capabilities on a broad spectrum of downstream tasks. Within the realm of software engineering, specialized tasks on code, such as program repair, present unique challenges, necessitating fine-tuning to unlock state-of-the-art performance. Fine-tuning approaches proposed in the literature for LLMs on program repair tasks are however general… ▽ More

    Submitted 22 April, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

  5. arXiv:2404.08570  [pdf, other

    cs.RO cs.AI cs.LG

    Enhancing Autonomous Vehicle Training with Language Model Integration and Critical Scenario Generation

    Authors: Hanlin Tian, Kethan Reddy, Yuxiang Feng, Mohammed Quddus, Yiannis Demiris, Panagiotis Angeloudis

    Abstract: This paper introduces CRITICAL, a novel closed-loop framework for autonomous vehicle (AV) training and testing. CRITICAL stands out for its ability to generate diverse scenarios, focusing on critical driving situations that target specific learning and performance gaps identified in the Reinforcement Learning (RL) agent. The framework achieves this by integrating real-world traffic dynamics, drivi… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: 7 pages, 5 figures

  6. arXiv:2404.05258  [pdf, other

    cs.CV

    Unsupervised Band Selection Using Fused HSI and LiDAR Attention Integrating With Autoencoder

    Authors: Judy X Yang, Jun Zhou, Jing Wang, Hui Tian, Alan Wee Chung Liew

    Abstract: Band selection in hyperspectral imaging (HSI) is critical for optimising data processing and enhancing analytical accuracy. Traditional approaches have predominantly concentrated on analysing spectral and pixel characteristics within individual bands independently. These approaches overlook the potential benefits of integrating multiple data sources, such as Light Detection and Ranging (LiDAR), an… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: 13 pages, 13figures, 6 tables

    MSC Class: F.2.2; I.2.7

  7. arXiv:2404.03883  [pdf, other

    eess.IV cs.CV

    LiDAR-Guided Cross-Attention Fusion for Hyperspectral Band Selection and Image Classification

    Authors: Judy X Yang, Jun Zhou, Jing Wang, Hui Tian, Alan Wee-Chung Liew

    Abstract: The fusion of hyperspectral and LiDAR data has been an active research topic. Existing fusion methods have ignored the high-dimensionality and redundancy challenges in hyperspectral images, despite that band selection methods have been intensively studied for hyperspectral image (HSI) processing. This paper addresses this significant gap by introducing a cross-attention mechanism from the transfor… ▽ More

    Submitted 15 April, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

    Comments: 15 pages, 13 figures

    MSC Class: F.2.2; I.2.7

    Journal ref: IEEE - TGRS-2024-00264.R1 Final Files Received

  8. arXiv:2404.01780  [pdf, other

    astro-ph.IM astro-ph.GA cs.CV

    CSST Strong Lensing Preparation: a Framework for Detecting Strong Lenses in the Multi-color Imaging Survey by the China Survey Space Telescope (CSST)

    Authors: Xu Li, Ruiqi Sun, Jiameng Lv, Peng Jia, Nan Li, Chengliang Wei, Zou Hu, Xinzhong Er, Yun Chen, Zhang Ban, Yuedong Fang, Qi Guo, Dezi Liu, Guoliang Li, Lin Lin, Ming Li, Ran Li, Xiaobo Li, Yu Luo, Xianmin Meng, Jundan Nie, Zhaoxiang Qi, Yisheng Qiu, Li Shao, Hao Tian , et al. (7 additional authors not shown)

    Abstract: Strong gravitational lensing is a powerful tool for investigating dark matter and dark energy properties. With the advent of large-scale sky surveys, we can discover strong lensing systems on an unprecedented scale, which requires efficient tools to extract them from billions of astronomical objects. The existing mainstream lens-finding tools are based on machine learning algorithms and applied to… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: The paper is accepted by the AJ. The complete code could be downloaded with DOI of: 10.12149/101393. Comments are welcome

  9. arXiv:2404.00272  [pdf, other

    cs.CV

    HSIMamba: Hyperpsectral Imaging Efficient Feature Learning with Bidirectional State Space for Classification

    Authors: Judy X Yang, Jun Zhou, Jing Wang, Hui Tian, Alan Wee Chung Liew

    Abstract: Classifying hyperspectral images is a difficult task in remote sensing, due to their complex high-dimensional data. To address this challenge, we propose HSIMamba, a novel framework that uses bidirectional reversed convolutional neural network pathways to extract spectral features more efficiently. Additionally, it incorporates a specialized block for spatial analysis. Our approach combines the op… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 11 pages, 2 figures, 8 tables

    ACM Class: F.2.2, I.2.7

  10. arXiv:2403.14085  [pdf, other

    cs.CV

    Surface Reconstruction from Point Clouds via Grid-based Intersection Prediction

    Authors: Hui Tian, Kai Xu

    Abstract: Surface reconstruction from point clouds is a crucial task in the fields of computer vision and computer graphics. SDF-based methods excel at reconstructing smooth meshes with minimal error and artefacts but struggle with representing open surfaces. On the other hand, UDF-based methods can effectively represent open surfaces but often introduce noise, leading to artefacts in the mesh. In this work… ▽ More

    Submitted 8 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  11. arXiv:2403.08896  [pdf, ps, other

    cs.LG cs.DC

    One-Shot Averaging for Distributed TD($λ$) Under Markov Sampling

    Authors: Haoxing Tian, Ioannis Ch. Paschalidis, Alex Olshevsky

    Abstract: We consider a distributed setup for reinforcement learning, where each agent has a copy of the same Markov Decision Process but transitions are sampled from the corresponding Markov chain independently by each agent. We show that in this setting, we can achieve a linear speedup for TD($λ$), a family of popular methods for policy evaluation, in the sense that $N$ agents can evaluate a policy $N$ ti… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  12. arXiv:2403.06838  [pdf, other

    cs.SE cs.CR

    ACFIX: Guiding LLMs with Mined Common RBAC Practices for Context-Aware Repair of Access Control Vulnerabilities in Smart Contracts

    Authors: Lyuye Zhang, Kaixuan Li, Kairan Sun, Daoyuan Wu, Ye Liu, Haoye Tian, Yang Liu

    Abstract: Smart contracts are susceptible to various security issues, among which access control (AC) vulnerabilities are particularly critical. While existing research has proposed multiple detection tools, the automatic and appropriate repair of AC vulnerabilities in smart contracts remains a challenge. Unlike commonly supported vulnerability types by existing repair tools, such as reentrancy, which are u… ▽ More

    Submitted 18 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: This is a technical report from Nanyang Technological University

  13. arXiv:2403.06520  [pdf, other

    cs.CL cs.AI

    How to Understand Named Entities: Using Common Sense for News Captioning

    Authors: Ning Xu, Yanhui Wang, Tingting Zhang, Hongshuo Tian, Mohan Kankanhalli, An-An Liu

    Abstract: News captioning aims to describe an image with its news article body as input. It greatly relies on a set of detected named entities, including real-world people, organizations, and places. This paper exploits commonsense knowledge to understand named entities for news captioning. By ``understand'', we mean correlating the news content with common sense in the wild, which helps an agent to 1) dist… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  14. arXiv:2403.05101  [pdf, other

    cs.CL cs.AI

    Rule-driven News Captioning

    Authors: Ning Xu, Tingting Zhang, Hongshuo Tian, An-An Liu

    Abstract: News captioning task aims to generate sentences by describing named entities or concrete events for an image with its news article. Existing methods have achieved remarkable results by relying on the large-scale pre-trained models, which primarily focus on the correlations between the input news content and the output predictions. However, the news captioning requires adhering to some fundamental… ▽ More

    Submitted 14 March, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  15. arXiv:2403.01798  [pdf, other

    cs.NI cs.LG

    Towards Fair and Efficient Learning-based Congestion Control

    Authors: Xudong Liao, Han Tian, Chaoliang Zeng, Xinchen Wan, Kai Chen

    Abstract: Recent years have witnessed a plethora of learning-based solutions for congestion control (CC) that demonstrate better performance over traditional TCP schemes. However, they fail to provide consistently good convergence properties, including {\em fairness}, {\em fast convergence} and {\em stability}, due to the mismatch between their objective functions and these properties. Despite being intuiti… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  16. arXiv:2402.19414  [pdf, ps, other

    cs.SI cs.DS

    Higher-Order Networks Representation and Learning: A Survey

    Authors: Hao Tian, Reza Zafarani

    Abstract: Network data has become widespread, larger, and more complex over the years. Traditional network data is dyadic, capturing the relations among pairs of entities. With the need to model interactions among more than two entities, significant research has focused on higher-order networks and ways to represent, analyze, and learn from them. There are two main directions to studying higher-order networ… ▽ More

    Submitted 9 April, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: 25 pages

    MSC Class: 68Q06 ACM Class: A.1; I.5.1

  17. arXiv:2402.15321  [pdf, other

    cs.CV cs.AI cs.LG

    OpenSUN3D: 1st Workshop Challenge on Open-Vocabulary 3D Scene Understanding

    Authors: Francis Engelmann, Ayca Takmaz, Jonas Schult, Elisabetta Fedele, Johanna Wald, Songyou Peng, Xi Wang, Or Litany, Siyu Tang, Federico Tombari, Marc Pollefeys, Leonidas Guibas, Hongbo Tian, Chunjie Wang, Xiaosheng Yan, Bingwen Wang, Xuanyang Zhang, Xiao Liu, Phuc Nguyen, Khoi Nguyen, Anh Tran, Cuong Pham, Zhening Huang, Xiaoyang Wu, Xi Chen , et al. (3 additional authors not shown)

    Abstract: This report provides an overview of the challenge hosted at the OpenSUN3D Workshop on Open-Vocabulary 3D Scene Understanding held in conjunction with ICCV 2023. The goal of this workshop series is to provide a platform for exploration and discussion of open-vocabulary 3D scene understanding tasks, including but not limited to segmentation, detection and mapping. We provide an overview of the chall… ▽ More

    Submitted 17 March, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: Our OpenSUN3D workshop website for ICCV 2023: https://opensun3d.github.io/index_iccv23.html

  18. arXiv:2402.02172  [pdf, other

    cs.SE

    CodeAgent: Collaborative Agents for Software Engineering

    Authors: Daniel Tang, Zhenghan Chen, Kisub Kim, Yewei Song, Haoye Tian, Saad Ezzini, Yongfeng Huang, Jacques Klein, Tegawende F. Bissyande

    Abstract: Code review is a heavily collaborative process, which aims at ensuring the overall quality and reliability of software. While it provides massive benefits, the implementation of code review in an organization faces several challenges that make its automation appealing. Automated code review tools have been around for a while and are now improving thanks to the adoption of novel AI models, which he… ▽ More

    Submitted 15 February, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

  19. arXiv:2401.16566  [pdf, other

    cs.RO

    Excitation Trajectory Optimization for Dynamic Parameter Identification Using Virtual Constraints in Hands-on Robotic System

    Authors: Huanyu Tian, Martin Huber, Christopher E. Mower, Zhe Han, Changsheng Li, Xingguang Duan, Christos Bergeles

    Abstract: This paper proposes a novel, more computationally efficient method for optimizing robot excitation trajectories for dynamic parameter identification, emphasizing self-collision avoidance. This addresses the system identification challenges for getting high-quality training data associated with co-manipulated robotic arms that can be equipped with a variety of tools, a common scenario in industrial… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  20. arXiv:2401.07870  [pdf, other

    cs.CL cs.AI cs.SE

    JumpCoder: Go Beyond Autoregressive Coder via Online Modification

    Authors: Mouxiang Chen, Hao Tian, Zhongxin Liu, Xiaoxue Ren, Jianling Sun

    Abstract: While existing code large language models (code LLMs) exhibit impressive capabilities in code generation, their autoregressive sequential generation inherently lacks reversibility. This limitation hinders them from timely correcting previous missing statements during coding as humans do, often leading to error propagation and suboptimal performance. We introduce JumpCoder, a novel modelagnostic fr… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  21. arXiv:2312.15186  [pdf, other

    cs.DC cs.AI cs.LG

    Efficient Asynchronous Federated Learning with Sparsification and Quantization

    Authors: Juncheng Jia, Ji Liu, Chendi Zhou, Hao Tian, Mianxiong Dong, Dejing Dou

    Abstract: While data is distributed in multiple edge devices, Federated Learning (FL) is attracting more and more attention to collaboratively train a machine learning model without transferring raw data. FL generally exploits a parameter server and a large number of edge devices during the whole process of the model training, while several devices are selected in each round. However, straggler devices may… ▽ More

    Submitted 6 January, 2024; v1 submitted 23 December, 2023; originally announced December 2023.

    Comments: To appear in Concurrency and Computation: Practice and Experience (CCPE), 21 pages

  22. arXiv:2312.09245  [pdf, other

    cs.CV

    DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving

    Authors: Wenhai Wang, Jiangwei Xie, ChuanYang Hu, Haoming Zou, Jianan Fan, Wenwen Tong, Yang Wen, Silei Wu, Hanming Deng, Zhiqi Li, Hao Tian, Lewei Lu, Xizhou Zhu, Xiaogang Wang, Yu Qiao, Jifeng Dai

    Abstract: Large language models (LLMs) have opened up new possibilities for intelligent agents, endowing them with human-like thinking and cognitive abilities. In this work, we delve into the potential of large language models (LLMs) in autonomous driving (AD). We introduce DriveMLM, an LLM-based AD framework that can perform close-loop autonomous driving in realistic simulators. To this end, (1) we bridge… ▽ More

    Submitted 25 December, 2023; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Technical Report

  23. arXiv:2312.09086  [pdf, other

    cs.LG cs.NE

    COMBHelper: A Neural Approach to Reduce Search Space for Graph Combinatorial Problems

    Authors: Hao Tian, Sourav Medya, Wei Ye

    Abstract: Combinatorial Optimization (CO) problems over graphs appear routinely in many applications such as in optimizing traffic, viral marketing in social networks, and matching for job allocation. Due to their combinatorial nature, these problems are often NP-hard. Existing approximation algorithms and heuristics rely on the search space to find the solutions and become time-consuming when this space is… ▽ More

    Submitted 1 January, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

  24. arXiv:2312.05397  [pdf, other

    cs.LG

    On the Performance of Temporal Difference Learning With Neural Networks

    Authors: Haoxing Tian, Ioannis Ch. Paschalidis, Alex Olshevsky

    Abstract: Neural Temporal Difference (TD) Learning is an approximate temporal difference method for policy evaluation that uses a neural network for function approximation. Analysis of Neural TD Learning has proven to be challenging. In this paper we provide a convergence analysis of Neural TD Learning with a projection onto $B(θ_0, ω)$, a ball of fixed radius $ω$ around the initial point $θ_0$. We show an… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  25. arXiv:2312.02521  [pdf, other

    cs.CV cs.AI

    Retrieving Conditions from Reference Images for Diffusion Models

    Authors: Haoran Tang, Xin Zhou, Jieren Deng, Zhihong Pan, Hao Tian, Pratik Chaudhari

    Abstract: Newly developed diffusion-based techniques have showcased phenomenal abilities in producing a wide range of high-quality images, sparking considerable interest in various applications. A prevalent scenario is to generate new images based on a subject from reference images. This subject could be face identity for styled avatars, body and clothing for virtual try-on and so on. Satisfying this requir… ▽ More

    Submitted 15 March, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

  26. arXiv:2312.01241  [pdf, other

    cs.CR cs.AI

    Just-in-Time Security Patch Detection -- LLM At the Rescue for Data Augmentation

    Authors: Xunzhu Tang, Zhenghan Chen, Kisub Kim, Haoye Tian, Saad Ezzini, Jacques Klein

    Abstract: In the face of growing vulnerabilities found in open-source software, the need to identify {discreet} security patches has become paramount. The lack of consistency in how software providers handle maintenance often leads to the release of security patches without comprehensive advisories, leaving users vulnerable to unaddressed security risks. To address this pressing issue, we introduce a novel… ▽ More

    Submitted 12 December, 2023; v1 submitted 2 December, 2023; originally announced December 2023.

  27. arXiv:2311.18835  [pdf, other

    cs.CV

    InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation

    Authors: Rongyao Fang, Shilin Yan, Zhaoyang Huang, Jingqiu Zhou, Hao Tian, Jifeng Dai, Hongsheng Li

    Abstract: Empowering models to dynamically accomplish tasks specified through natural language instructions represents a promising path toward more capable and general artificial intelligence. In this work, we introduce InstructSeq, an instruction-conditioned multi-modal modeling framework that unifies diverse vision tasks through flexible natural language control and handling of both visual and textual dat… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: 10 pages

  28. arXiv:2311.18405  [pdf, other

    cs.CV

    CAT-DM: Controllable Accelerated Virtual Try-on with Diffusion Model

    Authors: Jianhao Zeng, Dan Song, Weizhi Nie, Hongshuo Tian, Tongtong Wang, Anan Liu

    Abstract: Generative Adversarial Networks (GANs) dominate the research field in image-based virtual try-on, but have not resolved problems such as unnatural deformation of garments and the blurry generation quality. While the generative quality of diffusion models is impressive, achieving controllability poses a significant challenge when applying it to virtual try-on and multiple denoising iterations limit… ▽ More

    Submitted 25 April, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

  29. arXiv:2311.12307  [pdf, other

    cs.AI

    Causality is all you need

    Authors: Ning Xu, Yifei Gao, Hongshuo Tian, Yongdong Zhang, An-An Liu

    Abstract: In the fundamental statistics course, students are taught to remember the well-known saying: "Correlation is not Causation". Till now, statistics (i.e., correlation) have developed various successful frameworks, such as Transformer and Pre-training large-scale models, which have stacked multiple parallel self-attention blocks to imitate a wide range of tasks. However, in the causation community, h… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  30. arXiv:2311.03865  [pdf, other

    cs.LG cs.AI cs.CR

    When Fairness Meets Privacy: Exploring Privacy Threats in Fair Binary Classifiers through Membership Inference Attacks

    Authors: Huan Tian, Guangsheng Zhang, Bo Liu, Tianqing Zhu, Ming Ding, Wanlei Zhou

    Abstract: Previous studies have developed fairness methods for biased models that exhibit discriminatory behaviors towards specific subgroups. While these models have shown promise in achieving fair predictions, recent research has identified their potential vulnerability to score-based membership inference attacks (MIAs). In these attacks, adversaries can infer whether a particular data sample was used dur… ▽ More

    Submitted 12 January, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: Under review

  31. arXiv:2310.14560  [pdf, other

    cs.CV

    Polyhedral Surface: Self-supervised Point Cloud Reconstruction Based on Polyhedral Surface

    Authors: Hui Tian, Kai Xu

    Abstract: Point cloud reconstruction from raw point cloud has been an important topic in computer graphics for decades, especially due to its high demand in modeling and rendering applications. An important way to solve this problem is establishing a local geometry to fit the local curve. However, previous methods build either a local plane or polynomial curve. Local plane brings the loss of sharp feature a… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  32. arXiv:2310.12753   

    cs.SE

    Patch-CLIP: A Patch-Text Pre-Trained Model

    Authors: Xunzhu Tang, Zhenghan Chen, Saad Ezzini, Haoye Tian, Jacques Klein, Tegawende F. Bissyande

    Abstract: In recent years, patch representation learning has emerged as a necessary research direction for exploiting the capabilities of machine learning in software generation. These representations have driven significant performance enhancements across a variety of tasks involving code changes. While the progress is undeniable, a common limitation among existing models is their specialization: they pred… ▽ More

    Submitted 30 March, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: The paper is incomplete, causing much confusion for the community

  33. arXiv:2310.02559  [pdf, ps, other

    cs.IT cs.LG eess.SP

    Semi-Federated Learning: Convergence Analysis and Optimization of A Hybrid Learning Framework

    Authors: Jingheng Zheng, Wanli Ni, Hui Tian, Deniz Gunduz, Tony Q. S. Quek, Zhu Han

    Abstract: Under the organization of the base station (BS), wireless federated learning (FL) enables collaborative model training among multiple devices. However, the BS is merely responsible for aggregating local updates during the training process, which incurs a waste of the computational resource at the BS. To tackle this issue, we propose a semi-federated learning (SemiFL) paradigm to leverage the compu… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: This paper has been accepted by IEEE Transactions on Wireless Communications

  34. Convergence Analysis and Latency Minimization for Semi-Federated Learning in Massive IoT Networks

    Authors: Jianyang Ren, Wanli Ni, Hui Tian, Gaofeng Nie

    Abstract: As the number of sensors becomes massive in Internet of Things (IoT) networks, the amount of data is humongous. To process data in real-time while protecting user privacy, federated learning (FL) has been regarded as an enabling technique to push edge intelligence into IoT networks with massive devices. However, FL latency increases dramatically due to the increase of the number of parameters in d… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: This paper has been accepted by IEEE Transactions on Green Communications and Networking

  35. arXiv:2310.01045  [pdf, other

    cs.CL

    Tool-Augmented Reward Modeling

    Authors: Lei Li, Yekun Chai, Shuohuan Wang, Yu Sun, Hao Tian, Ningyu Zhang, Hua Wu

    Abstract: Reward modeling (a.k.a., preference modeling) is instrumental for aligning large language models with human preferences, particularly within the context of reinforcement learning from human feedback (RLHF). While conventional reward models (RMs) have exhibited remarkable scalability, they oft struggle with fundamental functionality such as arithmetic computation, code execution, and factual lookup… ▽ More

    Submitted 11 February, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 Spotlight

  36. arXiv:2309.17334  [pdf, other

    eess.IV cs.CV

    Multi-Depth Branch Network for Efficient Image Super-Resolution

    Authors: Huiyuan Tian, Li Zhang, Shijian Li, Min Yao, Gang Pan

    Abstract: A longstanding challenge in Super-Resolution (SR) is how to efficiently enhance high-frequency details in Low-Resolution (LR) images while maintaining semantic coherence. This is particularly crucial in practical applications where SR models are often deployed on low-power devices. To address this issue, we propose an innovative asymmetric SR architecture featuring Multi-Depth Branch Module (MDBM)… ▽ More

    Submitted 15 January, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

  37. arXiv:2309.16205  [pdf, other

    cs.CV eess.IV

    DiffGAN-F2S: Symmetric and Efficient Denoising Diffusion GANs for Structural Connectivity Prediction from Brain fMRI

    Authors: Qiankun Zuo, Ruiheng Li, Yi Di, Hao Tian, Changhong Jing, Xuhang Chen, Shuqiang Wang

    Abstract: Mapping from functional connectivity (FC) to structural connectivity (SC) can facilitate multimodal brain network fusion and discover potential biomarkers for clinical implications. However, it is challenging to directly bridge the reliable non-linear mapping relations between SC and functional magnetic resonance imaging (fMRI). In this paper, a novel diffusision generative adversarial network-bas… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: 12 pages

  38. arXiv:2309.15478  [pdf, other

    cs.CV cs.LG

    The Robust Semantic Segmentation UNCV2023 Challenge Results

    Authors: Xuanlong Yu, Yi Zuo, Zitao Wang, Xiaowen Zhang, Jiaxuan Zhao, Yuting Yang, Licheng Jiao, Rui Peng, Xinyi Wang, Junpei Zhang, Kexin Zhang, Fang Liu, Roberto Alcover-Couso, Juan C. SanMiguel, Marcos Escudero-Viñolo, Hanlin Tian, Kenta Matsui, Tianhao Wang, Fahmy Adan, Zhitong Gao, Xuming He, Quentin Bouniot, Hossein Moghaddam, Shyam Nandan Rai, Fabio Cermelli , et al. (12 additional authors not shown)

    Abstract: This paper outlines the winning solutions employed in addressing the MUAD uncertainty quantification challenge held at ICCV 2023. The challenge was centered around semantic segmentation in urban environments, with a particular focus on natural adversarial scenarios. The report presents the results of 19 submitted entries, with numerous techniques drawing inspiration from cutting-edge uncertainty q… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: 11 pages, 4 figures, accepted at ICCV 2023 UNCV workshop

  39. arXiv:2309.01371  [pdf, other

    cs.HC

    A Survey for Graphic Design Intelligence

    Authors: Danqing Huang, Jiaqi Guo, Shizhao Sun, Hanling Tian, Jieru Lin, Zheng Hu, Chin-Yew Lin, Jian-Guang Lou, Dongmei Zhang

    Abstract: Graphic design is an effective language for visual communication. Using complex composition of visual elements (e.g., shape, color, font) guided by design principles and aesthetics, design helps produce more visually-appealing content. The creation of a harmonious design requires carefully selecting and combining different visual elements, which can be challenging and time-consuming. To expedite t… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: 10 pages, 2 figures

  40. arXiv:2308.16586  [pdf, other

    cs.SE

    Learning to Represent Patches

    Authors: Xunzhu Tang, Haoye Tian, Zhenghan Chen, Weiguo Pian, Saad Ezzini, Abdoul Kader Kabore, Andrew Habib, Jacques Klein, Tegawende F. Bissyande

    Abstract: Patch representation is crucial in automating various software engineering tasks, like determining patch accuracy or summarizing code changes. While recent research has employed deep learning for patch representation, focusing on token sequences or Abstract Syntax Trees (ASTs), they often miss the change's semantic intent and the context of modified lines. To bridge this gap, we introduce a novel… ▽ More

    Submitted 3 October, 2023; v1 submitted 31 August, 2023; originally announced August 2023.

  41. arXiv:2308.15234  [pdf, other

    cs.SE

    Hyperbolic Code Retrieval: A Novel Approach for Efficient Code Search Using Hyperbolic Space Embeddings

    Authors: Xunzhu Tang, zhenghan Chen, Saad Ezzini, Haoye Tian, Yewei Song, Jacques Klein, Tegawende F. Bissyande

    Abstract: Within the realm of advanced code retrieval, existing methods have primarily relied on intricate matching and attention-based mechanisms. However, these methods often lead to computational and memory inefficiencies, posing a significant challenge to their real-world applicability. To tackle this challenge, we propose a novel approach, the Hyperbolic Code QA Matching (HyCoQA). This approach leverag… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  42. arXiv:2308.15233  [pdf, other

    cs.SE

    Multilevel Semantic Embedding of Software Patches: A Fine-to-Coarse Grained Approach Towards Security Patch Detection

    Authors: Xunzhu Tang, zhenghan Chen, Saad Ezzini, Haoye Tian, Yewei Song, Jacques Klein, Tegawende F. Bissyande

    Abstract: The growth of open-source software has increased the risk of hidden vulnerabilities that can affect downstream software applications. This concern is further exacerbated by software vendors' practice of silently releasing security patches without explicit warnings or common vulnerability and exposure (CVE) notifications. This lack of transparency leaves users unaware of potential security threats,… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  43. arXiv:2308.14371  [pdf, other

    cs.CV

    SuperUDF: Self-supervised UDF Estimation for Surface Reconstruction

    Authors: Hui Tian, Chenyang Zhu, Yifei Shi, Kai Xu

    Abstract: Learning-based surface reconstruction based on unsigned distance functions (UDF) has many advantages such as handling open surfaces. We propose SuperUDF, a self-supervised UDF learning which exploits a learned geometry prior for efficient training and a novel regularization for robustness to sparse sampling. The core idea of SuperUDF draws inspiration from the classical surface approximation opera… ▽ More

    Submitted 22 October, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

  44. arXiv:2308.10001  [pdf, other

    cs.CV

    AltNeRF: Learning Robust Neural Radiance Field via Alternating Depth-Pose Optimization

    Authors: Kun Wang, Zhiqiang Yan, Huang Tian, Zhenyu Zhang, Xiang Li, Jun Li, Jian Yang

    Abstract: Neural Radiance Fields (NeRF) have shown promise in generating realistic novel views from sparse scene images. However, existing NeRF approaches often encounter challenges due to the lack of explicit 3D supervision and imprecise camera poses, resulting in suboptimal outcomes. To tackle these issues, we propose AltNeRF -- a novel framework designed to create resilient NeRF representations using sel… ▽ More

    Submitted 23 February, 2024; v1 submitted 19 August, 2023; originally announced August 2023.

    Comments: Accepted by AAAI-24

  45. arXiv:2307.16144  [pdf, other

    cs.CV cs.MM

    Video Frame Interpolation with Flow Transformer

    Authors: Pan Gao, Haoyue Tian, Jie Qin

    Abstract: Video frame interpolation has been actively studied with the development of convolutional neural networks. However, due to the intrinsic limitations of kernel weight sharing in convolution, the interpolated frame generated by it may lose details. In contrast, the attention mechanism in Transformer can better distinguish the contribution of each pixel, and it can also capture long-range pixel depen… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: Accepted to ACM MM23

  46. arXiv:2307.15922  [pdf, other

    cs.NI

    Distributed Traffic Engineering in Hybrid Software Defined Networks: A Multi-agent Reinforcement Learning Framework

    Authors: Yingya Guo, Qi Tang, Yulong Ma, Han Tian, Kai Chen

    Abstract: Traffic Engineering (TE) is an efficient technique to balance network flows and thus improves the performance of a hybrid Software Defined Network (SDN). Previous TE solutions mainly leverage heuristic algorithms to centrally optimize link weight setting or traffic splitting ratios under the static traffic demand. Note that as the network scale becomes larger and network management gains more comp… ▽ More

    Submitted 29 July, 2023; originally announced July 2023.

  47. arXiv:2307.11019  [pdf, other

    cs.CL cs.IR

    Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation

    Authors: Ruiyang Ren, Yuhao Wang, Yingqi Qu, Wayne Xin Zhao, Jing Liu, Hao Tian, Hua Wu, Ji-Rong Wen, Haifeng Wang

    Abstract: Knowledge-intensive tasks (e.g., open-domain question answering (QA)) require a substantial amount of factual knowledge and often rely on external information for assistance. Recently, large language models (LLMs) (e.g., ChatGPT), have demonstrated impressive prowess in solving a wide range of tasks with world knowledge, including knowledge-intensive tasks. However, it remains unclear how well LLM… ▽ More

    Submitted 23 July, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

  48. arXiv:2307.07960  [pdf, other

    cs.SI cs.HC

    The Roll-Out of Community Notes Did Not Reduce Engagement With Misinformation on Twitter

    Authors: Yuwei Chuai, Haoye Tian, Nicolas Pröllochs, Gabriele Lenzini

    Abstract: Developing interventions that successfully reduce engagement with misinformation on social media is challenging. One intervention that has recently gained great attention is Twitter's Community Notes (previously known as "Birdwatch"). Community Notes is a crowdsourced fact-checking approach that allows users to write textual notes to inform others about potentially misleading posts on Twitter. Yet… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

  49. arXiv:2306.15989  [pdf, other

    cs.GR cs.AI

    Tensorformer: Normalized Matrix Attention Transformer for High-quality Point Cloud Reconstruction

    Authors: Hui Tian, Zheng Qin, Renjiao Yi, Chenyang Zhu, Kai Xu

    Abstract: Surface reconstruction from raw point clouds has been studied for decades in the computer graphics community, which is highly demanded by modeling and rendering applications nowadays. Classic solutions, such as Poisson surface reconstruction, require point normals as extra input to perform reasonable results. Modern transformer-based methods can work without normals, while the results are less fin… ▽ More

    Submitted 10 October, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

  50. arXiv:2306.09713  [pdf, ps, other

    cs.NI

    Efficient Coflow Scheduling in Hybrid-Switched Data Center Networks

    Authors: Xin Wang, Hong Shen, Hui Tian

    Abstract: To improve the application-level communication performance, scheduling of coflows, a collection of parallel flows sharing the same objective, is prevalent in modern data center networks (DCNs). Meanwhile, a hybrid-switched DCN design combining optical circuit switches (OPS) and electrical packet switches (EPS) for transmitting high-volume traffic and low-volume traffic separately has received cons… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.