Skip to main content

Showing 1–50 of 156 results for author: Zhong, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.18327  [pdf

    q-bio.QM cs.AI cs.CV cs.LG

    Histopathology Based AI Model Predicts Anti-Angiogenic Therapy Response in Renal Cancer Clinical Trial

    Authors: Jay Jasti, Hua Zhong, Vandana Panwar, Vipul Jarmale, Jeffrey Miyata, Deyssy Carrillo, Alana Christie, Dinesh Rakheja, Zora Modrusan, Edward Ernest Kadel III, Niha Beig, Mahrukh Huseni, James Brugarolas, Payal Kapur, Satwik Rajaram

    Abstract: Predictive biomarkers of treatment response are lacking for metastatic clear cell renal cell carcinoma (ccRCC), a tumor type that is treated with angiogenesis inhibitors, immune checkpoint inhibitors, mTOR inhibitors and a HIF2 inhibitor. The Angioscore, an RNA-based quantification of angiogenesis, is arguably the best candidate to predict anti-angiogenic (AA) response. However, the clinical adopt… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 19 pages, 4 Figures

  2. arXiv:2405.14452  [pdf, other

    cs.CV cs.AI

    JointRF: End-to-End Joint Optimization for Dynamic Neural Radiance Field Representation and Compression

    Authors: Zihan Zheng, Houqiang Zhong, Qiang Hu, Xiaoyun Zhang, Li Song, Ya Zhang, Yanfeng Wang

    Abstract: Neural Radiance Field (NeRF) excels in photo-realistically static scenes, inspiring numerous efforts to facilitate volumetric videos. However, rendering dynamic and long-sequence radiance fields remains challenging due to the significant data required to represent volumetric videos. In this paper, we propose a novel end-to-end joint optimization scheme of dynamic NeRF representation and compressio… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 8 pages, 5 figures

  3. arXiv:2405.11461  [pdf, other

    cs.IR cs.AI cs.CL

    DocReLM: Mastering Document Retrieval with Language Model

    Authors: Gengchen Wei, Xinle Pang, Tianning Zhang, Yu Sun, Xun Qian, Chen Lin, Han-Sen Zhong, Wanli Ouyang

    Abstract: With over 200 million published academic documents and millions of new documents being written each year, academic researchers face the challenge of searching for information within this vast corpus. However, existing retrieval systems struggle to understand the semantics and domain knowledge present in academic papers. In this work, we demonstrate that by utilizing large language models, a docume… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  4. arXiv:2404.18922  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    DPO Meets PPO: Reinforced Token Optimization for RLHF

    Authors: Han Zhong, Guhao Feng, Wei Xiong, Li Zhao, Di He, Jiang Bian, Liwei Wang

    Abstract: In the classical Reinforcement Learning from Human Feedback (RLHF) framework, Proximal Policy Optimization (PPO) is employed to learn from sparse, sentence-level rewards -- a challenging scenario in traditional deep reinforcement learning. Despite the great successes of PPO in the alignment of state-of-the-art closed-source large language models (LLMs), its open-source implementation is still larg… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  5. Misaka: Interactive Swarm Testbed for Smart Grid Distributed Algorithm Test and Evaluation

    Authors: Tingliang Zhang, Haiwang Zhong, Zhenfei Tan, Xinfei Yan

    Abstract: In this paper, we present Misaka, a visualized swarm testbed for smart grid algorithm evaluation, also an extendable open-source open-hardware platform for developing tabletop tangible swarm interfaces. The platform consists of a collection of custom-designed 3 omni-directional wheels robots each 10 cm in diameter, high accuracy localization through a microdot pattern overlaid on top of the activi… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Journal ref: 2020 IEEE/IAS Industrial and Commercial Power System Asia (I&CPS Asia)

  6. arXiv:2404.16223  [pdf, other

    cs.CV eess.IV

    Deep RAW Image Super-Resolution. A NTIRE 2024 Challenge Survey

    Authors: Marcos V. Conde, Florin-Alexandru Vasluianu, Radu Timofte, Jianxing Zhang, Jia Li, Fan Wang, Xiaopeng Li, Zikun Liu, Hyunhee Park, Sejun Song, Changho Kim, Zhijuan Huang, Hongyuan Yu, Cheng Wan, Wending Xiang, Jiamin Lin, Hang Zhong, Qiaosong Zhang, Yue Sun, Xuanwu Yin, Kunlong Zuo, Senyan Xu, Siyuan Jiang, Zhijing Sun, Jiaying Zhu , et al. (10 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 RAW Image Super-Resolution Challenge, highlighting the proposed solutions and results. New methods for RAW Super-Resolution could be essential in modern Image Signal Processing (ISP) pipelines, however, this problem is not as explored as in the RGB domain. Th goal of this challenge is to upscale RAW Bayer images by 2x, considering unknown degradations such as nois… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 - NTIRE Workshop

  7. arXiv:2404.12648  [pdf, ps, other

    cs.LG stat.ML

    Sample-efficient Learning of Infinite-horizon Average-reward MDPs with General Function Approximation

    Authors: Jianliang He, Han Zhong, Zhuoran Yang

    Abstract: We study infinite-horizon average-reward Markov decision processes (AMDPs) in the context of general function approximation. Specifically, we propose a novel algorithmic framework named Local-fitted Optimization with OPtimism (LOOP), which incorporates both model-based and value-based incarnations. In particular, LOOP features a novel construction of confidence sets and a low-switching policy upda… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: ICLR 2024

  8. arXiv:2404.04884  [pdf, other

    cs.CV

    LRNet: Change detection of high-resolution remote sensing imagery via strategy of localization-then-refinement

    Authors: Huan Zhong, Chen Wu, Ziqi Xiao

    Abstract: Change detection, as a research hotspot in the field of remote sensing, has witnessed continuous development and progress. However, the discrimination of boundary details remains a significant bottleneck due to the complexity of surrounding elements between change areas and backgrounds. Discriminating the boundaries of large change areas results in misalignment, while connecting boundaries occurs… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: 18 pages, 11 figures

  9. arXiv:2404.03578  [pdf, ps, other

    cs.LG stat.ML

    Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithm

    Authors: Miao Lu, Han Zhong, Tong Zhang, Jose Blanchet

    Abstract: The sim-to-real gap, which represents the disparity between training and testing environments, poses a significant challenge in reinforcement learning (RL). A promising approach to addressing this challenge is distributionally robust RL, often framed as a robust Markov decision process (RMDP). In this framework, the objective is to find a robust policy that achieves good performance under the wors… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  10. arXiv:2404.02638  [pdf, other

    cs.CV

    SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation

    Authors: Junyan Ye, Qiyan Luo, Jinhua Yu, Huaping Zhong, Zhimeng Zheng, Conghui He, Weijia Li

    Abstract: This paper aims at achieving fine-grained building attribute segmentation in a cross-view scenario, i.e., using satellite and street-view image pairs. The main challenge lies in overcoming the significant perspective differences between street views and satellite views. In this work, we introduce SG-BEV, a novel approach for satellite-guided BEV fusion for cross-view semantic segmentation. To over… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: accepted by CVPR 2024

  11. arXiv:2403.13027  [pdf, other

    cs.LG cs.CR cs.IT stat.ML

    Towards Better Statistical Understanding of Watermarking LLMs

    Authors: Zhongze Cai, Shang Liu, Hanzhao Wang, Huaiyang Zhong, Xiaocheng Li

    Abstract: In this paper, we study the problem of watermarking large language models (LLMs). We consider the trade-off between model distortion and detection ability and formulate it as a constrained optimization problem based on the green-red algorithm of Kirchenbauer et al. (2023a). We show that the optimal solution to the optimization problem enjoys a nice analytical property which provides a better under… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  12. arXiv:2403.09173  [pdf, other

    quant-ph cs.CR

    Bridging Quantum Computing and Differential Privacy: Insights into Quantum Computing Privacy

    Authors: Yusheng Zhao, Hui Zhong, Xinyue Zhang, Yuqing Li, Chi Zhang, Miao Pan

    Abstract: While quantum computing has a strong potential in data-driven fields, the privacy issue of sensitive or valuable information involved in the quantum algorithm should be considered. Differential privacy (DP), which is a fundamental privacy tool widely used in the classical scenario, has been extended to the quantum domain, i.e. quantum differential privacy (QDP). QDP may become one of the most prom… ▽ More

    Submitted 22 April, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: 12 pages (10 pages + 2 refs)

  13. arXiv:2403.07350  [pdf, ps, other

    cs.CL cs.AI cs.CV

    KEBench: A Benchmark on Knowledge Editing for Large Vision-Language Models

    Authors: Han Huang, Haitian Zhong, Qiang Liu, Shu Wu, Liang Wang, Tieniu Tan

    Abstract: Currently, little research has been done on knowledge editing for Large Vision-Language Models (LVLMs). Editing LVLMs faces the challenge of effectively integrating diverse modalities (image and text) while ensuring coherent and contextually relevant modifications. An existing benchmark has three metrics (Reliability, Locality and Generality) to measure knowledge editing for LVLMs. However, the be… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 13 pages

  14. arXiv:2403.05006  [pdf, ps, other

    cs.LG cs.AI stat.ME stat.ML

    Provable Multi-Party Reinforcement Learning with Diverse Human Feedback

    Authors: Huiying Zhong, Zhun Deng, Weijie J. Su, Zhiwei Steven Wu, Linjun Zhang

    Abstract: Reinforcement learning with human feedback (RLHF) is an emerging paradigm to align models with human preferences. Typically, RLHF aggregates preferences from multiple individuals who have diverse viewpoints that may conflict with each other. Our work \textit{initiates} the theoretical study of multi-party RLHF that explicitly models the diverse preferences of multiple individuals. We show how trad… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  15. arXiv:2403.02127  [pdf, other

    cs.CV cs.AI cs.CL

    LOCR: Location-Guided Transformer for Optical Character Recognition

    Authors: Yu Sun, Dongzhan Zhou, Chen Lin, Conghui He, Wanli Ouyang, Han-Sen Zhong

    Abstract: Academic documents are packed with texts, equations, tables, and figures, requiring comprehensive understanding for accurate Optical Character Recognition (OCR). While end-to-end OCR methods offer improved accuracy over layout-based approaches, they often grapple with significant repetition issues, especially with complex layouts in Out-Of-Domain (OOD) documents.To tackle this issue, we propose LO… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  16. arXiv:2402.11874  [pdf, other

    cs.CV

    Language-guided Image Reflection Separation

    Authors: Haofeng Zhong, Yuchen Hong, Shuchen Weng, Jinxiu Liang, Boxin Shi

    Abstract: This paper studies the problem of language-guided reflection separation, which aims at addressing the ill-posed reflection separation problem by introducing language descriptions to provide layer content. We propose a unified framework to solve this problem, which leverages the cross-attention mechanism with contrastive learning strategies to construct the correspondence between language descripti… ▽ More

    Submitted 15 April, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  17. arXiv:2402.10207  [pdf, other

    cs.LG cs.AI cs.CL

    Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment

    Authors: Rui Yang, Xiaoman Pan, Feng Luo, Shuang Qiu, Han Zhong, Dong Yu, Jianshu Chen

    Abstract: We consider the problem of multi-objective alignment of foundation models with human preferences, which is a critical step towards helpful and harmless AI systems. However, it is generally costly and unstable to fine-tune large foundation models using reinforcement learning (RL), and the multi-dimensionality, heterogeneity, and conflicting nature of human preferences further complicate the alignme… ▽ More

    Submitted 24 May, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Accepted by ICML 2024

  18. arXiv:2402.10186  [pdf, other

    cs.LG physics.chem-ph physics.comp-ph

    Self-consistent Validation for Machine Learning Electronic Structure

    Authors: Gengyuan Hu, Gengchen Wei, Zekun Lou, Philip H. S. Torr, Wanli Ouyang, Han-sen Zhong, Chen Lin

    Abstract: Machine learning has emerged as a significant approach to efficiently tackle electronic structure problems. Despite its potential, there is less guarantee for the model to generalize to unseen data that hinders its application in real-world scenarios. To address this issue, a technique has been proposed to estimate the accuracy of the predictions. This method integrates machine learning with self-… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 6 pages, 4 figures

  19. arXiv:2402.06852  [pdf

    cs.AI cs.CL

    ChemLLM: A Chemical Large Language Model

    Authors: Di Zhang, Wei Liu, Qian Tan, Jingdan Chen, Hang Yan, Yuliang Yan, Jiatong Li, Weiran Huang, Xiangyu Yue, Wanli Ouyang, Dongzhan Zhou, Shufei Zhang, Mao Su, Han-Sen Zhong, Yuqiang Li

    Abstract: Large language models (LLMs) have made impressive progress in chemistry applications. However, the community lacks an LLM specifically designed for chemistry. The main challenges are two-fold: firstly, most chemical data and scientific knowledge are stored in structured databases, which limits the model's ability to sustain coherent dialogue when used directly. Secondly, there is an absence of obj… ▽ More

    Submitted 25 April, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: 9 pages, 5 figures

  20. arXiv:2312.17248  [pdf, other

    cs.LG cs.AI cs.CC cs.DS stat.ML

    Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity

    Authors: Guhao Feng, Han Zhong

    Abstract: Reinforcement Learning (RL) encompasses diverse paradigms, including model-based RL, policy-based RL, and value-based RL, each tailored to approximate the model, optimal policy, and optimal value function, respectively. This work investigates the potential hierarchy of representation complexity -- the complexity of functions to be represented -- among these RL paradigms. We first demonstrate that,… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  21. arXiv:2312.17163  [pdf, other

    cs.CV cs.AI

    FENet: Focusing Enhanced Network for Lane Detection

    Authors: Liman Wang, Hanyang Zhong

    Abstract: Inspired by human driving focus, this research pioneers networks augmented with Focusing Sampling, Partial Field of View Evaluation, Enhanced FPN architecture and Directional IoU Loss - targeted innovations addressing obstacles to precise lane detection for autonomous driving. Experiments demonstrate our Focusing Sampling strategy, emphasizing vital distant details unlike uniform approaches, signi… ▽ More

    Submitted 26 April, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

    Comments: 12 pages including appendix. The Code is available at https://github.com/HanyangZhong/FENet

  22. arXiv:2312.16127  [pdf, other

    cs.AI

    LLM-SAP: Large Language Model Situational Awareness Based Planning

    Authors: Liman Wang, Hanyang Zhong

    Abstract: This work pioneers evaluating emergent planning capabilities based on situational awareness in large language models. We contribute (i) novel benchmarks and metrics for standardized assessment; (ii) a unique dataset to spur progress; and (iii) demonstrations that prompting and multi-agent schemes significantly enhance planning performance in context-sensitive planning tasks. Positioning this withi… ▽ More

    Submitted 4 February, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: 18 pages including appendix. Website:https://github.com/HanyangZhong/Situational_Planning_datasets

  23. arXiv:2312.14521  [pdf, other

    quant-ph cs.ET

    Tuning Quantum Computing Privacy through Quantum Error Correction

    Authors: Hui Zhong, Keyi Ju, Manojna Sistla, Xinyue Zhang, Xiaoqi Qin, Xin Fu, Miao Pan

    Abstract: Quantum computing is a promising paradigm for efficiently solving large and high-complexity problems. To protect quantum computing privacy, pioneering research efforts proposed to redefine differential privacy (DP) in quantum computing, i.e., quantum differential privacy (QDP), and harvest inherent noises generated by quantum computing to implement QDP. However, such an implementation approach is… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  24. arXiv:2312.11456  [pdf, other

    cs.LG cs.AI stat.ML

    Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-Constraint

    Authors: Wei Xiong, Hanze Dong, Chenlu Ye, Ziqi Wang, Han Zhong, Heng Ji, Nan Jiang, Tong Zhang

    Abstract: This paper studies the alignment process of generative models with Reinforcement Learning from Human Feedback (RLHF). We first identify the primary challenges of existing popular methods like offline PPO and offline DPO as lacking in strategical exploration of the environment. Then, to understand the mathematical principle of RLHF, we consider a standard mathematical formulation, the reverse-KL re… ▽ More

    Submitted 1 May, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 53 pages; theoretical study and algorithmic design of iterative RLHF and DPO

  25. arXiv:2312.11126  [pdf, other

    quant-ph cs.CR cs.LG

    Harnessing Inherent Noises for Privacy Preservation in Quantum Machine Learning

    Authors: Keyi Ju, Xiaoqi Qin, Hui Zhong, Xinyue Zhang, Miao Pan, Baoling Liu

    Abstract: Quantum computing revolutionizes the way of solving complex problems and handling vast datasets, which shows great potential to accelerate the machine learning process. However, data leakage in quantum machine learning (QML) may present privacy risks. Although differential privacy (DP), which protects privacy through the injection of artificial noise, is a well-established approach, its applicatio… ▽ More

    Submitted 6 March, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 6 pages, 4 figures

  26. arXiv:2312.04464  [pdf, other

    cs.LG stat.ML

    Horizon-Free and Instance-Dependent Regret Bounds for Reinforcement Learning with General Function Approximation

    Authors: Jiayi Huang, Han Zhong, Liwei Wang, Lin F. Yang

    Abstract: To tackle long planning horizon problems in reinforcement learning with general function approximation, we propose the first algorithm, termed as UCRL-WVTR, that achieves both \emph{horizon-free} and \emph{instance-dependent}, since it eliminates the polynomial dependency on the planning horizon. The derived regret bound is deemed \emph{sharp}, as it matches the minimax lower bound when specialize… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  27. arXiv:2311.06231  [pdf, other

    cs.CV

    Learning Human Action Recognition Representations Without Real Humans

    Authors: Howard Zhong, Samarth Mishra, Donghyun Kim, SouYoung Jin, Rameswar Panda, Hilde Kuehne, Leonid Karlinsky, Venkatesh Saligrama, Aude Oliva, Rogerio Feris

    Abstract: Pre-training on massive video datasets has become essential to achieve high action recognition performance on smaller downstream datasets. However, most large-scale video datasets contain images of people and hence are accompanied with issues related to privacy, ethics, and data protection, often preventing them from being publicly shared for reproducible research. Existing work has attempted to a… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: 19 pages, 7 figures, 2023 NeurIPS Datasets and Benchmarks Track

  28. arXiv:2310.19861  [pdf, ps, other

    cs.LG cs.GT stat.ML

    Posterior Sampling for Competitive RL: Function Approximation and Partial Observation

    Authors: Shuang Qiu, Ziyu Dai, Han Zhong, Zhaoran Wang, Zhuoran Yang, Tong Zhang

    Abstract: This paper investigates posterior sampling algorithms for competitive reinforcement learning (RL) in the context of general function approximations. Focusing on zero-sum Markov games (MGs) under two critical settings, namely self-play and adversarial learning, we first propose the self-play and adversarial generalized eluder coefficient (GEC) as complexity measures for function approximation, capt… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

  29. arXiv:2310.18480  [pdf

    cs.CE

    Capacity, Collision Avoidance and Shopping Rate under a Social Distancing Regime

    Authors: Haitian Zhong, David Sankoff

    Abstract: Capacity restrictions in stores, maintained by mechanisms like spacing customer intake, became familiar features of retailing in the time of the pandemic. Shopping rates in a crowded store under a social distance regime is prone to considerable slowdown. Inspired by the random particle collision concepts of statistical mechanics, we introduce a dynamical model of the evolution of shopping rate as… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: 24 pages, 22 figures

    MSC Class: 91B99

  30. arXiv:2310.12955  [pdf, other

    cs.LG cs.AI

    Towards Robust Offline Reinforcement Learning under Diverse Data Corruption

    Authors: Rui Yang, Han Zhong, Jiawei Xu, Amy Zhang, Chongjie Zhang, Lei Han, Tong Zhang

    Abstract: Offline reinforcement learning (RL) presents a promising approach for learning reinforced policies from offline datasets without the need for costly or unsafe interactions with the environment. However, datasets collected by humans in real-world environments are often noisy and may even be maliciously corrupted, which can significantly degrade the performance of offline RL. In this work, we first… ▽ More

    Submitted 9 March, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted by ICLR 2024

  31. arXiv:2310.11864  [pdf, other

    cs.CV cs.GR cs.LG

    VQ-NeRF: Neural Reflectance Decomposition and Editing with Vector Quantization

    Authors: Hongliang Zhong, Jingbo Zhang, Jing Liao

    Abstract: We propose VQ-NeRF, a two-branch neural network model that incorporates Vector Quantization (VQ) to decompose and edit reflectance fields in 3D scenes. Conventional neural reflectance fields use only continuous representations to model 3D scenes, despite the fact that objects are typically composed of discrete materials in reality. This lack of discretization can result in noisy material decomposi… ▽ More

    Submitted 10 November, 2023; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: Accepted by TVCG. Project Page: https://jtbzhl.github.io/VQ-NeRF.github.io/

  32. arXiv:2310.10357  [pdf, other

    cs.RO

    BEVGPT: Generative Pre-trained Large Model for Autonomous Driving Prediction, Decision-Making, and Planning

    Authors: Pengqin Wang, Meixin Zhu, Hongliang Lu, Hui Zhong, Xianda Chen, Shaojie Shen, Xuesong Wang, Yinhai Wang

    Abstract: Prediction, decision-making, and motion planning are essential for autonomous driving. In most contemporary works, they are considered as individual modules or combined into a multi-task learning paradigm with a shared backbone but separate task heads. However, we argue that they should be integrated into a comprehensive framework. Although several recent approaches follow this scheme, they suffer… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: AAAI2024

  33. arXiv:2310.03162  [pdf, other

    cs.CR

    Metaverse CAN: Embracing Continuous, Active, and Non-intrusive Biometric Authentication

    Authors: Hui Zhong, Chenpei Huang, Xinyue Zhang, Miao Pan

    Abstract: The Metaverse is a virtual world, an immersive experience, a new human-computer interaction, built upon various advanced technologies. How to protect Metaverse personal information and virtual properties is also facing new challenges, such as new attacks and new expectations of user experiences. While traditional methods (e.g., those employed in smartphone authentication) generally pass the basic… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: 6 pages, 3 figures

  34. arXiv:2309.17336  [pdf, other

    cs.CV cs.RO

    Robust 3D Object Detection from LiDAR-Radar Point Clouds via Cross-Modal Feature Augmentation

    Authors: Jianning Deng, Gabriel Chan, Hantao Zhong, Chris Xiaoxuan Lu

    Abstract: This paper presents a novel framework for robust 3D object detection from point clouds via cross-modal hallucination. Our proposed approach is agnostic to either hallucination direction between LiDAR and 4D radar. We introduce multiple alignments on both spatial and feature levels to achieve simultaneous backbone refinement and hallucination generation. Specifically, spatial alignment is proposed… ▽ More

    Submitted 12 March, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: Accepted to ICRA 2024. 8 pages, 4 figures. Equal contribution for Gabriel Chan and Hantao Zhong, listed randomly

  35. arXiv:2309.16747  [pdf, other

    cs.LG cs.AI

    Harnessing Diverse Data for Global Disaster Prediction: A Multimodal Framework

    Authors: Gengyin Liu, Huaiyang Zhong

    Abstract: As climate change intensifies, the urgency for accurate global-scale disaster predictions grows. This research presents a novel multimodal disaster prediction framework, combining weather statistics, satellite imagery, and textual insights. We particularly focus on "flood" and "landslide" predictions, given their ties to meteorological and topographical factors. The model is meticulously crafted b… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  36. arXiv:2309.15203  [pdf, other

    cs.CR cs.HC eess.SP

    Eve Said Yes: AirBone Authentication for Head-Wearable Smart Voice Assistant

    Authors: Chenpei Huang, Hui Zhong, Jie Lian, Pavana Prakash, Dian Shi, Yuan Xu, Miao Pan

    Abstract: Recent advances in machine learning and natural language processing have fostered the enormous prosperity of smart voice assistants and their services, e.g., Alexa, Google Home, Siri, etc. However, voice spoofing attacks are deemed to be one of the major challenges of voice control security, and never stop evolving such as deep-learning-based voice conversion and speech synthesis techniques. To so… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: 13 pages, 12 figures

  37. arXiv:2309.09737  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    RaTrack: Moving Object Detection and Tracking with 4D Radar Point Cloud

    Authors: Zhijun Pan, Fangqiang Ding, Hantao Zhong, Chris Xiaoxuan Lu

    Abstract: Mobile autonomy relies on the precise perception of dynamic environments. Robustly tracking moving objects in 3D world thus plays a pivotal role for applications like trajectory prediction, obstacle avoidance, and path planning. While most current methods utilize LiDARs or cameras for Multiple Object Tracking (MOT), the capabilities of 4D imaging radars remain largely unexplored. Recognizing the c… ▽ More

    Submitted 11 March, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: Accepted to ICRA 2024. 8 pages, 4 figures. Co-first authorship for Zhijun Pan, Fangqiang Ding and Hantao Zhong, listed randomly. See demo vide at: https://www.youtube.com/watch?v=_uSpbxOlLGw

  38. arXiv:2309.08230  [pdf, other

    cs.CR

    A Duty to Forget, a Right to be Assured? Exposing Vulnerabilities in Machine Unlearning Services

    Authors: Hongsheng Hu, Shuo Wang, Jiamin Chang, Haonan Zhong, Ruoxi Sun, Shuang Hao, Haojin Zhu, Minhui Xue

    Abstract: The right to be forgotten requires the removal or "unlearning" of a user's data from machine learning models. However, in the context of Machine Learning as a Service (MLaaS), retraining a model from scratch to fulfill the unlearning request is impractical due to the lack of training data on the service provider's side (the server). Furthermore, approximate unlearning further embraces a complex tr… ▽ More

    Submitted 15 January, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: To Appear in the Network and Distributed System Security Symposium (NDSS) 2024, San Diego, CA, USA

  39. arXiv:2308.14613  [pdf

    cs.CV

    MS-Net: A Multi-modal Self-supervised Network for Fine-Grained Classification of Aircraft in SAR Images

    Authors: Bingying Yue, Jianhao Li, Hao Shi, Yupei Wang, Honghu Zhong

    Abstract: Synthetic aperture radar (SAR) imaging technology is commonly used to provide 24-hour all-weather earth observation. However, it still has some drawbacks in SAR target classification, especially in fine-grained classification of aircraft: aircrafts in SAR images have large intra-class diversity and inter-class similarity; the number of effective samples is insufficient and it's hard to annotate. T… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  40. arXiv:2308.12714  [pdf, other

    cs.CV cs.AI

    VIGC: Visual Instruction Generation and Correction

    Authors: Bin Wang, Fan Wu, Xiao Han, Jiahui Peng, Huaping Zhong, Pan Zhang, Xiaoyi Dong, Weijia Li, Wei Li, Jiaqi Wang, Conghui He

    Abstract: The integration of visual encoders and large language models (LLMs) has driven recent progress in multimodal large language models (MLLMs). However, the scarcity of high-quality instruction-tuning data for vision-language tasks remains a challenge. The current leading paradigm, such as LLaVA, relies on language-only GPT-4 to generate data, which requires pre-annotated image captions and detection… ▽ More

    Submitted 4 February, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: Accepted by AAAI 2024, Project Website: https://opendatalab.github.io/VIGC, Code and Pretrained Model: https://github.com/opendatalab/VIGC

  41. PTransIPs: Identification of phosphorylation sites enhanced by protein PLM embeddings

    Authors: Ziyang Xu, Haitian Zhong, Bingrui He, Xueying Wang, Tianchi Lu

    Abstract: Phosphorylation is pivotal in numerous fundamental cellular processes and plays a significant role in the onset and progression of various diseases. The accurate identification of these phosphorylation sites is crucial for unraveling the molecular mechanisms within cells and during viral infections, potentially leading to the discovery of novel therapeutic targets. In this study, we develop PTrans… ▽ More

    Submitted 13 March, 2024; v1 submitted 8 August, 2023; originally announced August 2023.

  42. arXiv:2308.02356  [pdf, other

    cs.CV eess.IV

    T-UNet: Triplet UNet for Change Detection in High-Resolution Remote Sensing Images

    Authors: Huan Zhong, Chen Wu

    Abstract: Remote sensing image change detection aims to identify the differences between images acquired at different times in the same area. It is widely used in land management, environmental monitoring, disaster assessment and other fields. Currently, most change detection methods are based on Siamese network structure or early fusion structure. Siamese structure focuses on extracting object features at… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

    Comments: 21 pages, 11 figures, 6 tables

  43. arXiv:2308.02041  [pdf

    cs.CY cs.AI

    Regulating AI: Applying insights from behavioural economics and psychology to the application of article 5 of the EU AI Act

    Authors: Huixin Zhong, Eamonn O'Neill, Janina A. Hoffmann

    Abstract: Article 5 of the European Union's Artificial Intelligence Act is intended to regulate AI use to prevent potentially harmful consequences. Nevertheless, applying this legislation practically is likely to be challenging because of ambiguously used terminologies and because it fails to specify which manipulation techniques may be invoked by AI, potentially leading to significant harm. This paper aims… ▽ More

    Submitted 25 February, 2024; v1 submitted 24 July, 2023; originally announced August 2023.

    Comments: This paper was accepted for publication by AAAI 2024 paper on December of 2023

  44. arXiv:2306.13518  [pdf, other

    cs.CV cs.RO

    Segmentation and Tracking of Vegetable Plants by Exploiting Vegetable Shape Feature for Precision Spray of Agricultural Robots

    Authors: Nan Hu, Daobilige Su, Shuo Wang, Xuechang Wang, Huiyu Zhong, Zimeng Wang, Yongliang Qiao, Yu Tan

    Abstract: With the increasing deployment of agricultural robots, the traditional manual spray of liquid fertilizer and pesticide is gradually being replaced by agricultural robots. For robotic precision spray application in vegetable farms, accurate plant phenotyping through instance segmentation and robust plant tracking are of great importance and a prerequisite for the following spray action. Regarding t… ▽ More

    Submitted 26 June, 2023; v1 submitted 23 June, 2023; originally announced June 2023.

  45. arXiv:2306.06836  [pdf, other

    cs.LG cs.AI stat.ML

    Tackling Heavy-Tailed Rewards in Reinforcement Learning with Function Approximation: Minimax Optimal and Instance-Dependent Regret Bounds

    Authors: Jiayi Huang, Han Zhong, Liwei Wang, Lin F. Yang

    Abstract: While numerous works have focused on devising efficient algorithms for reinforcement learning (RL) with uniformly bounded rewards, it remains an open question whether sample or time-efficient algorithms for RL with large state-action space exist when the rewards are \emph{heavy-tailed}, i.e., with only finite $(1+ε)$-th moments for some $ε\in(0,1]$. In this work, we address the challenge of such r… ▽ More

    Submitted 7 March, 2024; v1 submitted 11 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023

  46. FollowNet: A Comprehensive Benchmark for Car-Following Behavior Modeling

    Authors: Xianda Chen, Meixin Zhu, Kehua Chen, Pengqin Wang, Hongliang Lu, Hui Zhong, Xu Han, Yinhai Wang

    Abstract: Car-following is a control process in which a following vehicle (FV) adjusts its acceleration to keep a safe distance from the lead vehicle (LV). Recently, there has been a booming of data-driven models that enable more accurate modeling of car-following through real-world driving datasets. Although there are several public datasets available, their formats are not always consistent, making it cha… ▽ More

    Submitted 25 May, 2023; originally announced June 2023.

  47. arXiv:2305.18258  [pdf, other

    cs.LG cs.AI cs.GT math.OC stat.ML

    Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration

    Authors: Zhihan Liu, Miao Lu, Wei Xiong, Han Zhong, Hao Hu, Shenao Zhang, Sirui Zheng, Zhuoran Yang, Zhaoran Wang

    Abstract: In online reinforcement learning (online RL), balancing exploration and exploitation is crucial for finding an optimal policy in a sample-efficient way. To achieve this, existing sample-efficient online RL algorithms typically consist of three components: estimation, planning, and exploration. However, in order to cope with general function approximators, most of them involve impractical algorithm… ▽ More

    Submitted 25 October, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

  48. arXiv:2305.09659  [pdf, ps, other

    cs.LG cs.AI math.OC stat.ML

    Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage

    Authors: Jose Blanchet, Miao Lu, Tong Zhang, Han Zhong

    Abstract: In this paper, we study distributionally robust offline reinforcement learning (robust offline RL), which seeks to find an optimal policy purely from an offline dataset that can perform well in perturbed environments. In specific, we propose a generic algorithm framework called Doubly Pessimistic Model-based Policy Optimization ($P^2MPO$), which features a novel combination of a flexible model est… ▽ More

    Submitted 22 August, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: V2 adds results on robust offline Markov games

  49. arXiv:2305.08841  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision Processes

    Authors: Han Zhong, Tong Zhang

    Abstract: The proximal policy optimization (PPO) algorithm stands as one of the most prosperous methods in the field of reinforcement learning (RL). Despite its success, the theoretical understanding of PPO remains deficient. Specifically, it is unclear whether PPO or its optimistic variants can effectively solve linear Markov decision processes (MDPs), which are arguably the simplest models in RL with func… ▽ More

    Submitted 8 June, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

  50. arXiv:2304.06484  [pdf, other

    cs.CY cs.LG cs.SI econ.GN stat.AP

    Exploring Gender and Race Biases in the NFT Market

    Authors: Howard Zhong, Mark Hamilton

    Abstract: Non-Fungible Tokens (NFTs) are non-interchangeable assets, usually digital art, which are stored on the blockchain. Preliminary studies find that female and darker-skinned NFTs are valued less than their male and lighter-skinned counterparts. However, these studies analyze only the CryptoPunks collection. We test the statistical significance of race and gender biases in the prices of CryptoPunks a… ▽ More

    Submitted 29 March, 2023; originally announced April 2023.