Skip to main content

Showing 1–49 of 49 results for author: Tong, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.16821  [pdf, other

    cs.CV

    How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites

    Authors: Zhe Chen, Weiyun Wang, Hao Tian, Shenglong Ye, Zhangwei Gao, Erfei Cui, Wenwen Tong, Kongzhi Hu, Jiapeng Luo, Zheng Ma, Ji Ma, Jiaqi Wang, Xiaoyi Dong, Hang Yan, Hewei Guo, Conghui He, Botian Shi, Zhenjiang Jin, Chao Xu, Bin Wang, Xingjian Wei, Wei Li, Wenjian Zhang, Bo Zhang, Pinlong Cai , et al. (10 additional authors not shown)

    Abstract: In this report, we introduce InternVL 1.5, an open-source multimodal large language model (MLLM) to bridge the capability gap between open-source and proprietary commercial models in multimodal understanding. We introduce three simple improvements: (1) Strong Vision Encoder: we explored a continuous learning strategy for the large-scale vision foundation model -- InternViT-6B, boosting its visual… ▽ More

    Submitted 29 April, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Technical report

  2. arXiv:2402.13533  [pdf, other

    cs.LG cs.AI cs.CL cs.DC

    FinGPT-HPC: Efficient Pretraining and Finetuning Large Language Models for Financial Applications with High-Performance Computing

    Authors: Xiao-Yang Liu, Jie Zhang, Guoxuan Wang, Weiqing Tong, Anwar Walid

    Abstract: Large language models (LLMs) are computationally intensive. The computation workload and the memory footprint grow quadratically with the dimension (layer width). Most of LLMs' parameters come from the linear layers of the transformer structure and are highly redundant. These linear layers contribute more than 80% of the computation workload and 99% of the model size. To pretrain and finetune LLMs… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  3. arXiv:2402.04991  [pdf, other

    cs.HC

    Exploring the Opportunity of Augmented Reality (AR) in Supporting Older Adults Explore and Learn Smartphone Applications

    Authors: Xiaofu Jin, Wai Tong, Xiaoying Wei, Xian Wang, Emily Kuang, Xiaoyu Mo, Huamin Qu, Mingming Fan

    Abstract: The global aging trend compels older adults to navigate the evolving digital landscape, presenting a substantial challenge in mastering smartphone applications. While Augmented Reality (AR) holds promise for enhancing learning and user experience, its role in aiding older adults' smartphone app exploration remains insufficiently explored. Therefore, we conducted a two-phase study: (1) a workshop w… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  4. arXiv:2312.12381  [pdf, other

    cs.CR

    Blockchain-Based Identity Authentication Oriented to Multi-Cluster UAV Networking

    Authors: Zesong Dong, Wei Tong, Zhiwei Zhang, Jian Li, Weidong Yang, Yulong Shen

    Abstract: Unmanned Aerial Vehicle (UAV) networking is increasingly used in field environments such as power inspection, agricultural plant protection, and emergency rescue. To guarantee UAV networking security, UAV identity authentication attracts wide attention, especially in the field environment without perfect infrastructure. Some blockchain-based UAV identity authentication solutions are proposed to es… ▽ More

    Submitted 14 November, 2023; originally announced December 2023.

  5. arXiv:2312.09245  [pdf, other

    cs.CV

    DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving

    Authors: Wenhai Wang, Jiangwei Xie, ChuanYang Hu, Haoming Zou, Jianan Fan, Wenwen Tong, Yang Wen, Silei Wu, Hanming Deng, Zhiqi Li, Hao Tian, Lewei Lu, Xizhou Zhu, Xiaogang Wang, Yu Qiao, Jifeng Dai

    Abstract: Large language models (LLMs) have opened up new possibilities for intelligent agents, endowing them with human-like thinking and cognitive abilities. In this work, we delve into the potential of large language models (LLMs) in autonomous driving (AD). We introduce DriveMLM, an LLM-based AD framework that can perform close-loop autonomous driving in realistic simulators. To this end, (1) we bridge… ▽ More

    Submitted 25 December, 2023; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Technical Report

  6. arXiv:2312.06200  [pdf, ps, other

    cs.IT

    Achieving the Fundamental Limit of Lossless Analog Compression via Polarization

    Authors: Shuai Yuan, Liuquan Yao, Yuan Li, Huazi Zhang, Jun Wang, Wen Tong, Zhiming Ma

    Abstract: In this paper, we study the lossless analog compression for i.i.d. nonsingular signals via the polarization-based framework. We prove that for nonsingular source, the error probability of maximum a posteriori (MAP) estimation polarizes under the Hadamard transform, which extends the polarization phenomenon to analog domain. Building on this insight, we propose partial Hadamard compression and deve… ▽ More

    Submitted 19 January, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: 48 pages, 5 figures. This work was presented in part at the 2023 IEEE Global Communications Conference

  7. arXiv:2311.13106  [pdf, other

    cs.NI

    Ten issues of NetGPT

    Authors: Wen Tong, Chenghui Peng, Tingting Yang, Fei Wang, Juan Deng, Rongpeng Li, Lu Yang, Honggang Zhang, Dong Wang, Ming Ai, Li Yang, Guangyi Liu, Yang Yang, Yao Xiao, Liexiang Yue, Wanfei Sun, Zexu Li, Wenwen Sun

    Abstract: With the rapid development and application of foundation models (FMs), it is foreseeable that FMs will play an important role in future wireless communications. As current Artificial Intelligence (AI) algorithms applied in wireless networks are dedicated models that aim for different neural network architectures and objectives, drawbacks in aspects of generality, performance gain, management, coll… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  8. arXiv:2311.04320  [pdf, other

    cs.RO

    Proprioceptive Invariant Robot State Estimation

    Authors: Tzu-Yuan Lin, Tingjun Li, Wenzhe Tong, Maani Ghaffari

    Abstract: This paper reports on developing a real-time invariant proprioceptive robot state estimation framework called DRIFT. A didactic introduction to invariant Kalman filtering is provided to make this cutting-edge symmetry-preserving approach accessible to a broader range of robotics applications. Furthermore, this work dives into the development of a proprioceptive state estimation framework for dead… ▽ More

    Submitted 20 February, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

  9. Augmenting Static Visualizations with PapARVis Designer

    Authors: Chen Zhu-Tian, Wai Tong, Qianwen Wang, Benjamin Bach, Huamin Qu

    Abstract: This paper presents an authoring environment for augmenting static visualizations with virtual content in augmented reality. Augmenting static visualizations can leverage the best of both physical and digital worlds, but its creation currently involves different tools and devices, without any means to explicitly design and debug both static and virtual content simultaneously. To address these issu… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  10. Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation

    Authors: Jiaxu Zhu, Weinan Tong, Yaoxun Xu, Changhe Song, Zhiyong Wu, Zhao You, Dan Su, Dong Yu, Helen Meng

    Abstract: Mapping two modalities, speech and text, into a shared representation space, is a research topic of using text-only data to improve end-to-end automatic speech recognition (ASR) performance in new domains. However, the length of speech representation and text representation is inconsistent. Although the previous method up-samples the text representation to align with acoustic modality, it may not… ▽ More

    Submitted 7 October, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: Proceedings of Interspeech. arXiv admin note: text overlap with arXiv:2309.01437

  11. arXiv:2306.02851  [pdf, other

    cs.CV cs.RO

    Scene as Occupancy

    Authors: Chonghao Sima, Wenwen Tong, Tai Wang, Li Chen, Silei Wu, Hanming Deng, Yi Gu, Lewei Lu, Ping Luo, Dahua Lin, Hongyang Li

    Abstract: Human driver can easily describe the complex traffic scene by visual system. Such an ability of precise perception is essential for driver's planning. To achieve this, a geometry-aware representation that quantizes the physical 3D scene into structured grid map with semantic labels per cell, termed as 3D Occupancy, would be desirable. Compared to the form of bounding box, a key insight behind occu… ▽ More

    Submitted 26 June, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: Project link: https://github.com/OpenDriveLab/OccNet

  12. arXiv:2303.10340  [pdf, other

    cs.CV

    3D Data Augmentation for Driving Scenes on Camera

    Authors: Wenwen Tong, Jiangwei Xie, Tianyu Li, Hanming Deng, Xiangwei Geng, Ruoyi Zhou, Dingchen Yang, Bo Dai, Lewei Lu, Hongyang Li

    Abstract: Driving scenes are extremely diverse and complicated that it is impossible to collect all cases with human effort alone. While data augmentation is an effective technique to enrich the training data, existing methods for camera data in autonomous driving applications are confined to the 2D image plane, which may not optimally increase data diversity in 3D real-world scenarios. To this end, we prop… ▽ More

    Submitted 18 March, 2023; originally announced March 2023.

  13. arXiv:2302.13549  [pdf

    cs.DS

    Random-Order Enumeration for Self-Reducible NP-Problems

    Authors: Pengyu Chen, Dongjing Miao, Weitian Tong, Zizheng Guo, Jianzhong Li, Zhipeng Cai

    Abstract: In plenty of data analysis tasks, a basic and time-consuming process is to produce a large number of solutions and feed them into downstream processing. Various enumeration algorithms have been developed for this purpose. An enumeration algorithm produces all solutions of a problem instance without repetition. To be a statistically meaningful representation of the solution space, solutions are req… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  14. arXiv:2302.08743   

    cs.LG

    Multi-View Clustering from the Perspective of Mutual Information

    Authors: Fu Lele, Zhang Lei, Wang Tong, Chen Chuan, Zhang Chuanfu, Zheng Zibin

    Abstract: Exploring the complementary information of multi-view data to improve clustering effects is a crucial issue in multi-view clustering. In this paper, we propose a novel model based on information theory termed Informative Multi-View Clustering (IMVC), which extracts the common and view-specific information hidden in multi-view data and constructs a clustering-oriented comprehensive representation.… ▽ More

    Submitted 29 May, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

    Comments: We think the paper writing isn't good enough, so we would like to withdraw the paper and renew the writing manner

  15. arXiv:2302.01966  [pdf, other

    cs.HC

    Towards an Understanding of Distributed Asymmetric Collaborative Visualization on Problem-solving

    Authors: Wai Tong, Meng Xia, Kam Kwai Wong, Doug A. Bowman, Ting-Chuen Pong, Huamin Qu, Yalong Yang

    Abstract: This paper provided empirical knowledge of the user experience for using collaborative visualization in a distributed asymmetrical setting through controlled user studies. With the ability to access various computing devices, such as Virtual Reality (VR) head-mounted displays, scenarios emerge when collaborators have to or prefer to use different computing environments in different places. However… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

    Comments: 11 pages, 12 figures, accepted at IEEE VR 2023

  16. arXiv:2211.06769  [pdf, other

    eess.IV cs.CV

    Realistic Bokeh Effect Rendering on Mobile GPUs, Mobile AI & AIM 2022 challenge: Report

    Authors: Andrey Ignatov, Radu Timofte, Jin Zhang, Feng Zhang, Gaocheng Yu, Zhe Ma, Hongbin Wang, Minsu Kwon, Haotian Qian, Wentao Tong, Pan Mu, Ziping Wang, Guangjing Yan, Brian Lee, Lei Fei, Huaijin Chen, Hyebin Cho, Byeongjun Kwon, Munchurl Kim, Mingyang Qian, Huixin Ma, Yanan Li, Xiaotao Wang, Lei Lei

    Abstract: As mobile cameras with compact optics are unable to produce a strong bokeh effect, lots of interest is now devoted to deep learning-based solutions for this task. In this Mobile AI challenge, the target was to develop an efficient end-to-end AI-based bokeh effect rendering approach that can run on modern smartphone GPUs using TensorFlow Lite. The participants were provided with a large-scale EBB!… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2211.03885; text overlap with arXiv:2105.07809, arXiv:2211.04470, arXiv:2211.05256, arXiv:2211.05910

  17. arXiv:2209.15140  [pdf, other

    cs.RO eess.SY

    Fully Proprioceptive Slip-Velocity-Aware State Estimation for Mobile Robots via Invariant Kalman Filtering and Disturbance Observer

    Authors: Xihang Yu, Sangli Teng, Theodor Chakhachiro, Wenzhe Tong, Tingjun Li, Tzu-Yuan Lin, Sarah Koehler, Manuel Ahumada, Jeffrey M. Walls, Maani Ghaffari

    Abstract: This paper develops a novel slip estimator using the invariant observer design theory and Disturbance Observer (DOB). The proposed state estimator for mobile robots is fully proprioceptive and combines data from an inertial measurement unit and body velocity within a Right Invariant Extended Kalman Filter (RI-EKF). By embedding the slip velocity into $\mathrm{SE}_3(3)$ matrix Lie group, the develo… ▽ More

    Submitted 30 September, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: The work will be presented in IROS2023. github repository at https://github.com/UMich-CURLY/slip_detection_DOB. arXiv admin note: text overlap with arXiv:1805.10410 by other authors

  18. arXiv:2208.10603  [pdf, other

    cs.HC

    Exploring Interactions with Printed Data Visualizations in Augmented Reality

    Authors: Wai Tong, Zhutian Chen, Meng Xia, Leo Yu-Ho Lo, Linping Yuan, Benjamin Bach, Huamin Qu

    Abstract: This paper presents a design space of interaction techniques to engage with visualizations that are printed on paper and augmented through Augmented Reality. Paper sheets are widely used to deploy visualizations and provide a rich set of tangible affordances for interactions, such as touch, folding, tilting, or stacking. At the same time, augmented reality can dynamically update visualization cont… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: 11 pages, 9 figures, 1 table, accepted at IEEE VIS 2022

  19. arXiv:2207.11238  [pdf

    cs.CV cs.AI cs.LG eess.IV

    Improved lightweight identification of agricultural diseases based on MobileNetV3

    Authors: Yuhang Jiang, Wenping Tong

    Abstract: At present, the identification of agricultural pests and diseases has the problem that the model is not lightweight enough and difficult to apply. Based on MobileNetV3, this paper introduces the Coordinate Attention block. The parameters of MobileNetV3-large are reduced by 22%, the model size is reduced by 19.7%, and the accuracy is improved by 0.92%. The parameters of MobileNetV3-small are reduce… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: Accepted by CAIBDA 2022

  20. arXiv:2206.06897  [pdf, other

    cs.IT

    On the Message Passing Efficiency of Polar and Low-Density Parity-Check Decoders

    Authors: Dawei Yin, Yuan Li, Xianbin Wang, Jiajie Tong, Huazi Zhang, Jun Wang, Guanghui Wang, Jun Chen, Guiying Yan, Zhiming Ma, Wen Tong

    Abstract: This study focuses on the efficiency of message-passing-based decoding algorithms for polar and low-density parity-check (LDPC) codes. Both successive cancellation (SC) and belief propagation (BP) decoding algorithms are studied {in} the message-passing framework. Counter-intuitively, SC decoding demonstrates the highest decoding efficiency, although it was considered a weak decoder {in terms of}… ▽ More

    Submitted 20 April, 2023; v1 submitted 14 June, 2022; originally announced June 2022.

  21. arXiv:2205.14407  [pdf, ps, other

    cs.DS

    An efficient polynomial-time approximation scheme for parallel multi-stage open shops

    Authors: Jianming Dong, Ruyan Jin, Guohui Lin, Bing Su, Weitian Tong, Yao Xu

    Abstract: Various new scheduling problems have been arising from practical production processes and spawning new research areas in the scheduling field. We study the parallel multi-stage open shops problem, which generalizes the classic open shop scheduling and parallel machine scheduling problems. Given m identical k-stage open shops and a set of n jobs, we aim to process all jobs on these open shops with… ▽ More

    Submitted 28 May, 2022; originally announced May 2022.

  22. arXiv:2205.06523  [pdf, ps, other

    cs.IT

    Deterministic Identification over Channels without CSI

    Authors: Yuan Li, Xianbin Wang, Huazi Zhang, Jun Wang, Wen Tong, Guiying Yan, Zhiming Ma

    Abstract: Identification capacities of randomized and deterministic identification were proved to exceed channel capacity for Gaussian channels \emph{with} channel side information (CSI). In this work, we extend deterministic identification to the block fading channels without CSI by applying identification codes for both channel estimation and user identification. We prove that identification capacity is a… ▽ More

    Submitted 11 August, 2022; v1 submitted 13 May, 2022; originally announced May 2022.

  23. arXiv:2204.06049  [pdf, ps, other

    cs.IT

    On the Rate-Distortion-Perception Function

    Authors: Jun Chen, Lei Yu, Jia Wang, Wuxian Shi, Yiqun Ge, Wen Tong

    Abstract: Rate-distortion-perception theory generalizes Shannon's rate-distortion theory by introducing a constraint on the perceptual quality of the output. The perception constraint complements the conventional distortion constraint and aims to enforce distribution-level consistencies. In this new theory, the information-theoretic limit is characterized by the rate-distortion-perception function. Although… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

  24. arXiv:2204.00856  [pdf, other

    cs.HC

    ComputableViz: Mathematical Operators as a Formalism for Visualization Processing and Analysis

    Authors: Aoyu Wu, Wai Tong, Haotian Li, Dominik Moritz, Yong Wang, Huamin Qu

    Abstract: Data visualizations are created and shared on the web at an unprecedented speed, raising new needs and questions for processing and analyzing visualizations after they have been generated and digitized. However, existing formalisms focus on operating on a single visualization instead of multiple visualizations, making it challenging to perform analysis tasks such as sorting and clustering visualiz… ▽ More

    Submitted 2 April, 2022; originally announced April 2022.

    Comments: 15 pages, 12 figures. In the ACM Conference on Human Factors in Computing Systems (CHI) 2022

  25. arXiv:2203.00573  [pdf, other

    cs.LG cond-mat.dis-nn stat.ML

    Contrasting random and learned features in deep Bayesian linear regression

    Authors: Jacob A. Zavatone-Veth, William L. Tong, Cengiz Pehlevan

    Abstract: Understanding how feature learning affects generalization is among the foremost goals of modern deep learning theory. Here, we study how the ability to learn representations affects the generalization performance of a simple class of models: deep Bayesian linear neural networks trained on unstructured Gaussian data. By comparing deep random feature models to deep networks in which all layers are t… ▽ More

    Submitted 16 June, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

    Comments: 35 pages, 7 figures. v2: minor typos corrected and references added; published in PRE

    Journal ref: Physical Review E 105, 064118 (2022)

  26. arXiv:2201.10929  [pdf, other

    cs.IT eess.SP

    Task-Oriented Image Semantic Communication Based on Rate-Distortion Theory

    Authors: Fangfang Liu, Wanjie Tong, Yang Yang, Zhengfen Sun, Caili Guo

    Abstract: Task-oriented image semantic communication is a new communication paradigm, which aims to transmit semantics for artificial intelligent (AI) tasks while ignoring the reconstruction quality of the images. However, in some applications, such as autonomous driving, both image reconstruction quality and the performance of the followed AI tasks must be simultaneously considered. To tackle this challeng… ▽ More

    Submitted 1 December, 2022; v1 submitted 26 January, 2022; originally announced January 2022.

    Comments: 17 pages, 8 figures

  27. arXiv:2201.07784  [pdf, other

    cs.IT

    On Distributed Lossy Coding of Symmetrically Correlated Gaussian Sources

    Authors: Siyao Zhou, Sadaf Salehkalaibar, Jingjing Qian, Jun Chen, Wuxian Shi, Yiqun Ge, Wen Tong

    Abstract: A distributed lossy compression network with $L$ encoders and a decoder is considered. Each encoder observes a source and sends a compressed version to the decoder. The decoder produces a joint reconstruction of target signals with the mean squared error distortion below a given threshold. It is assumed that the observed sources can be expressed as the sum of target signals and corruptive noises w… ▽ More

    Submitted 3 June, 2022; v1 submitted 19 January, 2022; originally announced January 2022.

  28. A polynomial-time approximation scheme for parallel two-stage flowshops under makespan constraint

    Authors: Weitian Tong, Yao Xu, Huili Zhang

    Abstract: As a hybrid of the Parallel Two-stage Flowshop problem and the Multiple Knapsack problem, we investigate the scheduling of parallel two-stage flowshops under makespan constraint, which was motivated by applications in cloud computing and introduced by Chen et al. [3] recently. A set of two-stage jobs are selected and scheduled on parallel two-stage flowshops to achieve the maximum total profit whi… ▽ More

    Submitted 18 May, 2022; v1 submitted 11 January, 2022; originally announced January 2022.

    Comments: Theoretical Computer Science (2022)

  29. arXiv:2201.01389  [pdf, other

    cs.IT cs.LG eess.SP

    Semantic Communications: Principles and Challenges

    Authors: Zhijin Qin, Xiaoming Tao, Jianhua Lu, Wen Tong, Geoffrey Ye Li

    Abstract: Semantic communication, regarded as the breakthrough beyond the Shannon paradigm, aims at the successful transmission of semantic information conveyed by the source rather than the accurate reception of each single symbol or bit regardless of its meaning. This article provides an overview on semantic communications. After a brief review of Shannon information theory, we discuss semantic communicat… ▽ More

    Submitted 27 June, 2022; v1 submitted 30 December, 2021; originally announced January 2022.

  30. arXiv:2112.10087  [pdf, other

    cs.CV

    Reasoning Structural Relation for Occlusion-Robust Facial Landmark Localization

    Authors: Congcong Zhu, Xiaoqiang Li, Jide Li, Songmin Dai, Weiqin Tong

    Abstract: In facial landmark localization tasks, various occlusions heavily degrade the localization accuracy due to the partial observability of facial features. This paper proposes a structural relation network (SRN) for occlusion-robust landmark localization. Unlike most existing methods that simply exploit the shape constraint, the proposed SRN aims to capture the structural relations among different fa… ▽ More

    Submitted 19 December, 2021; originally announced December 2021.

    Comments: Accepted by Pattern recognition

  31. arXiv:2110.12610  [pdf, other

    cs.IT eess.SP

    Antenna Array Enabled Space/Air/Ground Communications and Networking for 6G

    Authors: Zhenyu Xiao, Zhu Han, Arumugam Nallanathan, Octavia A. Dobre, Bruno Clerckx, Jinho Choi, Chong He, Wen Tong

    Abstract: Antenna arrays have a long history of more than 100 years and have evolved closely with the development of electronic and information technologies, playing an indispensable role in wireless communications and radar. With the rapid development of electronic and information technologies, the demand for all-time, all-domain, and full-space network services has exploded, and new communication requirem… ▽ More

    Submitted 26 March, 2022; v1 submitted 24 October, 2021; originally announced October 2021.

  32. Exploration of Artificial Intelligence-oriented Power System Dynamic Simulators

    Authors: Tannan Xiao, Ying Chen, Jianquan Wang, Shaowei Huang, Weilin Tong, Tirui He

    Abstract: With the rapid development of artificial intelligence (AI), it is foreseeable that the accuracy and efficiency of dynamic analysis for future power system will be greatly improved by the integration of dynamic simulators and AI. To explore the interaction mechanism of power system dynamic simulations and AI, a general design of an AI-oriented power system dynamic simulator is proposed, which consi… ▽ More

    Submitted 6 July, 2022; v1 submitted 3 October, 2021; originally announced October 2021.

    Comments: 10 pages, 8 figures, 1 table. Accepted by Journal of Modern Power System and Clean Energy

  33. arXiv:2109.11320  [pdf, other

    cs.IT

    Nine Challenges in Artificial Intelligence and Wireless Communications for 6G

    Authors: Wen Tong, Geoffrey Ye Li

    Abstract: In recent years, techniques developed in artificial intelligence (AI), especially those in machine learning (ML), have been successfully applied in various areas, leading to a widespread belief that AI will collectively play an important role in future wireless communications. To accomplish the aspiration, we present nine challenges to be addressed by the interdisciplinary areas of AI/ML and wirel… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Comments: 6 pages

  34. arXiv:2107.08607  [pdf, ps, other

    cs.IT cs.AR

    A unified polar decoder platform for low-power and low-cost devices

    Authors: Jiajie Tong, Qifan Zhang, Huazi Zhang, Rong Li, Jun Wang, Wen Tong

    Abstract: In this paper, we design a polar decoding platform for diverse application scenarios that require low-cost and low-power communications. Specifically, prevalent polar decoders such as successive cancellation (SC), SC-list (SCL) and Fano decoders are all supported under the same architecture. Unlike high-throughput or low-latency decoders that promote parallelism, this architecture promotes seriali… ▽ More

    Submitted 18 July, 2021; originally announced July 2021.

    Comments: 6 pages, 8 figures. Part of this paper was presented in an invited talk at the 2021 International Symposium on Information Theory (ISIT)

  35. arXiv:2107.08600  [pdf, ps, other

    cs.IT cs.AR

    Fast polar codes for terabits-per-second throughput communications

    Authors: Jiajie Tong, Xianbin Wang, Qifan Zhang, Huazi Zhang, Rong Li, Jun Wang, Wen Tong

    Abstract: Targeting high-throughput and low-power communications, we implement two successive cancellation (SC) decoders for polar codes. With $16nm$ ASIC technology, the area efficiency and energy efficiency are $4Tbps/mm^2$ and $0.63pJ/bit$, respectively, for the unrolled decoder, and $561Gbps/mm^2$ and $1.21pJ/bit$, respectively, for the recursive decoder. To achieve such a high throughput, a novel code… ▽ More

    Submitted 18 July, 2021; originally announced July 2021.

    Comments: 8 pages, 5 figures. Part of this paper was presented in an invited talk at the 2021 International Symposium on Information Theory (ISIT)

  36. arXiv:2104.01026  [pdf, other

    cs.CR

    SGBA: A Stealthy Scapegoat Backdoor Attack against Deep Neural Networks

    Authors: Ying He, Zhili Shen, Chang Xia, Jingyu Hua, Wei Tong, Sheng Zhong

    Abstract: Outsourced deep neural networks have been demonstrated to suffer from patch-based trojan attacks, in which an adversary poisons the training sets to inject a backdoor in the obtained model so that regular inputs can be still labeled correctly while those carrying a specific trigger are falsely given a target label. Due to the severity of such attacks, many backdoor detection and containment system… ▽ More

    Submitted 16 May, 2022; v1 submitted 2 April, 2021; originally announced April 2021.

  37. arXiv:2103.14300  [pdf, other

    cs.RO eess.SY

    Robotic Guide Dog: Leading a Human with Leash-Guided Hybrid Physical Interaction

    Authors: Anxing Xiao, Wenzhe Tong, Lizhi Yang, Jun Zeng, Zhongyu Li, Koushil Sreenath

    Abstract: An autonomous robot that is able to physically guide humans through narrow and cluttered spaces could be a big boon to the visually-impaired. Most prior robotic guiding systems are based on wheeled platforms with large bases with actuated rigid guiding canes. The large bases and the actuated arms limit these prior approaches from operating in narrow and cluttered environments. We propose a method… ▽ More

    Submitted 28 June, 2021; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: Accepted to 2021 International Conference on Robotics and Automation (ICRA 2021)

  38. arXiv:2103.14215  [pdf, other

    cs.IT

    The Complete Affine Automorphism Group of Polar Codes

    Authors: Yuan Li, Huazi Zhang, Rong Li, Jun Wang, Wen Tong, Guiying Yan, Zhiming Ma

    Abstract: Recently, a permutation-based successive cancellation (PSC) decoding framework for polar codes attaches much attention. It decodes several permuted codewords with independent successive cancellation (SC) decoders. Its latency thus can be reduced to that of SC decoding. However, the PSC framework is ineffective for permutations falling into the lower-triangular affine (LTA) automorphism group, as t… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

    Comments: 6 pages, 5 figures

  39. arXiv:2008.06678  [pdf, other

    cs.HC cs.GR

    MobileVisFixer: Tailoring Web Visualizations for Mobile Phones Leveraging an Explainable Reinforcement Learning Framework

    Authors: Aoyu Wu, Wai Tong, Tim Dwyer, Bongshin Lee, Petra Isenberg, Huamin Qu

    Abstract: We contribute MobileVisFixer, a new method to make visualizations more mobile-friendly. Although mobile devices have become the primary means of accessing information on the web, many existing visualizations are not optimized for small screens and can lead to a frustrating user experience. Currently, practitioners and researchers have to engage in a tedious and time-consuming process to ensure tha… ▽ More

    Submitted 15 August, 2020; originally announced August 2020.

    Comments: Accepted at IEEE VIS 2020 (Info VIS)

  40. arXiv:1905.10949  [pdf, other

    cs.LG cs.CL stat.ML

    QuesNet: A Unified Representation for Heterogeneous Test Questions

    Authors: Yu Yin, Qi Liu, Zhenya Huang, Enhong Chen, Wei Tong, Shijin Wang, Yu Su

    Abstract: Understanding learning materials (e.g. test questions) is a crucial issue in online learning systems, which can promote many applications in education domain. Unfortunately, many supervised approaches suffer from the problem of scarce human labeled data, whereas abundant unlabeled resources are highly underutilized. To alleviate this problem, an effective solution is to use pre-trained representat… ▽ More

    Submitted 26 May, 2019; originally announced May 2019.

  41. Volumetric Spline Parameterization for Isogeometric Analysis

    Authors: Maodong Pan, Falai Chen, Weihua Tong

    Abstract: Given the spline representation of the boundary of a three dimensional domain, constructing a volumetric spline parameterization of the domain (i.e., a map from a unit cube to the domain) with the given boundary is a fundamental problem in isogeometric analysis. A good domain parameterization should satisfy the following criteria: (1) the parameterization is a bijective map; and (2) the map has lo… ▽ More

    Submitted 2 February, 2019; originally announced February 2019.

  42. arXiv:1812.09353  [pdf, other

    cs.DS

    A local search $4/3$-approximation algorithm for the minimum $3$-path partition problem

    Authors: Yong Chen, Randy Goebel, Guohui Lin, Longcheng Liu, Bing Su, Weitian Tong, Yao Xu, An Zhang

    Abstract: Given a graph $G = (V, E)$, the $3$-path partition problem is to find a minimum collection of vertex-disjoint paths each of order at most $3$ to cover all the vertices of $V$. It is different from but closely related to the well-known $3$-set cover problem. The best known approximation algorithm for the $3$-path partition problem was proposed recently and has a ratio $13/9$. Here we present a loca… ▽ More

    Submitted 21 December, 2018; originally announced December 2018.

    Comments: 16 pages, 21 figures

  43. High-speed PAM4-based Optical SDM Interconnects with Directly Modulated Long-wavelength VCSEL

    Authors: Joris Van Kerrebrouck, Xiaodan Pang, Oskars Ozolins, Rui Lin, Aleksejs Udalcovs, Lu Zhang, Haolin Li, Silvia Spiga, Markus-Christian Amann, Lin Gan, Ming Tang, Songnian Fu, Richard Schatz, Gunnar Jacobsen, Sergei Popov, Deming Liu, Weijun Tong, Guy Torfs, Johan Bauwelinck, Jiajia Chen, Xin Yin

    Abstract: This paper reports the demonstration of high-speed PAM-4 transmission using a 1.5-μm single-mode vertical cavity surface emitting laser (SM-VCSEL) over multicore fiber with 7 cores over different distances. We have successfully generated up to 70 Gbaud 4-level pulse amplitude modulation (PAM-4) signals with a VCSEL in optical back-to-back, and transmitted 50 Gbaud PAM-4 signals over both 1-km disp… ▽ More

    Submitted 13 November, 2018; originally announced December 2018.

    Comments: 7 pages, accepted to publication in 'Journal of Lightwave Technology (JLT)

  44. arXiv:1811.04682  [pdf, other

    cs.CV cs.LG

    Learning Segmentation Masks with the Independence Prior

    Authors: Songmin Dai, Xiaoqiang Li, Lu Wang, Pin Wu, Weiqin Tong, Yimin Chen

    Abstract: An instance with a bad mask might make a composite image that uses it look fake. This encourages us to learn segmentation by generating realistic composite images. To achieve this, we propose a novel framework that exploits a new proposed prior called the independence prior based on Generative Adversarial Networks (GANs). The generator produces an image with multiple category-specific instance pro… ▽ More

    Submitted 13 November, 2018; v1 submitted 12 November, 2018; originally announced November 2018.

    Comments: 7+5 pages, 13 figures, Accepted to AAAI 2019

  45. arXiv:1704.05709  [pdf, ps, other

    cs.IT

    $β$-expansion: A Theoretical Framework for Fast and Recursive Construction of Polar Codes

    Authors: Gaoning He, Jean-Claude Belfiore, Xiaocheng Liu, Yiqun Ge, Ran Zhang, Ingmar Land, Ying Chen, Rong Li, Jun Wang, Ganghua Yang, Wen Tong

    Abstract: In this work, we introduce $β$-expansion, a notion borrowed from number theory, as a theoretical framework to study fast construction of polar codes based on a recursive structure of universal partial order (UPO) and polarization weight (PW) algorithm. We show that polar codes can be recursively constructed from UPO by continuously solving several polynomial equations at each recursive step. From… ▽ More

    Submitted 19 April, 2017; originally announced April 2017.

  46. arXiv:1610.09778  [pdf, other

    cs.LG cs.AI

    DPPred: An Effective Prediction Framework with Concise Discriminative Patterns

    Authors: Jingbo Shang, Meng Jiang, Wenzhu Tong, Jinfeng Xiao, Jian Peng, Jiawei Han

    Abstract: In the literature, two series of models have been proposed to address prediction problems including classification and regression. Simple models, such as generalized linear models, have ordinary performance but strong interpretability on a set of simple features. The other series, including tree-based models, organize numerical, categorical and high dimensional features into a comprehensive struct… ▽ More

    Submitted 30 October, 2016; originally announced October 2016.

  47. arXiv:1606.04157  [pdf, other

    cs.DS

    Single machine scheduling with job-dependent machine deterioration

    Authors: Wenchang Luo, Yao Xu, Weitian Tong, Guohui Lin

    Abstract: We consider the single machine scheduling problem with job-dependent machine deterioration. In the problem, we are given a single machine with an initial non-negative maintenance level, and a set of jobs each with a non-preemptive processing time and a machine deterioration. Such a machine deterioration quantifies the decrement in the machine maintenance level after processing the job. To avoid ma… ▽ More

    Submitted 13 June, 2016; originally announced June 2016.

    Comments: 15 pages

    Journal ref: Proceedings of ISAAC 2016, LIPIcs55, pages 1-13

  48. arXiv:1307.7089  [pdf, ps, other

    cs.DS

    An approximation algorithm for the Bandpass-2 problem

    Authors: Weitian Tong, Zhi-Zhong Chen, Lusheng Wang, Yinfeng Xu, Jiuping Xu, Randy Goebel, Guohui Lin

    Abstract: The general Bandpass-$B$ problem is NP-hard and can be approximated by a reduction into the weighted $B$-set packing problem, with a worst case performance ratio of $O(B^2)$. When $B = 2$, a maximum weight matching gives a 2-approximation to the problem. In this paper, we call the Bandpass-2 problem simply the Bandpass problem. The Bandpass problem can be viewed as a variation of the maximum trave… ▽ More

    Submitted 26 July, 2013; originally announced July 2013.

  49. arXiv:1304.3653  [pdf, ps, other

    cs.DS

    Algorithms for Cut Problems on Trees

    Authors: Iyad Kanj, Guohui Lin, Tian Liu, Weitian Tong, Ge Xia, Jinhui Xu, Boting Yang, Fenghui Zhang, Peng Zhang, Binhai Zhu

    Abstract: We study the {\sc multicut on trees} and the {\sc generalized multiway Cut on trees} problems. For the {\sc multicut on trees} problem, we present a parameterized algorithm that runs in time $O^{*}(ρ^k)$, where $ρ= \sqrt{\sqrt{2} + 1} \approx 1.555$ is the positive root of the polynomial $x^4-2x^2-1$. This improves the current-best algorithm of Chen et al. that runs in time $O^{*}(1.619^k)$. For t… ▽ More

    Submitted 12 April, 2013; originally announced April 2013.

    MSC Class: 68Q25