Skip to main content

Showing 1–50 of 360 results for author: Ma, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.03119  [pdf, ps, other

    cs.IT eess.SP

    DAFT-Spread Affine Frequency Division Multiple Access for Downlink Transmission

    Authors: Yiwei Tao, Miaowen Wen, Yao Ge, Tianqi Mao, Lixia Xiao, Jun Li

    Abstract: Affine frequency division multiplexing (AFDM) and orthogonal AFDM access (O-AFDMA) are promising techniques based on chirp signals, which are able to suppress the performance deterioration caused by Doppler shifts in high-mobility scenarios. However, the high peak-to-average power ratio (PAPR) in AFDM or O-AFDMA is still a crucial problem, which severely limits their practical applications. In thi… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  2. arXiv:2405.01668  [pdf, other

    cs.CR cs.SE

    WitheredLeaf: Finding Entity-Inconsistency Bugs with LLMs

    Authors: Hongbo Chen, Yifan Zhang, Xing Han, Huanyao Rong, Yuheng Zhang, Tianhao Mao, Hang Zhang, XiaoFeng Wang, Luyi Xing, Xun Chen

    Abstract: Originating from semantic bugs, Entity-Inconsistency Bugs (EIBs) involve misuse of syntactically valid yet incorrect program entities, such as variable identifiers and function names, which often have security implications. Unlike straightforward syntactic vulnerabilities, EIBs are subtle and can remain undetected for years. Traditional detection methods, such as static analysis and dynamic testin… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  3. arXiv:2404.16271  [pdf

    cs.CR cond-mat.mtrl-sci

    True random number generation using metastable 1T' molybdenum ditelluride

    Authors: Yang Liu, Pengyu Liu, Yingyi Wen, Zihan Liang, Songwei Liu, Lekai Song, Jingfang Pei, Xiaoyue Fan, Teng Ma, Gang Wang, Shuo Gao, Kong-Pang Pun, Xiaolong Chen, Guohua Hu

    Abstract: True random numbers play a critical role in secure cryptography. The generation relies on a stable and readily extractable entropy source. Here, from solution-processed structurally metastable 1T' MoTe2, we prove stable output of featureless, stochastic, and yet stable conductance noise at a broad temperature (down to 15 K) with minimal power consumption (down to 0.05 micro-W). Our characterizatio… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  4. arXiv:2404.15733  [pdf, other

    cs.AR

    BlissCam: Boosting Eye Tracking Efficiency with Learned In-Sensor Sparse Sampling

    Authors: Yu Feng, Tianrui Ma, Yuhao Zhu, Xuan Zhang

    Abstract: Eye tracking is becoming an increasingly important task domain in emerging computing platforms such as Augmented/Virtual Reality (AR/VR). Today's eye tracking system suffers from long end-to-end tracking latency and can easily eat up half of the power budget of a mobile VR device. Most existing optimization efforts exclusively focus on the computation pipeline by optimizing the algorithm and/or de… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  5. arXiv:2404.12228  [pdf, other

    cs.AI cs.LG

    Relationship Discovery for Drug Recommendation

    Authors: Xiang Li, Shunpan Liang, Yu Lei, Chen Li, Yulei Hou, Tengfei Ma

    Abstract: Medication recommendation systems are designed to deliver personalized drug suggestions that are closely aligned with individual patient needs. Previous studies have primarily concentrated on developing medication embeddings, achieving significant progress. Nonetheless, these approaches often fall short in accurately reflecting individual patient profiles, mainly due to challenges in distinguishin… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  6. arXiv:2404.11105  [pdf, other

    cs.DB cs.DC

    XMiner: Efficient Directed Subgraph Matching with Pattern Reduction

    Authors: Pingpeng Yuan, Yujiang Wang, Tianyu Ma, Siyuan He, Ling Liu

    Abstract: Graph pattern matching, one of the fundamental graph mining problems, aims to extract structural patterns of interest from an input graph. The state-of-the-art graph matching algorithms and systems are mainly designed for undirected graphs. Directed graph matching is more complex than undirected graph matching because the edge direction must be taken into account before the exploration of each dir… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  7. arXiv:2404.06939  [pdf, other

    cs.ET cs.AI

    Fast System Technology Co-Optimization Framework for Emerging Technology Based on Graph Neural Networks

    Authors: Tianliang Ma, Guangxi Fan, Xuguang Sun, Zhihui Deng, Kainlu Low, Leilai Shao

    Abstract: This paper proposes a fast system technology co-optimization (STCO) framework that optimizes power, performance, and area (PPA) for next-generation IC design, addressing the challenges and opportunities presented by novel materials and device architectures. We focus on accelerating the technology level of STCO using AI techniques, by employing graph neural network (GNN)-based approaches for both T… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: Accepted by the 61th Design Automation Conference (DAC)

  8. arXiv:2404.06772  [pdf, other

    cs.RO

    Beyond Gait: Learning Knee Angle for Seamless Prosthesis Control in Multiple Scenarios

    Authors: Pengwei Wang, Yilong Chen, Wan Su, Jie Wang, Teng Ma, Haoyong Yu

    Abstract: Deep learning models have become a powerful tool in knee angle estimation for lower limb prostheses, owing to their adaptability across various gait phases and locomotion modes. Current methods utilize Multi-Layer Perceptrons (MLP), Long-Short Term Memory Networks (LSTM), and Convolutional Neural Networks (CNN), predominantly analyzing motion information from the thigh. Contrary to these approache… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 8 pages, 6 figures, This work has been submitted to the IEEE-RAL for possible publication

  9. arXiv:2404.04969  [pdf, other

    cs.LG cs.AI

    Temporal Generalization Estimation in Evolving Graphs

    Authors: Bin Lu, Tingyan Ma, Xiaoying Gan, Xinbing Wang, Yunqiang Zhu, Chenghu Zhou, Shiyu Liang

    Abstract: Graph Neural Networks (GNNs) are widely deployed in vast fields, but they often struggle to maintain accurate representations as graphs evolve. We theoretically establish a lower bound, proving that under mild conditions, representation distortion inevitably occurs over time. To estimate the temporal distortion without human annotation after deployment, one naive approach is to pre-train a recurre… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: Published as a conference paper at ICLR 2024

  10. arXiv:2404.03893  [pdf, other

    cs.AI

    KGExplainer: Towards Exploring Connected Subgraph Explanations for Knowledge Graph Completion

    Authors: Tengfei Ma, Xiang song, Wen Tao, Mufei Li, Jiani Zhang, Xiaoqin Pan, Jianxin Lin, Bosheng Song, xiangxiang Zeng

    Abstract: Knowledge graph completion (KGC) aims to alleviate the inherent incompleteness of knowledge graphs (KGs), which is a critical task for various applications, such as recommendations on the web. Although knowledge graph embedding (KGE) models have demonstrated superior predictive performance on KGC tasks, these models infer missing links in a black-box manner that lacks transparency and accountabili… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: 13 pages, 7 figures, 11 tables. Under Review

  11. arXiv:2404.00474  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Linguistic Calibration of Language Models

    Authors: Neil Band, Xuechen Li, Tengyu Ma, Tatsunori Hashimoto

    Abstract: Language models (LMs) may lead their users to make suboptimal downstream decisions when they confidently hallucinate. This issue can be mitigated by having the LM verbally convey the probability that its claims are correct, but existing models cannot produce text with calibrated confidence statements. Through the lens of decision-making, we formalize linguistic calibration for long-form generation… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  12. arXiv:2403.19306  [pdf, other

    cs.CV

    Sparse Generation: Making Pseudo Labels Sparse for weakly supervision with points

    Authors: Tian Ma, Chuyang Shang, Wanzhu Ren, Yuancheng Li, Jiiayi Yang, Jiali Qian

    Abstract: In recent years, research on point weakly supervised object detection (PWSOD) methods in the field of computer vision has attracted people's attention. However, existing pseudo labels generation methods perform poorly in a small amount of supervised annotation data and dense object detection tasks. We consider the generation of weakly supervised pseudo labels as the result of model's sparse output… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  13. arXiv:2403.17676  [pdf

    physics.app-ph cs.ET

    Analysis on reservoir activation with the nonlinearity harnessed from solution-processed MoS2 devices

    Authors: Songwei Liu, Yang Liu, Yingyi Wen, Jingfang Pei, Pengyu Liu, Lekai Song, Xiaoyue Fan, Wenchen Yang, Danmei Pan, Teng Ma, Yue Lin, Gang Wang, Guohua Hu

    Abstract: Reservoir computing is a recurrent neural network that has been applied across various domains in machine learning. The implementation of reservoir computing, however, often demands heavy computations for activating the reservoir. Configuring physical reservoir networks and harnessing the nonlinearity from the underlying devices for activation is an emergent solution to address the computational c… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  14. arXiv:2403.01433  [pdf, other

    cs.CE q-bio.NC

    BrainMass: Advancing Brain Network Analysis for Diagnosis with Large-scale Self-Supervised Learning

    Authors: Yanwu Yang, Chenfei Ye, Guinan Su, Ziyao Zhang, Zhikai Chang, Hairui Chen, Piu Chan, Yue Yu, Ting Ma

    Abstract: Foundation models pretrained on large-scale datasets via self-supervised learning demonstrate exceptional versatility across various tasks. Due to the heterogeneity and hard-to-collect medical data, this approach is especially beneficial for medical image analysis and neuroscience research, as it streamlines broad downstream tasks without the need for numerous costly annotations. However, there ha… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  15. arXiv:2403.00880  [pdf, other

    cs.IR cs.AI

    Dual-Granularity Medication Recommendation Based on Causal Inference

    Authors: Shunpan Liang, Xiang Li, Xiang Li, Chen Li, Yu Lei, Yulei Hou, Tengfei Ma

    Abstract: As medical demands grow and machine learning technology advances, AI-based diagnostic and treatment systems are garnering increasing attention. Medication recommendation aims to integrate patients' long-term health records with medical knowledge, recommending accuracy and safe medication combinations for specific conditions. However, most existing researches treat medication recommendation systems… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  16. Asphalt Concrete Characterization Using Digital Image Correlation: A Systematic Review of Best Practices, Applications, and Future Vision

    Authors: Siqi Wang, Zehui Zhu, Tao Ma, Jianwei Fan

    Abstract: Digital Image Correlation (DIC) is an optical technique that measures displacement and strain by tracking pattern movement in a sequence of captured images during testing. DIC has gained recognition in asphalt pavement engineering since the early 2000s. However, users often perceive the DIC technique as an out-of-box tool and lack a thorough understanding of its operational and measurement princip… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: Journal of Testing and Evaluation

  17. arXiv:2402.15185  [pdf, other

    cs.IT eess.SP

    Pre-Chirp-Domain Index Modulation for Affine Frequency Division Multiplexing

    Authors: Guangyao Liu, Tianqi Mao, Ruiqi Liu, Zhenyu Xiao

    Abstract: Affine frequency division multiplexing (AFDM), tailored as a novel multicarrier technique utilizing chirp signals for high-mobility communications, exhibits marked advantages compared to traditional orthogonal frequency division multiplexing (OFDM). AFDM is based on the discrete affine Fourier transform (DAFT) with two modifiable parameters of the chirp signals, termed as the pre-chirp parameter a… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  18. arXiv:2402.12875  [pdf, other

    cs.LG cs.CC stat.ML

    Chain of Thought Empowers Transformers to Solve Inherently Serial Problems

    Authors: Zhiyuan Li, Hong Liu, Denny Zhou, Tengyu Ma

    Abstract: Instructing the model to generate a sequence of intermediate steps, a.k.a., a chain of thought (CoT), is a highly effective method to improve the accuracy of large language models (LLMs) on arithmetics and symbolic reasoning tasks. However, the mechanism behind CoT remains unclear. This work provides a theoretical understanding of the power of CoT for decoder-only transformers through the lens of… ▽ More

    Submitted 7 May, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: 38 pages. Accepted by ICLR 2024

  19. arXiv:2401.13307  [pdf, other

    cs.CV

    ChatterBox: Multi-round Multimodal Referring and Grounding

    Authors: Yunjie Tian, Tianren Ma, Lingxi Xie, Jihao Qiu, Xi Tang, Yuan Zhang, Jianbin Jiao, Qi Tian, Qixiang Ye

    Abstract: In this study, we establish a baseline for a new task named multimodal multi-round referring and grounding (MRG), opening up a promising direction for instance-level multimodal dialogues. We present a new benchmark and an efficient vision-language model for this purpose. The new benchmark, named CB-300K, spans challenges including multi-round dialogue, complex spatial relationships among multiple… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: 17 pages, 6 tables, 9 figurs. Code, data, and model are available at: https://github.com/sunsmarterjie/ChatterBox

  20. arXiv:2401.05011  [pdf, other

    cs.CV

    Dual-Perspective Knowledge Enrichment for Semi-Supervised 3D Object Detection

    Authors: Yucheng Han, Na Zhao, Weiling Chen, Keng Teck Ma, Hanwang Zhang

    Abstract: Semi-supervised 3D object detection is a promising yet under-explored direction to reduce data annotation costs, especially for cluttered indoor scenes. A few prior works, such as SESS and 3DIoUMatch, attempt to solve this task by utilizing a teacher model to generate pseudo-labels for unlabeled samples. However, the availability of unlabeled samples in the 3D domain is relatively limited compared… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: Code is available at https://github.com/tingxueronghua/DPKE

  21. arXiv:2401.00283  [pdf, other

    cs.IT eess.SP

    Near-Space Communications: the Last Piece of 6G Space-Air-Ground-Sea Integrated Network Puzzle

    Authors: Hongshan Liu, Tong Qin, Zhen Gao, Tianqi Mao, Keke Ying, Ziwei Wan, Li Qiao, Rui Na, Zhongxiang Li, Chun Hu, Yikun Mei, Tuan Li, Guanghui Wen, Lei Chen, Zhonghuai Wu, Ruiqi Liu, Gaojie Chen, Shuo Wang, Dezhi Zheng

    Abstract: This article presents a comprehensive study on the emerging near-space communications (NS-COM) within the context of space-air-ground-sea integrated network (SAGSIN). Specifically, we firstly explore the recent technical developments of NS-COM, followed by the discussions about motivations behind integrating NS-COM into SAGSIN. To further demonstrate the necessity of NS-COM, a comparative analysis… ▽ More

    Submitted 4 March, 2024; v1 submitted 30 December, 2023; originally announced January 2024.

    Comments: 28 pages, 8 figures, 2 tables

  22. arXiv:2312.17670  [pdf, other

    cs.CV cs.LG q-bio.QM q-bio.TO

    Benchmarking the CoW with the TopCoW Challenge: Topology-Aware Anatomical Segmentation of the Circle of Willis for CTA and MRA

    Authors: Kaiyuan Yang, Fabio Musio, Yihui Ma, Norman Juchler, Johannes C. Paetzold, Rami Al-Maskari, Luciano Höher, Hongwei Bran Li, Ibrahim Ethem Hamamci, Anjany Sekuboyina, Suprosanna Shit, Houjing Huang, Chinmay Prabhakar, Ezequiel de la Rosa, Diana Waldmannstetter, Florian Kofler, Fernando Navarro, Martin Menten, Ivan Ezhov, Daniel Rueckert, Iris Vos, Ynte Ruigrok, Birgitta Velthuis, Hugo Kuijf, Julien Hämmerli , et al. (59 additional authors not shown)

    Abstract: The Circle of Willis (CoW) is an important network of arteries connecting major circulations of the brain. Its vascular architecture is believed to affect the risk, severity, and clinical outcome of serious neuro-vascular diseases. However, characterizing the highly variable CoW anatomy is still a manual and time-consuming expert task. The CoW is usually imaged by two angiographic imaging modaliti… ▽ More

    Submitted 29 April, 2024; v1 submitted 29 December, 2023; originally announced December 2023.

    Comments: 24 pages, 11 figures, 9 tables. Summary Paper for the MICCAI TopCoW 2023 Challenge

  23. arXiv:2312.16483  [pdf, ps, other

    cs.LG cs.NE math.NA

    Expressivity and Approximation Properties of Deep Neural Networks with ReLU$^k$ Activation

    Authors: Juncai He, Tong Mao, Jinchao Xu

    Abstract: In this paper, we investigate the expressivity and approximation properties of deep neural networks employing the ReLU$^k$ activation function for $k \geq 2$. Although deep ReLU networks can approximate polynomials effectively, deep ReLU$^k$ networks have the capability to represent higher-degree polynomials precisely. Our initial contribution is a comprehensive, constructive proof for polynomial… ▽ More

    Submitted 10 January, 2024; v1 submitted 27 December, 2023; originally announced December 2023.

  24. arXiv:2312.15746  [pdf, other

    cs.IR cs.AI

    Large Language Models are Not Stable Recommender Systems

    Authors: Tianhui Ma, Yuan Cheng, Hengshu Zhu, Hui Xiong

    Abstract: With the significant successes of large language models (LLMs) in many natural language processing tasks, there is growing interest among researchers in exploring LLMs for novel recommender systems. However, we have observed that directly using LLMs as a recommender system is usually unstable due to its inherent position bias. To this end, we introduce exploratory research and find consistent patt… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

  25. arXiv:2312.12784  [pdf, other

    cs.LG

    Fast Cell Library Characterization for Design Technology Co-Optimization Based on Graph Neural Networks

    Authors: Tianliang Ma, Guangxi Fan, Zhihui Deng, Xuguang Sun, Kainlu Low, Leilai Shao

    Abstract: Design technology co-optimization (DTCO) plays a critical role in achieving optimal power, performance, and area (PPA) for advanced semiconductor process development. Cell library characterization is essential in DTCO flow, but traditional methods are time-consuming and costly. To overcome these challenges, we propose a graph neural network (GNN)-based machine learning model for rapid and accurate… ▽ More

    Submitted 19 March, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

  26. arXiv:2312.06682  [pdf, other

    cs.AI cs.LG

    Learning to Denoise Unreliable Interactions for Link Prediction on Biomedical Knowledge Graph

    Authors: Tengfei Ma, Yujie Chen, Wen Tao, Dashun Zheng, Xuan Lin, Patrick Cheong-lao Pang, Yiping Liu, Yijun Wang, Bosheng Song, Xiangxiang Zeng

    Abstract: Link prediction in biomedical knowledge graphs (KGs) aims at predicting unknown interactions between entities, including drug-target interaction (DTI) and drug-drug interaction (DDI), which is critical for drug discovery and therapeutics. Previous methods prefer to utilize the rich semantic relations and topological structure of the KG to predict missing links, yielding promising outcomes. However… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  27. arXiv:2312.04810  [pdf, other

    cs.CV

    RS-Corrector: Correcting the Racial Stereotypes in Latent Diffusion Models

    Authors: Yue Jiang, Yueming Lyu, Tianxiang Ma, Bo Peng, Jing Dong

    Abstract: Recent text-conditioned image generation models have demonstrated an exceptional capacity to produce diverse and creative imagery with high visual quality. However, when pre-trained on billion-sized datasets randomly collected from the Internet, where potential biased human preferences exist, these models tend to produce images with common and recurring stereotypes, particularly for certain racial… ▽ More

    Submitted 20 December, 2023; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: 16 pages, 15 figures, conference

  28. arXiv:2312.04316  [pdf, other

    cs.RO cs.AI cs.CV

    Towards Knowledge-driven Autonomous Driving

    Authors: Xin Li, Yeqi Bai, Pinlong Cai, Licheng Wen, Daocheng Fu, Bo Zhang, Xuemeng Yang, Xinyu Cai, Tao Ma, Jianfei Guo, Xing Gao, Min Dou, Yikang Li, Botian Shi, Yong Liu, Liang He, Yu Qiao

    Abstract: This paper explores the emerging knowledge-driven autonomous driving technologies. Our investigation highlights the limitations of current autonomous driving systems, in particular their sensitivity to data bias, difficulty in handling long-tail scenarios, and lack of interpretability. Conversely, knowledge-driven methods with the abilities of cognition, generalization and life-long learning emerg… ▽ More

    Submitted 27 December, 2023; v1 submitted 7 December, 2023; originally announced December 2023.

  29. arXiv:2311.15030  [pdf, other

    cs.RO

    Tuning-free Quasi-stiffness Control Framework of a Powered Transfemoral Prosthesis for Task-adaptive Walking

    Authors: Teng Ma, Shucong Yin, Zhimin Hou, Binxin Huang, Haoyong Yu, Chenglong Fu

    Abstract: Impedance-based control represents a prevalent strategy in the development of powered transfemoral prostheses. However, creating a task-adaptive, tuning-free controller that effectively generalizes across diverse locomotion modes and terrain conditions continues to be a significant challenge. This letter proposes a tuning-free and task-adaptive quasi-stiffness control framework for powered prosthe… ▽ More

    Submitted 26 March, 2024; v1 submitted 25 November, 2023; originally announced November 2023.

    Comments: 8 pages, 10 figures. This work has been submitted to the IEEE-RAL for possible publication

  30. arXiv:2311.14333  [pdf, other

    cs.LG

    Cycle Invariant Positional Encoding for Graph Representation Learning

    Authors: Zuoyu Yan, Tengfei Ma, Liangcai Gao, Zhi Tang, Chao Chen, Yusu Wang

    Abstract: Cycles are fundamental elements in graph-structured data and have demonstrated their effectiveness in enhancing graph learning models. To encode such information into a graph learning framework, prior works often extract a summary quantity, ranging from the number of cycles to the more sophisticated persistence diagram summaries. However, more detailed information, such as which edges are encoded… ▽ More

    Submitted 30 November, 2023; v1 submitted 24 November, 2023; originally announced November 2023.

    Comments: Accepted as oral presentation in the Learning on Graphs Conference (LoG 2023)

  31. arXiv:2311.07277  [pdf, other

    cs.SE cs.CL

    AdaCCD: Adaptive Semantic Contrasts Discovery Based Cross Lingual Adaptation for Code Clone Detection

    Authors: Yangkai Du, Tengfei Ma, Lingfei Wu, Xuhong Zhang, Shouling Ji

    Abstract: Code Clone Detection, which aims to retrieve functionally similar programs from large code bases, has been attracting increasing attention. Modern software often involves a diverse range of programming languages. However, current code clone detection methods are generally limited to only a few popular programming languages due to insufficient annotated data as well as their own model design constr… ▽ More

    Submitted 6 March, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

  32. arXiv:2311.05332  [pdf, other

    cs.CV cs.AI cs.CL cs.RO

    On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving

    Authors: Licheng Wen, Xuemeng Yang, Daocheng Fu, Xiaofeng Wang, Pinlong Cai, Xin Li, Tao Ma, Yingxuan Li, Linran Xu, Dengke Shang, Zheng Zhu, Shaoyan Sun, Yeqi Bai, Xinyu Cai, Min Dou, Shuanglu Hu, Botian Shi, Yu Qiao

    Abstract: The pursuit of autonomous driving technology hinges on the sophisticated integration of perception, decision-making, and control systems. Traditional approaches, both data-driven and rule-based, have been hindered by their inability to grasp the nuance of complex driving environments and the intentions of other road users. This has been a significant bottleneck, particularly in the development of… ▽ More

    Submitted 28 November, 2023; v1 submitted 9 November, 2023; originally announced November 2023.

  33. arXiv:2311.02316  [pdf, other

    cs.LG cs.NE

    Self-Supervised Learning of Representations for Space Generates Multi-Modular Grid Cells

    Authors: Rylan Schaeffer, Mikail Khona, Tzuhsuan Ma, Cristóbal Eyzaguirre, Sanmi Koyejo, Ila Rani Fiete

    Abstract: To solve the spatial problems of mapping, localization and navigation, the mammalian lineage has developed striking spatial representations. One important spatial representation is the Nobel-prize winning grid cells: neurons that represent self-location, a local and aperiodic quantity, with seemingly bizarre non-local and spatially periodic activity patterns of a few discrete periods. Why has the… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  34. arXiv:2310.18087  [pdf, other

    cs.CV

    A Chebyshev Confidence Guided Source-Free Domain Adaptation Framework for Medical Image Segmentation

    Authors: Jiesi Hu, Yanwu Yang, Xutao Guo, Jinghua Wang, Ting Ma

    Abstract: Source-free domain adaptation (SFDA) aims to adapt models trained on a labeled source domain to an unlabeled target domain without the access to source data. In medical imaging scenarios, the practical significance of SFDA methods has been emphasized due to privacy concerns. Recent State-of-the-art SFDA methods primarily rely on self-training based on pseudo-labels (PLs). Unfortunately, PLs suffer… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

  35. arXiv:2310.16853  [pdf, other

    cs.PL cs.AI

    CP-BCS: Binary Code Summarization Guided by Control Flow Graph and Pseudo Code

    Authors: Tong Ye, Lingfei Wu, Tengfei Ma, Xuhong Zhang, Yangkai Du, Peiyu Liu, Shouling Ji, Wenhai Wang

    Abstract: Automatically generating function summaries for binaries is an extremely valuable but challenging task, since it involves translating the execution behavior and semantics of the low-level language (assembly code) into human-readable natural language. However, most current works on understanding assembly code are oriented towards generating function names, which involve numerous abbreviations that… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Main Conference

  36. arXiv:2310.12419  [pdf, other

    cs.CR

    Toward Unbiased Multiple-Target Fuzzing with Path Diversity

    Authors: Huanyao Rong, Wei You, Xiaofeng Wang, Tianhao Mao

    Abstract: In this paper, we propose a novel directed fuzzing solution named AFLRun, which features target path-diversity metric and unbiased energy assignment. Firstly, we develop a new coverage metric by maintaining extra virgin map for each covered target to track the coverage status of seeds that hit the target. This approach enables the storage of waypoints into the corpus that hit a target through inte… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

  37. arXiv:2310.09696  [pdf, other

    cs.AI

    Progressive Evidence Refinement for Open-domain Multimodal Retrieval Question Answering

    Authors: Shuwen Yang, Anran Wu, Xingjiao Wu, Luwei Xiao, Tianlong Ma, Cheng Jin, Liang He

    Abstract: Pre-trained multimodal models have achieved significant success in retrieval-based question answering. However, current multimodal retrieval question-answering models face two main challenges. Firstly, utilizing compressed evidence features as input to the model results in the loss of fine-grained information within the evidence. Secondly, a gap exists between the feature extraction of evidence an… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

  38. arXiv:2310.05627  [pdf, other

    cs.CL cs.LG q-fin.ST

    Integrating Stock Features and Global Information via Large Language Models for Enhanced Stock Return Prediction

    Authors: Yujie Ding, Shuai Jia, Tianyi Ma, Bingcheng Mao, Xiuze Zhou, Liuliu Li, Dongming Han

    Abstract: The remarkable achievements and rapid advancements of Large Language Models (LLMs) such as ChatGPT and GPT-4 have showcased their immense potential in quantitative investment. Traders can effectively leverage these LLMs to analyze financial news and predict stock returns accurately. However, integrating LLMs into existing quantitative models presents two primary challenges: the insufficient utiliz… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 8 pages, International Joint Conferences on Artificial Intelligence

    Journal ref: International Joint Conferences on Artificial Intelligence,2023

  39. arXiv:2310.02594  [pdf, other

    cs.CL

    I$^2$KD-SLU: An Intra-Inter Knowledge Distillation Framework for Zero-Shot Cross-Lingual Spoken Language Understanding

    Authors: Tianjun Mao, Chenghong Zhang

    Abstract: Spoken language understanding (SLU) typically includes two subtasks: intent detection and slot filling. Currently, it has achieved great success in high-resource languages, but it still remains challenging in low-resource languages due to the scarcity of labeled training data. Hence, there is a growing interest in zero-shot cross-lingual SLU. Despite of the success of existing zero-shot cross-ling… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: 12 pages,2 figures

  40. arXiv:2309.17036  [pdf, other

    cs.RO cs.CV

    UniQuadric: A SLAM Backend for Unknown Rigid Object 3D Tracking and Light-Weight Modeling

    Authors: Linghao Yang, Yanmin Wu, Yu Deng, Rui Tian, Xinggang Hu, Tiefeng Ma

    Abstract: Tracking and modeling unknown rigid objects in the environment play a crucial role in autonomous unmanned systems and virtual-real interactive applications. However, many existing Simultaneous Localization, Mapping and Moving Object Tracking (SLAMMOT) methods focus solely on estimating specific object poses and lack estimation of object scales and are unable to effectively track unknown objects. I… ▽ More

    Submitted 2 October, 2023; v1 submitted 29 September, 2023; originally announced September 2023.

  41. arXiv:2309.16292  [pdf, other

    cs.RO cs.CL

    DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models

    Authors: Licheng Wen, Daocheng Fu, Xin Li, Xinyu Cai, Tao Ma, Pinlong Cai, Min Dou, Botian Shi, Liang He, Yu Qiao

    Abstract: Recent advancements in autonomous driving have relied on data-driven approaches, which are widely adopted but face challenges including dataset bias, overfitting, and uninterpretability. Drawing inspiration from the knowledge-driven nature of human driving, we explore the question of how to instill similar capabilities into autonomous driving systems and summarize a paradigm that integrates an int… ▽ More

    Submitted 21 February, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: Published as a conference paper at ICLR 2024

  42. arXiv:2309.09421  [pdf, other

    cs.MM

    Unified Pretraining Target Based Video-music Retrieval With Music Rhythm And Video Optical Flow Information

    Authors: Tianjun Mao, Shansong Liu, Yunxuan Zhang, Dian Li, Ying Shan

    Abstract: Background music (BGM) can enhance the video's emotion. However, selecting an appropriate BGM often requires domain knowledge. This has led to the development of video-music retrieval techniques. Most existing approaches utilize pretrained video/music feature extractors trained with different target sets to obtain average video/music-level embeddings. The drawbacks are two-fold. One is that differ… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

  43. arXiv:2309.07849  [pdf, other

    cs.CV

    TFNet: Exploiting Temporal Cues for Fast and Accurate LiDAR Semantic Segmentation

    Authors: Rong Li, ShiJie Li, Xieyuanli Chen, Teli Ma, Juergen Gall, Junwei Liang

    Abstract: LiDAR semantic segmentation plays a crucial role in enabling autonomous driving and robots to understand their surroundings accurately and robustly. A multitude of methods exist within this domain, including point-based, range-image-based, polar-coordinate-based, and hybrid strategies. Among these, range-image-based techniques have gained widespread adoption in practical applications due to their… ▽ More

    Submitted 14 April, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: accepted by CVPR2024 Workshop on Autonomous Driving

  44. arXiv:2309.06421  [pdf, other

    eess.IV cs.CV

    AGMDT: Virtual Staining of Renal Histology Images with Adjacency-Guided Multi-Domain Transfer

    Authors: Tao Ma, Chao Zhang, Min Lu, Lin Luo

    Abstract: Renal pathology, as the gold standard of kidney disease diagnosis, requires doctors to analyze a series of tissue slices stained by H&E staining and special staining like Masson, PASM, and PAS, respectively. These special staining methods are costly, time-consuming, and hard to standardize for wide use especially in primary hospitals. Advances of supervised learning methods have enabled the virtua… ▽ More

    Submitted 17 September, 2023; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: BMVC 2023

  45. arXiv:2309.05683  [pdf, other

    cs.LG cs.AI cs.RO

    EANet: Expert Attention Network for Online Trajectory Prediction

    Authors: Pengfei Yao, Tianlu Mao, Min Shi, Jingkai Sun, Zhaoqi Wang

    Abstract: Trajectory prediction plays a crucial role in autonomous driving. Existing mainstream research and continuoual learning-based methods all require training on complete datasets, leading to poor prediction accuracy when sudden changes in scenarios occur and failing to promptly respond and update the model. Whether these methods can make a prediction in real-time and use data instances to update the… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  46. Weakly Supervised Point Clouds Transformer for 3D Object Detection

    Authors: Zuojin Tang, Bo Sun, Tongwei Ma, Daosheng Li, Zhenhui Xu

    Abstract: The annotation of 3D datasets is required for semantic-segmentation and object detection in scene understanding. In this paper we present a framework for the weakly supervision of a point clouds transformer that is used for 3D object detection. The aim is to decrease the required amount of supervision needed for training, as a result of the high cost of annotating a 3D datasets. We propose an Unsu… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: International Conference on Intelligent Transportation Systems (ITSC), 2022

    Report number: 3948-3955

    Journal ref: International Conference on Intelligent Transportation Systems (ITSC 2022)

  47. arXiv:2309.03548  [pdf, other

    cs.CV

    Trash to Treasure: Low-Light Object Detection via Decomposition-and-Aggregation

    Authors: Xiaohan Cui, Long Ma, Tengyu Ma, Jinyuan Liu, Xin Fan, Risheng Liu

    Abstract: Object detection in low-light scenarios has attracted much attention in the past few years. A mainstream and representative scheme introduces enhancers as the pre-processing for regular detectors. However, because of the disparity in task objectives between the enhancer and detector, this paradigm cannot shine at its best ability. In this work, we try to arouse the potential of enhancer + detector… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

  48. arXiv:2308.16781  [pdf, other

    cs.AI cs.LG

    StratMed: Relevance Stratification between Biomedical Entities for Sparsity on Medication Recommendation

    Authors: Xiang Li, Shunpan Liang, Yulei Hou, Tengfei Ma

    Abstract: With the growing imbalance between limited medical resources and escalating demands, AI-based clinical tasks have become paramount. As a sub-domain, medication recommendation aims to amalgamate longitudinal patient history with medical knowledge, assisting physicians in prescribing safer and more accurate medication combinations. Existing works ignore the inherent long-tailed distribution of medic… ▽ More

    Submitted 27 November, 2023; v1 submitted 31 August, 2023; originally announced August 2023.

  49. arXiv:2308.14329  [pdf, other

    cs.RO cs.AI

    End-to-End Driving via Self-Supervised Imitation Learning Using Camera and LiDAR Data

    Authors: Jin Bok Park, Jinkyu Lee, Muhyun Back, Hyunmin Han, David T. Ma, Sang Min Won, Sung Soo Hwang, Il Yong Chun

    Abstract: In autonomous driving, the end-to-end (E2E) driving approach that predicts vehicle control signals directly from sensor data is rapidly gaining attention. To learn a safe E2E driving system, one needs an extensive amount of driving data and human intervention. Vehicle control data is constructed by many hours of human driving, and it is challenging to construct large vehicle control datasets. Ofte… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: 20 pages, 8 figures

  50. arXiv:2308.12549  [pdf, other

    cs.CV cs.AI

    Synchronize Feature Extracting and Matching: A Single Branch Framework for 3D Object Tracking

    Authors: Teli Ma, Mengmeng Wang, Jimin Xiao, Huifeng Wu, Yong Liu

    Abstract: Siamese network has been a de facto benchmark framework for 3D LiDAR object tracking with a shared-parametric encoder extracting features from template and search region, respectively. This paradigm relies heavily on an additional matching network to model the cross-correlation/similarity of the template and search region. In this paper, we forsake the conventional Siamese paradigm and propose a n… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: ICCV 2023