Skip to main content

Showing 1–50 of 105 results for author: An, S

Searching in archive cs. Search in all archives.
.
  1. 3SHNet: Boosting Image-Sentence Retrieval via Visual Semantic-Spatial Self-Highlighting

    Authors: Xuri Ge, Songpei Xu, Fuhai Chen, Jie Wang, Guoxin Wang, Shan An, Joemon M. Jose

    Abstract: In this paper, we propose a novel visual Semantic-Spatial Self-Highlighting Network (termed 3SHNet) for high-precision, high-efficiency and high-generalization image-sentence retrieval. 3SHNet highlights the salient identification of prominent objects and their spatial locations within the visual modality, thus allowing the integration of visual semantics-spatial interactions and maintaining indep… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: Accepted Information Processing and Management (IP&M), 10 pages, 9 figures and 8 tables

    Journal ref: Information Processing & Management, Volume 61, Issue 4, July 2024, 103716

  2. arXiv:2404.16898  [pdf, other

    cs.LG cs.AI

    How to Parameterize Asymmetric Quantization Ranges for Quantization-Aware Training

    Authors: Jaeseong You, Minseop Park, Kyunggeun Lee, Seokjun An, Chirag Patel, Markus Nage

    Abstract: This paper investigates three different parameterizations of asymmetric uniform quantization for quantization-aware training: (1) scale and offset, (2) minimum and maximum, and (3) beta and gamma. We perform a comprehensive comparative analysis of these parameterizations' influence on quantization-aware training, using both controlled experiments and real-world large language models. Our particula… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  3. arXiv:2404.16811  [pdf, other

    cs.CL cs.AI

    Make Your LLM Fully Utilize the Context

    Authors: Shengnan An, Zexiong Ma, Zeqi Lin, Nanning Zheng, Jian-Guang Lou

    Abstract: While many contemporary large language models (LLMs) can process lengthy input, they still struggle to fully utilize information within the long context, known as the lost-in-the-middle challenge. We hypothesize that it stems from insufficient explicit supervision during the long-context training, which fails to emphasize that any position in a long context can hold crucial information. Based on t… ▽ More

    Submitted 26 April, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: 19 pages, 7 figures, 3 tables, 9 examples

  4. arXiv:2404.05680  [pdf, other

    cs.CV

    SphereHead: Stable 3D Full-head Synthesis with Spherical Tri-plane Representation

    Authors: Heyuan Li, Ce Chen, Tianhao Shi, Yuda Qiu, Sizhe An, Guanying Chen, Xiaoguang Han

    Abstract: While recent advances in 3D-aware Generative Adversarial Networks (GANs) have aided the development of near-frontal view human face synthesis, the challenge of comprehensively synthesizing a full 3D head viewable from all angles still persists. Although PanoHead proves the possibilities of using a large-scale dataset with images of both frontal and back views for full-head synthesis, it often caus… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: project page: https://lhyfst.github.io/spherehead

  5. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  6. arXiv:2403.17188  [pdf, other

    cs.CV cs.CR

    LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning

    Authors: Siyuan Cheng, Guanhong Tao, Yingqi Liu, Guangyu Shen, Shengwei An, Shiwei Feng, Xiangzhe Xu, Kaiyuan Zhang, Shiqing Ma, Xiangyu Zhang

    Abstract: Backdoor attack poses a significant security threat to Deep Learning applications. Existing attacks are often not evasive to established backdoor detection techniques. This susceptibility primarily stems from the fact that these attacks typically leverage a universal trigger pattern or transformation function, such that the trigger can cause misclassification for any input. In response to this, re… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)

  7. arXiv:2403.11426  [pdf, other

    cs.DS cs.CG

    ETH-Tight Algorithm for Cycle Packing on Unit Disk Graphs

    Authors: Shinwoo An, Eunjin Oh

    Abstract: In this paper, we consider the Cycle Packing problem on unit disk graphs defined as follows. Given a unit disk graph G with n vertices and an integer k, the goal is to find a set of $k$ vertex-disjoint cycles of G if it exists. Our algorithm runs in time $2^{O(\sqrt k)}n^{O(1)}$. This improves the $2^{O(\sqrt k\log k)}n^{O(1)}$-time algorithm by Fomin et al. [SODA 2012, ICALP 2017]. Moreover, our… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: In SoCG'24

  8. arXiv:2402.19431  [pdf, other

    cs.SE cs.AI cs.CL

    Compositional API Recommendation for Library-Oriented Code Generation

    Authors: Zexiong Ma, Shengnan An, Bing Xie, Zeqi Lin

    Abstract: Large language models (LLMs) have achieved exceptional performance in code generation. However, the performance remains unsatisfactory in generating library-oriented code, especially for the libraries not present in the training data of LLMs. Previous work utilizes API recommendation technology to help LLMs use libraries: it retrieves APIs related to the user requirements, then leverages them as c… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Journal ref: 32nd IEEE/ACM International Conference on Program Comprehension (ICPC 2024), Apr 2024, Lisboa, Portugal

  9. arXiv:2402.11811  [pdf, other

    cs.CL

    FIPO: Free-form Instruction-oriented Prompt Optimization with Preference Dataset and Modular Fine-tuning Schema

    Authors: Junru Lu, Siyu An, Min Zhang, Yulan He, Di Yin, Xing Sun

    Abstract: In the quest to facilitate the deep intelligence of Large Language Models (LLMs) accessible in final-end user-bot interactions, the art of prompt crafting emerges as a critical yet complex task for the average user. Contrast to previous model-oriented yet instruction-agnostic Automatic Prompt Optimization methodologies, yielding polished results for predefined target models while suffering rapid d… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  10. arXiv:2402.05467  [pdf, other

    cs.AI cs.CL cs.CR

    Rapid Optimization for Jailbreaking LLMs via Subconscious Exploitation and Echopraxia

    Authors: Guangyu Shen, Siyuan Cheng, Kaiyuan Zhang, Guanhong Tao, Shengwei An, Lu Yan, Zhuo Zhang, Shiqing Ma, Xiangyu Zhang

    Abstract: Large Language Models (LLMs) have become prevalent across diverse sectors, transforming human life with their extraordinary reasoning and comprehension abilities. As they find increased use in sensitive tasks, safety concerns have gained widespread attention. Extensive efforts have been dedicated to aligning LLMs with human moral principles to ensure their safe deployment. Despite their potential,… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  11. arXiv:2401.07571  [pdf, other

    cs.CV

    A Bi-Pyramid Multimodal Fusion Method for the Diagnosis of Bipolar Disorders

    Authors: Guoxin Wang, Sheng Shi, Shan An, Fengmei Fan, Wenshu Ge, Qi Wang, Feng Yu, Zhiren Wang

    Abstract: Previous research on the diagnosis of Bipolar disorder has mainly focused on resting-state functional magnetic resonance imaging. However, their accuracy can not meet the requirements of clinical diagnosis. Efficient multimodal fusion strategies have great potential for applications in multimodal data and can further improve the performance of medical diagnosis models. In this work, we utilize bot… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: Accepted by IEEE ICASSP 2024

  12. arXiv:2401.03850  [pdf, other

    eess.AS cs.SD

    Inverse Nonlinearity Compensation of Hyperelastic Deformation in Dielectric Elastomer for Acoustic Actuation

    Authors: Jin Woo Lee, Gwang Seok An, Jeong-Yun Sun, Kyogu Lee

    Abstract: This paper delves into the analysis of nonlinear deformation induced by dielectric actuation in pre-stressed ideal dielectric elastomers. It formulates a nonlinear ordinary differential equation governing this deformation based on the hyperelastic model under dielectric stress. Through numerical integration and neural network approximations, the relationship between voltage and stretch is establis… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  13. arXiv:2401.00496  [pdf, other

    cs.CV cs.AI cs.LG

    SAR-RARP50: Segmentation of surgical instrumentation and Action Recognition on Robot-Assisted Radical Prostatectomy Challenge

    Authors: Dimitrios Psychogyios, Emanuele Colleoni, Beatrice Van Amsterdam, Chih-Yang Li, Shu-Yu Huang, Yuchong Li, Fucang Jia, Baosheng Zou, Guotai Wang, Yang Liu, Maxence Boels, Jiayu Huo, Rachel Sparks, Prokar Dasgupta, Alejandro Granados, Sebastien Ourselin, Mengya Xu, An Wang, Yanan Wu, Long Bai, Hongliang Ren, Atsushi Yamada, Yuriko Harai, Yuto Ishikawa, Kazuyuki Hayashi , et al. (25 additional authors not shown)

    Abstract: Surgical tool segmentation and action recognition are fundamental building blocks in many computer-assisted intervention applications, ranging from surgical skills assessment to decision support systems. Nowadays, learning-based action recognition and segmentation approaches outperform classical methods, relying, however, on large, annotated datasets. Furthermore, action recognition and tool segme… ▽ More

    Submitted 23 January, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

  14. arXiv:2312.14492  [pdf, other

    cs.CV

    Context Enhanced Transformer for Single Image Object Detection

    Authors: Seungjun An, Seonghoon Park, Gyeongnyeon Kim, Jeongyeol Baek, Byeongwon Lee, Seungryong Kim

    Abstract: With the increasing importance of video data in real-world applications, there is a rising need for efficient object detection methods that utilize temporal information. While existing video object detection (VOD) techniques employ various strategies to address this challenge, they typically depend on locally adjacent frames or randomly sampled images within a clip. Although recent Transformer-bas… ▽ More

    Submitted 26 December, 2023; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: Project page: https://ku-cvlab.github.io/CETR

  15. arXiv:2312.13783  [pdf, other

    cs.CV cs.AI cs.LG

    Few Shot Part Segmentation Reveals Compositional Logic for Industrial Anomaly Detection

    Authors: Soopil Kim, Sion An, Philip Chikontwe, Myeongkyun Kang, Ehsan Adeli, Kilian M. Pohl, Sang Hyun Park

    Abstract: Logical anomalies (LA) refer to data violating underlying logical constraints e.g., the quantity, arrangement, or composition of components within an image. Detecting accurately such anomalies requires models to reason about various component types through segmentation. However, curation of pixel-level annotations for semantic segmentation is both time-consuming and expensive. Although there are s… ▽ More

    Submitted 15 April, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: Accepted in AAAI2024

  16. arXiv:2312.06871  [pdf, other

    cs.AI cs.LG cs.MA

    Using Analytics on Student Created Data to Content Validate Pedagogical Tools

    Authors: John Kos, Kenneth Eaton, Sareen Zhang, Rahul Dass, Stephen Buckley, Sungeun An, Ashok Goel

    Abstract: Conceptual and simulation models can function as useful pedagogical tools, however it is important to categorize different outcomes when evaluating them in order to more meaningfully interpret results. VERA is a ecology-based conceptual modeling software that enables users to simulate interactions between biotics and abiotics in an ecosystem, allowing users to form and then verify hypothesis throu… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 16 pages, preprint

  17. arXiv:2312.03307  [pdf, other

    stat.ML cs.LG

    Balanced Marginal and Joint Distributional Learning via Mixture Cramer-Wold Distance

    Authors: Seunghwan An, Sungchul Hong, Jong-June Jeon

    Abstract: In the process of training a generative model, it becomes essential to measure the discrepancy between two high-dimensional probability distributions: the generative distribution and the ground-truth distribution of the observed dataset. Recently, there has been growing interest in an approach that involves slicing high-dimensional distributions, with the Cramer-Wold distance emerging as a promisi… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  18. arXiv:2312.01729  [pdf, other

    cs.LG

    EdgeConvFormer: Dynamic Graph CNN and Transformer based Anomaly Detection in Multivariate Time Series

    Authors: Jie Liu, Qilin Li, Senjian An, Bradley Ezard, Ling Li

    Abstract: Transformer-based models for anomaly detection in multivariate time series can benefit from the self-attention mechanism due to its advantage in modeling long-term dependencies. However, Transformer-based anomaly detection models have problems such as a large amount of data being required for training, standard positional encoding is not suitable for multivariate time series data, and the interdep… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  19. arXiv:2312.00050  [pdf, other

    cs.CR cs.AI cs.LG

    Elijah: Eliminating Backdoors Injected in Diffusion Models via Distribution Shift

    Authors: Shengwei An, Sheng-Yen Chou, Kaiyuan Zhang, Qiuling Xu, Guanhong Tao, Guangyu Shen, Siyuan Cheng, Shiqing Ma, Pin-Yu Chen, Tsung-Yi Ho, Xiangyu Zhang

    Abstract: Diffusion models (DM) have become state-of-the-art generative models because of their capability to generate high-quality images from noises without adversarial training. However, they are vulnerable to backdoor attacks as reported by recent studies. When a data input (e.g., some Gaussian noise) is stamped with a trigger (e.g., a white patch), the backdoored model always generates the target image… ▽ More

    Submitted 4 February, 2024; v1 submitted 27 November, 2023; originally announced December 2023.

    Comments: AAAI 2024

  20. arXiv:2311.03665  [pdf, other

    cs.CG cs.DS

    Faster Algorithms for Cycle Hitting Problems on Disk Graphs

    Authors: Shinwoo An, Kyungjin Cho, Eunjin Oh

    Abstract: In this paper, we consider three hitting problems on a disk intersection graph: Triangle Hitting Set, Feedback Vertex Set, and Odd Cycle Transversal. Given a disk intersection graph $G$, our goal is to compute a set of vertices hitting all triangles, all cycles, or all odd cycles, respectively. Our algorithms run in time $2^{\tilde O(k^{4/5})}n^{O(1)}$, $2^{\tilde O(k^{9/10})}n^{O(1)}$, and… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: WADS 2023

  21. arXiv:2310.20689  [pdf, other

    cs.CL cs.AI

    Learning From Mistakes Makes LLM Better Reasoner

    Authors: Shengnan An, Zexiong Ma, Zeqi Lin, Nanning Zheng, Jian-Guang Lou, Weizhu Chen

    Abstract: Large language models (LLMs) recently exhibited remarkable reasoning capabilities on solving math problems. To further improve their reasoning capabilities, this work explores whether LLMs can LEarn from MistAkes (LEMA), akin to the human learning process. Consider a human student who failed to solve a math problem, he will learn from what mistake he has made and how to correct it. Mimicking this… ▽ More

    Submitted 29 March, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: 23 pages, 13 figures, 6 tables

  22. arXiv:2310.20187  [pdf, other

    cs.LG cs.AI

    Self-Supervised Pre-Training for Precipitation Post-Processor

    Authors: Sojung An, Junha Lee, Jiyeon Jang, Inchae Na, Wooyeon Park, Sujeong You

    Abstract: Obtaining a sufficient forecast lead time for local precipitation is essential in preventing hazardous weather events. Global warming-induced climate change increases the challenge of accurately predicting severe precipitation events, such as heavy rainfall. In this paper, we propose a deep learning-based precipitation post-processor for numerical weather prediction (NWP) models. The precipitation… ▽ More

    Submitted 19 February, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: 7 pages, 3 figures, 1 table, accepted to NeurIPS 2023 Workshop on Tackling Climate Change with Machine Learning at [this http URL](https://www.climatechange.ai/papers/neurips2023/18)

  23. arXiv:2310.16374  [pdf, other

    cs.LG stat.ML

    Joint Distributional Learning via Cramer-Wold Distance

    Authors: Seunghwan An, Jong-June Jeon

    Abstract: The assumption of conditional independence among observed variables, primarily used in the Variational Autoencoder (VAE) decoder modeling, has limitations when dealing with high-dimensional datasets or complex correlation structures among observed variables. To address this issue, we introduced the Cramer-Wold distance regularization, which can be computed in a closed-form, to facilitate joint dis… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  24. arXiv:2310.15179  [pdf, other

    physics.ao-ph cs.AI cs.LG math.DS stat.OT

    Reducing Uncertainty in Sea-level Rise Prediction: A Spatial-variability-aware Approach

    Authors: Subhankar Ghosh, Shuai An, Arun Sharma, Jayant Gupta, Shashi Shekhar, Aneesh Subramanian

    Abstract: Given multi-model ensemble climate projections, the goal is to accurately and reliably predict future sea-level rise while lowering the uncertainty. This problem is important because sea-level rise affects millions of people in coastal communities and beyond due to climate change's impacts on polar ice sheets and the ocean. This problem is challenging due to spatial variability and unknowns such a… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 6 pages, 5 figures, I-GUIDE 2023 conference

    ACM Class: J.2; I.2.m; I.2.6; I.2.1; I.2

  25. arXiv:2310.11650  [pdf, other

    cs.IR cs.CV cs.MM

    VKIE: The Application of Key Information Extraction on Video Text

    Authors: Siyu An, Ye Liu, Haoyuan Peng, Di Yin

    Abstract: Extracting structured information from videos is critical for numerous downstream applications in the industry. In this paper, we define a significant task of extracting hierarchical key information from visual texts on videos. To fulfill this task, we decouple it into four subtasks and introduce two implementation solutions called PipVKIE and UniVKIE. PipVKIE sequentially completes the four subta… ▽ More

    Submitted 9 January, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

  26. arXiv:2310.02690  [pdf, other

    eess.IV cs.CV

    Multi-Dimension-Embedding-Aware Modality Fusion Transformer for Psychiatric Disorder Clasification

    Authors: Guoxin Wang, Xuyang Cao, Shan An, Fengmei Fan, Chao Zhang, Jinsong Wang, Feng Yu, Zhiren Wang

    Abstract: Deep learning approaches, together with neuroimaging techniques, play an important role in psychiatric disorders classification. Previous studies on psychiatric disorders diagnosis mainly focus on using functional connectivity matrices of resting-state functional magnetic resonance imaging (rs-fMRI) as input, which still needs to fully utilize the rich temporal information of the time series of rs… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  27. arXiv:2309.13450  [pdf

    cs.SE

    Conducting A/B Experiments with a Scalable Architecture

    Authors: Andrew Hornback, Sungeun An, Scott Bunin, Stephen Buckley, John Kos, Ashok Goel

    Abstract: A/B experiments are commonly used in research to compare the effects of changing one or more variables in two different experimental groups - a control group and a treatment group. While the benefits of using A/B experiments are widely known and accepted, there is less agreement on a principled approach to creating software infrastructure systems to assist in rapidly conducting such experiments. W… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

  28. arXiv:2309.05590  [pdf, other

    cs.CV cs.AI cs.MM

    Temporal Action Localization with Enhanced Instant Discriminability

    Authors: Dingfeng Shi, Qiong Cao, Yujie Zhong, Shan An, Jian Cheng, Haogang Zhu, Dacheng Tao

    Abstract: Temporal action detection (TAD) aims to detect all action boundaries and their corresponding categories in an untrimmed video. The unclear boundaries of actions in videos often result in imprecise predictions of action boundaries by existing methods. To resolve this issue, we propose a one-stage framework named TriDet. First, we propose a Trident-head to model the action boundary via an estimated… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: An extended version of the CVPR paper arXiv:2303.07347, submitted to IJCV

  29. arXiv:2308.11912  [pdf, other

    cs.LG cs.CY

    Addressing Selection Bias in Computerized Adaptive Testing: A User-Wise Aggregate Influence Function Approach

    Authors: Soonwoo Kwon, Sojung Kim, Seunghyun Lee, Jin-Young Kim, Suyeong An, Kyuseok Kim

    Abstract: Computerized Adaptive Testing (CAT) is a widely used, efficient test mode that adapts to the examinee's proficiency level in the test domain. CAT requires pre-trained item profiles, for CAT iteratively assesses the student real-time based on the registered items' profiles, and selects the next item to administer using candidate items' profiles. However, obtaining such item profiles is a costly pro… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: CIKM 2023

  30. arXiv:2308.08239  [pdf, other

    cs.CL

    MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation

    Authors: Junru Lu, Siyu An, Mingbao Lin, Gabriele Pergola, Yulan He, Di Yin, Xing Sun, Yunsheng Wu

    Abstract: We propose MemoChat, a pipeline for refining instructions that enables large language models (LLMs) to effectively employ self-composed memos for maintaining consistent long-range open-domain conversations. We demonstrate a long-range open-domain conversation through iterative "memorization-retrieval-response" cycles. This requires us to carefully design tailored tuning instructions for each disti… ▽ More

    Submitted 22 August, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

  31. arXiv:2307.06123  [pdf, other

    cs.CR cs.LG

    SoK: Comparing Different Membership Inference Attacks with a Comprehensive Benchmark

    Authors: Jun Niu, Xiaoyan Zhu, Moxuan Zeng, Ge Zhang, Qingyang Zhao, Chunhui Huang, Yangming Zhang, Suyu An, Yangzhong Wang, Xinghui Yue, Zhipeng He, Weihao Guo, Kuo Shen, Peng Liu, Yulong Shen, Xiaohong Jiang, Jianfeng Ma, Yuqing Zhang

    Abstract: Membership inference (MI) attacks threaten user privacy through determining if a given data example has been used to train a target model. However, it has been increasingly recognized that the "comparing different MI attacks" methodology used in the existing works has serious limitations. Due to these limitations, we found (through the experiments in this work) that some comparison results reporte… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: 21 pages,15 figures

  32. arXiv:2305.16948  [pdf, other

    cs.LG cs.AI

    Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets

    Authors: Hayeon Lee, Sohyun An, Minseon Kim, Sung Ju Hwang

    Abstract: Distillation-aware Neural Architecture Search (DaNAS) aims to search for an optimal student architecture that obtains the best performance and/or efficiency when distilling the knowledge from a given teacher model. Previous DaNAS methods have mostly tackled the search for the neural architecture for fixed datasets and the teacher, which are not generalized well on a new task consisting of an unsee… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: ICLR 2023 (Notable-top-25%)

  33. arXiv:2305.16943  [pdf, other

    cs.LG

    DiffusionNAG: Predictor-guided Neural Architecture Generation with Diffusion Models

    Authors: Sohyun An, Hayeon Lee, Jaehyeong Jo, Seanie Lee, Sung Ju Hwang

    Abstract: Existing NAS methods suffer from either an excessive amount of time for repetitive sampling and training of many task-irrelevant architectures. To tackle such limitations of existing NAS methods, we propose a paradigm shift from NAS to a novel conditional Neural Architecture Generation (NAG) framework based on diffusion models, dubbed DiffusionNAG. Specifically, we consider the neural architecture… ▽ More

    Submitted 24 March, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted to ICLR 2024

  34. arXiv:2305.14210  [pdf, other

    cs.CL cs.AI

    Skill-Based Few-Shot Selection for In-Context Learning

    Authors: Shengnan An, Bo Zhou, Zeqi Lin, Qiang Fu, Bei Chen, Nanning Zheng, Weizhu Chen, Jian-Guang Lou

    Abstract: In-context learning is the paradigm that adapts large language models to downstream tasks by providing a few examples. Few-shot selection -- selecting appropriate examples for each test instance separately -- is important for in-context learning. In this paper, we propose Skill-KNN, a skill-based few-shot selection method for in-context learning. The key advantages of Skill-KNN include: (1) it add… ▽ More

    Submitted 10 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted by EMNLP 2023 main conference

  35. arXiv:2305.04835  [pdf, other

    cs.CL cs.AI

    How Do In-Context Examples Affect Compositional Generalization?

    Authors: Shengnan An, Zeqi Lin, Qiang Fu, Bei Chen, Nanning Zheng, Jian-Guang Lou, Dongmei Zhang

    Abstract: Compositional generalization--understanding unseen combinations of seen primitives--is an essential reasoning capability in human intelligence. The AI community mainly studies this capability by fine-tuning neural networks on lots of training samples, while it is still unclear whether and how in-context learning--the prevailing few-shot paradigm based on large language models--exhibits composition… ▽ More

    Submitted 8 June, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: ACL 2023 main conference, long paper

  36. arXiv:2305.01903  [pdf, other

    eess.SY cs.RO math.OC

    Input-Output Feedback Linearization Preserving Task Priority for Multivariate Nonlinear Systems Having Singular Input Gain Matrix

    Authors: Sang-ik An, Dongheui Lee, Gyunghoon Park

    Abstract: We propose an extension of the input-output feedback linearization for a class of multivariate systems that are not input-output linearizable in a classical manner. The key observation is that the usual input-output linearization problem can be interpreted as the problem of solving simultaneous linear equations associated with the input gain matrix: thus, even at points where the input gain matrix… ▽ More

    Submitted 4 May, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  37. arXiv:2303.14587  [pdf, other

    cs.CV

    PAniC-3D: Stylized Single-view 3D Reconstruction from Portraits of Anime Characters

    Authors: Shuhong Chen, Kevin Zhang, Yichun Shi, Heng Wang, Yiheng Zhu, Guoxian Song, Sizhe An, Janus Kristjansson, Xiao Yang, Matthias Zwicker

    Abstract: We propose PAniC-3D, a system to reconstruct stylized 3D character heads directly from illustrated (p)ortraits of (ani)me (c)haracters. Our anime-style domain poses unique challenges to single-view reconstruction; compared to natural images of human heads, character portrait illustrations have hair and accessories with more complex and diverse geometry, and are shaded with non-photorealistic conto… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

    Comments: CVPR 2023, code release: https://github.com/ShuhongChen/panic3d-anime-reconstruction

  38. arXiv:2303.13071  [pdf, other

    cs.CV

    PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360$^{\circ}$

    Authors: Sizhe An, Hongyi Xu, Yichun Shi, Guoxian Song, Umit Ogras, Linjie Luo

    Abstract: Synthesis and reconstruction of 3D human head has gained increasing interests in computer vision and computer graphics recently. Existing state-of-the-art 3D generative adversarial networks (GANs) for 3D human head synthesis are either limited to near-frontal views or hard to preserve 3D consistency in large view angles. We propose PanoHead, the first 3D-aware generative model that enables high-qu… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: CVPR 2023. Project Page:https://sizhean.github.io/panohead

  39. Tracker Meets Night: A Transformer Enhancer for UAV Tracking

    Authors: Junjie Ye, Changhong Fu, Ziang Cao, Shan An, Guangze Zheng, Bowen Li

    Abstract: Most previous progress in object tracking is realized in daytime scenes with favorable illumination. State-of-the-arts can hardly carry on their superiority at night so far, thereby considerably blocking the broadening of visual tracking-related unmanned aerial vehicle (UAV) applications. To realize reliable UAV tracking at night, a spatial-channel Transformer-based low-light enhancer (namely SCT)… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Journal ref: IEEE Robotics and Automation Letters, 2022, vol. 7 No. 2

  40. arXiv:2302.11978  [pdf, other

    cs.LG cs.CL

    Does Deep Learning Learn to Abstract? A Systematic Probing Framework

    Authors: Shengnan An, Zeqi Lin, Bei Chen, Qiang Fu, Nanning Zheng, Jian-Guang Lou

    Abstract: Abstraction is a desirable capability for deep learning models, which means to induce abstract concepts from concrete instances and flexibly apply them beyond the learning context. At the same time, there is a lack of clear understanding about both the presence and further characteristics of this capability in deep learning models. In this paper, we introduce a systematic probing framework to expl… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: ICLR 2023

  41. arXiv:2302.11737  [pdf, other

    stat.ML cs.LG

    Causally Disentangled Generative Variational AutoEncoder

    Authors: Seunghwan An, Kyungwoo Song, Jong-June Jeon

    Abstract: We present a new supervised learning technique for the Variational AutoEncoder (VAE) that allows it to learn a causally disentangled representation and generate causally disentangled outcomes simultaneously. We call this approach Causally Disentangled Generation (CDG). CDG is a generative model that accurately decodes an output based on a causally disentangled representation. Our research demonstr… ▽ More

    Submitted 8 October, 2023; v1 submitted 22 February, 2023; originally announced February 2023.

  42. arXiv:2302.11294  [pdf, other

    stat.ML cs.LG

    Distributional Learning of Variational AutoEncoder: Application to Synthetic Data Generation

    Authors: Seunghwan An, Jong-June Jeon

    Abstract: The Gaussianity assumption has been consistently criticized as a main limitation of the Variational Autoencoder (VAE) despite its efficiency in computational modeling. In this paper, we propose a new approach that expands the model capacity (i.e., expressive power of distributional family) without sacrificing the computational advantages of the VAE framework. Our VAE model's decoder is composed of… ▽ More

    Submitted 27 October, 2023; v1 submitted 22 February, 2023; originally announced February 2023.

  43. arXiv:2301.06241  [pdf, other

    cs.CR cs.LG

    BEAGLE: Forensics of Deep Learning Backdoor Attack for Better Defense

    Authors: Siyuan Cheng, Guanhong Tao, Yingqi Liu, Shengwei An, Xiangzhe Xu, Shiwei Feng, Guangyu Shen, Kaiyuan Zhang, Qiuling Xu, Shiqing Ma, Xiangyu Zhang

    Abstract: Deep Learning backdoor attacks have a threat model similar to traditional cyber attacks. Attack forensics, a critical counter-measure for traditional cyber attacks, is hence of importance for defending model backdoor attacks. In this paper, we propose a novel model backdoor forensics technique. Given a few attack samples such as inputs with backdoor triggers, which may represent different types of… ▽ More

    Submitted 15 January, 2023; originally announced January 2023.

  44. arXiv:2211.15929  [pdf, other

    cs.CR cs.LG

    Backdoor Vulnerabilities in Normally Trained Deep Learning Models

    Authors: Guanhong Tao, Zhenting Wang, Siyuan Cheng, Shiqing Ma, Shengwei An, Yingqi Liu, Guangyu Shen, Zhuo Zhang, Yunshu Mao, Xiangyu Zhang

    Abstract: We conduct a systematic study of backdoor vulnerabilities in normally trained Deep Learning models. They are as dangerous as backdoors injected by data poisoning because both can be equally exploited. We leverage 20 different types of injected backdoor attacks in the literature as the guidance and study their correspondences in normally trained models, which we call natural backdoor vulnerabilitie… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  45. arXiv:2211.04698  [pdf, other

    cs.CL

    Unsupervised Extractive Summarization with Heterogeneous Graph Embeddings for Chinese Document

    Authors: Chen Lin, Ye Liu, Siyu An, Di Yin

    Abstract: In the scenario of unsupervised extractive summarization, learning high-quality sentence representations is essential to select salient sentences from the input document. Previous studies focus more on employing statistical approaches or pre-trained language models (PLMs) to extract sentence embeddings, while ignoring the rich information inherent in the heterogeneous types of interaction between… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

  46. arXiv:2210.12873  [pdf, other

    cs.CR cs.AI cs.LG

    FLIP: A Provable Defense Framework for Backdoor Mitigation in Federated Learning

    Authors: Kaiyuan Zhang, Guanhong Tao, Qiuling Xu, Siyuan Cheng, Shengwei An, Yingqi Liu, Shiwei Feng, Guangyu Shen, Pin-Yu Chen, Shiqing Ma, Xiangyu Zhang

    Abstract: Federated Learning (FL) is a distributed learning paradigm that enables different parties to train a model together for high quality and strong privacy protection. In this scenario, individual participants may get compromised and perform backdoor attacks by poisoning the data (or gradients). Existing work on robust aggregation and certified FL robustness does not study how hardening benign clients… ▽ More

    Submitted 27 February, 2023; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: Accepted by ICLR 2023. Code is available at https://github.com/KaiyuanZh/FLIP

  47. arXiv:2210.08394  [pdf, other

    cs.CV

    mRI: Multi-modal 3D Human Pose Estimation Dataset using mmWave, RGB-D, and Inertial Sensors

    Authors: Sizhe An, Yin Li, Umit Ogras

    Abstract: The ability to estimate 3D human body pose and movement, also known as human pose estimation (HPE), enables many applications for home-based health monitoring, such as remote rehabilitation training. Several possible solutions have emerged using sensors ranging from RGB cameras, depth sensors, millimeter-Wave (mmWave) radars, and wearable inertial sensors. Despite previous efforts on datasets and… ▽ More

    Submitted 15 October, 2022; originally announced October 2022.

    Comments: Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022). Project page: https://sizhean.github.io/mri

  48. arXiv:2209.02579  [pdf, other

    cs.HC

    Contextualizing Large-Scale Domain Knowledge for Conceptual Modeling and Simulation

    Authors: Sungeun An, Spencer Rugaber, Jennifer Hammock, Ashok K. Goel

    Abstract: We present an interactive modeling tool, VERA, that scaffolds the acquisition of domain knowledge involved in conceptual modeling and agent-based simulations. We describe the knowledge engineering process of contextualizing large-scale domain knowledge. Specifically, we use the ontology of biotic interactions in Global Biotic Interactions, and the trait data of species in Encyclopedia of Life to f… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

  49. arXiv:2209.02576  [pdf, other

    cs.HC

    Cognitive Assistance for Inquiry-Based Modeling

    Authors: Sungeun An, Robert Bates, Spencer Rugaber, Jennifer Hammock, Emily Weigel, Ashok K. Goel

    Abstract: Inquiry-based modeling is essential to scientific practice. However, modeling is difficult for novice scientists in part due to limited domain-specific knowledge and quantitative skills. VERA is an interactive tool that helps users construct conceptual models of ecological phenomena, run them as simulations, and examine their predictions. VERA provides cognitive scaffolding for modeling by supplyi… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

  50. arXiv:2208.06592  [pdf, other

    cs.CR cs.CV

    Confidence Matters: Inspecting Backdoors in Deep Neural Networks via Distribution Transfer

    Authors: Tong Wang, Yuan Yao, Feng Xu, Miao Xu, Shengwei An, Ting Wang

    Abstract: Backdoor attacks have been shown to be a serious security threat against deep learning models, and detecting whether a given model has been backdoored becomes a crucial task. Existing defenses are mainly built upon the observation that the backdoor trigger is usually of small size or affects the activation of only a few neurons. However, the above observations are violated in many cases especially… ▽ More

    Submitted 13 August, 2022; originally announced August 2022.