Skip to main content

Showing 1–50 of 108 results for author: Woo, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.06424  [pdf, other

    cs.CL cs.AI cs.LG

    Improving Instruction Following in Language Models through Proxy-Based Uncertainty Estimation

    Authors: JoonHo Lee, Jae Oh Woo, Juree Seok, Parisa Hassanzadeh, Wooseok Jang, JuYoun Son, Sima Didari, Baruch Gutow, Heng Hao, Hankyu Moon, Wenjun Hu, Yeong-Dae Kwon, Taehee Lee, Seungjai Min

    Abstract: Assessing response quality to instructions in language models is vital but challenging due to the complexity of human language across different contexts. This complexity often results in ambiguous or inconsistent interpretations, making accurate assessment difficult. To address this issue, we propose a novel Uncertainty-aware Reward Model (URM) that introduces a robust uncertainty estimation for t… ▽ More

    Submitted 19 May, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

    Comments: Accepted to ICML 2024

  2. arXiv:2405.05107  [pdf, other

    cs.ET cs.AR eess.SY

    Leveraging AES Padding: dBs for Nothing and FEC for Free in IoT Systems

    Authors: Jongchan Woo, Vipindev Adat Vasudevan, Benjamin D. Kim, Rafael G. L. D'Oliveira, Alejandro Cohen, Thomas Stahlbuhk, Ken R. Duffy, Muriel Médard

    Abstract: The Internet of Things (IoT) represents a significant advancement in digital technology, with its rapidly growing network of interconnected devices. This expansion, however, brings forth critical challenges in data security and reliability, especially under the threat of increasing cyber vulnerabilities. Addressing the security concerns, the Advanced Encryption Standard (AES) is commonly employed… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  3. arXiv:2405.04752  [pdf, other

    eess.AS cs.SD

    HILCodec: High Fidelity and Lightweight Neural Audio Codec

    Authors: Sunghwan Ahn, Beom Jun Woo, Min Hyun Han, Chanyeong Moon, Nam Soo Kim

    Abstract: The recent advancement of end-to-end neural audio codecs enables compressing audio at very low bitrates while reconstructing the output audio with high fidelity. Nonetheless, such improvements often come at the cost of increased model complexity. In this paper, we identify and address the problems of existing neural audio codecs. We show that the performance of Wave-U-Net does not increase consist… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  4. arXiv:2404.07217  [pdf, other

    eess.SP cs.AI cs.CV cs.LG

    Attention-aware Semantic Communications for Collaborative Inference

    Authors: Jiwoong Im, Nayoung Kwon, Taewoo Park, Jiheon Woo, Jaeho Lee, Yongjune Kim

    Abstract: We propose a communication-efficient collaborative inference framework in the domain of edge inference, focusing on the efficient use of vision transformer (ViTs) models. The partitioning strategy of conventional collaborative inference fails to reduce communication cost because of the inherent architecture of ViTs maintaining consistent layer dimensions across the entire transformer encoder. Ther… ▽ More

    Submitted 23 February, 2024; originally announced April 2024.

  5. arXiv:2403.04981  [pdf, other

    cs.ET

    Paving the Way for Pass Disturb Free Vertical NAND Storage via A Dedicated and String-Compatible Pass Gate

    Authors: Zijian Zhao, Sola Woo, Khandker Akif Aabrar, Sharadindu Gopal Kirtania, Zhouhang Jiang, Shan Deng, Yi Xiao, Halid Mulaosmanovic, Stefan Duenkel, Dominik Kleimaier, Steven Soss, Sven Beyer, Rajiv Joshi, Scott Meninger, Mohamed Mohamed, Kijoon Kim, Jongho Woo, Suhwan Lim, Kwangsoo Kim, Wanki Kim, Daewon Ha, Vijaykrishnan Narayanan, Suman Datta, Shimeng Yu, Kai Ni

    Abstract: In this work, we propose a dual-port cell design to address the pass disturb in vertical NAND storage, which can pass signals through a dedicated and string-compatible pass gate. We demonstrate that: i) the pass disturb-free feature originates from weakening of the depolarization field by the pass bias at the high-${V}_{TH}$ (HVT) state and the screening of the applied field by channel at the low-… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 29 pages, 7 figures

  6. arXiv:2402.18775   

    cs.RO eess.SY

    How to Evaluate Human-likeness of Interaction-aware Driver Models

    Authors: Jemin Woo, Changsun Ahn

    Abstract: This study proposes a method for qualitatively evaluating and designing human-like driver models for autonomous vehicles. While most existing research on human-likeness has been focused on quantitative evaluation, it is crucial to consider qualitative measures to accurately capture human perception. To this end, we conducted surveys utilizing both video study and human experience-based study. The… ▽ More

    Submitted 3 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: This paper could benefit from further refinement to enhance the significance of its results

  7. arXiv:2402.06984  [pdf, other

    cs.SD cs.CV cs.MM eess.AS eess.IV

    Speech motion anomaly detection via cross-modal translation of 4D motion fields from tagged MRI

    Authors: Xiaofeng Liu, Fangxu Xing, Jiachen Zhuo, Maureen Stone, Jerry L. Prince, Georges El Fakhri, Jonghye Woo

    Abstract: Understanding the relationship between tongue motion patterns during speech and their resulting speech acoustic outcomes -- i.e., articulatory-acoustic relation -- is of great importance in assessing speech quality and developing innovative treatment and rehabilitative strategies. This is especially important when evaluating and detecting abnormal articulatory features in patients with speech-rela… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

    Comments: SPIE Medical Imaging 2024: Image Processing

  8. arXiv:2402.06982  [pdf, other

    cs.CV cs.AI physics.med-ph

    Treatment-wise Glioblastoma Survival Inference with Multi-parametric Preoperative MRI

    Authors: Xiaofeng Liu, Nadya Shusharina, Helen A Shih, C. -C. Jay Kuo, Georges El Fakhri, Jonghye Woo

    Abstract: In this work, we aim to predict the survival time (ST) of glioblastoma (GBM) patients undergoing different treatments based on preoperative magnetic resonance (MR) scans. The personalized and precise treatment planning can be achieved by comparing the ST of different treatments. It is well established that both the current status of the patient (as represented by the MR scans) and the choice of tr… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

    Comments: SPIE Medical Imaging 2024: Computer-Aided Diagnosis

  9. arXiv:2402.05876  [pdf, other

    cs.LG cs.MA stat.ML

    Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices

    Authors: Jiin Woo, Laixi Shi, Gauri Joshi, Yuejie Chi

    Abstract: Offline reinforcement learning (RL), which seeks to learn an optimal policy using offline data, has garnered significant interest due to its potential in critical applications where online data collection is infeasible or expensive. This work explores the benefit of federated learning for offline RL, aiming at collaboratively leveraging offline datasets at multiple agents. Focusing on finite-horiz… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  10. arXiv:2402.00375  [pdf, other

    eess.IV cs.CV

    Disentangled Multimodal Brain MR Image Translation via Transformer-based Modality Infuser

    Authors: Jihoon Cho, Xiaofeng Liu, Fangxu Xing, Jinsong Ouyang, Georges El Fakhri, Jinah Park, Jonghye Woo

    Abstract: Multimodal Magnetic Resonance (MR) Imaging plays a crucial role in disease diagnosis due to its ability to provide complementary information by analyzing a relationship between multimodal images on the same subject. Acquiring all MR modalities, however, can be expensive, and, during a scanning session, certain MR images may be missed depending on the study protocol. The typical solution would be t… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 6 pages

  11. arXiv:2401.17571  [pdf, other

    eess.IV cs.CV

    Is Registering Raw Tagged-MR Enough for Strain Estimation in the Era of Deep Learning?

    Authors: Zhangxing Bian, Ahmed Alshareef, Shuwen Wei, Junyu Chen, Yuli Wang, Jonghye Woo, Dzung L. Pham, Jiachen Zhuo, Aaron Carass, Jerry L. Prince

    Abstract: Magnetic Resonance Imaging with tagging (tMRI) has long been utilized for quantifying tissue motion and strain during deformation. However, a phenomenon known as tag fading, a gradual decrease in tag visibility over time, often complicates post-processing. The first contribution of this study is to model tag fading by considering the interplay between $T_1$ relaxation and the repeated application… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted to SPIE Medical Imaging 2024 (oral)

  12. arXiv:2312.12098  [pdf, other

    cs.CV

    Domain Generalization in LiDAR Semantic Segmentation Leveraged by Density Discriminative Feature Embedding

    Authors: Jaeyeul Kim, Jungwan Woo, Jeonghoon Kim, Sunghoon Im

    Abstract: While significant progress has been achieved in LiDAR-based perception, domain generalization continues to present challenges, often resulting in reduced performance when encountering unfamiliar datasets due to domain discrepancies. One of the primary hurdles stems from the variability of LiDAR sensors, leading to inconsistencies in point cloud density distribution. Such inconsistencies can underm… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: under review

  13. arXiv:2311.04747  [pdf, other

    cs.HC

    Exchanging... Watch out!

    Authors: Liu Yang, Jieyeon Woo, Catherine Achard, Catherine Pelachaud

    Abstract: During a conversation, individuals take turns speaking and engage in exchanges, which can occur smoothly or involve interruptions. Listeners have various ways of participating, such as displaying backchannels, signalling the aim to take a turn, waiting for the speaker to yield the floor, or even interrupting and taking over the conversation. These exchanges are commonplace in natural interaction… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  14. arXiv:2310.15850  [pdf, other

    physics.med-ph cs.AI eess.SP

    Posterior Estimation for Dynamic PET imaging using Conditional Variational Inference

    Authors: Xiaofeng Liu, Thibault Marin, Tiss Amal, Jonghye Woo, Georges El Fakhri, Jinsong Ouyang

    Abstract: This work aims efficiently estimating the posterior distribution of kinetic parameters for dynamic positron emission tomography (PET) imaging given a measurement of time of activity curve. Considering the inherent information loss from parametric imaging to measurement space with the forward kinetic model, the inverse mapping is ambiguous. The conventional (but expensive) solution can be the Marko… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Published on IEEE NSS&MIC

  15. arXiv:2310.09229  [pdf

    cs.LG cs.DC

    Insuring Smiles: Predicting routine dental coverage using Spark ML

    Authors: Aishwarya Gupta, Rahul S. Bhogale, Priyanka Thota, Prathushkumar Dathuri, Jongwook Woo

    Abstract: Finding suitable health insurance coverage can be challenging for individuals and small enterprises in the USA. The Health Insurance Exchange Public Use Files (Exchange PUFs) dataset provided by CMS offers valuable information on health and dental policies [1]. In this paper, we leverage machine learning algorithms to predict if a health insurance plan covers routine dental services for adults. By… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: 4 pages, 13 figures, 5 tables

  16. arXiv:2310.07787  [pdf

    cs.LG cs.DC

    Using Spark Machine Learning Models to Perform Predictive Analysis on Flight Ticket Pricing Data

    Authors: Philip Wong, Phue Thant, Pratiksha Yadav, Ruta Antaliya, Jongwook Woo

    Abstract: This paper discusses predictive performance and processes undertaken on flight pricing data utilizing r2(r-square) and RMSE that leverages a large dataset, originally from Expedia.com, consisting of approximately 20 million records or 4.68 gigabytes. The project aims to determine the best models usable in the real world to predict airline ticket fares for non-stop flights across the US. Therefore,… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: 4 pages, 13 figures, 1 table

  17. arXiv:2310.06076  [pdf

    cs.DC

    CFPB Consumer Complaints Analysis Using Hadoop

    Authors: Dhwani Vaishnav, Manimozhi Neethinayagam, Akanksha S Khaire, Mansi Vivekanand Dhoke, Jongwook Woo

    Abstract: Consumer complaints are a crucial source of information for companies, policymakers, and consumers alike. They provide insight into the problems faced by consumers and help identify areas for improvement in products, services, and regulatory frameworks. This paper aims to analyze Consumer Complaints Dataset provided by Consumer Financial Protection Bureau (CFPB) and provide insights into the natur… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 4 pages, 7 figures, 2 Ttables

  18. arXiv:2310.03200  [pdf

    cs.IR cs.DC

    Amazon Books Rating prediction & Recommendation Model

    Authors: Hsiu-Ping Lin, Suman Chauhan, Yougender Chauhan, Nagender Chauhan, Jongwook Woo

    Abstract: This paper uses the dataset of Amazon to predict the books ratings listed on Amazon website. As part of this project, we predicted the ratings of the books, and also built a recommendation cluster. This recommendation cluster provides the recommended books based on the column's values from dataset, for instance, category, description, author, price, reviews etc. This paper provides a flow of handl… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: 5 pages, 4 figures, 8 tables

  19. arXiv:2309.14586  [pdf, other

    cs.SD cs.AI cs.CV eess.AS eess.SP

    Speech Audio Synthesis from Tagged MRI and Non-Negative Matrix Factorization via Plastic Transformer

    Authors: Xiaofeng Liu, Fangxu Xing, Maureen Stone, Jiachen Zhuo, Sidney Fels, Jerry L. Prince, Georges El Fakhri, Jonghye Woo

    Abstract: The tongue's intricate 3D structure, comprising localized functional units, plays a crucial role in the production of speech. When measured using tagged MRI, these functional units exhibit cohesive displacements and derived quantities that facilitate the complex process of speech production. Non-negative matrix factorization-based approaches have been shown to estimate the functional units through… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: MICCAI 2023 (Oral presentation)

  20. arXiv:2309.08836  [pdf, other

    cs.CL cs.AI cs.CY

    Bias and Fairness in Chatbots: An Overview

    Authors: Jintang Xue, Yun-Cheng Wang, Chengwei Wei, Xiaofeng Liu, Jonghye Woo, C. -C. Jay Kuo

    Abstract: Chatbots have been studied for more than half a century. With the rapid development of natural language processing (NLP) technologies in recent years, chatbots using large language models (LLMs) have received much attention nowadays. Compared with traditional ones, modern chatbots are more powerful and have been used in real-world applications. There are however, bias and fairness concerns in mode… ▽ More

    Submitted 10 December, 2023; v1 submitted 15 September, 2023; originally announced September 2023.

  21. arXiv:2309.08019  [pdf, other

    cs.CR cs.IT cs.LG

    CRYPTO-MINE: Cryptanalysis via Mutual Information Neural Estimation

    Authors: Benjamin D. Kim, Vipindev Adat Vasudevan, Jongchan Woo, Alejandro Cohen, Rafael G. L. D'Oliveira, Thomas Stahlbuhk, Muriel Médard

    Abstract: The use of Mutual Information (MI) as a measure to evaluate the efficiency of cryptosystems has an extensive history. However, estimating MI between unknown random variables in a high-dimensional space is challenging. Recent advances in machine learning have enabled progress in estimating MI using neural networks. This work presents a novel application of MI estimation in the field of cryptography… ▽ More

    Submitted 18 September, 2023; v1 submitted 14 September, 2023; originally announced September 2023.

  22. arXiv:2308.12646  [pdf, other

    cs.HC cs.GR cs.LG

    The GENEA Challenge 2023: A large scale evaluation of gesture generation models in monadic and dyadic settings

    Authors: Taras Kucherenko, Rajmund Nagy, Youngwoo Yoon, Jieyeon Woo, Teodor Nikolov, Mihail Tsakov, Gustav Eje Henter

    Abstract: This paper reports on the GENEA Challenge 2023, in which participating teams built speech-driven gesture-generation systems using the same speech and motion dataset, followed by a joint evaluation. This year's challenge provided data on both sides of a dyadic interaction, allowing teams to generate full-body motion for an agent given its speech (text and audio) and the speech and motion of the int… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: The first three authors made equal contributions. Accepted for publication at the ACM International Conference on Multimodal Interaction (ICMI)

    ACM Class: I.3; I.2

  23. arXiv:2308.05063  [pdf, other

    cs.CR cs.AR cs.IT eess.SY

    CERMET: Coding for Energy Reduction with Multiple Encryption Techniques -- $It's\ easy\ being\ green$

    Authors: Jongchan Woo, Vipindev Adat Vasudevan, Benjamin Kim, Alejandro Cohen, Rafael G. L. D'Oliveira, Thomas Stahlbuhk, Muriel Médard

    Abstract: This paper presents CERMET, an energy-efficient hardware architecture designed for hardware-constrained cryptosystems. CERMET employs a base cryptosystem in conjunction with network coding to provide both information-theoretic and computational security while reducing energy consumption per bit. This paper introduces the hardware architecture for the system and explores various optimizations to en… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

  24. arXiv:2308.02949  [pdf, other

    eess.IV cs.CV physics.med-ph

    MomentaMorph: Unsupervised Spatial-Temporal Registration with Momenta, Shooting, and Correction

    Authors: Zhangxing Bian, Shuwen Wei, Yihao Liu, Junyu Chen, Jiachen Zhuo, Fangxu Xing, Jonghye Woo, Aaron Carass, Jerry L. Prince

    Abstract: Tagged magnetic resonance imaging (tMRI) has been employed for decades to measure the motion of tissue undergoing deformation. However, registration-based motion estimation from tMRI is difficult due to the periodic patterns in these images, particularly when the motion is large. With a larger motion the registration approach gets trapped in a local optima, leading to motion estimation errors. We… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: Accepted by MICCAI Workshop 2023: Time-Series Data Analytics and Learning (MTSAIL)

  25. arXiv:2307.13699  [pdf

    cs.HC cs.AI cs.CL

    EFL Students' Attitudes and Contradictions in a Machine-in-the-loop Activity System

    Authors: David James Woo, Hengky Susanto, Kai Guo

    Abstract: This study applies Activity Theory and investigates the attitudes and contradictions of 67 English as a foreign language (EFL) students from four Hong Kong secondary schools towards machine-in-the-loop writing, where artificial intelligence (AI) suggests ideas during composition. Students answered an open-ended question about their feelings on writing with AI. Results revealed mostly positive atti… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: 38 pages, 4 figures

  26. arXiv:2307.10062  [pdf, other

    cs.CV cs.LG

    Unsupervised Accuracy Estimation of Deep Visual Models using Domain-Adaptive Adversarial Perturbation without Source Samples

    Authors: JoonHo Lee, Jae Oh Woo, Hankyu Moon, Kwonho Lee

    Abstract: Deploying deep visual models can lead to performance drops due to the discrepancies between source and target distributions. Several approaches leverage labeled source data to estimate target domain accuracy, but accessing labeled source data is often prohibitively difficult due to data confidentiality or resource limitations on serving devices. Our work proposes a new framework to estimate model… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: Accepted to ICCV 2023

  27. Cases of EFL Secondary Students' Prompt Engineering Pathways to Complete a Writing Task with ChatGPT

    Authors: David James Woo, Kai Guo, Hengky Susanto

    Abstract: ChatGPT is a state-of-the-art (SOTA) chatbot. Although it has potential to support English as a foreign language (EFL) students' writing, to effectively collaborate with it, a student must learn to engineer prompts, that is, the skill of crafting appropriate instructions so that ChatGPT produces desired outputs. However, writing an appropriate prompt for ChatGPT is not straightforward for non-tech… ▽ More

    Submitted 19 June, 2023; originally announced July 2023.

    Comments: 41 pages, 6 figures

  28. arXiv:2306.01888  [pdf

    cs.CY cs.DC

    Consumer's Behavior Analysis of Electric Vehicle using Cloud Computing in the State of New York

    Authors: Jairo Juarez, Wendy Flores, Zhenfei Lu, Mako Hattori, Melissa Hernandez, Safir Larios-Ramirez, Jongwook Woo

    Abstract: Sales of Electric Vehicles (EVs) in the United States have grown fast in the past decade. We analyze the Electric Vehicle Drive Clean Rebate data from the New York State Energy Research and Development Authority (NYSERDA) to understand consumer behavior in EV purchasing and their potential environmental impact. Based on completed rebate applications since 2017, this dataset features the make and m… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: 4 pages, 6 figures

  29. arXiv:2306.01798  [pdf

    cs.CY cs.AI

    Exploring EFL students' prompt engineering in human-AI story writing: an Activity Theory perspective

    Authors: David James Woo, Kai Guo, Hengky Susanto

    Abstract: This study applies Activity Theory to investigate how English as a foreign language (EFL) students prompt generative artificial intelligence (AI) tools during short story writing. Sixty-seven Hong Kong secondary school students created generative-AI tools using open-source language models and wrote short stories with them. The study collected and analyzed the students' generative-AI tools, short s… ▽ More

    Submitted 10 February, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: 44 pages, 9 figures

  30. arXiv:2305.19404  [pdf, other

    cs.CV cs.AI cs.LG physics.med-ph

    Incremental Learning for Heterogeneous Structure Segmentation in Brain Tumor MRI

    Authors: Xiaofeng Liu, Helen A. Shih, Fangxu Xing, Emiliano Santarnecchi, Georges El Fakhri, Jonghye Woo

    Abstract: Deep learning (DL) models for segmenting various anatomical structures have achieved great success via a static DL model that is trained in a single source domain. Yet, the static DL model is likely to perform poorly in a continually evolving environment, requiring appropriate model updates. In an incremental learning setting, we would expect that well-trained static models are updated, following… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: Early Accept to MICCAI 2023

  31. arXiv:2305.14589  [pdf, other

    eess.IV cs.CV cs.LG physics.med-ph

    Attentive Continuous Generative Self-training for Unsupervised Domain Adaptive Medical Image Translation

    Authors: Xiaofeng Liu, Jerry L. Prince, Fangxu Xing, Jiachen Zhuo, Reese Timothy, Maureen Stone, Georges El Fakhri, Jonghye Woo

    Abstract: Self-training is an important class of unsupervised domain adaptation (UDA) approaches that are used to mitigate the problem of domain shift, when applying knowledge learned from a labeled source domain to unlabeled and heterogeneous target domains. While self-training-based UDA has shown considerable promise on discriminative tasks, including classification and segmentation, through reliable pseu… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted to Medical Image Analysis

  32. arXiv:2305.11310  [pdf, other

    cs.HC cs.LG cs.SD eess.AS

    AMII: Adaptive Multimodal Inter-personal and Intra-personal Model for Adapted Behavior Synthesis

    Authors: Jieyeon Woo, Mireille Fares, Catherine Pelachaud, Catherine Achard

    Abstract: Socially Interactive Agents (SIAs) are physical or virtual embodied agents that display similar behavior as human multimodal behavior. Modeling SIAs' non-verbal behavior, such as speech and facial gestures, has always been a challenging task, given that a SIA can take the role of a speaker or a listener. A SIA must emit appropriate behavior adapted to its own speech, its previous behaviors (intra-… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: 8 pages, 1 figure

    MSC Class: 68T07 ACM Class: I.2.11

  33. arXiv:2305.10697  [pdf, other

    cs.LG stat.ML

    The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup and Beyond

    Authors: Jiin Woo, Gauri Joshi, Yuejie Chi

    Abstract: When the data used for reinforcement learning (RL) are collected by multiple agents in a distributed manner, federated versions of RL algorithms allow collaborative learning without the need for agents to share their local data. In this paper, we consider federated Q-learning, which aims to learn an optimal Q-function by periodically aggregating local Q-estimates trained on local data alone. Focus… ▽ More

    Submitted 12 December, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: Short version at ICML 2023

  34. Bitcoin Double-Spending Attack Detection using Graph Neural Network

    Authors: Changhoon Kang, Jongsoo Woo, James Won-Ki Hong

    Abstract: Bitcoin transactions include unspent transaction outputs (UTXOs) as their inputs and generate one or more newly owned UTXOs at specified addresses. Each UTXO can only be used as an input in a transaction once, and using it in two or more different transactions is referred to as a double-spending attack. Ultimately, due to the characteristics of the Bitcoin protocol, double-spending is impossible.… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: 3 pages, 1 table, Accepted as poster at IEEE ICBC 2023

  35. arXiv:2304.12233  [pdf, other

    physics.chem-ph cs.AI cs.LG

    Diffusion-based Generative AI for Exploring Transition States from 2D Molecular Graphs

    Authors: Seonghwan Kim, Jeheon Woo, Woo Youn Kim

    Abstract: The exploration of transition state (TS) geometries is crucial for elucidating chemical reaction mechanisms and modeling their kinetics. Recently, machine learning (ML) models have shown remarkable performance for prediction of TS geometries. However, they require 3D conformations of reactants and products often with their appropriate orientations as input, which demands substantial efforts and co… ▽ More

    Submitted 12 October, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

  36. arXiv:2304.11276  [pdf

    cs.CL

    The Role of AI in Human-AI Creative Writing for Hong Kong Secondary Students

    Authors: Hengky Susanto, David James Woo, Kai Guo

    Abstract: The recent advancement in Natural Language Processing (NLP) capability has led to the development of language models (e.g., ChatGPT) that is capable of generating human-like language. In this study, we explore how language models can be utilized to help the ideation aspect of creative writing. Our empirical findings show that language models play different roles in helping student writers to be mo… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

    Journal ref: International Council of Teachers of English (ICTE) Newsletter (Spring 2023)

  37. arXiv:2304.03724  [pdf, other

    physics.chem-ph cs.AI cs.LG

    GeoTMI:Predicting quantum chemical property with easy-to-obtain geometry via positional denoising

    Authors: Hyeonsu Kim, Jeheon Woo, Seonghwan Kim, Seokhyun Moon, Jun Hyeong Kim, Woo Youn Kim

    Abstract: As quantum chemical properties have a dependence on their geometries, graph neural networks (GNNs) using 3D geometric information have achieved high prediction accuracy in many tasks. However, they often require 3D geometries obtained from high-level quantum mechanical calculations, which are practically infeasible, limiting their applicability to real-world problems. To tackle this, we propose a… ▽ More

    Submitted 14 December, 2023; v1 submitted 28 March, 2023; originally announced April 2023.

  38. arXiv:2304.03275  [pdf, other

    cs.CV

    That's What I Said: Fully-Controllable Talking Face Generation

    Authors: Youngjoon Jang, Kyeongha Rho, Jong-Bin Woo, Hyeongkeun Lee, Jihwan Park, Youshin Lim, Byeong-Yeol Kim, Joon Son Chung

    Abstract: The goal of this paper is to synthesise talking faces with controllable facial motions. To achieve this goal, we propose two key ideas. The first is to establish a canonical space where every face has the same motion patterns but different identities. The second is to navigate a multimodal motion space that only represents motion-related features while eliminating identity information. To disentan… ▽ More

    Submitted 18 September, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

  39. arXiv:2304.02478  [pdf

    cs.CL cs.AI cs.CY

    Exploring AI-Generated Text in Student Writing: How Does AI Help?

    Authors: David James Woo, Hengky Susanto, Chi Ho Yeung, Kai Guo, April Ka Yeng Fung

    Abstract: English as foreign language_EFL_students' use of text generated from artificial intelligence_AI_natural language generation_NLG_tools may improve their writing quality. However, it remains unclear to what extent AI-generated text in these students' writing might lead to higher-quality writing. We explored 23 Hong Kong secondary school students' attempts to write stories comprising their own words… ▽ More

    Submitted 31 December, 2023; v1 submitted 10 March, 2023; originally announced April 2023.

    Comments: 45 pages, 11 figures, 3 tables

    ACM Class: J.5; K.3.1

  40. arXiv:2303.17708  [pdf, other

    cs.SE cs.LG

    Analysis of Failures and Risks in Deep Learning Model Converters: A Case Study in the ONNX Ecosystem

    Authors: Purvish Jajal, Wenxin Jiang, Arav Tewari, Erik Kocinare, Joseph Woo, Anusha Sarraf, Yung-Hsiang Lu, George K. Thiruvathukal, James C. Davis

    Abstract: Software engineers develop, fine-tune, and deploy deep learning (DL) models using a variety of development frameworks and runtime environments. DL model converters move models between frameworks and to runtime environments. Conversion errors compromise model quality and disrupt deployment. However, the failure characteristics of DL model converters are unknown, adding risk when using DL interopera… ▽ More

    Submitted 24 April, 2024; v1 submitted 30 March, 2023; originally announced March 2023.

  41. arXiv:2303.10057  [pdf, other

    eess.IV cs.LG physics.med-ph

    Posterior Estimation Using Deep Learning: A Simulation Study of Compartmental Modeling in Dynamic PET

    Authors: Xiaofeng Liu, Thibault Marin, Tiss Amal, Jonghye Woo, Georges El Fakhri, Jinsong Ouyang

    Abstract: Background: In medical imaging, images are usually treated as deterministic, while their uncertainties are largely underexplored. Purpose: This work aims at using deep learning to efficiently estimate posterior distributions of imaging parameters, which in turn can be used to derive the most probable parameters as well as their uncertainties. Methods: Our deep learning-based approaches are based o… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Comments: Published in Medical Physics

  42. arXiv:2302.07203  [pdf, other

    eess.IV cs.CV cs.SD eess.AS eess.SP

    Synthesizing audio from tongue motion during speech using tagged MRI via transformer

    Authors: Xiaofeng Liu, Fangxu Xing, Jerry L. Prince, Maureen Stone, Georges El Fakhri, Jonghye Woo

    Abstract: Investigating the relationship between internal tissue point motion of the tongue and oropharyngeal muscle deformation measured from tagged MRI and intelligible speech can aid in advancing speech motor control theories and developing novel treatment methods for speech related-disorders. However, elucidating the relationship between these two sources of information is challenging, due in part to th… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: SPIE Medical Imaging: Deep Dive Oral

  43. arXiv:2301.08959  [pdf, other

    eess.IV cs.CV

    Successive Subspace Learning for Cardiac Disease Classification with Two-phase Deformation Fields from Cine MRI

    Authors: Xiaofeng Liu, Fangxu Xing, Hanna K. Gaggin, C. -C. Jay Kuo, Georges El Fakhri, Jonghye Woo

    Abstract: Cardiac cine magnetic resonance imaging (MRI) has been used to characterize cardiovascular diseases (CVD), often providing a noninvasive phenotyping tool.~While recently flourished deep learning based approaches using cine MRI yield accurate characterization results, the performance is often degraded by small training samples. In addition, many deep learning models are deemed a ``black box," for w… ▽ More

    Submitted 21 January, 2023; originally announced January 2023.

    Comments: ISBI 2023

  44. arXiv:2301.07234  [pdf, other

    eess.IV cs.CV

    DRIMET: Deep Registration for 3D Incompressible Motion Estimation in Tagged-MRI with Application to the Tongue

    Authors: Zhangxing Bian, Fangxu Xing, Jinglun Yu, Muhan Shao, Yihao Liu, Aaron Carass, Jiachen Zhuo, Jonghye Woo, Jerry L. Prince

    Abstract: Tagged magnetic resonance imaging~(MRI) has been used for decades to observe and quantify the detailed motion of deforming tissue. However, this technique faces several challenges such as tag fading, large motion, long computation times, and difficulties in obtaining diffeomorphic incompressible flow fields. To address these issues, this paper presents a novel unsupervised phase-based 3D motion es… ▽ More

    Submitted 30 April, 2023; v1 submitted 17 January, 2023; originally announced January 2023.

    Comments: Accepted to MIDL 2023 (oral)

  45. Scalable and Secure Row-Swap: Efficient and Safe Row Hammer Mitigation in Memory Systems

    Authors: Jeonghyun Woo, Gururaj Saileshwar, Prashant J. Nair

    Abstract: As Dynamic Random Access Memories (DRAM) scale, they are becoming increasingly susceptible to Row Hammer. By rapidly activating rows of DRAM cells (aggressor rows), attackers can exploit inter-cell interference through Row Hammer to flip bits in neighboring rows (victim rows). A recent work, called Randomized Row-Swap (RRS), proposed proactively swapping aggressor rows with randomly selected rows… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

    Journal ref: The 29th IEEE International Symposium on High-Performance Computer Architecture (HPCA 2022)

  46. arXiv:2211.15075  [pdf, other

    eess.AS cs.SD

    Inter-KD: Intermediate Knowledge Distillation for CTC-Based Automatic Speech Recognition

    Authors: Ji Won Yoon, Beom Jun Woo, Sunghwan Ahn, Hyeonseung Lee, Nam Soo Kim

    Abstract: Recently, the advance in deep learning has brought a considerable improvement in the end-to-end speech recognition field, simplifying the traditional pipeline while producing promising results. Among the end-to-end models, the connectionist temporal classification (CTC)-based model has attracted research interest due to its non-autoregressive nature. However, such CTC models require a heavy comput… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    Comments: Accepted by 2022 SLT Workshop

  47. arXiv:2210.02940  [pdf, other

    cs.LG cs.AI stat.ML

    Communication-Efficient and Drift-Robust Federated Learning via Elastic Net

    Authors: Seonhyeong Kim, Jiheon Woo, Daewon Seo, Yongjune Kim

    Abstract: Federated learning (FL) is a distributed method to train a global model over a set of local clients while keeping data localized. It reduces the risks of privacy and security but faces important challenges including expensive communication costs and client drift issues. To address these issues, we propose FedElasticNet, a communication-efficient and drift-robust FL framework leveraging the elastic… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

  48. arXiv:2209.07910  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Memory Consistent Unsupervised Off-the-Shelf Model Adaptation for Source-Relaxed Medical Image Segmentation

    Authors: Xiaofeng Liu, Fangxu Xing, Georges El Fakhri, Jonghye Woo

    Abstract: Unsupervised domain adaptation (UDA) has been a vital protocol for migrating information learned from a labeled source domain to facilitate the implementation in an unlabeled heterogeneous target domain. Although UDA is typically jointly trained on data from both domains, accessing the labeled source domain data is often restricted, due to concerns over patient data privacy or intellectual propert… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

    Comments: Published in Medical Image Analysis (extension of MICCAI paper)

  49. arXiv:2208.07769  [pdf, other

    cs.CV cs.AI cs.LG eess.IV eess.SP

    Unsupervised Domain Adaptation for Segmentation with Black-box Source Model

    Authors: Xiaofeng Liu, Chaehwa Yoo, Fangxu Xing, C. -C. Jay Kuo, Georges El Fakhri, Jonghye Woo

    Abstract: Unsupervised domain adaptation (UDA) has been widely used to transfer knowledge from a labeled source domain to an unlabeled target domain to counter the difficulty of labeling in a new domain. The training of conventional solutions usually relies on the existence of both source and target domain data. However, privacy of the large-scale and well-labeled data in the source domain and trained model… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: SPIE Medical Imaging 2022: Image Processing

  50. arXiv:2208.07754  [pdf, other

    cs.CV cs.AI cs.LG eess.IV eess.SP

    Subtype-Aware Dynamic Unsupervised Domain Adaptation

    Authors: Xiaofeng Liu, Fangxu Xing, Jia You, Jun Lu, C. -C. Jay Kuo, Georges El Fakhri, Jonghye Woo

    Abstract: Unsupervised domain adaptation (UDA) has been successfully applied to transfer knowledge from a labeled source domain to target domains without their labels. Recently introduced transferable prototypical networks (TPN) further addresses class-wise conditional alignment. In TPN, while the closeness of class centers between source and target domains is explicitly enforced in a latent space, the unde… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: IEEE Transactions on Neural Networks and Learning Systems (TNNLS)